J
JS Bangs
In the continuing development of Lingua:
honology, I'm starting to
consider what the benefits would be of moving my file-parsing formats to
XML from the current custom format.
Currently, two of the sub-modules do some form of file-parsing, and the
formats they use are described at:
http://search.cpan.org/author/JASPAX/Lingua-Phonology-0.25/Phonology/Features.pm#loadfile
http://search.cpan.org/author/JASPAX/Lingua-Phonology-0.25/Phonology/Symbols.pm#loadfile
The existing formats are concise and human-readable, but completely
custom. As I'm thinking of adding file-parsing to Lingua:
honology::Rules
(and perhaps other modules), I was looking for something more reusable,
general, and powerful (especially since the Rules submodule will require
some fairly complex parsing rules). If I use XML, I can pass parsing
duties off to XML::Whatever, but I'm concerned that the costs (in terms of
verbosity) will outweight the benefits of portability and extensibility.
For example, I can currently write the following line in a file to be
parsed by Lingua:
honology::Symbols:
d +anterior -distributed voice
In XML, this might have to be as verbose as:
<symbol label="d`">
<feature name="anterior" value="+" \>
<feature name="distributed" value="-" \>
<feature name="voice" \>
</symbol>
Which is significantly heavier and less clear. I'm rather torn on this, so
I was wondering what insight the minds here have to offer. Many thanks--
--
Jesse S. Bangs (e-mail address removed)
http://students.washington.edu/jaspax/
http://students.washington.edu/jaspax/blog
Jesus asked them, "Who do you say that I am?"
And they answered, "You are the eschatological manifestation of the ground
of our being, the kerygma in which we find the ultimate meaning of our
interpersonal relationship."
And Jesus said, "What?"
consider what the benefits would be of moving my file-parsing formats to
XML from the current custom format.
Currently, two of the sub-modules do some form of file-parsing, and the
formats they use are described at:
http://search.cpan.org/author/JASPAX/Lingua-Phonology-0.25/Phonology/Features.pm#loadfile
http://search.cpan.org/author/JASPAX/Lingua-Phonology-0.25/Phonology/Symbols.pm#loadfile
The existing formats are concise and human-readable, but completely
custom. As I'm thinking of adding file-parsing to Lingua:
(and perhaps other modules), I was looking for something more reusable,
general, and powerful (especially since the Rules submodule will require
some fairly complex parsing rules). If I use XML, I can pass parsing
duties off to XML::Whatever, but I'm concerned that the costs (in terms of
verbosity) will outweight the benefits of portability and extensibility.
For example, I can currently write the following line in a file to be
parsed by Lingua:
d +anterior -distributed voice
In XML, this might have to be as verbose as:
<symbol label="d`">
<feature name="anterior" value="+" \>
<feature name="distributed" value="-" \>
<feature name="voice" \>
</symbol>
Which is significantly heavier and less clear. I'm rather torn on this, so
I was wondering what insight the minds here have to offer. Many thanks--
--
Jesse S. Bangs (e-mail address removed)
http://students.washington.edu/jaspax/
http://students.washington.edu/jaspax/blog
Jesus asked them, "Who do you say that I am?"
And they answered, "You are the eschatological manifestation of the ground
of our being, the kerygma in which we find the ultimate meaning of our
interpersonal relationship."
And Jesus said, "What?"