D
David Winter
As a technical author and translator, I am highly interested in single
source/multi format publishing. Meaning: I'd like to keep manuals,
technical specifications etc. in multiple languages (English, French)
in a *single* repository (<- files or database) and generate documents
in the various languages and target formats (XHTML, PDF, HTML Help,
Text) on demand.
I am not a programmer, though, and can't develop my own tools, but of
course I am willing to invest money and spend time learning.
I understand that I could use an existing XML Schema such as DocBook or
cook my own and then use XSLT to generate the various output formats.
Since I'm not keen on reinventing the wheel, I'd like to ask you what
would be a good (proven) way to achieve the following. I am looking for
a set of tools and technologies that will work together reliably, and I
assume others have solved these problems before. I'd be grateful if
someone could answer a few of the following questions.
1) Authoring tool?
I guess using a native XML editor from the start would be a better
approach than exporting from some proprietary format such as
FrameMaker. I have considered <oxygen/> and the XML Mind Editor. Are
these good editors for daily work on big, complex documents? What other
products would you recommend for a user fluent with plain text editors,
Frame and Dreamweaver? (A *cough* WYSIWYG environment (using some CSS)
would be appreciated.)
2) Appropriate XML Schema/DTD: DocBook or ..?
DocBook is impressing, but - forgive my blasphemy - seems a bit baroque
while missing pieces I would need for certain clients/technologies. Now
this may seem a bit megalomaniac, but if I wanted to build my own XML
Schema - what tools should I use? The Altova product suite seems
professional, but maybe overkill for a freelancer. What would you
suggest?
3) XSLT
I understand the XSLT processor does most of the magic that turns XML
into target formats. Assuming you'd want XHTML, pretty PDFs and HTML
Help - what would be my weapon of choice as a non-programmer? I'd like
to be able to modify PDF and HTML output, so a "blackbox" app is out of
question.
4) Multilingual documents
To prevent version drift, I would like to keep the text for all
languages in the same file. I.e. the (imaginary) <head1> tag should
hold both the English "Introduction" and the French "Préliminaire".
What's the best approach to achieve this? I can hardly have two <head1
lang="FOO"> tags when my DTD/Schema allows only one. Namespaces?
5) Index/TOC/Document outline
A (multi-level) Index, Table of Contents and maybe a (collapsible)
outline view of a document - does XSLT take care of these? Are there
e.g. sample XSLT stylesheets that can generate a hyperlinked outline of
an XML document in HTML?
6) Conditional Text
What I mean here is text that can be filtered out when generating
target formats. Assuming I want to do something like "Only generate the
digest version of the manual" - does DocBook allow me to tag sections
as "Only for Digest Version"? What would be the generic approach to do
this in XML, and how can I combine them on rendering ("Only for PDF"
AND "Digest")?
7) CAT translation
Integration of Translation Memory Tools: Is there an easy way to feed
XML (e.g. DocBook) documents into CAT tools? Ideally, this would accept
<para lang="EN">Source</para> and generate <para
lang="FR">Target</para> from a TU database.
Thank you for helping.
source/multi format publishing. Meaning: I'd like to keep manuals,
technical specifications etc. in multiple languages (English, French)
in a *single* repository (<- files or database) and generate documents
in the various languages and target formats (XHTML, PDF, HTML Help,
Text) on demand.
I am not a programmer, though, and can't develop my own tools, but of
course I am willing to invest money and spend time learning.
I understand that I could use an existing XML Schema such as DocBook or
cook my own and then use XSLT to generate the various output formats.
Since I'm not keen on reinventing the wheel, I'd like to ask you what
would be a good (proven) way to achieve the following. I am looking for
a set of tools and technologies that will work together reliably, and I
assume others have solved these problems before. I'd be grateful if
someone could answer a few of the following questions.
1) Authoring tool?
I guess using a native XML editor from the start would be a better
approach than exporting from some proprietary format such as
FrameMaker. I have considered <oxygen/> and the XML Mind Editor. Are
these good editors for daily work on big, complex documents? What other
products would you recommend for a user fluent with plain text editors,
Frame and Dreamweaver? (A *cough* WYSIWYG environment (using some CSS)
would be appreciated.)
2) Appropriate XML Schema/DTD: DocBook or ..?
DocBook is impressing, but - forgive my blasphemy - seems a bit baroque
while missing pieces I would need for certain clients/technologies. Now
this may seem a bit megalomaniac, but if I wanted to build my own XML
Schema - what tools should I use? The Altova product suite seems
professional, but maybe overkill for a freelancer. What would you
suggest?
3) XSLT
I understand the XSLT processor does most of the magic that turns XML
into target formats. Assuming you'd want XHTML, pretty PDFs and HTML
Help - what would be my weapon of choice as a non-programmer? I'd like
to be able to modify PDF and HTML output, so a "blackbox" app is out of
question.
4) Multilingual documents
To prevent version drift, I would like to keep the text for all
languages in the same file. I.e. the (imaginary) <head1> tag should
hold both the English "Introduction" and the French "Préliminaire".
What's the best approach to achieve this? I can hardly have two <head1
lang="FOO"> tags when my DTD/Schema allows only one. Namespaces?
5) Index/TOC/Document outline
A (multi-level) Index, Table of Contents and maybe a (collapsible)
outline view of a document - does XSLT take care of these? Are there
e.g. sample XSLT stylesheets that can generate a hyperlinked outline of
an XML document in HTML?
6) Conditional Text
What I mean here is text that can be filtered out when generating
target formats. Assuming I want to do something like "Only generate the
digest version of the manual" - does DocBook allow me to tag sections
as "Only for Digest Version"? What would be the generic approach to do
this in XML, and how can I combine them on rendering ("Only for PDF"
AND "Digest")?
7) CAT translation
Integration of Translation Memory Tools: Is there an easy way to feed
XML (e.g. DocBook) documents into CAT tools? Ideally, this would accept
<para lang="EN">Source</para> and generate <para
lang="FR">Target</para> from a TU database.
Thank you for helping.