abiword-user Mailing List Archive: Re: What can I parse abiword

From: J.M. Maurer <uwog_at_uwog.net>
Date: Thu Feb 01 2007 - 12:17:42 CET

On Thu, 2007-02-01 at 04:13 -0600, Lars Eighner wrote:
> Anyone interested in following my progress on this project, my "conjured
> documentation" of AbiWord, or development of a usable DTD can follow any or
> all of that at <http://larseighner.com/AbbyWriter/> (unless I get a real
> typewriter first, in which case I will be doing real work).
>
>
> On Tue, 30 Jan 2007, Dominic Lachowicz wrote:
>
> > I don't know if this addresses your question or not, but our DTD is
> > hopelessly out of date, and trying to validate against it is probably a lost
> > cause. However, it is valid XML and any conforming XML parser should be able
> > to parse the ABW documents.

FYI: I updated our DTD 2 weeks ago, but I didn't do all of it. It's
missing some tiny pieces, nothing that couldn't be added without 2
hours.

Marc

> > There's no API for generating documents to speak of. The best advice I can
> > give you at the moment is:
> >
> > 1) Your documents seem simple enough. Make 1 or 2 sample documents using
> > AbiWord, and then craft your program to generate those sorts of documents
> > using print() or some XML building API.
> > 2) If your documents really are plaintext or something similar (maybe
> > HTML?), you can use AbiWord to convert that to whatever format you desire.
> > If you use AbiWord 2.5, you don't even need the input or output document to
> > hit the disk.
> >
> > Good luck,
> > Dom
> >
> > On 1/30/07, Lars Eighner <abbynormal@larseighner.com> wrote:
> >>
> >>
> >> It seems I cannot parse abiword documents with ONSGMLS.
> >>
> >> Attributes id and xid are given type ID, but they are all numeric,
> >> including the first character. When I add the numeric characters
> >> to NAMESTRT in the sgml declaration, ONSGMLS pukes. If I change
> >> the (local) awml.dtd so the type is CDATA, of course ONSGMLS will
> >> parse it, but then it does not check id and xid for uniqueness.
> >>
> >> The reason I want to do this, is I want to filter essentially plain
> >> text documents through a perl script to produce an abiword document.
> >> (So that I can compose in a text-mode editor such as JOE, as the
> >> abiword GUI is practically unusable, and I have to build abiword
> >> templates by hand anyway. The abiword format is, of course, very
> >> brittle, and abiword offers no diagnostics when it rejects a document.
> >> It would be helpful to parse the candidate documents first to eliminate
> >> the obvious errors.)
> >>
> >> What will parse abiword documents correctly and throw meaningful
> >> error messages?
> >>
> >> --
> >> Lars Eighner
> >> http://www.larseighner.com/index.html
> >> 8800 N IH35 APT 1191 AUSTIN TX 78753-5266
> >>
> >> -----------------------------------------------
> >> To unsubscribe from this list, send a message to
> >> abiword-user-request@abisource.com with the word
> >> unsubscribe in the message body.
> >>
> >
> >
> >
> >
>

-----------------------------------------------
To unsubscribe from this list, send a message to
abiword-user-request@abisource.com with the word
unsubscribe in the message body.
Received on Thu Feb 1 12:17:39 2007

This archive was generated by hypermail 2.1.8 : Thu Feb 01 2007 - 12:17:39 CET

Re: What can I parse abiword with?