Re: What can I parse abiword with?

From: J.M. Maurer <uwog_at_uwog.net>
Date: Thu Feb 01 2007 - 12:19:20 CET

On Thu, 2007-02-01 at 12:17 +0100, J.M. Maurer wrote:
> On Thu, 2007-02-01 at 04:13 -0600, Lars Eighner wrote:
> > Anyone interested in following my progress on this project, my "conjured
> > documentation" of AbiWord, or development of a usable DTD can follow any or
> > all of that at <http://larseighner.com/AbbyWriter/> (unless I get a real
> > typewriter first, in which case I will be doing real work).
> >
> >
> > On Tue, 30 Jan 2007, Dominic Lachowicz wrote:
> >
> > > I don't know if this addresses your question or not, but our DTD is
> > > hopelessly out of date, and trying to validate against it is probably a lost
> > > cause. However, it is valid XML and any conforming XML parser should be able
> > > to parse the ABW documents.
>
> FYI: I updated our DTD 2 weeks ago, but I didn't do all of it. It's
> missing some tiny pieces, nothing that couldn't be added without 2
> hours.

It can be found where btw: http://www.abisource.com/awml.dtd

Marc

> Marc
>
> > > There's no API for generating documents to speak of. The best advice I can
> > > give you at the moment is:
> > >
> > > 1) Your documents seem simple enough. Make 1 or 2 sample documents using
> > > AbiWord, and then craft your program to generate those sorts of documents
> > > using print() or some XML building API.
> > > 2) If your documents really are plaintext or something similar (maybe
> > > HTML?), you can use AbiWord to convert that to whatever format you desire.
> > > If you use AbiWord 2.5, you don't even need the input or output document to
> > > hit the disk.
> > >
> > > Good luck,
> > > Dom
> > >
> > > On 1/30/07, Lars Eighner <abbynormal@larseighner.com> wrote:
> > >>
> > >>
> > >> It seems I cannot parse abiword documents with ONSGMLS.
> > >>
> > >> Attributes id and xid are given type ID, but they are all numeric,
> > >> including the first character. When I add the numeric characters
> > >> to NAMESTRT in the sgml declaration, ONSGMLS pukes. If I change
> > >> the (local) awml.dtd so the type is CDATA, of course ONSGMLS will
> > >> parse it, but then it does not check id and xid for uniqueness.
> > >>
> > >> The reason I want to do this, is I want to filter essentially plain
> > >> text documents through a perl script to produce an abiword document.
> > >> (So that I can compose in a text-mode editor such as JOE, as the
> > >> abiword GUI is practically unusable, and I have to build abiword
> > >> templates by hand anyway. The abiword format is, of course, very
> > >> brittle, and abiword offers no diagnostics when it rejects a document.
> > >> It would be helpful to parse the candidate documents first to eliminate
> > >> the obvious errors.)
> > >>
> > >> What will parse abiword documents correctly and throw meaningful
> > >> error messages?
> > >>
> > >> --
> > >> Lars Eighner
> > >> http://www.larseighner.com/index.html
> > >> 8800 N IH35 APT 1191 AUSTIN TX 78753-5266
> > >>
> > >> -----------------------------------------------
> > >> To unsubscribe from this list, send a message to
> > >> abiword-user-request@abisource.com with the word
> > >> unsubscribe in the message body.
> > >>
> > >
> > >
> > >
> > >
> >
>
> -----------------------------------------------
> To unsubscribe from this list, send a message to
> abiword-user-request@abisource.com with the word
> unsubscribe in the message body.

-----------------------------------------------
To unsubscribe from this list, send a message to
abiword-user-request@abisource.com with the word
unsubscribe in the message body.
Received on Thu Feb 1 12:19:57 2007

This archive was generated by hypermail 2.1.8 : Thu Feb 01 2007 - 12:19:57 CET