[MTOS-dev] XML parsing

Byrne Reese byrne at sixapart.com
Thu Mar 6 21:35:04 PST 2008


I remember pushing hard for this as well especially because WordPress WXR
files can become quite larger and a DOM parser requires you to parse the
whole document in memory before you can even access its contents.

SAX was the obvious alternative - it meant that there was virtually no size
limit that needed to be imposed on WXR files.

Even WordPress requires that MT import files be less than 2MB. We don't
suffer from that restruction.

Byrne


On 3/6/08 5:06 PM, "Fumiaki Yoshimatsu" <fyoshimatsu at sixapart.com> wrote:

>>>  While we don't parse XML in any of the core functions, we do in:
>>> 
>>>  * the Atompub server(s)
>>>  * profile exchange for OpenID commenters
>> 
>> There is also the XML-RPC services and Feeds.App Lite.
> 
> Backup/Restore parses XML purely by using SAX after the advice from Tim
> for the code in wheeljack branch
> (It used XML::XPath originally).
> WordPress Importer plugin also do in mostly the same way to parse WXR.
> 
> Fumiaki
> 
> _______________________________________________
> MTOS-dev mailing list
> MTOS-dev at sixapart.com
> http://www.sixapart.com/mailman/listinfo/mtos-dev



More information about the MTOS-dev mailing list