Skip to main content



      Home
Home » Newcomers » Newcomers » How do you parse multiple XML files with Eclipse? Can Rinzo do?(Looking for a Eclipse plugin that can parse multiple XML files making up a corpus)
How do you parse multiple XML files with Eclipse? Can Rinzo do? [message #1719695] Mon, 11 January 2016 09:02 Go to next message
Eclipse UserFriend
Hello

I was looking for an app that could parse multiple XML files making up a huge collection of text called the British National Corpus, and then found a plugin for XML editing called Rinzo.

The site on Rinzo claims to parse large XML but does not explain if it can parse multiple XML files at one sitting and doesn't look easy to use. If someone out there has ever used this plugin, please let me know if it can do so. Also, if anyone knows any other way to parse multiple XML files with Eclipse, please post the technique as I could not find any.

Thank you for reading.
Re: How do you parse multiple XML files with Eclipse? Can Rinzo do? [message #1719707 is a reply to message #1719695] Mon, 11 January 2016 10:33 Go to previous messageGo to next message
Eclipse UserFriend
Parse them to what end?
Re: How do you parse multiple XML files with Eclipse? Can Rinzo do? [message #1719785 is a reply to message #1719707] Mon, 11 January 2016 22:07 Go to previous messageGo to next message
Eclipse UserFriend
Thanks for the reply.

I would like to get Eclipse to find certain words across multiple XML files and to display the lines containing the words if possible.

[Updated on: Mon, 11 January 2016 22:08] by Moderator

Re: How do you parse multiple XML files with Eclipse? Can Rinzo do? [message #1719786 is a reply to message #1719785] Mon, 11 January 2016 22:14 Go to previous messageGo to next message
Eclipse UserFriend
On 01/11/2016 08:07 PM, anon anon wrote:
> Thanks for the reply.
> I would like to get Eclipse to find certain terms across multiple XML
> files and to display the lines containing the terms if possible.

fgrep? Here are some commands for you:

fgrep -Hn term *.xml # (in same subdirectory)

find . -name '*.xml' -exec fgrep -Hn term {} \; # (in multiple
subdirectories under the current subdirectory)
Re: How do you parse multiple XML files with Eclipse? Can Rinzo do? [message #1719801 is a reply to message #1719785] Tue, 12 January 2016 02:51 Go to previous messageGo to next message
Eclipse UserFriend
The built-in basic File Search can restrict itself to whole words. I don't think there's a search feature (from eclipse.org's XML Editor and tools) that specifically deals with XML syntax other than the one for detecting Tasks (looking for keywords specifically in comments, e.g. "TODO"). Even that has a practical limitation of 2GB since it can't report locations past there.

[Updated on: Tue, 12 January 2016 02:53] by Moderator

Re: How do you parse multiple XML files with Eclipse? Can Rinzo do? [message #1719850 is a reply to message #1719785] Tue, 12 January 2016 10:06 Go to previous messageGo to next message
Eclipse UserFriend
On 01/11/2016 08:07 PM, anon anon wrote:
> Thanks for the reply.
> I would like to get Eclipse to find certain terms across multiple XML
> files and to display the lines containing the terms if possible.

Anon Anon,

If you are interested in writing something to do this, I can help you,
particularly in Java having done this myself, though I'm not willing to
do the work for you. I can give you massive amounts of SAX handler and
symbol-table code for such a project.

This isn't an Eclipse concern, so contact me directly if you care.

Cheers,

Russ Bateman
icon6.gif  Re: How do you parse multiple XML files with Eclipse? Can Rinzo do? [message #1719909 is a reply to message #1719786] Tue, 12 January 2016 23:39 Go to previous messageGo to next message
Eclipse UserFriend
Thank you, I would like to take up this option of running fgrep.
Being a complete beginner, I have so far only got up to creating a project and cannot figure fgrep can be run. Could you possibly please give me a little more advice about how running fgrep in the IDE?

The installation I have on my Mac is Eclipse for PHP Developers.

[Updated on: Tue, 12 January 2016 23:40] by Moderator

Re: How do you parse multiple XML files with Eclipse? Can Rinzo do? [message #1719957 is a reply to message #1719909] Wed, 13 January 2016 08:39 Go to previous message
Eclipse UserFriend
fgrep is a tool that's run from the command line on a *nix system. You can run it on Windows by installing Cygwin.
Previous Topic:How to import a text document and put each line in an array
Next Topic:webview upload image
Goto Forum:
  


Current Time: Wed Jul 23 20:57:01 EDT 2025

Powered by FUDForum. Page generated in 0.10823 seconds
.:: Contact :: Home ::.

Powered by: FUDforum 3.0.2.
Copyright ©2001-2010 FUDforum Bulletin Board Software

Back to the top