Skip to main content



      Home
Home » Archived » BIRT » How to best handle badly formatted data
How to best handle badly formatted data [message #182975] Mon, 07 August 2006 06:10 Go to next message
Eclipse UserFriend
Originally posted by: exquisitus.transentia.com.au

I am curently handling CSV data using the flat file data source.

Turns out SOME rows in the data are 'bad' (ie don't match the
expected/required # fields).

I am open to suggestions as to how best to handle this situation in BIRT
2.1.

I anticipate developing a POJO that can reject bad lines (perhaps
providing a collection of 'BadLine' objects; at minimum a count of
rejected lines) while presenting a collection of 'GoodData' objects for
further processing.

Is this naive or reinventing the wheel?

Cheers,

Exquisitus
Re: How to best handle badly formatted data [message #183101 is a reply to message #182975] Mon, 07 August 2006 12:13 Go to previous messageGo to next message
Eclipse UserFriend
You may be able to use a filter. How do you know if a row is bad?

Jason

"exquisitus" <exquisitus@transentia.com.au> wrote in message
news:575750d809582c7e0379e1dfb0c27675$1@www.eclipse.org...
>I am curently handling CSV data using the flat file data source.
>
> Turns out SOME rows in the data are 'bad' (ie don't match the
> expected/required # fields).
>
> I am open to suggestions as to how best to handle this situation in BIRT
> 2.1.
> I anticipate developing a POJO that can reject bad lines (perhaps
> providing a collection of 'BadLine' objects; at minimum a count of
> rejected lines) while presenting a collection of 'GoodData' objects for
> further processing.
>
> Is this naive or reinventing the wheel?
>
> Cheers,
>
> Exquisitus
>
Re: How to best handle badly formatted data [message #183190 is a reply to message #182975] Mon, 07 August 2006 14:59 Go to previous message
Eclipse UserFriend
Originally posted by: ablancas_ at_pervasive_dot_com.x.y.zzz

You might want to do the data analysis ahead of running the report, perhaps staging the clean data to a second file for
input to BIRT (same for bad data or the results of the validation). Data analysis is probably to much of a burden for a
chart or report engine to handle.


"exquisitus" <exquisitus@transentia.com.au> wrote in message news:575750d809582c7e0379e1dfb0c27675$1@www.eclipse.org...
>I am curently handling CSV data using the flat file data source.
>
> Turns out SOME rows in the data are 'bad' (ie don't match the expected/required # fields).
>
> I am open to suggestions as to how best to handle this situation in BIRT 2.1.
> I anticipate developing a POJO that can reject bad lines (perhaps providing a collection of 'BadLine' objects; at
> minimum a count of rejected lines) while presenting a collection of 'GoodData' objects for further processing.
>
> Is this naive or reinventing the wheel?
>
> Cheers,
>
> Exquisitus
>
Previous Topic:package selection
Next Topic:Setting a timeout on a dataset query
Goto Forum:
  


Current Time: Sun Jun 08 13:33:23 EDT 2025

Powered by FUDForum. Page generated in 0.03879 seconds
.:: Contact :: Home ::.

Powered by: FUDforum 3.0.2.
Copyright ©2001-2010 FUDforum Bulletin Board Software

Back to the top