Skip to main content

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]
AW: [smila-user] JDBC Crawler

Hi Andreas,

 

trying your jdbc source (without the duplicated <xml> header), I got an error in the log that the Database Connection pattern is

not valid:

 

2009-09-28 14:58:05,116 ERROR [RMI TCP Connection(5)-172.24.187.83          ]  internal.JaxbPluginContext                    - Line:Col[50:64]:cvc-pattern-valid: Value 'jdbc:sqlserver://afa-23592:1433;DatabaseName=KinkonGmbHTestDB;integratedSecurity=true;' is not facet-valid with respect to pattern ...

jdbc:sqlserver://[\w\.\]+:\d+(;(DatabaseName|HostProcess|NetAddress|Password|PortNumber|ProgramName|SelectMethod|SendStringParametersAsUnicode|ServerName|User)=[\w\i]+)*|jdbc:odbc:[\w\.\-]+ ...

 

When changing your Database Connection to e.g. :

                <Database Connection="jdbc:sqlserver://afa-23592:1433;DatabaseName=KinkonGmbHTestDB"  ...

 

everything looks fine and the data source id is shown as availabe Crawler Task in the JConsole.

 

Cheers,

 Andreas

 

Von: smila-user-bounces@xxxxxxxxxxx [mailto:smila-user-bounces@xxxxxxxxxxx] Im Auftrag von Andreas.Schultz@xxxxxxxxxxx
Gesendet: Montag, 28. September 2009 15:06
An: smila-user@xxxxxxxxxxx; smila-dev@xxxxxxxxxxx
Betreff: [smila-user] JDBC Crawler

 

Hi all,

 

I currently try to use the JDBC Crawler of SMILA without any success so far.

 

In more detail:

I try to connect to a MS SQL Server Express 2008 DB on a separate computer/server.

In general, a direct jdbc connect using a little test program has been successful.

I used the JDBC example from the SMILA web site as a template, but

1)      the (jdbc-)crawler does not appear within the jconsole CrawlerController->Operations-> getAvailableCrawlerTasks

2)      starting the crawling-process leads to an error:

Error loading DataSource with DataSourceId 'kinkonnewsjdbc': javax.xml.bind.UnmarshalException

 - with linked exception:

[org.xml.sax.SAXParseException: The processing instruction target matching "[xX][mM][lL]" is not allowed.]

 

 

 

 

 

my crawler-config named “kinkonnewsjdbc.xml” looks like:

 

<?xml version="1.0" encoding="UTF-8"?>

<!--

/***********************************************************************************************************************

 * Copyright (c) 2008 empolis GmbH and brox IT Solutions GmbH. All rights reserved. This program and the accompanying

 * materials are made available under the terms of the Eclipse Public License v1.0 which accompanies this distribution,

 * and is available at http://www.eclipse.org/legal/epl-v10.html

 *

 * Contributors: Ivan Churkin (brox IT Solutions GmbH) - initial creator

 **********************************************************************************************************************/

-->

<?xml version="1.0" encoding="UTF-8"?>

<DataSourceConnectionConfig

  xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"

  xsi:noNamespaceSchemaLocation="../org.eclipse.smila.connectivity.framework.crawler.jdbc/schemas/JdbcDataSourceConnectionConfigSchema.xsd">

                <DataSourceID>kinkonnewsjdbc</DataSourceID>

                <SchemaID>org.eclipse.smila.connectivity.framework.crawler.jdbc</SchemaID>

                <DataConnectionID>

                               <Crawler>JdbcCrawler</Crawler>

                </DataConnectionID>

                <!--CompoundHandling>No</CompoundHandling-->

                <DeltaIndexing>full</DeltaIndexing>

                <Attributes>

                               <Attribute Name="Leading_ID"

      HashAttribute="true"

                                               KeyAttribute="true"

      Type="String">

                                               <ColumnName>nMessageID</ColumnName>

                                               <SqlType>string</SqlType>

                               </Attribute>

                               <Attribute Name="Category"

                                               HashAttribute="false"

                                               KeyAttribute="false"

                                               Type="String">

                                               <ColumnName>category</ColumnName>

                                               <SqlType>string</SqlType>

                               </Attribute>

                               <Attribute Name="Content"

                                               HashAttribute="false"

                                               KeyAttribute="false"

                                               Type="String">

                                               <ColumnName>content</ColumnName>

                                               <SqlType>string</SqlType>

                               </Attribute>

                </Attributes>

                <Process>

                               <Selections>

                                               <Grouping>

                                                               <Stepping>20</Stepping>

                                                               <SQL>SELECT nMessageID FROM dbo.SMILA_BookmarksView ORDER BY nMessageID ASC</SQL>

                                               </Grouping>

                                               <SQL>

                                                               SELECT nMessageID, category, content

                                                               FROM dbo.SMILA_BookmarksView where id BETWEEN %01min AND %01max

                                               </SQL>

                               </Selections>

                               <Database Connection="jdbc:sqlserver://afa-23592:1433;databaseName=KinkonGmbHTestDB;integratedSecurity=true;"

                                               FetchSize="100000"

                                               User=""

                                               Password=""

                                               JdbcDriver="com.microsoft.sqlserver.jdbc.SQLServerDriver" />

<!--Database Connection="jdbc:derby:crawlerTestDerbyDB" User=""

                                               Password="" FetchSize="100000"

                                               JdbcDriver="org.apache.derby.jdbc.EmbeddedDriver" /-->                                     

                </Process>

</DataSourceConnectionConfig>

 

 

 

Has anybody an idea?

 

Best & thanks

Andreas Schultz
Senior Software Developer

- - - - Bitte beachten Sie meine neuen Kontaktdaten - - - -


Empolis GmbH  |  Meisenstr. 90 | 33607 Bielefeld  |  Germany
AN ATTENSITY GROUP COMPANY
Phone +49 (0)521 55 785 413|  Fax +49 (0)521 55 785 121
andreas.schultz@xxxxxxxxxxx

 

www.empolis.com
Sitz Kaiserslautern  |  Amtsgericht Kaiserslautern HRB 30711  |  Geschäftsführer: Dr. Stefan Wess, Dr. Peter Tepassé

 

………………………………………………………………………………………………………………………………………………………………………………………………………..

Know. Right. Now.

Das ist unsere Philosophie. Empolis, an Attensity Group Company, bietet eine integrierte Suite von Geschäftsanwendungen,

die mit Hilfe patentierter semantischer Informations-Technologien die exponentiell wachsende Menge unstrukturierter
Daten analysiert, interpretiert und automatisiert verarbeitet. Entscheider, Experten, Mitarbeiter und Kunden erhalten so
stets situations- und aufgabengerecht genau das Wissen, das für ihre Arbeit relevant ist.

………………………………………………………………………………………………………………………………………………………………………………………………………..

Abonnieren Sie unseren monatlichen Newsletter: http://www.empolis.de/newsletter.html

 


Back to the top