Solr version 0.02 ================= The README is used to introduce the module and provide instructions on how to install the module, any machine dependencies it may have (for example C compilers and installed libraries) and any other information that should be provided before the module is installed. A README file is required for CPAN modules since CPAN extracts the README file from a module distribution so that people browsing the archive can use it get an idea of the modules uses. It is usually a good idea to provide version information here so that people can decide whether fixes for the module are worth downloading. INSTALLATION To install this module type the following: perl Makefile.PL make make test make install DEPENDENCIES This module requires these other modules and libraries: POSIX File::Slurp Log::Log4perl LWP::UserAgent HTTP::Request XML::Simple NAME Solr - Perl extension for interfacing with Solr, a lucene based search engine. SYNOPSIS use Solr; my $solr = Solr->new(schema=>"/home/garafolat/cpan/Solr/schema.xml", port=> "8302", url=> "http://solr.foobar.com", log_dir=> "/var/opt/logs"); #NOTE: timeout is an optional param for each method and is used by LWP to pass a timeout for waiting for a response result code from Solr. It defaults to 600 seconds if not supplied $solr->add($hashRef, $timeout); # Takes a reference to a hash of hashes of fields and values, creates solr xml, and posts to server defined in constructor. Note: An update is performed by adding a document with an existing uniqueID field. $solr->add_by_file($file, $timeout); # simply posts a file. The file must be in valid solr xml add syntax matching your solr server's schema for field requirements. $solr->post_file($file,$timeout); # posts file to solr server. Differs from add_by_file only in logging $solr->delete_by_id($id,$timeout); # Deletes by field declared as uniqueID in solr's schema.xml $solr->delete_by_query($query,$timeout); # Delete's by query string provided $solr->commit(); # commits adds and deletes $solr->optimize($timeout); # issues a post to solr server that causes it to perform an optimze # timeout is an optional parameter expressed in seconds before # LWP stops waiting for a return result code from Solr. Defaults # to 600 if not supplied. For more details on solr's xml posting used by these methods see: http://wiki.apache.org/solr/UpdateXmlMessages#head-f5553a8214869b11708ad5476e05e7b05a024c23 A Note On Logging Solr.pm uses Log4perl for logging. Log4perl is a perl port of log4j. to disable logging set the optional initialization variable disable_logging=>1. Further reading on Log4perl can be found at: http://search.cpan.org/~mschilli/Log-Log4perl-1.14/lib/Log/Log4perl.pm http://www.perl.com/pub/a/2002/09/11/log4perl.html?page=1 http://log4perl.sourceforge.net/releases/Log-Log4perl/docs/html/Log/Log4perl/FAQ.html NOTE on Adding documents Currently there is no error checking for in add_by_file and add methods. It is possible to implement such checking using the schema object to compare fields in schema with the data structure passed into add or the file referenced in add_by_file. I haven't implemented this yet, as the simply catching a non zero result code from the solr server is sufficient for my needs in notifying of an error at this time. DESCRIPTION This module provides a set of methods for adding (updating) and deleting entries in an existing solr server. EXPORT None by default. THINGS TO DO -Add options to pass optional parameters to Solr in various post methods -Add solr search query functionality and parsing of: a) xml results b) user defined result formats -Incorporate Error checking of adds and deletes: a) Level 1, check for proper syntax b) Level 2, compare posts against schema for sanity check If you'd like to make contributions to project, find bugs, or have suggestions for additions, please feel free to email me at the address below. SEE ALSO See http://wiki.apache.org/solr/FrontPage for additional documenation on setting up and using solr. AUTHOR Timothy Garafola, timothy.garafolat@cnet.com COPYRIGHT AND LICENSE Copyright (C) 2008 by CNET Networks Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0 Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.