Hadoop Professionals

A Community for Hadoop Users

Jason Venner

Notes and Presentation from the HUG talk on Nov 18th - integrating solr & katta for scaled reliable search

I have attached the slides to this discussion and will attach key pieces of the configuration files shortly.

Tags: hadoop, katta, reliabile, scale, search, solr

Attachments:

Reply to This

Replies to This Discussion

There are 2 solr config files in the setup.
We have the solrconfig.xml that is bundled as part of each index, and we have a solrconfig that is used by each front solr instance.

The way this works is that each solr index is hosted by a katta node, and handles a special type of request the KattaRequest. Each of these indexes is a named shard.

In addition to these embedded solr instances, there are 1 or more front end solr instances. These instances provide a user interface, I commonly run them in jetty, using a hacked version of the bundled examples.
These solr instances take the incoming requests and distributed the queries to the specified shard names. The results are collected, merged and returned to the requesting user/agent.

Reply to This

Configuration files
front-end-solr-config.xml is the solconfig for the solar instances that serve user queries
zoo.cfg is the configuration for my zookeeper cluster
katta.zk.properties is the katta configuration that points to the zoo.cfg nodes
katta.master.properties is the katta file that sets the distribution policy for the index shards
back-end-solr-config.xml is the solrconfig for my individual shards.
Attachments:

Reply to This

more files
Attachments:

Reply to This

Reply to This

RSS

Groups

© 2010   Created by Jason Venner.   Powered by .

Badges  |  Report an Issue  |  Terms of Service

Sign in to chat!