Hadoop Professionals

A Community for Hadoop Users

Notes and Presentation from the HUG talk on Nov 18th - integrating solr & katta for scaled reliable search

I have attached the slides to this discussion and will attach key pieces of the configuration files shortly.

Tags: hadoop, katta, reliabile, scale, search, solr

Views: 13

Attachments:

Reply to This

Replies to This Discussion

There are 2 solr config files in the setup.
We have the solrconfig.xml that is bundled as part of each index, and we have a solrconfig that is used by each front solr instance.

The way this works is that each solr index is hosted by a katta node, and handles a special type of request the KattaRequest. Each of these indexes is a named shard.

In addition to these embedded solr instances, there are 1 or more front end solr instances. These instances provide a user interface, I commonly run them in jetty, using a hacked version of the bundled examples.
These solr instances take the incoming requests and distributed the queries to the specified shard names. The results are collected, merged and returned to the requesting user/agent.
Configuration files
front-end-solr-config.xml is the solconfig for the solar instances that serve user queries
zoo.cfg is the configuration for my zookeeper cluster
katta.zk.properties is the katta configuration that points to the zoo.cfg nodes
katta.master.properties is the katta file that sets the distribution policy for the index shards
back-end-solr-config.xml is the solrconfig for my individual shards.
Attachments:
more files
Attachments:

Reply to Discussion

RSS




Groups

© 2012   Created by Jason Venner.

Badges  |  Report an Issue  |  Terms of Service