
Stop words can also serve another purpose. You can filter out words that are so common in a particular set of data that the system can't handle them in a useful way. For example, consider the word "fish" in our dataset. It's probably very common. With only 500 fish being indexed it's not really going to make much difference, but what if we were indexing five million fish, and each one had the word "fish" in the description even just five times? That's 25 million occurrences of the word "fish". Eventually we might start to hit the upper limit of what Solr can handle. The word "fish" in this case is probably also not very useful in a search query. You're browsing a fish database. Are you really likely to search for the query fish and expect any meaningful results? Likely it would instead return every result. It would be like going to Drupal.org and searching for the word "drupal" and expecting to get something useful. Not going to happen.
Solr has the ability to read in a list of stop words, or words that should be ignored during indexing, so that those words do not clutter your index and are removed from influencing result relevancy. In this tutorial we'll take a look at configuring stop words for Solr.
First, we'll use the Solr web UI to see the most common terms in our index for the body field. Then, based on that list, and the list of common stop words provided by the Solr team, we'll configure our stopwords.txt file. Finally, we'll re-index all the content of our site so that it makes use of the new stop words configuration and re-examine the most common terms noting that our stop words no longer appear in the list.
By the end of this tutorial you should be able to use the Solr web UI to get a list of the most common terms in your index, and know how to add terms to Solr's stopwords.txt file to prevent them from showing up in your index.
To learn more please visit https://drupalize.me
drupal console D7 Search API and Solr YouTube | |
2 Likes | 2 Dislikes |
1,228 views views | 8.87K followers |
How-to & Style | Upload TimePublished on 29 Jun 2015 |
Related keywords
search by image,solrock,solrac,search party,training day,searching filme,solrock pokemon,solriamfetol,search www,search for a cure,http://solerelief.com,training shed,training significado,search engine marketing,sol relief,drupal bootstrap,training with hinako,search console google,search and destroy,drupal tutorial,solrepublic,training camp,training que es,search engine,training wheels,http://drupal.org,search manager,training gym,search icon,drupal شرح,search www drama,solr query,drupal developer,search engine optimization,drupal php,solr documentation,solr search engine,drupal commerce,training shoes,training point,drupal 7 exploit,solrcloud,solr download,solr vs elasticsearch,drupal modules,training force,training center garmin,search family,training peaks,solriamfetol cost,training deporte,searching,solrock pokemon go,training center,drupal download,search twitter,drupal 7,solresol,search significado,search console,drupal 8,drupal themes,training mask,drupal themes free,solrx,drupal 7 download,search tradução,drupal theme,training wheels lyrics,drupal core,training me,solrock weakness,training wheels letra,drupal exploit,search modes,training traduccion,solr tutorial,drupal 8 download,solrenview,
Không có nhận xét nào:
Đăng nhận xét