Configuring searchable attributes the right way
On this page
Setting searchable attributes is one of the most important aspects of configuring your search and one of the first steps you need to take. It goes much further than telling Algolia which attributes to search for, and it also defines which ones to look at first and what strategy to adopt. A small change in setting searchable attributes could lead to entirely different results. For that reason, itâs crucial to understand how it works.
Besides deciding which attributes Algolia should search, there are two aspects you can control with searchable attributes:
- The relative importance of each attribute
- Whether the position of the match in the attribute matters
Searchable attributes with different priorities
The order of searchable attributes directly affects search relevance. Attributes higher in the list are more relevant than attributes further down. Therefore, you should set attributes higher in the searchable attributes list when their content is more relevant to your users.
Consider a movie database website where users can search movies by title, actors, and director. You may be tempted to prioritize actors before directors Yet, consider the following dataset:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
[
{
"title": "Mystic River",
"director": "Clint Eastwood",
"actors": ["Sean Penn", "Tim Robbins", "Kevin Bacon", "Laurence Fishburne"]
},
{
"title": "In the Line of Fire",
"director": "Wolfgang Petersen",
"actors": ["Clint Eastwood", "John Malkovich", "Renee Russo"]
},
{
"title": "Invictus",
"director": "Clint Eastwood",
"actors": ["Morgan Freeman", "Matt Damon"]
},
{
"title": "Million Dollar Baby",
"director": "Clint Eastwood",
"actors": ["Clint Eastwood", "Hilary Swank", "Morgan Freeman"]
}
]
For a movie search, setting the title
as the first searchable attribute makes sense. But what if someone searched for âClint Eastwoodâ? Would they primarily be looking for movies with Clint Eastwood, by Clint Eastwood, or both? In other words, how should you rank the director
and actors
attributes?
By putting director
first, movies that Clint Eastwood directed would come before those where he appeared as an actor. This may or may not be desirable. For example, if a user types âjaneâ, do you want to first display movies with director Jane Campion or actress Jane Fonda?
Thereâs no one-size-fits-all approach, and the strategy depends on your use case. Defining a priority order in your searchable attributes isnât trivial. It can vary depending on your data, how itâs structured, what your users search for, and what they expect as results.
Using the API
To make some attributes searchable, you must apply searchableAttributes
during indexing.
1
2
3
4
5
6
7
$index->setSettings([
'searchableAttributes' => [
"title",
"director",
"actors"
]
]);
Using the dashboard
You can also set your searchable attributes in your Algolia dashboard.
- Select the Search product icon on your dashboard.
- On the Index page, select your index.
- Click the Configuration tab.
- In the Searchable Attributes section, click the Add a searchable attribute button.
- Add attributes you want to make searchable, one after the other, by order of importance.
- Save your changes.
Searchable attributes with the same priority
Sometimes, setting an attribute before or after another doesnât make sense because you want them to be equally considered. Hereâs another dataset, this time with records representing movies, actors, and directors:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
[
{
"movie_title": "John Wick",
"cast": ["Keanu Reeves", "Michael Nyqvist", "Alfie Allen"]
},
{
"actor_name": "John Cleese",
"filmography": ["Monty Python and the Holy Grail", "Life of Brian", "Harry Potter and the Philosopher's Stone"]
},
{
"director_name": "John Carpenter",
"filmography": ["Halloween", "The Thing", "Escape from New York"]
}
]
If someone typed âJohnâ, it wouldnât necessarily make more sense to show movies with âJohnâ in the title than actors or directors named John. Instead, you may want to rely on other criteria, like popularity, to decide what comes first.
In this case, it makes sense to set movie_title
, actor_name
, and director_name
at the same level. This way, if someone types âJohnâ, the engine considers all three records equal in the attribute ranking criterion, and the engine would go to the next ranking criterion to try and break the tie.
Attributes with the same priority are always unordered.
Using the API
1
2
3
4
5
6
7
$index->setSettings([
'searchableAttributes' => [
"movie_title,actor_name,director_name",
"cast",
"filmography"
]
]);
Using the dashboard
You can also set your searchable attributes in your Algolia dashboard.
- Select the Search product icon on your dashboard.
- On the Index page, select your index.
- Click the Configuration tab.
- In the Searchable Attributes section, click the Add a searchable attribute button.
- Type attributes directly in the input field as a comma-separated listâfor example,
director,cast
. - Save your changes.
Understanding word position
Searchable attributes can be ordered or unordered:
- ordered means that the engine considers matches at the beginning of an attribute more important than matches in the middle or the end. When an attribute is ordered, the earlier a match occurs, the higher the engine ranks it.
- unordered means that the position of the match within the attribute doesnât affect ranking.
You should usually set the attribute to be unordered since the position of the matching word in the attribute often doesnât matter.
Consider the following dataset:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
[
{
"title": "Avengers: Infinity War",
"cast": ["Robert Downey Jr.", "Chris Hemsworth", "Mark Ruffalo", "Chris Evans", "Scarlett Johansson"]
},
{
"title": "World War Z",
"cast": ["Brad Pitt", "Mireille Enos", "Daniella Kertesz", "James Badge Dale", "Ludi Boeken"]
},
{
"title": "War of the Worlds",
"cast": ["Tom Cruise", "Dakota Fanning", "Justin Chatwin", "Miranda Otto", "Tim Robbins"]
},
{
"title": "Lost in Translation",
"cast": ["Scarlett Johansson", "Bill Murray", "Akiko Takeshita"]
},
{
"title": "North",
"cast": ["Elijah Wood", "Jason Alexander", "Julia Louis-Dreyfus", "Marc Shaiman", "Scarlett Johansson"]
}
]
If the user typed âwar,â it wouldnât make sense to return âWorld War Zâ before âAvengers: Infinity Warâ because the query happens to appear earlier in the title. The ranking should rely on other criteria like popularity or reviews, so it makes sense to set title
as unordered
.
Things are different with the cast
attribute. If the user typed âScarlett Johansson,â chances are they would be more interested in movies where Scarlett Johansson has a leading role than movies where she played small parts.
Assuming that the order of actors and actresses in the cast
attribute reflects role size, keeping cast
ordered makes more sense.
Using the API
You can set attributes as unordered or ordered with searchableAttributes
when indexing. If unordered isnât set on an attribute, it will default to ordered when set by the API.
1
2
3
4
5
6
$index->setSettings([
'searchableAttributes' => [
"unordered(title)",
"cast"
]
]);
Using the dashboard
- Select the Search product icon on your dashboard.
- On the Index page, select your index.
- Click the Configuration tab.
- Go to the Searchable Attributes section. If not already done, add attributes in order of importance.
- For each searchable attribute, consider whether it should be ordered or unordered.
- Save your changes.
When using the dashboard, the UI defaults to setting your attribute as unordered. Select ordered from the menu to change it.
Successful strategies
Setting searchable attributes is more an art than a science and highly depends on your use case. Thereâs no magic formula that works for everyone. However, the following tips will get you started.
Keeping as few searchable attributes as possible
As a rule of thumb, the more searchable attributes, the noisier the search. It may be tempting to add as many searchable attributes as possible to get as many matches as you can. Yet, this reduces search relevance by creating ânoiseâ (littering good results with not-so-good ones).
Instead, be conservative with your searchable attributes and focus on what your users tend to search for. Some attributes, like image URLs or review scores, shouldnât be searchable because they donât have textual value. Some other attributes, like the plot summary of a movie, might look interesting but could be unnecessary. How often do you find a movie by searching for the synopsis? Probably far less often than with the title or lead actors.
The same advice goes for the length of the attribute values. Suppose you have a âmoviesâ index where the records have an actors
attribute. If this attribute has the entire cast, itâs less efficient than if it only had the leading roles. Most users would probably look at leading roles when searching for an actor.
Properly setting attribute order
Knowing how to order your searchable attributes can be challenging, especially when you have several. One method is to compare them in pairs, each attribute with the previous one, and move them around accordingly. The process is like insertion sorting.
For example, suppose you have the following attributes in the records of a âmoviesâ index:
director
cast
title
genres
plot_summary
You would:
- Compare the first two:
director
andcast
. If you thinkcast
should come beforedirector
(for example, you want to see movies with Clint Eastwood before movies directed by Clint Eastwood), movecast
to the first place. - Compare
director
andtitle
and decide that the movieâs name is more important than the directorâs. That means youâd movetitle
to the second place and compare it with the previous one (cast
). - Prioritize a match on an actorâs name or a movie title. If all records in your index represent movies, it may make sense to prioritize
title
, so youâd move it first. Then, because you donât have any more attributes to comparetitle
to, youâd move on togenres
.
The benefit is that youâre making a thorough, granular comparison of each attribute with the others and considering your use case for every pair. This method is much more reliable than doing a global, intuitive sort by what seems more important.
Where to put filters?
Sometimes itâs better to prioritize filters over all other attributes. This might not seem intuitive at first, as you may think users tend to search by attributes like name
or title
.
Suppose you have a âmoviesâ index where the records have a genre
attribute with values like âcrimeâ and âaction.â Films fall into these categories, but there are also films whose titles and descriptions use these exact words, for example, âCrime Doesnât Pay,â âLast Action Hero.â
In that case, you can decide that when a user searches for words that are more like genres, itâs best to search filter attributes before the title. This is guaranteed to return all crime movies whenever someone types in âcrime,â regardless of the title.
Making this decision when setting up searchable attributes can significantly affect your results.
Changing your searchable attributes strategy
Be careful before changing your searchable attributes strategy because this affects your entire index. Donât make decisions based on a single query. Instead, experiment with several queries close to what your users search for (use Algolia analytics to explore their search habits).