Performing search#

The Tonita search API offers the ability to perform search in two ways:

Given a query string, Tonita will return the most relevant listings. Restrictions based on facet value and category can also be applied to narrow the set of eligible listings and make search more precise.
Given the ID of a listing, Tonita will return the listings most similar to it in the vector space we’ve constructed for your data.

Before proceeding with search, we first need a corpus populated with listings to search over. Please see our guides on corpora and listings, or see the Quickstart.

Note

Search is a corpus-specific operation. This means that whenever a search is performed, it is performed only over the listings in a single specified corpus. In particular, you cannot perform search over the listings of multiple corpora in a single API call.

However, developers have complete control over how their corpora are organized, and are therefore free to create a corpus that contains whichever listings they like.

Performing search given a query string#

Suppose you want to find the listings most relevant to a given query string. Let’s start with a simple example:

tonita.search(
    query='sunny 1 bedroom on a quiet street near parks',
    max_results=2,
    categories=["apartment"],
    corpus_id="new_york"
)

Here, we simply want the two listings most relevant to our query string. The results will be returned in a SearchResponse object as follows:

# Example return value:
# SearchResponse(
#     items=[
#         SearchResponseItem(
#             listing_id="qd12309mc",
#             score=0.92,
#             categories=["apartment", "co-op"],
#             snippets=[
#                 Snippet(
#                     display_string="Oversized windows face south."
#                 )
#                 Snippet(
#                     display_string="Only two blocks from Central Park!"
#                 )
#             ],
#         ),
#         SearchResponseItem(
#             listing_id="bn32358ss",
#             score=0.88,
#             categories=["apartment", "condo"],
#             snippets=[
#                 Snippet(
#                     display_string="Overlooks a beautiful park."
#                 )
#                 Snippet(
#                     display_string="Located on a high floor."
#                 )
#             ]
#         )
#     ]
# )

A SearchResponse contains the search results in the form of an array of SearchResponseItems. Each SearchResponseItem contains the following information for a single relevant listing:

listing_id: The ID of the listing.
score: The relevance score of the listing. This score is bounded between 0 and 1, inclusive.
categories: The matching categories that the listings belong to (currently these are not populated in SearchResponse).
snippets: Information extracted from the listing’s data that explain the listing’s relevance. Note that SearchResponseItems are sorted in descending order of relevance score.

Performing retrieval only#

Search can be thought of as progressing in two stages:

A retrieval stage, where listings are retrieved along with raw scores;
A rescoring stage, where we refine the scores of the listings that were retrieved.

The retrieval stage is very fast, whereas the rescoring stage can take more time. In order to perform retrieval only, set the retrieval_only flag of tonita.search() to True in a given call:

tonita.search(
    query='sunny 1 bedroom on a quiet street near parks',
    max_results=2,
    categories=["apartment"],
    retrieval_only=True,
    corpus_id="new_york"
)

Attention

At this time, the retrieval_only flag is applicable only for searches with a query, and its default value is False.

For searches where a listing ID is specified, only raw scores will ever be returned. Therefore, the retrieval_only flag does not apply; richer rescoring options are coming soon.

Note, however, that the ranking of the listings will typically change after scores are refined in the rescoring stage.

Performing search given a listing ID#

The search API also allows us to search by providing a listing ID (in place of a query). In this case, we will return the listings most like the one specified. More technically, the listings we retrieve will be similar in a real-world sense, as captured in the vector space we’ve constructed for your data.

To find listings similar to some listing (say, with ID "foo"), simply pass that listing’s ID to tonita.search():

tonita.search(
    listing_id="foo",
    max_results=2,
    corpus_id="new_york"
)

Here, we are asking for the two listings most similar to the listing with ID “foo” in vector space. The return value will be a SearchResponse, just as above.

Restrictions#

Category restrictions#

Search can be made more precise by specifying category restrictions. Recall that we can specify categories to associate with each listing we upload (see Managing listings). We can specify the categories whose listings we’re interested in for a given search in our call to tonita.search().

Let’s go back to our previous search, and suppose that we’re interested in condominium apartments specifically. We specify this in the call:

tonita.search(
    query='sunny 1 bedroom on a quiet street near parks',
    max_results=2,
    categories=["condo"],
    corpus_id="my_corpus_id"
)

The results will now only contain those listings that belong to the “condo” category:

# Example return value:
# SearchResponse(
#     items=[
#         SearchResponseItem(
#             listing_id="bn32358ss",
#             score=0.88,
#             categories=["apartment", "condo"],
#             snippets=[
#                 Snippet(
#                     display_string="Overlooks a beautiful park."
#                 )
#                 Snippet(
#                     display_string="Located on a high floor."
#                 )
#             ]
#         ),
#         SearchResponseItem(
#             listing_id="ql81799cs",
#             score=0.71,
#             categories=["apartment", "condo"],
#             snippets=[
#                 Snippet(
#                     display_string="Quiet with nearby green space."
#                 )
#             ],
#         ),
#     ]
# )