Efficient Metric Indexing for Similarity Search and Similarity Joins
Efficient Metric Indexing for Similarity Search and Similarity Joins
Implementation Details
1. We are using yelp dataset (as it has many features)
2. We are giving query (Restaurants, Phoenix,AZ)
3. Dataset are collected live from yelp data
4. Pre-processing the data to numeral
5. Plot in map
6. We have considered orgin (latitude, longitude), from this 2.5 km surrounded resuts are arrived for space filling
7. The results are plotted in map
8. We apply B+ tree on the space filled results (for the given features, similarity results are arrived)
9. Apply similary join
10. We are processed range query, thus the best matched record is shown as result.
2. We are giving query (Restaurants, Phoenix,AZ)
3. Dataset are collected live from yelp data
4. Pre-processing the data to numeral
5. Plot in map
6. We have considered orgin (latitude, longitude), from this 2.5 km surrounded resuts are arrived for space filling
7. The results are plotted in map
8. We apply B+ tree on the space filled results (for the given features, similarity results are arrived)
9. Apply similary join
10. We are processed range query, thus the best matched record is shown as result.
Python Demo