simSearch can process any file that meets the Platform File Specifications as long as it has an “ID” column which assigns a unique value to each row.
simSearch accepts queries in the form of an object with matching columns to the training data and returns a list of the most similar objects in the data with the distance those objects are from the queried object.
simSearch Sample Output
simSearch’s results are ordered by distance with the nearest neighbor first. The ID column contains the ID of the neighboring object, the distance column contains the distance the returned object is from the queried object.
simSearch also accepts Batch Queries. A Batch Query will take a file of objects and return the k number of nearest neighbors for each object in the file. The output of a Batch Query will be a CSV file with the object ID and the k nearest neighbors with the same ID / distance format used in individual queries.