Data research and development of the data collection mechanismAltexSoft’s team researched, evaluated existing data, and defined key variables that impact user choice. Data scientists considered existing user interaction data and suggested additional types of information to power a recommender engine. Variables included user interactions, metadata (location, device, browser, etc.), and other analytics. The team also helped to design data storage for this purpose and covered all data preparation and cleaning activities to remove irrelevant variables and correct incomplete or inaccurate data.
Designing recommender engine with rapid user profilingOne of the biggest challenges was that most users visit the marketplace just once. To be helpful, the system must gather and analyze individual visitor data rapidly during the first session. The data science team designed an algorithm that instantly generates a user profile, which allows for tailoring property recommendations even for a single visit right after the first interactions. The engine uses a content-based filtering method meaning that it matches user profile data with specific attributes for each property profile.
Integrating the engine into the web service with automatic updatesAs user preferences and specifics of properties change over time, the algorithm connected to a web service is capable of updating itself to account for more recent and relevant data. The engine is deployed as a web service powered by Flask, a Python-based framework. The service is used by the main web application that delivers a personalized list of properties for each visitor. In the marketplace, the recommender system achieved a 15-30 percent higher conversion rate (contacting property sellers) across different markets compared to a non-personalized search.
Approach and Technical Info
The Homsters recommender engine was developed by a team consisting of a machine learning engineer and a machine learning team lead.
The duration of the project was about 4 months with the total scope of work completed in about 1.5 man-months.
The technology stack included: Python, Pandas, scikit-learn, Plot.ly, Matplotlib, Flask, and SQL Server.
Services provided within the project framework: Data Science Consulting.
Wondering how to build
an exclusive software product for your business?
Discuss your project needs with our architects.