• Europes biggest bike selection

    buycycle's personalization engine

    Sebastian Rauner
    Sebastian Rauner
    Apr 4, 2024 7 min
    buycycle's personalization engine

    buycycle emerges as a transformative platform in the pre-owned bicycle market. A market hindered before by high transaction costs that deterred customers from considering used bikes over buying new. These costs, ranging from complex cross-border shipping to concerns over buyer and seller trust and prevalent information inefficiencies, are now being mitigated through buycycle’s comprehensive service offerings.

    By centralizing services such as streamlined international shipping, secure escrow payment systems, and robust insurance options, buycycle aims at reducing the barriers to buying and selling pre-owned bikes. This not only elevates the value proposition of second-hand bicycles but also enhances market efficiency, making pre-owned options more competitive against new purchases.

    Environmental benefits are inherent in our business model, promoting bicycle reuse, reducing CO2 emissions, and supporting sustainable consumption. Economically, buycycle enables more affordable cycling options, fostering broader adoption and healthier lifestyles while lessening dependence on costlier transport.

    One key tool to achieve this is through democratizing bike market information, providing detailed bike insights to inform buyers and sellers alike.

    Get the buycycle App now!

    👉 or 👈

    Democratizing bicycle market information

    buycycle's mission

    A core mission of buycycle is to establish a level playing field by ensuring all market participants have access to comprehensive and symmetrical information. To this end, we offer a detailed buyers guide to empower consumers, provide precise bicycle valuations, and facilitate the connection between buyers and sellers by aligning the right bikes with user preferences. Achieving such transparency is made possible through our commitment to data-driven services, which forms the cornerstone of the buycycle platform.

    The role of personalization

    Personalization emerges as a natural progression in our quest to democratize market information. By leveraging the power of machine learning, we aim to curate a tailored experience for each user, ensuring that the wealth of information available is not only accessible but also actionable. This focus on personalization is not just about enhancing user experience—it’s about transforming data into meaningful interactions that resonate with the individual needs and desires of our customers.

    Data-driven personalization

    Imagine a user overwhelmed by the choice of 20,000 bikes on sale. They need assistance in finding the perfect fit, which may not always be readily available due to the nature of pre-owned stock. The recommendation system is tailored to help users navigate this vast selection effectively. On the other hand we need to ensure if the exact bike a user wants isn’t in stock, they won’t leave the platform empty-handed. Instead, they’ll be presented with a curated selection of alternatives that meet their needs and preferences, making the shopping experience personalized and efficient.

    The recommendation system is designed to address four key aspects:

    • Relevance: Ensuring that the bikes recommended meet the user’s specific needs and preferences.
    • Novelty: Introducing users to bikes they may not have previously considered or known about.
    • Serendipity: Surprising users with bikes that they might find unexpectedly appealing.
    • Diversity: Offering a wide range of bikes to prevent redundancy in recommendations.

    Personalization engine

    buycycle’s personalization engine employs a two-tiered approach to cater to users with varying levels of historic feedback data.

    The first tier is a content-based model that suggests bikes based on specific attributes such as type, size, and brand, ideal for users with clear preferences or limited interaction history.

    The second tier utilizes collaborative filtering, which leverages user behavior and feedback to refine recommendations. This model identifies patterns in user interactions to suggest bikes that similar users have shown interest in, enhancing personalization as more data becomes available.

    Tech stack

    Our system is built as a Flask app, hosted on Kubernetes for scalability and managed with ArgoCD for continuous deployment, ensuring reliability and up-to-date functionality. For a deeper look into our system, visit our repository.

    Content-based

    Our Personalization engine’s content-based filtering is designed to suggest bikes based on their features, independent of user history. This is crucial when user preferences are unknown, focusing instead on the attributes of the bikes as shown in Figure 1. The model compares bikes’ characteristics, such as brand, frame size, brake type, and color, to identify similar options.

    idtemplate_idpricemsrp
    35551181283400.05800.0
    bike_created_at_yearbike_created_at_monthbike_yearsales_country_id
    2023112021137
    bike_type_idbike_category_idmileage_codemotor
    113000_10000
    citycondition_codeframe_sizerider_height_min
    Berlin3.056173.0
    rider_height_maxbrake_type_codeframe_material_codeshifting_code
    183.0hydrauliccarbonelectronic
    bike_component_idcolorfamily_model_idfamily_id
    14#3B82F687273088
    brand_idquality_scoreis_mobilecurrency_id
    453002
    seller_idis_ebikeis_frameset
    15198200

    Excerpt of bike feature data

    Using euclidean distance between the feature vectors, we measure the similarity between bikes in a feature space, ranking them by similarity scores. We found that euclidean distance allows the weighing of features according to their importance over cosine similarity.

    As Figure 1 demonstrates, the content-based recommendations closely match the featured bike in key features such as components level, price, size, and color. This ensures that even without user input, our recommendations are likely to align with what users might find appealing and within their expected budget and aesthetic preferences.

    Content based similarity
    Content based similarity

    In summary, our content-based filtering leverages a detailed bike feature table to present users with bike suggestions that are similar in price, size, and color, ensuring relevance and potential appeal in every recommendation.

    Collaborative filtering

    Our collaborative filtering model personalizes bike recommendations by learning from user behaviors like views, add to favorites and purchases. It uses machine learning, stylized in Figure 2, to predict interests from similar users’ patterns, ensuring relevance.

    For instance, if User A likes both aero and gravel bikes, we might suggest gravel bikes to User B. She is so far only interested in aero bikes but her similarity to user A suggests that she might be interested in gravel bikes too, introducing novelty and serendipity. This method also diversifies options, preventing monotonous suggestions. Our hybrid approach, based on Kula, 2015, combines content-based insights with collaborative filtering, offering users a dynamic selection of pre-owned bikes that align with their preferences.

    Collaborative Filtering
    Collaborative Filtering

    This model takes into account user-item interactions and extracts latent features to inform its recommendations, incorporating elements of swarm intelligence.

    Evaluating recommendation strategies: A/B testing insights

    In our pursuit to refine recommendation accuracy, we conducted a two-week A/B test, as depicted in Figure 3, to compare the effectiveness of purely content-based filtering against a hybrid approach that combines content-based and collaborative filtering. During this trial, we observed user interactions with recommendations generated by both models. The content-based model operated solely on bike features (stable), while the hybrid model also considered user behavior and preferences (canary).

    The A/B test results were compelling, revealing a 40% increase in conversion rates—from viewing recommendations to placing orders, when the hybrid model was employed over the content-based model alone. Figure 3 showcases the A/B test outcomes over the 14-day period, providing clear evidence that integrating collaborative filtering with content-based insights leads to a more engaging and effective recommendation system (blue is the hybrid model, red the content only model).

    A/B Test Results
    A/B Test Results

    Expanding personalization through advanced applications

    buycycle’s inaugural machine learning model, our personalization engine, is a cornerstone in our quest to revolutionize the pre-owned bicycle market. It tackles key challenges such as information inefficiencies by delivering personalized bike suggestions, empowering our users to make well-informed decisions. This advancement is a significant stride towards bolstering buycycle’s dedication to market efficiency, environmental sustainability, and economic accessibility.

    As we refine this first model iteration and embrace cutting-edge technologies, we’re excited to broaden its use, including personalized email campaigns and real-time personalization through streaming data.

    These initiatives are just the beginning of our journey into machine learning. We anticipate starting new endeavors and applying our learnings to continuously innovate, keeping us at the vanguard of the marketplace, enriching the user experience, and ensuring equitable access to cycling options.

    Repository

    You find the personalization engine here and our the data science work in this repository.