This is a autopost bolg frinds we are trying to all latest sports,news,all new update provide for you
Thursday, February 27, 2025
Show HN: Ranked Search for Semi-Structured Data https://ift.tt/qNPSpAM
Show HN: Ranked Search for Semi-Structured Data We’ve been working on a search problem that requires querying both text and numbers simultaneously. For example, in a dataset of clothing items with descriptions and prices, a search for “slim pants for $20” should prioritize skinny jeans for $25 over slim pants for $50 because they are semantically similar and the price is closer. I’ve found that standard embedding models struggle with numerical ordering, while text-to-SQL methods rely on exact matches and often filter out too many results. To solve this, we built a system designed specifically for structured datasets like CSVs or tables. Here’s a demo link where you can upload a small CSV to try out (no login required): https://ift.tt/xD7rVp8 . Unlike most RAG approaches, we process each column independently, handling text with embeddings and numbers with custom scoring. When a user submits a query, we parse it into relevant fields—for instance, extracting “slim pants” as the description and “20” as the price. We then compute cosine similarity between the description embeddings and “slim pants” while also calculating the percent error between the user’s price input and the numerical field. These individual similarity scores are then combined across all columns to generate a final ranking. Right now, our system works best with well-structured data, so some preprocessing is often needed. We’re working on improving this by detecting and restructuring messy data automatically, such as pivoting columns or extracting attributes from large text fields. We’re also adding feedback mechanisms, like a thumbs up/down system, to refine future search results based on user input. I’d love to hear about your experiences with similar search challenges and would appreciate any feedback! https://ift.tt/xD7rVp8 February 27, 2025 at 11:27PM
Subscribe to:
Post Comments (Atom)
Show HN: Tablr – Supabase with AI Features https://ift.tt/ltABMro
Show HN: Tablr – Supabase with AI Features https://www.tablr.dev/ June 30, 2025 at 04:35AM
-
Show HN: Locksmith – detect locks taken by Postgres migrations https://ift.tt/0cBueJt February 10, 2025 at 02:26AM
-
Show HN: I built a FOSS tool to run your Steam games in the Cloud I wanted to play my Steam games but my aging PC couldn’t keep up, so I bui...
-
Show HN: TNX API – Natural Language Interactions with Your Database Hey HN! I built TNX API to make working with databases as simple as aski...
No comments:
Post a Comment