This is a autopost bolg frinds we are trying to all latest sports,news,all new update provide for you
Friday, December 19, 2025
Show HN: Misata – synthetic data engine using LLM and Vectorized NumPy https://ift.tt/HU8J5xB
Show HN: Misata – synthetic data engine using LLM and Vectorized NumPy Hey HN, I’m the author. I built Misata because existing tools (Faker, Mimesis) are great for random rows but terrible for relational or temporal integrity. I needed to generate data for a dashboard where "Timesheets" must happen after "Project Start Date," and I wanted to define these rules via natural language. How it works: LLM Layer: Uses Groq/Llama-3.3 to parse a "story" into a JSON schema constraint config. Simulation Layer: Uses Vectorized NumPy (no loops) to generate data. It builds a DAG of tables to ensure parent rows exist before child rows (referential integrity). Performance: Generates ~250k rows/sec on my M1 Air. It’s early alpha. The "Graph Reverse Engineering" (describe a chart -> get data) is experimental but working for simple curves. pip install misata I’d love feedback on the simulator.py architecture—I’m currently keeping data in-memory (Pandas) which hits a ceiling at ~10M rows. Thinking of moving to DuckDB for out-of-core generation next. Thoughts? https://ift.tt/4PEXrgd December 16, 2025 at 08:08PM
Subscribe to:
Post Comments (Atom)
Show HN: PHP-fts – Full-text search engine in pure PHP, no extensions https://ift.tt/wgSBiJP
Show HN: PHP-fts – Full-text search engine in pure PHP, no extensions https://ift.tt/WpBoNzV May 7, 2026 at 01:58AM
-
Show HN: A directory of 800 free APIs, no auth required Explore reliable free APIs for developers — ideal for web and software development, ...
-
Show HN: I built Dirac, Hash Anchored AST native coding agent, costs -64.8 pct Fully open source, a hard fork of cline. Full evals on the gi...
-
Show HN: I built a FOSS tool to run your Steam games in the Cloud I wanted to play my Steam games but my aging PC couldn’t keep up, so I bui...
No comments:
Post a Comment