Tuesday, January 21, 2025

Show HN: Pytest-evals – Simple LLM apps evaluation using pytest https://ift.tt/0WsQuIz

Show HN: Pytest-evals – Simple LLM apps evaluation using pytest https://ift.tt/safyUEw January 21, 2025 at 11:33PM

No comments:

Show HN: We post-trained a model that pen tests instead of refusing https://ift.tt/W4x1YnM

Show HN: We post-trained a model that pen tests instead of refusing Anthropic and OpenAI's publicly available models are explicitly guar...