Home / Hacker News / Show HN: Pykoi – a Python library for LLM data collection and fine tuning https://ift.tt/gdAolJc

Show HN: Pykoi – a Python library for LLM data collection and fine tuning https://ift.tt/gdAolJc

August 12, 2023 - Hacker News

Show HN: Pykoi – a Python library for LLM data collection and fine tuning Hi HN, pykoi is an open-source python library for ML scientists. pykoi makes it easier to collect data for LLMs, to use that data for finetuning, and to compare models to each other (e.g. your model pre- and post- finetuning, or your model vs openai vs claude). The library comes from pain points we experienced in LLM development: 1. Collecting feedback data from users isn't as easy as it could be. (The current process usually involves sharing excel files of annotated responses back-and-forth, offering no insight into how users actually engage with your models). 2. RLHF remains complicated to carry out. By complicated , we mean requires a lot of steps, hundreds of configs, lengthy setups, etc. 3. Comparing models to each other as they're used (that is, independent from academic metrics) is full of friction. The current approach: spin up a model, ask questions, write them down. Repeat for other models then compare. At a high-level, we think that the active learning process should be closed-loop: data collection, fine tuning, and inference all feed from the same system. This library is our first step in that direction. The project is still very early but we hope that some if it is useful. Note, we're fully open-source, and actively adding features! Website: https://ift.tt/NniIETK GitHub: https://ift.tt/TluKVpE We would love your feedback! https://ift.tt/Ihsg5Lp August 11, 2023 at 10:42PM

Show HN: Pykoi – a Python library for LLM data collection and fine tuning https://ift.tt/gdAolJc

Reviewed by Manish Pethev on August 12, 2023 Rating: 5

No comments:

If you have any suggestions please send me a comment.

Subscribe to: Post Comments ( Atom )

Facebook SDK

Recent Posts

Show HN: Pykoi – a Python library for LLM data collection and fine tuning https://ift.tt/gdAolJc

No comments:

Flickr

Follow Us

Recent Posts

Facebook

Popular Posts

Ads

Categories

Show Menu

Social Icons

Main Menu

Menu

Social Media Icons

Link List

Report Abuse

Author

Show HN: InBulk – tools to help you perform different tasks in bulk https://ift.tt/iL6BwVg

Contact Form

Search This Blog

Blog Archive

Tags

Random Posts

Tags

Recent

Recent Posts