No video

Data Curation for Open Source LLM Fine Tuning - Data Science Festival

  Рет қаралды 62

Data Science Festival

Data Science Festival

27 күн бұрын

A talk by Clemens Schroeer from Lemon AI.
This session covers Data Curation for Open Source LLM Fine-Tuning.
Everyone wants to fine-tune open source LLMs, but a lack of high quality data makes this hard. Even the data that companies do have is difficult to understand, making it challenging to iterate towards a high quality dataset that will provide good results from fine-tuning. Clemens will share his experience curating datasets to fine-tune models such as Mistral 7B and discuss some of the challenges that should be taken into consideration.
Technical Level: Technical practitioner
This session was part of the Data Science Festival MayDay event 2024. Find out more at datasciencefestival.com/event...
The Data Science Festival is the place for data-driven people to come together, share cutting-edge ideas, and solve real-world problems. We run monthly events, meet-ups, and the biggest free-to-attend data festivals in the UK. Join the community at datasciencefestival.com/

Пікірлер: 1
@kevon217
@kevon217 17 күн бұрын
BERTopic for the win.
WORLD'S SHORTEST WOMAN
00:58
Stokes Twins
Рет қаралды 126 МЛН
НРАВИТСЯ ЭТОТ ФОРМАТ??
00:37
МЯТНАЯ ФАНТА
Рет қаралды 8 МЛН
لااا! هذه البرتقالة مزعجة جدًا #قصير
00:15
One More Arabic
Рет қаралды 13 МЛН
LLAMA-3.1 🦙: EASIET WAY To FINE-TUNE ON YOUR DATA 🙌
15:08
Prompt Engineering
Рет қаралды 11 М.
What are AI Agents?
12:29
IBM Technology
Рет қаралды 118 М.
Haleon's LLM Translation Tool Development Story
23:37
Data Science Festival
Рет қаралды 50
Identifying Biomarkers of Cardiovascular Diseases with Machine Learning
19:44
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 853 М.
Has Generative AI Already Peaked? - Computerphile
12:48
Computerphile
Рет қаралды 921 М.
Navigating the AI Revolution: A Blueprint for Business Success
37:04
Data Science Festival
Рет қаралды 27
Data + AI Summit Keynote Day 1 - Full
2:52:20
Databricks
Рет қаралды 31 М.
LoRA explained (and a bit about precision and quantization)
17:07
WORLD'S SHORTEST WOMAN
00:58
Stokes Twins
Рет қаралды 126 МЛН