When:
to
Room:
Hiten B
Track:
development and coding / 開発とコーディング

How can we manage AI training data safely and efficiently?

How can we manage AI training data safely and efficiently?

In this session, you’ll learn how to use Drupal as a backend for storing, curating, and exporting high-quality training data for LLMs. Perfect for developers and teams working with internal knowledge, support content, or custom RAG pipelines.
We’ll cover:
- Why Drupal is a strong fit for managing training data
- How to structure prompts, completions, and metadata
- Using permissions and workflows for safe curation
- Exporting JSONL or chunked data for fine-tuning & RAG
- Tips for keeping your data clean, consistent, and useful