TechCompenso
Logo TechCompenso
Disponibile su Google Play Entra nel Talent Radar ๐Ÿš€
โœ‰๏ธ
Newsletter settimanale

TechCompenso per Te

Ogni settimana annunci remote-friendly, sia ibridi che full-remote, e consigli di carriera per muoverti meglio nel mercato tech e digital in Italia.

P

Member of Engineering (Pre-training / Data Acquisition)

๐Ÿข Poolside

Full-Remote

๐Ÿ“ Descrizione

You'll be working alongside our pre-training data team, focused on one of the most foundational challenges in training frontier LLMs: acquiring the best possible pre-training data. The data we collect is upstream of everything. It directly shapes the capability of the models we train. As our first dedicated data acquisition engineer, you will spearhead and evolve systems that crawl the web at massive scale, rapidly ingest data from strategic partnerships, and build specialized tooling to maximize recall from high-value sources. You'll collaborate closely with pre-training data researchers and engineers to ensure that our sourcing of data maps to our training needs, to ensure we have the most capable pre-trained models.

๐Ÿ”Ž Informazioni

๐Ÿ–ฅ๏ธ

Modalitร  di lavoro

Full-Remote

๐Ÿ”น Python ๐Ÿ”น AWS ๐Ÿ”น Kubernetes ๐Ÿ”น Docker