Back to positions

ML/NLP Engineer Needed – Low-Resource Language AI & Speech Project

Remote role Full-time Open position

Hello, We are launching a language technology project for Chimini, a low-resource Bantu language, and are seeking an ML/NLP engineer to help us design and implement the foundational phase of the project. Long-Term Vision Our long-term goal is to build: A structured Chimini text + audio corpus A scalable API layer for integration into our own applications Eventually, speech-to-text and text-to-speech capability in Chimini Chimini is historically related to Swahili, but we do not yet know how structurally similar they are. Pronunciation may differ significantly, which may impact model transfer for speech systems. We currently have: Written texts Audio recordings Access to native speakers for transcription and validation Phase 1 (3–6 Months) The objective of Phase 1 is to build a strong ML-ready foundation, including: Designing a scalable database structure for text and audio Preparing and structuring data for NLP workflows Building a clean corpus pipeline (segmentation, transcription storage, metadata) Advising on whether Chimini–Swahili linguistic comparison should be conducted before leveraging transfer learning Evaluating potential approaches: Fine-tuning multilingual models Embedding-based retrieval systems LLM + RAG architectures Longer-term speech model strategy We want the system designed from the beginning to support future ML training and experimentation.

Responsibilities

Define ML/NLP strategy for a low-resource language Recommend architecture for scalable corpus and training workflows Implement foundational data pipelines Advise on transfer learning feasibility from Swahili or multilingual models Provide phased roadmap (short-term vs long-term) Ideal Experience: NLP for low-resource or multilingual languages Speech systems (ASR/TTS) Fine-tuning transformer models Embeddings and vector databases Designing ML pipelines for scalable experimentation We will handle data collection, transcription, and language validation. Please include: Relevant ML/NLP experience Proposed high-level technical approach Estimated timeline for Phase 1 Availability We are looking for someone who can help architect this correctly from the start, with long-term ML scalability in mind. Best regards, Apply tot his job Apply To this Job Apply tot his job Apply To this Job

Further positions

Senior Full Stack Engineer (ruby on rails)

Remote role Full-time

Senior Software Engineer - B2B Tribe

Remote role Full-time

Senior Frontend Engineer, Core Engineering (Remote)

Remote role Full-time

Principal Software Engineer - Core Sharing & Collaboration

Remote role Full-time

Frontend Developer TON Telegram Mini App

Remote role Full-time

2 Open Data Backend Developers (Full Time / Remote)

Remote role Full-time

Principal FrontEnd Engineer - 100% Remote - EMEA

Remote role Full-time

Front End Developer – Mid-level

Remote role Full-time

Senior Backend Developer - C# and.NET 10 (Remote, Full-Time) [AS239]

Remote role Full-time

Full Stack Engineer, PHP & React

Remote role Full-time

Data Engineer (DataStage)

Remote role Full-time

Onboarding Coach

Remote role Full-time

Fort Wayne Virtual Academy | Teacher Multilingual Learners (.5) | 2026-2027 School Year

Remote role Full-time

Remote Part-Time Customer Service Representative – Home‑Based Support for arenaflex Marketplace

Remote role Full-time

[Remote] Senior Backend Engineer II, AI Native, Devices Cloud

Remote role Full-time

Senior Customer Success Manager

Remote role Full-time

Virtual Behavioral Health Assessor - Independent Contractor FT

Remote role Full-time

Remote Customer Service Manager – Leading West Region Support Operations for Premier Home Furnishings E-Commerce Platform

Remote role Full-time

Experienced Customer Support Specialist – Remote Opportunity to Deliver Exceptional Healthcare Experience

Remote role Full-time

Travel Support Specialist

Remote role Full-time