Data Annotation Engineer

Remote Full-time
Veryfi is a YC-funded Silicon Valley startup that uses AI to understand documents like receipts and invoices. As a Data Engineer at Veryfi, you'll contribute to the evolution of our training data infrastructure and the development of new features and projects. You'll gather, process, and analyze diverse datasets to generate high-quality training data for our machine-learning models. Furthermore, by delving deep into our system, you'll have the autonomy to identify challenges and opportunities, taking ownership of developing solutions to refine existing tools and algorithms.

Key Responsibilities:
• Gather, process, and analyze diverse datasets to generate training data that fuels the development of our ML projects.
• Expand and optimize the training data pipelines to improve the speed and accuracy of our processes.
• Collaborate with a cross-functional team to define requirements and prioritize development efforts.

Essential Skills:
• Proficient in Python programming for data handling and processing, with experience in utilizing data science tools such as Pandas, NumPy, SciPy, and others.
• Strong analytical thinking with a focus on delivering results.
• Meticulous attention to detail, ensuring accuracy and precision in all data handling and processing tasks.
• Enthusiastic about learning and adapting to new technologies and methodologies, particularly in the realm of Machine Learning (ML).
• Innovation mindset, adept at challenging existing processes and driving positive change.

Preferred Qualifications:
• Familiarity with regex development, software engineering principles, and Linux command line tools.
• Experience with Natural Language Processing (NLP) techniques and libraries, including the use of Large - - -- - Language Models (LLMs) and supervised learning for document data extraction.
• Effective organizational abilities, capable of managing projects independently from inception to completion.
• Exceptional verbal and written communication skills, effectively communicating problems, proposed solutions, and results to stakeholders in a multicultural environment.

A Bachelor's degree in computer science, engineering, or a related field. Postgraduate studies are a plus but not required.

Keywords: NLP, Patterns Detection, Data Labeling, Software Development, Data Engineering.

Apply tot his job

Apply To this Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Design Experience Strategist

Remote

Cloud Infrastructure DevOps Engineer (DevOps Engineer ) - Remote

Remote

Immediately Require Senior Caseworker, Home Study and Post-Release Services in New York City, NY

Remote

Senior Staff iOS Software Engineer

Remote

Senior Threat Analyst 2 (Romania)

Remote

Marketing Operations Specialist, CRM Management

Remote

Patient Coordinator Non-clinical (Remote) AccessNurse

Remote

Entry/Junior Level Data Scientist/Python Programmer (Remote)

Remote

**Experienced Data Entry Operator – Corporate Database Management and Customer Service**

Remote

Part-Time Retail Sales Associate

Remote
← Back