KEY RESPONSIBILITIES
- Work with large language models and vector databases to enhance our natural language understanding capabilities.
- Design, develop, and deploy machine learning models for OCR and NLU tasks related to medical documents.
- Build predictive models using patient history data, with a focus on improving healthcare outcomes.
- Process and analyze text and time series data, deriving insights that drive decision-making.
- Develop and optimize data pipelines for handling complex PDFs with both tabular and text data.
- Collaborate with cross-functional teams to integrate machine learning models into production systems on AWS.
- Continuously explore new machine learning technologies and methodologies to improve our solutions.
PREFERRED SKILLS
- Experience with programmatically handling PDFs of varying structures and content types.
- Knowledge of the latest trends and tools in machine learning and data science.
- Prior experience in the healthcare industry is a plus.
QUALIFICATIONS
- Bachelor’s or Master’s degree in Computer Science, Data Science, Machine Learning, or a related field.
- Minimum of 2 years of experience in machine learning, with a strong background in NLP, NLU and OCR.
- Proficiency in Python and experience with machine learning frameworks such as TensorFlow, PyTorch, or similar.
- Familiarity with large language models, vector databases, and Langchain.
- Experience working with text and time series data, particularly in the healthcare domain.
- Strong problem-solving skills and the ability to work independently as well as in a team.
- Excellent communication skills and the ability to translate complex technical concepts to non-technical stakeholders.
- (Bonus Qualifications) Experience with AWS cloud services, including but not limited to S3, Lambda, EC2 or GCP or Azure Equivalent.