Higharc - Research Intern, Special Projects

Remote - USA+ Equity2mo ago

Remote Intern NA Artificial Intelligence Nonprofit Research Intern Python Training Development Reporting Transformers vLLM

Upload My Resume

Drop here or click to browse · Tap to choose · PDF, DOCX, DOC, RTF, TXT

Apply in One Click

Requirements

• Active enrollment in a PhD program in Computer Science, Machine Learning, or a related field at a U.S. institution. • Strong Python programming skills and experience building deep learning training workflows (PyTorch preferred). • Solid understanding of computer vision, transformers, representation learning, and ML experimentation practices. • Demonstrated research ability through publications, strong preprints, open-source research code, or equivalent evidence of research impact. • Comfort working in conventional research and engineering stacks (RoboFlow, Modal, WandB, or similar). • A plus if you also bring: • Experience with vision-language models or multimodal foundation models. • Experience designing and deploying semi-supervised learning methods (pseudo-labeling, self-training, distillation, consistency regularization) and familiarity with common failure modes such as confirmation bias, noisy pseudo-labels, and calibration drift. • Experience with multi-GPU training and large-scale experimentation. • Familiarity with AEC data and workflows — CAD/BIM concepts, plan understanding, or domain-specific labeling. • Working at Higharc • Higharc has been remote first since our founding in 2018. We offer flexible hours so you can do your best work without missing out on life. Higharc offers competitive salaries with significant equity, in a fast-growing, well-funded company. • Personal healthiness is an important value for us- we provide comprehensive medical, dental, and vision coverage, with unlimited PTO, and meaningful maternity/paternity leave to all U.S based employees that are full-time. You'll also have access to other big-company benefits such like short and long-term disability plans and a 401K. Haven't worked remotely before? We provide a stipend to create the ideal home office.

Responsibilities

• You'll work directly with Higharc's Special Projects team across the full research lifecycle — from problem formulation and dataset strategy through training, evaluation, and publication-quality write-ups. The primary research areas are Vision-Language Models for architectural drawings (visually grounded retrieval, grounded document QA, and visual-to-structured-output) and semi-supervised learning for instance segmentation. The key deliverable is a publishable research contribution alongside reproducible code, ready for integration with Higharc ML products. Expect to: • Build data pipelines and extract data from Higharc's existing datasets. • Design, implement, and execute semi-supervised and weakly-supervised VLM and segmentation training pipelines. • Develop evaluation suites and error taxonomies for targeted multimodal tasks. • Run rigorous ablations and scaling experiments, track results, and maintain reproducibility and research hygiene throughout. • Document findings and present results through technical reports, demos, and a submission-ready draft. • You have strong research instincts, the ability to work independently on open-ended problems, and a track record of producing rigorous, reproducible work. You're comfortable moving quickly without sacrificing research hygiene. • Active enrollment in a PhD program in Computer Science, Machine Learning, or a related field at a U.S. institution. • Strong Python programming skills and experience building deep learning training workflows (PyTorch preferred). • Solid understanding of computer vision, transformers, representation learning, and ML experimentation practices. • Demonstrated research ability through publications, strong preprints, open-source research code, or equivalent evidence of research impact. • Comfort working in conventional research and engineering stacks (RoboFlow, Modal, WandB, or similar). • A plus if you also bring: • Experience with vision-language models or multimodal foundation models. • Experience designing and deploying semi-supervised learning methods (pseudo-labeling, self-training, distillation, consistency regularization) and familiarity with common failure modes such as confirmation bias, noisy pseudo-labels, and calibration drift. • Experience with multi-GPU training and large-scale experimentation. • Familiarity with AEC data and workflows — CAD/BIM concepts, plan understanding, or domain-specific labeling. • Working at Higharc • Higharc has been remote first since our founding in 2018. We offer flexible hours so you can do your best work without missing out on life. Higharc offers competitive salaries with significant equity, in a fast-growing, well-funded company. Personal healthiness is an important value for us- we provide comprehensive medical, dental, and vision coverage, with unlimited PTO, and meaningful maternity/paternity leave to all U.S based employees that are full-time. You'll also have access to other big-company benefits such like short and long-term disability plans and a 401K. Haven't worked remotely before? We provide a stipend to create the ideal home office.

Get Started Free

No credit card. Takes 10 seconds.

Requirements

Responsibilities