Software Engineer – Data
Location: Dallas, TX
Employment Type: Direct Hire
Job ID: 140879
Date Added: 10/21/2025
Location: Dallas, TX – Onsite 4 days a week
The Company
Headquartered in Dallas TX, our client is a technology and strategy consultancy that aims to provide a competitive edge to its clients by solving complex problems with data, software, and strategy. They specialize in areas like technology strategy, product development, software engineering, and digital transformation, with a particular emphasis on AI, MLOps, and Data Engineering. The firm's clientele spans various industries, including AgTech, Healthcare, Logistics, and Financial Services.
Platform / Stack
You will work with technologies that include PyTorch, Vision Transformers (ViT), and PaddleOCR.
Compensation Expectation: $120,000 – 160,000
What You'll Do As a Computer Vision Engineer:
- Model Design & Training: Develop and fine-tune computer vision and OCR models for blueprint and document analysis.
- Image Processing Pipelines: Implement contour detection, edge extraction, vectorization, segmentation, and geometric normalization workflows.
- Multimodal & Vision-Language AI: Integrate state-of-the-art models like Florence, CLIP, SAM2, Kosmos, and GroundingDINO into multimodal pipelines for enriched understanding and captioning of visual layouts.
- Architecture Innovation: Apply and extend Vision Transformer (ViT) and diffusion-based architectures for segmentation, classification, and region labeling.
- Dataset Creation: Design and manage dataset labeling workflows, leveraging synthetic data generation and feedback-driven annotation.
- Pipeline Orchestration: Build and maintain scalable training and inference pipelines, integrating MLOps best practices.
- Cross-Functional Collaboration: Partner with AI Strategy and Product teams to ensure technical success aligns with client value and ROI metrics.
- Thought Leadership: Contribute to the company’s IP, internal symposiums, and technical mentorship programs.
Core Technologies:
Core Technologies
- Frameworks & Libraries: PyTorch, OpenCV, Pillow, pandas, numpy
- OCR Tools: PaddleOCR, EasyOCR, or Tesseract
- Segmentation & Detection: Semantic and instance segmentation, form and layout detection models
- SOTA Models: Florence, SAM2, CLIP, Kosmos, GroundingDINO
- Architectures: Vision Transformers (ViT), diffusion-based encoders/decoders
- Pipelines: Training orchestration, data versioning, multimodal integration
Qualifications:
You could be a great fit if you have:
- 4–7 years of hands-on experience in computer vision or applied deep learning
- Proven ability to deliver production-grade CV systems (document understanding, OCR, semantic segmentation, etc.)
- Deep experience with image preprocessing, feature extraction, and multimodal integration
- Track record of training or fine-tuning transformer-based models for visual reasoning tasks
- Solid understanding of model optimization, deployment, and evaluation
- Clear communicator — able to explain model design choices and tradeoffs to non-technical audiences
- Bachelor’s or Master’s in Computer Science, Electrical Engineering, or related field (PhD preferred)
- Portfolio Requirement: Include a link to a GitHub repository, published paper, demo notebook, or any prior work that demonstrates applied computer vision expertise. Candidates without visible work samples will not be reviewed.
This client requires that a background check be completed. A background check is required to protect our company/client and its stakeholders by ensuring that we hire individuals with a trustworthy history, which helps maintain a safe and secure workplace. This proactive measure minimizes potential risks and promotes a culture of integrity within the organization.
Benefits Offered:
Employer provides access to:
- Medical & Dental benefits
- 401k
- PTO

