Work Type: Contractor | Permanent RemoteCompensation: USD 50 – 125/hourHours: 10 to 40 hours/week (Partial PST overlap required)Experience Required: 10 – 25 YearsContract Duration: 1 Month (Extension based on performance)Notice Period: Immediate preferredNoteThis is a contract-based, fully remote opportunity. Contractors must be citizens or valid work permit holders in the US, Canada, Australia, or approved Western European countries. No medical benefits or paid leave. The contractor is responsible for managing their own taxes and compliance. Payments are made based on actual hours worked.
Job OverviewWe're looking for highly experienced Fullstack Engineers to work on frontier AI projects, especially around improving Large Language Models (LLMs) for software engineering tasks. You'll support the development of LLM training and evaluation datasets by assessing code generated by models, building agent-based tools, and analyzing complex real-world software. You’ll work with teams pushing the boundary of AI-assisted development to identify weaknesses in model behavior and provide detailed code assessments. The goal is to build smarter, more context-aware AI coding tools that function in realistic development environments.
Key ResponsibilitiesWork across multiple LLM-related projects aimed at improving AI model performance on code. Lead end-to-end engineering efforts for agent use cases like home automation, coding copilots, and creative assistants. Review and rank model-generated code snippets using a structured evaluation system. Evaluate code diffs for correctness, style, maintainability, and performance. Build scalable fullstack applications to support dataset pipelines and tooling. Collaborate with researchers to uncover edge cases and complex code behaviors. Write clear, structured rationales to explain code assessment outcomes. Must-Have Skills10+ years of experience in software engineering with strong fullstack capabilities.
At least 2–3 years as a full-time employee (not contractor) at a top-tier tech company (e. g. , Google, Meta, Microsoft, Amazon, Stripe, etc. ). Deep expertise in software architecture, debugging, and code review. Proven ability to assess large, realistic codebases and evaluate code quality. Strong written and oral communication for clear and logical evaluations. Hands-on experience with Git, code versioning, modern frameworks, and cloud platforms.
Customize your resume to highlight skills and experiences relevant to this specific position.
Learn about the company's mission, values, products, and recent news before your interview.
Ensure your LinkedIn profile is complete, professional, and matches your resume information.
Prepare thoughtful questions to ask about team dynamics, growth opportunities, and company culture.