Mezure: Competence evaluation for post-AI hiring

Define competence on your terms

Answer a few questions and Mezure assembles the assignment, environment, and rubric for you. Tune anything you like, then send candidates a single link. The bar is yours to set.

Create assessment

Autosaved to your workspace. Close this dialog and continue from the sidebar.

What kind of engineering competence are you trying to evaluate?

Start from a common assessment shape. You can still edit every answer as the flow continues.

DebuggingEngineer fixes a focused failure with tests already present.

Feature buildEngineer adds a scoped product behavior to existing code.

From scratchEngineer starts with an empty or minimal project.

Candidate review

mezure.dev/dashboard/reviews/sub_7f3a9c

Gregory House

Search ranking regression by gregory.house@company.com

All changes from the candidate

Search files

src/search/rank-results.ts+41 -8

- const freshnessBoost = result.cached ? 0.18 : 0;

+ const freshnessBoost = result.cached ? 0.04 : 0.16;

const semanticScore = cosineSimilarity(query, result.embedding);

+ const keywordScore = exactMatchBoost(query, result.title);

return semanticScore + freshnessBoost + keywordScore;

M: Off-by-one risk if the ranked list is empty.

M: Ask why keyword boost is safer than another semantic cutoff.

src/search/cache.ts+27 -6

- const freshnessBoost = result.cached ? 0.18 : 0;

+ const freshnessBoost = result.cached ? 0.04 : 0.16;

const semanticScore = cosineSimilarity(query, result.embedding);

+ const keywordScore = exactMatchBoost(query, result.title);

return semanticScore + freshnessBoost + keywordScore;

M: Candidate changed cache behavior without documenting rollout risk.

src/search/scoring.ts+18 -4

- const freshnessBoost = result.cached ? 0.18 : 0;

+ const freshnessBoost = result.cached ? 0.04 : 0.16;

const semanticScore = cosineSimilarity(query, result.embedding);

+ const keywordScore = exactMatchBoost(query, result.title);

return semanticScore + freshnessBoost + keywordScore;

tests/search-ranking.test.ts+32 -4

describe("ranking regression", () => {

+ it("prefers fresh exact matches over stale semantic hits", () => {

+ expect(rankResults(query, fixtures)[0].id).toBe("fresh");

docs/ranking-notes.md+14

- const freshnessBoost = result.cached ? 0.18 : 0;

+ const freshnessBoost = result.cached ? 0.04 : 0.16;

const semanticScore = cosineSimilarity(query, result.embedding);

+ const keywordScore = exactMatchBoost(query, result.title);

return semanticScore + freshnessBoost + keywordScore;

Book a private demo

Mezure is onboarding paid pilots with engineering teams hiring in the agentic era. Share your work email and we'll follow up to scope a pilot around your hiring process.

Evaluate competence on the agentic era.

AGI research agent harness

The full workspace, already instrumented.

Define competence on your terms

Create assessment

What kind of engineering competence are you trying to evaluate?

Measure competence on the correct abstraction layer.

Book a private demo