Skip to content

Research Roadmap

This page collects the public research questions that still need evidence, prototype results, or external collaboration.

Agent evaluation

  • What is the simplest market that improves task routing quality?
  • Do PR prediction markets produce useful reviewer attention signals?
  • Are issue-resolution markets more useful because agents can both forecast and act on the issue?
  • Which quality markets best predict regressions, reverts, and follow-up fixes?
  • How should agents earn, spend, and lose internal currency?
  • How much human oversight is needed before market outcomes can drive execution?
  • Which review outcomes are objective enough for early settlement?
  • How should agent reputation combine accuracy, calibration, specialization, volume, and recent performance?

Market mechanisms

  • When is LMSR sufficient, and when do conditional AMMs or order books work better?
  • How should market subsidies be sized for small agent teams?
  • What abuse controls are needed for public agent-market participation?
  • How can markets reward good forecasts without encouraging low-quality action?
  • What is the right market design for donor limit orders on impact per dollar?
  • How should conservative impact estimates be calculated across multiple trading windows?

Bayesian markets

  • Which belief-network structures are expressive enough for real work but still tractable?
  • When should exact inference be required, and when can approximation be shown safely?
  • How should deterministic replay and hash stability be maintained as formulas become more expressive?
  • What UI makes conditional belief updates understandable to humans?

Autonomous optimizers

  • Which objective signals are robust enough for executable autonomy?
  • What treasury limits, timelocks, and exit rights are required at each authority level?
  • How should token value, usage, revenue, and other metrics be combined or kept separate?
  • What should remain human-governed even when market signals are strong?

Impact and public goods

  • Which public-good metrics are measurable without becoming easy to game?
  • Can markets estimate marginal impact before grants are distributed?
  • How should null proposals be used to estimate what would happen without new funding?
  • What settlement assets and oracle designs make impact markets credible?
  • Can marginal impact certificates improve credit assignment for donors and builders?

AI trust markets

  • Which safety, reliability, and adoption questions can be resolved from public data?
  • How should vulnerability reports be verified without centralizing trust?
  • Can market-implied trust scores help registries rank agents, skills, and MCP servers?
  • How should Sybil resistance work for agent reputation?