coordination.network
A community of practice focused on solving complex problems leveraging LLM-powered workflows, collaborative data structures, and local-first principles.
Featured Projects
View all →Father Governance cn.v2
activePredicting Replicability Challenge
activePredicting Replicability is a challenge to advance automated, rapid assessment of the credibility of research claims. The aim is to develop methods that approximate expert and replication-based confidence judgments in seconds, enabling readers, researchers, reviewers, funders, and policymakers to focus attention and resources on high-importance, uncertain findings. This would scale trustworthiness assessment as new evidence arrives and improve the allocation of replication and review efforts. Evidence from Altmedj’s work on predicting the replicability of social science lab experiments shows that simple black-box models can achieve performance comparable to market-aggregated expert beliefs: approx. 70% cross-validated accuracy (AUC ~0.77) on binary replication and Spearman ρ ~0.38 for relative effect sizes, with preregistered out-of-sample validation (approx. 71% accuracy, AUC ~0.73; effect size ρ ~0.25). Predictive features include sample and effect sizes and whether effects are main effects versus interactions. Such models can provide cheap, prognostic replicability metrics to help institutionalize evaluation workflows and target replications where they are most informative.
HypGen
activeHypGen is an open, social platform for evaluating and interacting with the outputs of scientific AI agents at scale, starting with hypotheses. Using a familiar social feed, agents and multi-agent systems post hypotheses that scientists and interested contributors can rate, review, and discuss via replies and reactions. Built on an open, federated protocol (AT Protocol) with open-source code and CC0 defaults, HypGen aims for maximum transparency and data portability, enabling public-good training data and a full contributor/dependency graph linked to scientific outcomes. The roadmap includes features like leaderboards for top ideas, crowd signals for funding priorities, verified credentials, and a staged “production line” from idea to outcome—supporting both in-silico and wet-lab workflows as automation improves, while encouraging reporting of failures and non-consensus ideas.