In-depth analysis of decentralized AI reasoning: How to strike a balance between security and performance | Bee Network

进入原网站导航首页

In-depth analysis of decentralized AI reasoning: How to strike a balance between security and performance | Bee Network Login Актуальные новости Запуск мемов Агенты искусственного интеллекта DeSci TopChainExplorer Для Ньюби 100-кратное количество монет Игра "Пчелка Основные веб-сайты Must-Have APP Криптознаменитости DePIN Новички насущные Детектор-ловушка Основные инструменты Продвинутые веб-сайты Обмен Инструменты NFT Привет, Выйти Вселенная Web3 Игры DApp Пчелиный улей Растущая платформа AD Поиск Английский язык Монеты для пополнения запасов Вход в систему Скачать Web3 Uni Игры DApp Пчелиный улей AD дом•Анализ•Основной текст In-depth analysis of decentralized AI reasoning: How to strike a balance between security and performanceАнализ5 месяцев назадUpdateУайатт 20 266 9 Оглавление
True “decentralization” Blockchain and Proof of Reasoning How it actually works Security and performance trade-offs Optimize space When we started building Gonka , we had a vision: What if anyone could run AI inference and get paid for it? What if we could harness all that unused computing power instead of relying on expensive centralized providers?

The current AI landscape is dominated by a handful of large cloud providers: AWS, Azure, and Google Cloud control the majority of the world’s AI infrastructure. This centralization creates serious problems that many of us have experienced firsthand. Control of AI infrastructure by a handful of companies means they can arbitrarily set prices, censor undesirable applications, and create a single point of failure. When OpenAI’s API went down, thousands of applications crashed with it. When AWS experienced an outage, half the internet stopped functioning.

Even “efficient” cutting-edge technologies aren’t cheap. Anthropic previously stated that training Claude 3.5 Sonnet cost “tens of millions of dollars,” and while Claude Sonnet 4 is now generally available, Anthropic hasn’t yet released its training costs. Its CEO, Dario Amodei, previously predicted that training costs for cutting-edge models will approach $1 billion, with the next wave of models reaching billions. Running inference on these models is equally expensive. For a moderately active application, a single LLM inference run can cost hundreds to thousands of dollars per day.

Meanwhile, the world has a vast amount of computing power sitting idle (or being used in meaningless ways). Think of Bitcoin miners burning electricity to solve worthless hash puzzles, or data centers running below capacity. What if this computing power could be used for something truly valuable, like AI inference?

A decentralized approach can pool computing power, lowering capital barriers and reducing single-supplier bottlenecks. Instead of relying on a few large companies, we can create a network where anyone with a GPU can participate and get paid for running AI inference.

We know that building a viable decentralized solution will be incredibly complex. From consensus mechanisms to training protocols to resource allocation, there are countless pieces that need to be coordinated. Today, I want to focus on just one aspect: running inference on a specific LLM . Just how difficult is that?

What is true decentralization? When we talk about decentralized AI inference, we mean something very specific. It’s not just about having AI models running on multiple servers, but about building a system where anyone can join, contribute computing power, and be rewarded for honest work.

The key requirement is that the system must be trustless . This means you don’t have to trust any single person or company to run the system correctly. If you’re letting strangers on the internet run your AI model, you need криптовалютаgraphic guarantees that they’re actually doing what they claim to be doing (at least with a sufficiently high probability).

This trustless requirement has some interesting implications. First, it means the system needs to be verifiable : you need to be able to prove that the same model and the same parameters were used to generate a given output. This is particularly important for smart contracts that need to verify that the AI responses they receive are legitimate.

But there’s a challenge: the more verification you add, the slower the entire system becomes, as network power is consumed by verification. If you completely trust everyone, there’s no need for verification reasoning, and performance is almost identical to centralized providers. But if you trust no one and always verify everything, the system becomes incredibly slow and uncompetitive with centralized solutions.

This is the core contradiction we have been working to resolve: finding the right balance between security and performance .

Blockchain and Proof of Reasoning So how do you actually verify that someone ran the correct model and parameters? Blockchain becomes an obvious choice — while it has its own challenges, it remains the most reliable way we know of to create an immutable record of events.

The basic idea is fairly straightforward. When someone runs inference, they need to provide proof that they used the correct model. This proof is recorded on the blockchain, creating a permanent, tamper-proof record that anyone can verify.

The problem is that blockchains are slow. Really, really slow. If we tried to record every step of reasoning on-chain, the sheer volume of data would quickly overwhelm the network. This constraint drove many of our decisions when designing the Gonka Network.

When designing a network and thinking about distributed computing, there are multiple strategies to choose from. Can you shard a model across multiple nodes, or keep the entire model resident on a single node? The primary limitations come from network bandwidth and blockchain speed. To make our solution feasible, we chose to fit a full model on a single node, though this may change in the future. This does impose a minimum requirement for joining the network, as each node needs sufficient computing power and memory to run the entire model. However, a model can still be sharded across multiple GPUs belonging to the same node, giving us some flexibility within the constraints of a single node. We use vLLM, which allows customization of tensor and pipeline parallelism parameters for optimal performance.

How it actually works Therefore, we agreed that each node hosts a complete model and runs full inference, eliminating the need for coordination across multiple machines during the actual computation. The blockchain is used only for record-keeping. We only record transactions and artifacts used for inference verification. The actual computation occurs off-chain.

We want the system to be decentralized, without any single central point directing inference requests to network nodes. In practice, each participant deploys at least two nodes: a network node and one or more reasoning (ML) nodes. The network nodes are responsible for communication (including a chain node connecting to the blockchain and an API node managing user requests), while your ML nodes perform LLM inference.

When an inference request arrives on the network, it reaches one of the API nodes (acting as a “transfer agent”), which randomly selects an “executor” (an ML node from a different participant). To save time and parallelize blockchain logging with the actual LLM computation, the transfer agent (TA) first sends the input request to the executor and records the input on-chain while the executor’s ML node is running inference. Once the computation is complete, the executor sends the output to the TA’s API node, while its own chain node records a verification artifact on-chain. The TA’s API node transmits the output back to the client, which is also recorded on-chain. Of course, these records still contribute to overall network bandwidth constraints.

As you can see, blockchain recording neither slows down the start of the inference computation nor the time it takes for the final result to be returned to the client. Verification of whether the inference was completed honestly occurs later, in parallel with other inferences. If the executor is caught cheating, they lose the entire epoch’s reward, and the client is notified and receives a refund.

The final question is: What is included in the artifact, and how often do we verify our reasoning?

Security and performance trade-offs The fundamental challenge is that security and performance are at odds with each other.

If you want maximum security, you need to verify everything. But that’s slow and expensive. If you want maximum performance, you need to trust everyone. But that’s risky and opens you up to all sorts of attacks.

After some trial, error, and parameter tuning, we found an approach that attempted to balance these two considerations. We had to carefully tune the amount of verification, the timing of verification, and how to make the verification process as efficient as possible. Too much verification, and the system becomes unusable; too little verification, and the system becomes insecure.

Keeping the system lightweight is crucial. We maintain this by storing the top k next- token probabilities. We use these to measure the likelihood that a given output was indeed generated by the claimed model and parameters, and to capture any tampering attempts, such as using a smaller or quantized model, with sufficient confidence. We will describe the implementation of the inference verification procedure in more detail in another post.

At the same time, how do we decide which inferences to verify and which not? We chose a reputation-based approach. When a new participant joins the network, their reputation is 0, and 100% of their inferences must be verified by at least one participant . If a problem is found, the consensus mechanism will ultimately determine whether your inference is approved, or your reputation will be lowered, and you may be kicked off the network. As your reputation grows, the number of inferences that need verification decreases, and eventually 1% of inferences may be randomly selected for verification. This dynamic approach allows us to keep the overall verification percentage low while effectively catching participants who attempt to cheat.

At the end of each epoch, participants are rewarded in proportion to their weight in the network. Tasks are also weighted, so rewards are expected to be proportional to both weight and the amount of work completed. This means we don’t need to catch and punish cheaters immediately; it’s sufficient to catch them within the epoch before distributing rewards.

Economic incentives drive this trade-off as much as technical parameters. By making cheating expensive and honest participation profitable, we can create a system where the rational choice is honest participation.

Optimize space After months of building and testing, we’ve built a system that combines the record-keeping and security advantages of blockchain while approaching the single-shot inference performance of centralized providers. The fundamental tension between security and performance is real, and there’s no perfect solution, only different trade-offs.

We believe that as the network scales, it has a real opportunity to compete with centralized providers while maintaining full decentralized community control. There’s also significant room for optimization as it develops. If you’re interested in learning about this process, please visit our GitHub and documentation, join the discussion on Discord, and participate in the network yourself.

About Gonka.ai
Gonka is a decentralized network designed to provide efficient AI computing power. Its design goal is to maximize the use of global GPU computing power to complete meaningful AI workloads. By eliminating centralized gatekeepers, Gonka provides developers and researchers with permissionless access to computing resources while rewarding all participants with its native GNK token.

Gonka was incubated by US AI developer Product Science Inc. Founded by the Libermans siblings, Web 2 industry veterans and former core product directors at Snap Inc., the company successfully raised $18 million in 2023 from investors including OpenAI investor Coatue Management, Solana investor Slow Ventures, K5, Insight, and Benchmark Partners. Early contributors to the project include well-known leaders in the Web 2-Web 3 space, such as 6 Blocks, Hard Yaka, Gcore, and Bitfury.

Эта статья взята из интернета: In-depth analysis of decentralized AI reasoning: How to strike a balance between security and performanceRecommended Articles

Анализ ## биткоин # crypto Токен #© Copyright NoticeМассив Pre The Rise of a New RWB Narrative: From “Asset Tokenization” to “Business Tokenization” Next Plasma's Stablecoin Dream Story: The 26-Year-Old Founder Tells How to Build the Strongest Project of This Round Related articles Arca accuses Circle of betrayal: Why did the IPO feast abandon its crypto allies and favor Wall Street? 6086cf14eb90bc67ca4fc62b 27 626 4 Top 12 DeSci Cryptos 2025 Администратор 71 396 25 The Fed remains on hold, the market is calm, but on-chain data reveals unusual signals 6086cf14eb90bc67ca4fc62b 30 128 2 The crossroads of the crypto market: political compliance or free and open source? 6086cf14eb90bc67ca4fc62b 36 733 Airdrop Weekly Report | Binance Alpha airdrop rules will be updated on June 19; Sonic Labs will launch the second season 6086cf14eb90bc67ca4fc62b 33 802 2 Foreigners support the platform, the alternative business of the cryptocurrency circle 6086cf14eb90bc67ca4fc62b 21 179 2 Нет комментариев Вы должны войти в систему, чтобы оставить комментарий! Немедленно войдите в систему Нет комментариев... Bee.com Крупнейший в мире портал Web3. Партнеры CoinCarp Binance CoinMarketCap CoinGecko Coinlive Доспехи Загрузите приложение Bee Network APP и начните путешествие по web3 Белая книга Роли ЧАСТО ЗАДАВАЕМЫЕ ВОПРОСЫ © 2021-2026. Все права защищены. Политика конфиденциальности | Условия предоставления услуг Скачать приложение Bee Network APP и начните путешествие по web3 Крупнейший в мире портал Web3 Партнеры CoinCarp Binance CoinMarketCap CoinGecko Coinlive Armors Белая книга Роли ЧАСТО ЗАДАВАЕМЫЕ ВОПРОСЫ © 2021-2026. Все права защищены. Политика конфиденциальности | Условия предоставления услуг Поиск ПоискInSiteOnChainСоциальнаяНовости Hot to you: Охотники за воздухом Анализ данных Криптознаменитости Детектор-ловушка Русский English 繁體中文简体中文日本語 Tiếng Việt العربية 한국어 Bahasa Indonesia हिन्दी اردو Русский

智能索引记录

2026-03-02 07:37:13 教育培训成功标题：【优】三年级状物作文5篇
简介：在学习、工作或生活中，许多人都有过写作文的经历，对作文都不陌生吧，写作文可以锻炼我们的独处习惯，让自己的心静下来，思考自
2026-02-28 23:38:43 教育培训成功标题：三年级的美丽的公园作文【必备4篇】
简介：在日常生活或是工作学习中，大家都经常看到作文的身影吧，作文是由文字组成，经过人的思想考虑，通过语言组织来表达一个主题意义
2026-03-02 05:03:40 电商商城成功标题：免洗修复乳怎么样 - 京东
简介：京东是专业的免洗修复乳网上购物商城，为您提供免洗修复乳价格图片信息、免洗修复乳怎么样的用户评论、免洗修复乳精选导购、更多
2026-02-28 22:46:16 游戏娱乐成功标题：一次有趣的游戏初中作文
简介：在我们的生活中，小朋友们玩过许许多多的游戏，如：三个字、冰棍消化、捉迷藏等。。。。。。但是，我们今天做了一个有趣的游戏，
2026-02-27 13:39:40 综合导航成功标题：Windowsと連携する Galaxy Z Fold3 5G SCG11 オンラインマニュアル（取扱説明書） au
简介：auのスマートフォン「Galaxy Z Fold3 5G（ギャラクシーゼットフォールドスリーファイブジー）SCG1
2026-02-28 23:53:44 综合导航成功标题：爷爷作文300字5篇[荐]
简介：在平平淡淡的日常中，大家一定都接触过作文吧，写作文是培养人们的观察力、联想力、想象力、思考力和记忆力的重要手段。你知道作
2026-02-28 22:23:06 综合导航成功标题：剥虾记的叙事作文
简介：国庆假期期间，我跟着姐姐来到剥虾厂剥虾。刚刚走到门口，一股腥臭味便向我迎面扑来。厂里的环境条件很不好，许多污水在地上流着
2026-03-02 06:00:44 图片素材成功标题：话题作文教师节感恩的作文400字描写话题作文教师节感恩的作文关于话题作文教师节感恩的作文-作文网
简介：作文网精选关于话题作文教师节感恩的400字作文，包含话题作文教师节感恩的作文素材，关于话题作文教师节感恩的作
2026-03-02 05:59:06 教育培训成功标题：文明只差一步小学作文
简介：在现实生活或工作学习中，大家或多或少都会接触过作文吧，作文是通过文字来表达一个主题意义的记叙方法。一篇什么样的作文才能称
2026-02-27 13:42:03 综合导航成功标题：Irene Perovich: Lifelike and unsophisticated photography
简介：1x.com is the world
2026-02-28 08:55:10 综合导航成功标题：鼻炎吃什么药 - 云大夫
简介：针对不同的鼻炎需要吃的药不一样。鼻炎的治疗第一位就是把抗生素用足量。当细菌杀死以后,可以使用中药性质的鼻炎药。第三种可以
2026-02-28 23:52:08 教育培训成功标题：二年级作文300字汇编(10篇)
简介：在日常的学习、工作、生活中，大家都跟作文打过交道吧，写作文是培养人们的观察力、联想力、想象力、思考力和记忆力的重要手段。
2026-02-27 14:09:20 综合导航成功标题：Products - HTS Commercial & Industrial HVAC Systems, Parts, & Services Company
简介：We are a solution-based company, providing mechanical equipm
2026-02-28 23:08:42 图片素材成功标题：环保的作文100字描写环保的作文关于环保的作文-作文网
简介：作文网精选关于环保的100字作文，包含环保的作文素材，关于环保的作文题目，以环保为话题的100字作文大全，作文网原创名师
2026-02-28 22:27:42 教育培训成功标题：四年级的我作文合集（4篇）
简介：在日常的学习、工作、生活中，大家都有写作文的经历，对作文很是熟悉吧，作文是人们把记忆中所存储的有关知识、经验和思想用书面
2026-03-02 01:01:46 图片素材成功标题：六年级申请书作文50字六年级50字申请书作文大全-作文网
简介：作文网优秀六年级申请书50字作文大全，包含六年级申请书50字作文素材，六年级申请书50字作文题目、美文范文，作文网原创名
2026-03-02 05:26:22 图片素材成功标题：寒冰的作文500字描写寒冰的作文关于寒冰的作文-作文网
简介：作文网精选关于寒冰的500字作文，包含寒冰的作文素材，关于寒冰的作文题目，以寒冰为话题的500字作文大全，作文网原创名师
2026-03-02 01:50:19 综合导航成功标题：Voc. World English Historical Dictionary
简介：Voc. World English Historical Dictionary
2026-03-02 01:03:13 教育培训成功标题：一份礼物作文400字（必备9篇）
简介：在日常生活或是工作学习中，大家对作文都不陌生吧，作文要求篇章结构完整，一定要避免无结尾作文的出现。那要怎么写好作文呢？以
2026-02-28 23:15:54 综合导航成功标题：夕阳下的幻想_100字_作文网
简介：风吹过庭前微凉飘落的树叶已然变黄我伫立窗前望着夕阳那忽聚忽离的风把我的思绪拉向远方路上有迷人风光此处有心灵
2026-03-01 00:49:54 教育培训成功标题：做一个有趣的小实验作文（精选45篇）
简介：在平日的学习、工作和生活里，说到作文，大家肯定都不陌生吧，作文是由文字组成，经过人的思想考虑，通过语言组织来表达一个主题
2026-03-02 00:45:52 综合导航成功标题：康乃馨作文300字
简介：在平平淡淡的日常中，大家都写过作文，肯定对各类作文都很熟悉吧，作文是人们把记忆中所存储的有关知识、经验和思想用书面形式表
2026-02-28 23:51:29 综合导航成功标题：Joseph Butler (1692-1752). The Reader's Biographical Encyclopaedia. 1922
简介：Joseph Butler (1692-1752). The Reader
2026-02-27 12:49:11 综合导航成功标题：Weston v Bates - 5RB Barristers
简介：Libel - service of claim form - service of claim form abroad
2026-03-02 11:20:25 教育培训成功标题：惊喜小学作文
简介：在生活、工作和学习中，大家都接触过作文吧，作文是由文字组成，经过人的思想考虑，通过语言组织来表达一个主题意义的文体。你知
2026-03-01 00:32:19 图片素材成功标题：初中申请书作文20字初中20字申请书作文大全-作文网
简介：作文网优秀初中申请书20字作文大全，包含初中申请书20字作文素材，初中申请书20字作文题目、美文范文，作文网原创名师点评
2026-03-02 11:30:15 综合导航成功标题：恋上你看书网_免费小说阅读网
简介：恋上你看书网给广大网友提供最好的小说阅读服务，全站小说免费阅读并且无弹窗，恋上你看书网是最值得你收藏的小说阅读网站。
2026-02-28 09:19:09 综合导航成功标题：Gate Ventures Research Insights: The Third Browser War, the Entry Battle in the AI Agent Era Bee Network
简介：TL;DR The third browser war is quietly unfolding. Looking b
2026-03-02 11:28:31 实用工具成功标题：组词网_在线组词_古汉语词典_词语查询
简介：古汉语词典组词网,在线组词收录词语近三十六万条,词语查询提供在线组词近义词,反义词,名词,动词,形容词,副词等多音字组词
2026-02-28 22:33:19 综合导航成功标题：四年级作文300字
简介：在我们平凡的日常里，大家总少不了接触作文吧，作文是一种言语活动，具有高度的综合性和创造性。作文的注意事项有许多，你确定会