1. Private Protocol Metrics
听云Network
  • 中文版
    • Release Notes
      • 2026年
      • 2025年
      • 2024年
    • 产品介绍
      • 产品概述
        • 产品概述
      • 工作原理
        • 监测原理
        • 监测点部署
      • 产品架构
        • 产品架构
      • 典型场景
        • CDN加速评估及选型
        • 网站对标分析
        • 网站应用压力测试分析
        • 对网站进行性能评估和优化
        • 具备可观测混合压测场景
        • DNS劫持监测-CNAME监测
        • VPN质量监控
        • WIFI质量监控
        • 短信服务质量监控
        • API主动监控
        • 证书监控
        • MTR网络链路监控
        • 企业业务质量监控
        • 端到端场景监测
        • whois域名监控场景
        • IPV6支持率监控
      • 核心优势
        • 端到端监测
        • 市场地位
        • 数据保障
        • 本土化服务
      • 功能模块
        • 即时监测
          • 即时监测
        • 持续监测
          • 持续监测介绍
          • 监测任务类型
          • 监测结果分析
        • 私有节点监测
          • 私有节点监测
        • 域名分析
          • 域名分析
        • CDN调优
          • CDN调优
        • 劫持监测
          • 劫持监测
        • 警报通知
          • 警报通知
        • 报告服务
          • 报告服务
        • API服务
          • API服务
    • 使用指南
      • 持续监测图表
        • 公共操作
        • 概览
        • 趋势图
        • 地域图
        • 地图
        • 主机图
        • 错误图
        • 散点图
        • 元素
      • 任务类型
        • 监测任务类型介绍
        • 固网业务监测
        • 移动业务监测
      • 持续监测
        • 持续监测介绍
        • 新建任务
        • 查询任务
        • 管理任务
        • 绑定任务
        • 节点组管理
      • 即时监测
        • 即时监测介绍
        • 配置即时监测
        • 查看监测结果
      • 系统管理
        • 系统管理介绍
        • 自有点监测
        • 用户管理
        • 使用量查询
        • 授权码管理
        • 地图参数
      • 域名分析
        • 域名分析
      • CDN调优
        • CDN调优介绍
        • CDN评估
        • 算法建议
        • 策略调优
      • 警报
        • 警报介绍
        • 性能警报
        • 错误警报
        • 智能警报
        • 警报日志
        • 警报推送
      • 报告
        • 报告介绍
        • 创建快速报告
        • 创建智能报告
        • 导出报告
        • 图表池
      • API服务
        • API服务
      • 其他功能
        • 分享当前页面
        • 移动端快速访问
      • WebRecorder脚本录制器(插件版)
        • 介绍
        • 安装
        • 创建项目及界面展示
        • 脚本录制
        • 配置任务
        • 注意事项
        • 脚本录制器支持的命令
        • 脚本录制器对事务验证码的支持
    • 指标说明
      • PC监控
        • 页面指标
        • 文件指标
        • 事务指标
        • 流媒体指标
        • Ping指标
        • MTR指标
        • 私有协议指标
          • 私有协议指标解释概述
          • Websocket 插件配置说明
          • DIG 插件配置说明
          • Socket(TCP/UDP) 插件配置说明
          • Telnet 插件配置说明
          • FTP 插件配置说明
          • SMTP/IMAP/POP3 插件配置说明
          • MTR 插件配置说明
          • Traceroute 插件配置说明
          • DeepSeek 插件配置说明
          • TCP Send 插件配置说明
          • UDP 插件配置说明
          • HTTP(SM) 插件配置说明
          • HTTP3 插件配置说明
          • OCSP 插件配置说明
          • WIFI 插件配置说明
          • VPN 插件配置说明
      • 移动(H5)监控
        • 页面指标
        • 文件指标
        • 事务指标
        • 流媒体指标
        • Ping指标
        • 短信指标
    • 脚本录制器
      • NBR脚本录制器工具的下载
      • NBR脚本录制器的使用-----简版
    • 常见问题
      • 功能分析
        • 创建任务
        • 创建节点组
        • 高级参数----结束条件
        • 高级参数----诊断
        • 报表无数据之校验节点DNS功能
        • 流媒体监测,配置任务后无数据
        • 平台是否支持查看“指定Cname是否生效”
        • 真机单文件任务下载崩溃
        • 听云检测和浏览器实际访问数据对不上
        • 监测地址写不全导致的报错
        • 绑定Host未生效,出现了列表之外的主机IP
        • 绑定Host未生效,出现了列表之外的主机IP
        • 不同任务类型,相同IP解析结果不同
        • 复制任务,无法做绑定
        • 即时监测IPV6页面级别解释
        • 私有协议Socket(TCP/UDP)
      • 错误分析
        • 打开流媒体失败
        • 打开流媒体失败
        • 打开流媒体失败
        • 页面任务元素出现阻塞时间的分析思路
        • 无法解析域名和服务器地址
        • 无法解析服务器名称或地址
        • 无法解析服务器名称或地址
        • 无法解析服务器名称或地址
        • 无法解析服务器名称或地址
        • 无法与服务器建立连接
        • 无法与服务器建立连接
        • 无法与服务器建立连接
        • 无法与服务器建联连接
        • 与服务器建立连接失败
        • 操作超时
        • 操作超时之Chrome90
        • 任务超时
        • 任务超时
        • 服务器连接被重置
        • 与服务器连接被重置
        • 与服务器连接被重置
        • 应用的证书无效
        • 应用证书无效
        • 应用证书无效
        • 安全频道支持出错
        • 安全频道支持出错
        • 安全频道支持出错
        • 证书错误
        • 私有化平台页面访问报-100错误
        • 404 - Not Found错误
        • HTTP/1.1 403 Forbidden
        • HTTP412错误
        • 元素报表数值对不上
      • 数据分析
        • 如何区分HLS格式是直播还是点播
        • 报表数据与客户服务器端统计不一致
        • DNS时间过长的分析思路
        • 绑定对比任务的消耗点数不同
        • 4G真机监测下客户反馈http via不正确
        • 节点IP定位不准确
        • 私有化平台任务无数据
        • CDN厂商CNAME无响应问题
        • 单文件下载任务, 创建后无数据
        • SSL握手时间长
        • 监控数据量少,和节点组频率对应不上
        • 跨省跨运营商现象分析
        • 流媒体任务,三个任务数据量相差较大
        • 相同文件下载,不同厂商字节数下载不同
        • 单文件下载,配置MD5,下载不全未报错。
        • DNS时间较长问题分析
        • PING任务警告,Request timed out
        • PING任务丢包百分之百
        • 元素瀑布图中有重复元素加载
        • 散点数据和抓包数据对不上
        • 抓包中如何查看传输的内容
        • 任务无数据/数据少
        • 相同URL,额外等待时间影响onload时间
        • 听云监测数据和浏览器访问数据总下载字节数相差较大
        • DNS时间较长
        • 跨运营商访问
        • 页面任务元素出现阻塞时间
        • 抓包中出现TCP ZERO window
      • 报表权限
      • 节点分析
        • 数据访问发现有跨区访问
        • 节点劫持
      • 告警分析
        • 运营商链路故障引起告警
        • 客户应用500,平台未告警
      • 问题百问
        • Network问题Q&A(1)
        • Network问题Q&A(2)
  • English
    • Release Notes
      • 2026
      • 2025
      • 2024
    • Product Introduction
      • Product Overview
        • Product Overview
      • How It Works
        • Monitoring Principles
        • Probe Deployment
      • Product Architecture
        • Product Architecture
      • Typical Scenarios
        • CDN Acceleration Evaluation and Selection
        • Website Benchmarking Analysis
        • Website Application Stress Test Analysis
        • Website Performance Evaluation and Optimization
        • Observable Hybrid Load Testing Scenario
        • DNS Hijacking Monitoring - CNAME Monitoring
        • VPN Quality Monitoring
        • WIFI Quality Monitoring
        • SMS Service Quality Monitoring
        • API Active Monitoring
        • Certificate Monitoring
        • MTR Network Link Monitoring
        • Enterprise Business Quality Monitoring
        • End-to-End Scenario Monitoring
        • Whois Domain Monitoring Scenario
        • IPv6 Support Rate Monitoring
      • Core Advantages
        • End-to-End Monitoring
        • Market Position
        • Data Protection
        • Localized Service
      • Feature Modules
        • Instant Testing
          • Instant Testing
        • Synthetic Monitoring
          • Synthetic Monitoring Introduction
          • Monitoring Task Types
          • Monitoring Result Analysis
        • Private Probe Monitoring
          • Private Probe Monitoring
        • Domain Analysis
          • Domain Analysis
        • CDN Tuning
          • CDN Tuning
        • Hijacking Monitoring
          • Hijacking Monitoring
        • Alert Notifications
          • Alert Notifications
        • Report Service
          • Report Service
        • API Service
          • API Service
    • User Guide
      • Synthetic Monitoring Charts
        • Common Operations
        • Overview
        • Trend Chart
        • Region Chart
        • Map
        • Host Chart
        • Error Chart
        • Scatter Plot
        • Elements
      • Task Types
        • Monitoring Task Types Introduction
        • Fixed Network Service Monitoring
        • Mobile Service Monitoring
      • Synthetic Monitoring
        • Synthetic Monitoring Introduction
        • Create Task
        • Query Tasks
        • Manage Tasks
        • Bind Task
        • Probe Group Management
      • Instant Testing
        • Instant Testing Introduction
        • Configure Instant Testing
        • View Monitoring Results
      • System Management
        • System Management Introduction
        • Private Probe Monitoring
        • User Management
        • Usage Query
        • Authorization Code Management
        • Map Parameters
      • Domain Analysis
        • Domain Analysis
      • CDN Tuning
        • CDN Tuning Introduction
        • CDN Assessment
        • Algorithm Recommendations
        • Policy Tuning
      • Alerts
        • Alert Introduction
        • Performance Alert
        • Error Alert
        • Intelligent Alert
        • Alert Log
        • Alert Push
      • Reports
        • Report Introduction
        • Create Quick Report
        • Create Smart Report
        • Export Report
        • Chart Pool
      • API Service
        • API Service
      • Other Features
        • Share Current Page
        • Mobile Quick Access
      • WebRecorder Script Recorder (Plugin Version)
        • Configure Task
        • Create Project and Interface Display
        • Introduction
        • Script Recording
        • Commands Supported by Script Recorder
        • Script Recorder Support for Transaction Verification Codes
        • Notes
        • Installation
    • Metric Descriptions
      • PC Monitoring
        • Page Metrics
        • File Metrics
        • Transaction Metrics
        • Streaming Media Metrics
        • Ping Metrics
        • MTR Metrics
        • Private Protocol Metrics
          • Private Protocol Metrics Overview
          • WebSocket Plugin Configuration Guide
          • DIG Plugin Configuration Guide
          • Socket(TCP/UDP) Plugin Configuration Guide
          • Telnet Plugin Configuration Guide
          • FTP Plugin Configuration Guide
          • SMTP/IMAP/POP3 Plugin Configuration Guide
          • MTR Plugin Configuration Guide
          • Traceroute Plugin Configuration Guide
          • DeepSeek Plugin Configuration Guide
          • TCP Send Plugin Configuration Guide
          • UDP Plugin Configuration Guide
          • HTTP(SM) Plugin Configuration Guide
          • HTTP3 Plugin Configuration Guide
          • OCSP Plugin Configuration Guide
          • WIFI Plugin Configuration Guide
          • VPN Plugin Configuration Guide
      • Mobile (H5) Monitoring
        • Page Metrics
        • File Metrics
        • Transaction Metrics
        • Streaming Media Metrics
        • Ping Metrics
        • SMS Metrics
    • Script Recorder
      • Download the NBR Script Recorder Tool
      • Using the NBR Script Recorder - Short Version
    • FAQs
      • Feature Analysis
        • Create Task
        • Create Probe Group
        • Advanced Parameters - End Conditions
        • Advanced Parameters - Diagnostics
        • Verify Probe DNS When Report Has No Data
        • No Data After Configuring a Streaming Media Monitoring Task
        • Does the Platform Support Checking Whether a Specified CNAME Takes Effect
        • Real Device Single File Task Download Crash
        • Tingyun Monitoring Does Not Match Actual Browser Access Data
        • Error Caused by Incomplete Monitoring Address
        • Bound Host Does Not Take Effect and Host IP Outside List Appears
        • Bound Host Does Not Take Effect and Host IP Outside List Appears
        • Same IP Has Different Resolution Results Across Task Types
        • Copied Task Cannot Be Bound
        • Instant Testing IPv6 Page-Level Explanation
        • Private Protocol Socket (TCP/ UDP)
      • Error Analysis
        • Failed to open streaming media
        • Failed to open streaming media (2)
        • Failed to open streaming media (3)
        • Troubleshooting Blocking Time for Page Task Elements
        • Unable to Resolve Server Name or Address
        • Unable to Resolve Server Name or Address (2)
        • Unable to Resolve Server Name or Address (3)
        • Unable to Resolve Server Name or Address (4)
        • Unable to Resolve Server Name or Address (5)
        • Unable to Establish Connection with Server
        • Unable to Establish Connection with Server (2)
        • Unable to Establish Connection with Server (3)
        • Unable to Establish Connection with Server (4)
        • Failed to establish connection with server
        • Operation Timeout
        • Chrome 90 Operation Timeout
        • Task timeout
        • Task timeout (2)
        • Server Connection Reset
        • The connection to the server was reset
        • The connection to the server was reset (2)
        • Invalid Application Certificate
        • Invalid Application Certificate (2)
        • Invalid Application Certificate (3)
        • Secure channel support error
        • Secure channel support error (2)
        • Secure channel support error (3)
        • Certificate Error
        • -100 Error When Accessing Private Platform Page
        • 404 - Not Found error
        • HTTP/1.1 403 Forbidden
        • HTTP412 error
        • Element Report Values Do Not Match
      • Data Analysis
        • How to Distinguish Live and VOD HLS Formats
        • Report Data Does Not Match Customer Server Statistics
        • Troubleshooting Long DNS Time
        • Different Point Consumption for Bound Comparison Tasks
        • Incorrect HTTP Via Reported in 4G Real Device Monitoring
        • Inaccurate Probe IP Location
        • No Data for Tasks on Private Platform
        • CDN Vendor CNAME No Response Issue
        • No Data After Creating Single File Download Task
        • Long SSL Handshake Time
        • Low Monitoring Data Volume Does Not Match Probe Group Frequency
        • Cross-Province and Cross-Carrier Phenomenon Analysis
        • Large Data Volume Difference Across Three Streaming Media Tasks
        • Different Vendors Download Different Byte Counts for Same File
        • Single File Download with MD5 Configured Does Not Report Incomplete Download
        • Long DNS Time Analysis
        • Ping Task Warning - Request Timed Out
        • Ping Task 100 Percent Packet Loss
        • Duplicate Elements Loaded in Element Waterfall Chart
        • Scatter Data Does Not Match Packet Capture Data
        • How to View Transmitted Content in Packet Capture
        • No or Insufficient Task Data
        • Additional Wait Time Affects Onload Time for Same URL
        • Large Difference Between Tingyun Monitoring and Browser Total Download Bytes
        • Long DNS Time
        • Cross-Carrier Access
        • Blocking Time Appears for Page Task Elements
        • TCP Zero Window Appears in Packet Capture
      • Probe Analysis
        • Cross-Region Access Found in Data Access
        • Probe Hijacking
      • Alert Analysis
        • Alert Caused by Carrier Link Failure
        • Customer Application 500 with No Platform Alert
      • Q&A
        • Network Q&A (1)
        • Network Q&A (2)
  1. Private Protocol Metrics

DeepSeek Plugin Configuration Guide

1. Overview#

nbdeepseek is a Network and generation performance measurement plug-in for the DeepSeek / OpenAI compatible streaming Chat Completion API. It measures the full link indicators from the Network layer (DNS, TCP, SSL, TTFB) to the LLM generation layer (first token delay, inference speed, content speed) by sending the real streaming conversation Request.

2. Detailed explanation of input parameters#

2.1 API connection parameters#

Parameter nameAliastypeIs it required?Default valueExplanation and Impact
base_urlurlstringyes—API base URL, such as https://api.deepseek.com/v1. The plugin will POST request to {base_url}/{endpoint}. If filled in incompletely, it may result in Request 404 or connection failure.
endpoint—stringnochat/completionsAPI endpoint path, spliced ​​after base_url. If the target service uses non-standard endpoints (such as custom gateways), this parameter can be modified.
api_keykeystringyes—Bearer Token, used for Authorization: Bearer <api_key> Request header. If missing or invalid, the API will return 401 Unauthorized Error.

2.2 Request content parameters#

Parameter nameTypeRequiredDefault valueExplanation and impact
modelstringno""The Data Model name used by Request is filled in the model field of JSON. If it is empty, some gateways may report an error, and some may use the default Data Model.
promptstringno""User message content, fill in the messages[0].content field of JSON. Directly affects the LLM's generated content length and Theme, thereby significantly affecting the number of Tokens and the generation time. It is recommended to fix prompt to obtain comparable baseline data.

2.3 Timeout and Network parameters#

Parameter nameTypeRequiredDefault valueExplanation and impact
timeoutintegerno300000Overall download timeout in milliseconds (5 minutes). The total time limit from sending Request to fully receiving the streaming response. If the content generated by LLM is long or Network is slow, it needs to be increased appropriately, otherwise it will be forcibly interrupted and a timeout of Error will be reported.
readtimeutintegerno30000Socket single read timeout, unit milliseconds. Note the spelling is readtimeut (not readtimeout). If the interval between two SSE data blocks exceeds this value, the read timeout Error will be triggered. Used to detect long pauses in streaming responses.

2.4 Validation and Quality Control Parameters#

Parameter nameTypeRequiredDefault valueExplanation and impact
checkstringno""Response content validation rules. Used to ensure that LLM returns the expected content and not garbled characters or Error information.
• Plain Text: Perform substring inclusion matching (case sensitive).
• Regular Expression: Use the /pattern/ package, such as /hello.+/, to perform regular matching. If
fails to be verified, the plug-in will report 699002 Error code.
cspmintegerno100Content Speed Percentage Multiplier (Content Speed Percentage Multiplier**). Quality control threshold for detecting Exception responses.
calculation logic: If content speed / reasoning speed * 100 - 100 >= cspm, that is, the content generation speed is faster than the inference speed by more than cspm%, then the judgment result is Exception and reported to 612280 (client Error).
The larger the value, the greater the speed difference tolerated; if set to 0, any content speed ​​> reasoning speed ​​will report an error.

3. Detailed explanation of output indicators#

3.1 Numerical performance indicators#

StageIndicator nameUnitDescription
0Total time spentmsThe overall time taken from initiating HTTP request to fully receiving the streaming response. It is the core metric for measuring the end-to-end performance of APIs.
1DNS Lookup TimemsThe time taken to resolve the base_url domain name to IP. If using IP direct connection, it is close to 0.
2Connection Establishment TimemsThe time taken to establish the TCP three-way handshake. Reflects the direct connection quality from layer Network to the target server.
3SSL/TLS handshake timemsThe HTTPS encrypted handshake takes time. Including certificate exchange, key negotiation, etc.
4Request sending timemsThe time taken to send the HTTP POST request header and body. Usually very small, but if the upstream bandwidth is limited or the Body is very large (such as an image URL), it may increase significantly.
5Time to First Byte (TTFB)msThe time from sending Request to receiving the first response byte. Reflects the response speed of the server in processing the first packet and is a key indicator of API latency.
6Remaining time to receivemsThe time from the first byte until the entire streaming response is fully received. Mainly affected by LLM generation speed and generation length.
7First Token DelaymsThe time from Request to the receipt of the first SSE data block with actual content (delta.content or delta.reasoning_content). This is a core measure of LLM response sensitivity.
8Reasoning Token Speedtokens/s × 1000The speed of generation of inference content (reasoning_content). The value needs to be divided by 1000 to get the true tokens/s. Throughput reflecting the thought process of Data Model.
9Content Token Speedtokens/s × 1000The speed of generating formal reply content (content). The value needs to be divided by 1000. Throughput reflecting the text output of Data Model.
10average overall speedtokens/s × 1000The speed of the total number of tokens (inference + content) divided by the total elapsed time. The value needs to be divided by 1000. Used to horizontally compare the comprehensive generation efficiency of different Data Model/services.
11Reasoning Token quantitytokensThe accumulated number of reasoning_content Tokens. Split estimates by whitespace characters from the content in the SSE stream.
12Content Token quantitytokensThe accumulated number of content Tokens. Same as above, splitting estimates by whitespace characters.
13Inference generation timemsThe time window from the start of output of reasoning_content to the end of reasoning_content.
14Content generation timemsThe time window from the start of outputting content to the end of content.
15Completion speedtokens/s × 1000The completion speed (completion_tokens / total elapsed time × 1000) calculated based on the usage.completion_tokens returned by the API. The difference from stage 10 is the use of official statistics rather than streaming estimates.

3.2 Text and Profiling information#

StageDescription
0Log/Debug Information. General Log, debugging output, response content when debug=true, etc.
1Server IP. base_url Actual IP address after domain name resolution.
2Target Host name. The domain name part of base_url.

3.3 Error code#

info valuemeaningTrigger condition
612280Client ErrorCommon client issues such as settings missing (no URL or key), content speed Exception (cspm verification failed), etc.
699001Reasoning ErrorAfter the streaming response ends, the total number of Tokens is < 1 or the number of content Tokens is < 1. Usually it means that Data Model returns empty, timeout interrupt or stream Exception ends.
699002Content check failedThe text or regular expression specified by the check parameter is not matched in the final generated content.
Standard ErrorNetwork/HTTP ErrorDNS failure (such as 612007), connection failure, SSL failure, HTTP 4xx/5xx, read timeout, etc.

4. Speed ​​calculation and verification logic#

4.1 Token counting method#

During the streaming reception process, the plug-in splits and counts delta.reasoning_content and delta.content of each SSE chunk according to the regular \s+ (blank characters). This may differ from the tokenizer count in the official usage, so:
Stage 11/12 (number of reasoning/content tokens) is streaming estimate.
stage 15 (Completion speed) is based on the official usage.completion_tokens, more accurate but may only be received after the stream ends.

4.2 cspm verification example#

Assume cspm=100:
reasoning speedcontent speedcalculateresult
1000 (1.0 t/s)1500 (1.5 t/s)1500/1000*100-100 = 5050 < 100, passed
1000 (1.0 t/s)2500 (2.5 t/s)2500/1000*100-100 = 150150 >= 100, failed, reported 12280

5. Typical Configuration Example#

5.1 Basic DeepSeek API test#

base_url=https://api.deepseek.com/v1
api_key=sk-xxxxxxxxxxxxxxxx
model=deepseek-chat
prompt=Please explain in one sentence what artificial intelligence is
timeout=120000
readtimeut=30000

5.2 Testing with content verification#

base_url=https://api.deepseek.com/v1
api_key=sk-xxxxxxxxxxxxxxxx
model=deepseek-chat
What is prompt=1+1 equal to? Please answer numbers only
check=2
cspm=100

5.3 Regular content verification#

base_url=https://api.deepseek.com/v1
api_key=sk-xxxxxxxxxxxxxxxx
model=deepseek-chat
prompt=Please describe the sun
check=/sun.+star/

5.4 Long text generation test#

base_url=https://api.deepseek.com/v1
api_key=sk-xxxxxxxxxxxxxxxx
model=deepseek-chat
prompt=Please write a 500-word essay about cloud computing
timeout=300000
readtimeut=60000

9. Precautions#

1.
Streaming response dependency: The plug-in forces the use of stream=true. If the target API does not support SSE (Server-Sent Events) streaming return, the content will not be parsed correctly, which may result in 699001 (the number of Tokens is 0).
2.
Token counting difference: The streaming estimated Token number is different from the official usage counting method. When comparing, please distinguish between stage 11/12 (estimated) and stage 15 (based on official completion_tokens).
3.
readtimeut spelling: The settings key is readtimeut (missing the letters o), please strictly follow this spelling, otherwise the Default value 30000 will be used. This value detects the quiet time between SSE data blocks, not the overall timeout.
4.
Reasonable value of cspm: If the content of Data Model itself is generated much faster than inference (such as some lightweight Data Model), cspm=100 may be a false positive. It is recommended to observe the normal ratio through debugging mode before setting the threshold.
5.
Callback Threads safety: All callbacks are triggered in the internal Network/parsing Threads, and the host program needs to ensure the Threads safety of the callback function.
上一页
Traceroute Plugin Configuration Guide
下一页
TCP Send Plugin Configuration Guide
Built with