5.1.6 Release Notes
tros-jazzy
Version: 2.5.4 (2026-05-19)
Changes (RDK S600 platform):
- Adapted to system SDK
RKD S600 v5.1.0.
Version: 2.5.1 (2026-03-10)
Changes:
- Adapted to system SDK
RKD S600 v5.0.1. - Refactored
mipi_camvideobuff management from stream capture to message publishing, as well as the stitching thread; changed calibration info topics for stitched images to "image_combine_raw/left/camera_info" and "image_combine_raw/right/camera_info".
New Features:
- Added detection algorithms yolov2, yolov3, yolov5; segmentation algorithm unet; classification algorithm mobilenet.
- Added open-vocabulary object detection algorithm DOSOD.
- Added human detection and tracking algorithm (Ultralytics YOLO Pose).
- Added human instance tracking algorithm reid.
- Added segment-anything algorithm MobileSAM.
Version: 2.5.0 (2025-12-02)
New Features:
- Support for
RDK S600platform.
tros-humble
Version: 2.5.3 (2026-05-08)
New Features (RDK S100 platform):
- Added mono_edgetam_prompt: supports two prompt styles via prompt_mode—point prompts or box prompts. Exports prompt initialization results (in-memory feature files) for downstream tracking, with optional local rendering output.
- Added mono_edgetam_track to load features and use them for tracking initialization. Supports continuous tracking in local image and subscribed stream modes. Updates tracking memory frame by frame and publishes segmentation/tracking results.
Changes (RDK S100 platform):
- Adapted to system version: RDKS100_V4.0.6.
Version: 2.5.2 (2026-03-17)
New Features (RDK X5 platform):
- Added epipolar alignment detection to the stereo depth estimation algorithm; adjusted subscribed messages per hobot_mipi_cam to "image_combine_raw/left/camera_info" and "image_combine_raw/right/camera_info"
- Stereo OCC algorithm supports mipi cameras.
- Object detection algorithm supports
yolo26. sensevoice_ros2algorithm adds configuration options for Chinese/English mode.- Added hand keypoint and gesture recognition algorithm based on
palm_detection_mediapipeandhand_landmarks_mediapipe.
Changes (RDK X5 platform):
- Adapted to system version: RDK 3.5.0 (Linux SDK V1.1.2)
- Refactored
mipi_camvideobuff management from stream capture to message publishing and the stitching thread. Changed calibration info topics for stitched images to "image_combine_raw/left/camera_info" and "image_combine_raw/right/camera_info". Refactored X5 eeprom read handling. Added subdirectory publishing on X5. Added combined 132gs imu data publishing.
Version: 2.4.6 (2026-02-02)
Changes (RDK S100 platform):
- Adapted to system version
V4.0.5andOEversionV3.7.0.
Version: 2.4.5 (2025-10-28)
Bug Fixes (RDK X5 platform):
- Fixed a bug in the image processing acceleration module
hobot_cvwhen using vse to accelerate image resize. - Fixed incorrect inference latency statistics in the on-board model inference framework
dnn_node. - Fixed runtime loading failures caused by incorrect installation paths for
ros component soin MIPI image capture and algorithm modules. - Optimized image codec configuration: removed invalid configuration parameters and added debug configuration parameters.
Version: 2.4.4 (2025-10-24)
New Features (RDK S100 platform):
- On-board algorithm inference framework
dnn_nodeaddsROIinference mode; the number of outputoutput tensors isoutput_size x roi_size. - Added human detection and tracking (Ultralytics YOLO Pose) algorithm example.
- Added human instance tracking algorithm example, which extracts human features using the
reidmodel and stores, manages, and queries features via anSQlitedatabase. The launch file disables ROI-based human tracking in the human detection and tracking nodemono2d_body_detection; inputROIsize is limited to less than 3.5x the actual model input size. - Added hand keypoint and gesture recognition (mediapipe) algorithm example. Implements basic palm detection, pre/post-processing, and
ai msgpublishing; supports mipi camera, usb camera, and local image replay; supports zero-copy and non-zero-copy image acquisition. - Added DeepSeek large language model algorithm example. Supports human-machine dialogue;
RDK S100andRDK S100Pnewly supportDeepSeek_R1_Distill_Qwen_1.5BandDeepSeek_R1_Distill_Qwen_7Bmodels.
Version: 2.4.3 (2025-9-15)
New Features (RDK X5 platform):
- MIPI image capture supports launching multi-channel image capture.
- Stereo MIPI image capture supports
sc132gsstereo camera.
Version: 2.4.2 (2025-8-29)
New Features (RDK S100 platform):
- MIPI image capture supports
230aistereo module. - YOLO object detection supports
yolo11andyolov12algorithms; added component-based launch scripts supporting 4K image inference withmipi cam. - EdgeSAM segment anything algorithm adds
edgesamfor model inference; addsnv12format data input; adds padding for segmentation results for joint visualization in stereo depth use cases. - Added text-image feature retrieval algorithm for text-image feature extraction and retrieval.
- Added DOSOD algorithm, D-Robotics open-vocabulary detection DOSOD on-device deployment package; added re-parameterization quantization method for customizing detection categories in DOSOD models.
- Added stereo OCC, integrating D-Robotics stereo
OCCnetwork. - Vision-language model algorithm adds
smolvlm2model support, with image replay and subscription modes; adds ability to output complete topics afterllmmodel inference completes. - Added stereo depth estimation algorithm.
Version: 2.4.1 (2025-7-30)
New Features (RDK X5 platform):
- MIPI image capture supports
imx415module. - EdgeSAM segment anything algorithm adds
edgesamfor model inference; addsnv12format data input; adds padding for segmentation results for joint visualization in stereo depth use cases. - Added human instance tracking algorithm, which extracts human features using the
reidmodel and stores, manages, and queries features via anSQlitedatabase. - Added stereo OCC algorithm, integrating D-Robotics stereo
OCCnetwork. - Vision-language model adds
smolvlm2model support, with image replay and subscription modes; adds ability to output complete topics afterllmmodel inference completes.
Version: 2.4.0 (2025-5-12)
New Features:
- Support for
RDK S100platform.
Version: 2.3.3 (2025-4-30)
New Features:
- Support for
RDK X5 Moduleplatform. - Added ASR open-source solution based on
sensevoice_cpp, supporting command word and ASR data publishing. - Stereo depth estimation algorithm optimizes post-processing time; added V2.3 model version.
- Added on-device vision-language model algorithm example based on
llama.cpp.
Version: 2.3.2 (2025-1-15)
Changes:
- Stereo depth estimation algorithm updated stereo model to improve depth estimation quality.
- Multi-channel video analysis algorithm application example: optimized processing pipeline and WEB visualization.
- Stereo utility package removed launch files that could not start.
New Features:
- Added ZED camera image capture to launch ZED camera for stereo images as input to stereo depth estimation algorithm.
- Added DOSOD algorithm, D-Robotics open-vocabulary detection DOSOD on-device deployment package.
Bug Fixes:
- Fixed crash in yolov8-seg image segmentation algorithm post-processing caused by out-of-bounds boxes.
- Image codec fixed incorrect frame rate statistics.
- Stereo MIPI image capture fixed i2c detection issue; added lpwm switch configuration.
Version: 2.3.1 (2024-11-20)
Changes:
- Upgraded dependent
opencvversion from 3.4.5 to 4.5.4 (latest release version used on Ubuntu 22.04).
New Features:
- Tools support publishing
bgr/rgbformat message data; support configuring frame_id for published messages. - Human detection and tracking algorithm supports configuring subscribed message topics; supports component mode; pre-processing supports scaling input images before inference; launch scripts support compressed image replay with configurable image paths.
- On-board algorithm model inference and deployment framework fixed incorrect inference latency calculation in multi-threaded inference; supports configuring task count in configuration files.
- Image codec uses frame_id from subscribed image messages as output image message frame_id; supports publish frame drop control.
- Gesture recognition algorithm supports configuring post-processing thresholds at startup; supports dynamic gesture recognition.
- Added face age detection algorithm for detecting human age.
- Added face 106 keypoint detection algorithm for detecting 106 facial keypoints.
- Added perception message fusion Node to subscribe to multiple PerceptionTargets topics, perform time alignment and deduplication, then fuse and publish as a single topic. See multi-algorithm inference for application reference.
- Added perception message filter Node using OneEuroFilter strategy to smooth points and boxes, correcting positions of human, face, hand detection boxes and keypoints in perception results to fix jitter. See multi-algorithm inference for application reference.
- Added stereo utility package for capturing stereo and depth images.
- Added multi-channel video analysis algorithm application example that pulls multiple h264 and h265 streams via rtsp protocol for inference and visualizes perception results on the WEB.
Bug Fixes:
- MIPI image capture fixed
imx219module startup failure. - Hand keypoint detection algorithm pre-processing adds hand box expansion to fix incorrect keypoint output.
Version: 2.3.0 (2024-09-19)
New Features:
- Support for
RDK X5platform. - Data acquisition adds stereo MIPI image capture capability.
- Algorithm repository adds
yolov8andyolov10object detection,yolov8-segimage segmentation reference algorithms. - Algorithm repository adds YOLO-World algorithm for open-vocabulary input detection.
- Algorithm repository adds optical flow estimation algorithm for optical flow detection.
- Algorithm repository adds segment anything algorithm for universal segmentation.
- Algorithm repository adds text-image feature retrieval algorithm for text-image feature extraction and retrieval.
- Algorithm repository adds stereo depth estimation algorithm implementing vision-based depth estimation.
Version: 2.2.0 (2024-04-11)
Changes:
- Based on TROS Foxy 2.1.3, adapted to Ubuntu 22.04 and ROS2 Humble.
- TROS installation path changed from
/opt/trosto/opt/tros/humble, consistent with ROS2 installation path hierarchy and naming. - No longer provides
tros-ros-basepackage (includes rclcpp, rclpy, ros2cli and other ROS2 base packages); uses standard ROS2 distribution packages; installing TROS Humble automatically installs dependent ROS2 Humble. - Uses ROS2 fastdds zero-copy communication for modules involving image data such as data acquisition, image codec, and algorithm examples.
- Zero-copy communication QoS Reliability changed from
RMW_QOS_POLICY_RELIABILITY_RELIABLE(rclcpp::QoS()) toRMW_QOS_POLICY_RELIABILITY_BEST_EFFORT(rclcpp::SensorDataQoS()) to avoid potential stability risks when using zero-copy. - Refactored
hobot_dnnto use lower-level on-board inference frameworklibdnninstead ofeasydnn. hobot_audioupgraded speech algorithm SDK to use lower-level on-board inference frameworklibdnninstead ofeasydnn.hobot_triggeradapted to ROS2 Humble version rosbag2.
New Features:
robot_dev_configadded bloom build and packaging scripts for ARM platform TROS compilation and packaging.hobot_mipi_camnode added frame_ts_type configuration option, supporting realtime (for communication latency calculation) and sensor (default, for sensor timestamp synchronization) parameters.- Added
hobot_shmnode for configuring ROS2 zero-copy environment.
Bug Fixes:
- Fixed compatibility issues introduced by compiler upgrade.
- Fixed path dependency issues in some ROS2 pkg on-board compilation.
tros-foxy
Version: 2.1.3 (2024-03-11)
Changes:
- JPEG compressed image data type changed from
sensor_msgs::msg::Imageto standardsensor_msgs::msg::CompressedImage, enabling tools such as foxglove and ros2 rqt to view TROS-published JPEG images. Affects hobot_websocket, hobot_codec, hobot_image_publisher, hobot_usb_cam modules. - Unified use of jpeg/mjpeg configuration options for publishing/subscribing JPEG compressed images; removed jpeg-compressed/mjpeg-compressed configuration options. Affects hobot_codec and hobot_usb_cam modules.
- Introduced TROS_DISTRO environment variable to indicate TROS distribution. After running
source /opt/tros/setup.bash/source /opt/tros/local_setup.bash, theTROS_DISTROenvironment variable is empty. Configuration file paths for hobot_codec, hobot_audio, hobot_mipi_cam, hobot_usb_cam and other modules changed from/opt/tros/libto/opt/tros/${TROS_DISTRO}/lib.
Version: 2.1.2 (2024-01-19)
New Features:
- Refactored
hobot_usb_camwith support for more format configurations and transcoding. hobot_audioupdated speech SDK, supporting both 2mic and 4mic microphone boards; added micphone_name configuration for device ID.
Bug Fixes:
hobot_rgbd_camnode fixed incorrect step field in sent data messages.hobot_ttsupdated audio playback function calls to fix playback failures on newer system versions.hobot_llmremoved config device tree file and updated README; newer system versions can set ION memory size via command-line tools.
Version: 2.1.1 (2023-11-03)
New Features:
- Added
hobot_chatbotnode that invokes intelligent speech, large language model, and text-to-speech modules to enable on-board voice chat.
Bug Fixes:
- Text-to-speech
hobot_ttsnode fixed application exit caused by certain characters.
Version: 2.1.0 (2023-09-14)
Changes:
tros-ros-baseupdated to latest ROS2 foxy source, compatible with latest ROS2 foxy packages.- Using ROS2 foxy packages only requires
source /opt/tros/setup.bash; no longer need scripts to create symlinks.
New Features:
- Text-to-speech
hobot_ttsnode added parameter to specify audio playback device. - Added large language model
hobot_llmnode for on-device LLM experience. - Image codec
hobot_codecnode configuration parameterin_formataddedjpeg-compressedoption; subscribes to topic data type based on configuration option.
Bug Fixes:
- MIPI image capture
hobot_mipi_camnode fixed incorrect step field in RGB format data messages.
Version: 2.0.2 (2023-08-28)
Changes:
- ROS2 source configured during tros.b installation (
/etc/apt/sources.list.d/ros2.list) changed to Tsinghua mirror, fixing slow and failed ROS2 package installation.
New Features:
- Added permission check when tros.b startup scripts configure the environment (
source /opt/tros/setup.bashandsource /opt/tros/local_setup.bash). If the current account lacks root permissions, automatically enters the process to switch to root account, fixing tros.b usage failures due to insufficient permissions. - Intelligent speech algorithm
hobot_audionode added audio device number parameter configuration for easier secondary development. - Event trigger
hobot_triggernode added ability to assign tasks to Trigger module via std_msg topics, standardizing Trigger configuration.
Bug Fixes:
- Fixed image acceleration processing
hobot_cvnode failure when performing crop&resize simultaneously. - Fixed MIPI image capture
hobot_mipi_camnode error log output at startup. - Fixed invalid launch file configuration for data visualization message conversion
hobot_visualizationnode.
Version: 2.0-Release (2.0.1) (2023-06-10)
Changes:
- Upgraded speech algorithm to optimize ASR (Automatic Speech Recognition) performance.
- Optimized algorithm example
model_nameconfiguration option to automatically parsemodel_namefrom model files, fixing model loading failures caused by incorrect parameter configuration and improving ease of secondary development. - tros.b installation package no longer includes nav2 packages; users install the latest ROS2 nav2 packages directly on RDK via apt, fixing stability issues in older nav2 versions.
New Features:
- Added
hobot_triggerandhobot_visalizationnodes for Trigger event triggering, rosbag data acquisition and visualization, helping users locate, reproduce, and visualize perception, planning, and control issues in robot scenarios. Users can also develop data triggering, recording, and real-time transmission capabilities. - USB image capture node auto-adapts USB camera device numbers, lowering the barrier for USB camera usage.
- Added Visual Inertial Odometry (VIO) algorithm node for low-cost, robust, high-precision robot localization based on vision.
- Added text-to-speech
hobot_ttsnode to convert text to speech for playback. - Added LiDAR object detection algorithm
hobot_centerpointnode. - Added BEV perception algorithm
hobot_bevnode. - Added stereo depth estimation algorithm
hobot_stereonetnode.
Bug Fixes:
- Upgraded
RDK X3easydnn (version 1.6.1) and dnn (version 1.18.4), fixing operator crash issues and supporting more operators. - Fixed incorrect depth data published by RGBD image capture node.
Other Updates:
- Optimized human detection and tracking algorithm node to adapt output perception result coordinates based on input image resolution.
- Fixed orb_slam3 algorithm compilation failure caused by incorrect compilation script path.
Version: 2.0-Beta (2.0.0) (2023-05-29)
2.0-Beta (2.0.0) is the first 2.x version of tros.b. Users of tros.b 1.x are recommended to upgrade to 2.x.
Changes:
- Code hosting platform moved from Gitlab to GitHub for easier secondary development by more developers.
- Integrated more efficient package management to speed up version upgrades and simplify robot application installation.
New Features:
- Support for the new RDK X3 Module core board development kit.
- hobot_audio added ASR recognition result output for voice application development.
Bug Fixes:
- Fixed dnn_node built-in MobileNet_SSD model post-processing crash in multi-threaded scenarios.
- Fixed dnn_node DDR input model inference failure on X86 platform
- Fixed hobot_codec and hobot_image_publisher compilation failures on X86 platform.
Other Updates:
- Updated example launch scripts to reference dependent module launch scripts and configure parameters.
- websocket updated display-side D-Robotics logo.