Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Sponsored by ARK] Bidirectional DShot #23863

Open
wants to merge 16 commits into
base: main
Choose a base branch
from

Conversation

dakejahl
Copy link
Contributor

@dakejahl dakejahl commented Oct 29, 2024

First of all a big thank you to @julianoes for the initial implementation. This implementation differs in that it uses up to 4 DMA channels for Capture Compare input on all 4 channels of the first timer. The limitation to a single timer is due to the limited available DMA channels on most boards (4 without PX4IO on ARKV6X) This will work down to 1 DMA channel by only capturing the input from a single timer channel.

Implementation
If Bidirectional DShot is not enabled the implementation is the same as it was before. Timers will be configured for DShot mode if the timer supports Burst Mode and if there is enough DMA channels available (1 DMA per timer). If Bidirectional Dshot is enabled, only the first timer will be configured for DShot due to DMA channel limitations. Burst mode uses 1 DMA, and CaptureCompare mode uses 4 (re-using the DMA used during Burst). An hrt callback is used to process the eRPM frames since DShot frames have predictable timing but unpredictable pulse count. This webpage was a useful resource.

Further discussion
Initially I wanted to support bidirectional dshot on all timers with DMA simultaneously. This requires triggering each timer (burst/captcomp) sequentially so that we can reuse the available DMA channels between the timers.

Timer1 Burst (1DMA) --> Timer1 CaptComp (x4 DMA) --> Timer2 Burst (1 DMA) --> Timer2 CaptComp (x4 DMA)

I made it pretty far with this implementation but decide to scrap it because I kept running into race conditions and this development has already taken quite some time. I think it is still possible to do, but is out of scope of this PR. The code is structured in a way that should allow fairly easy extensibility to support this in the future.

Issues
An issue I found but am not fixing here is that DShot.cpp configures all of the channels on timers that have DShot enabled. This results in a pulse train on every channel output on that timer, regardless of if the output function is enabled. The fix is not super easy as it requires extra logic in order to properly initialize the timer channels for burst mode, since the enabled outputs need to be sequential. This is the current implementations behavior so I am leaving it as I found it.

Targets Tested
ARKV6X with 4 DMA channels
ARKV6X with 2 DMA channels
CubeOrange with 4 DMA channels

Screenshots
image
image

@dakejahl dakejahl force-pushed the dev/bidirectional_dshot_single_timer branch 2 times, most recently from e4681c4 to 16dc6d4 Compare October 29, 2024 22:58
@DronecodeBot
Copy link

This pull request has been mentioned on Discussion Forum for PX4, Pixhawk, QGroundControl, MAVSDK, MAVLink. There might be relevant details there:

https://discuss.px4.io/t/px4-sync-q-a-oct-30-2024/42184/1

@julianoes
Copy link
Contributor

Thanks for your work @dakejahl. I'll review and test this on the Holybro 6Xs as well as Cubes tomorrow.

@julianoes
Copy link
Contributor

@dakejahl This implementation differs in that it uses up to 4 DMA channels for Capture Compare input on all 4 channels of the first timer.

Do I remember it right that I did the capture round robin style, so one channel per output pulse. Is that what you mean with this?

And so if I understand you right, we need 4 DMA channels to enable this? Which means that we need to disable DMA for serial ports or sensors to enable this, right?

And what are the steps to enable this? It's not on by default, right?

@dakejahl
Copy link
Contributor Author

dakejahl commented Oct 30, 2024

@julianoes your impl was doing round robin using a single DMA and reading from the burst register. You were re-using the 1 DMA from the burst output for the input capture.

And so if I understand you right, we need 4 DMA channels to enable this? Which means that we need to disable DMA for serial ports or sensors to enable this, right?

The init logic determines if DMA is available for each channel, so this will work with only 1 DMA (the 1 used for burst output anyway). If there are more DMA available, then great, but they are not strictly required as the channel will be marked in the captcomp_channels field. For example on the ARKV6X jetson carrier and basic carrier we do not have a PX4IO, so this leaves us with 4 DMA, so it works out of the box with all 4 channels. If the ARKV6X was paired with a carrier with PX4IO, then we would only have 2 DMA available and only Timer1 Ch1 and Ch2 would be used for CaptComp. I tested simulating having a PX4IO by adding some test code to allocate 2 extra DMA channels during init, and the driver worked as expected by only capturing on Ch1 and Ch2.

And what are the steps to enable this? It's not on by default, right?

DSHOT_BIDIR_EN parameter enables this, it is disabled by default

@julianoes
Copy link
Contributor

and the driver worked as expected by only capturing on Ch1 and Ch2.

Aha! So then you only have feedback for part of the ESCs essentially?

DSHOT_BIDIR_EN parameter enables this, it is disabled by default

Thanks, of course! 🤦‍♂️

@dakejahl
Copy link
Contributor Author

Aha! So then you only have feedback for part of the ESCs essentially?

yes exactly, you'd only have feedback from 1. This leads me to a question I have: is there a notch filter per ESC when using the gyro dynamic notch filter? (IMU_GYRO_DNF_EN). We could support round robin for the 1 - 3 DMA use case.

@julianoes
Copy link
Contributor

We could support round robin for the 1 - 3 DMA use case.

That's what I'm thinking, but I'm not aware how the dynamic notch filtering works.

@dakejahl
Copy link
Contributor Author

That's what I'm thinking, but I'm not aware how the dynamic notch filtering works.

I bet @dagar or @bresch knows. Is there a notch filter applied for each ESC? Is it worth it to round robin the ESC RPM if we don't have 4 DMA available?

It would be pretty easy to update the logic to round robin with the available DMA:

1 DMA: each channel is captured on every 4th measurement --> C x x x C x x x C x x x C ...
2 DMA: each channel is capture on every other measurement --> C x C x C x C x C x...
3 DMA: every third measurement is skipped --> C C C x C C C x C C C x C ...

@jgw365
Copy link

jgw365 commented Oct 31, 2024

截图 2024-10-31 20-19-57

The output of DShot in a px4_fmu-v6c board is fine, but I got incorrect eRPM data on channel 2 with 10% CRC errors, and the other channels can't be read. The code I used is an old PX4 with the DShot part of your code, and there are no warning errors.

@jgw365
Copy link

jgw365 commented Oct 31, 2024

截图 2024-10-31 20-19-57

The output of DShot in a px4_fmu-v6c board is fine, but I got incorrect eRPM data on channel 2 with 10% CRC errors, and the other channels can't be read. The code I used is an old PX4 with the DShot part of your code, and there are no warning errors.

Could you please provide me with some advice on how to solve this? @dakejahl

@jgw365
Copy link

jgw365 commented Oct 31, 2024

图片
The log records the RPM during arming.

@dakejahl
Copy link
Contributor Author

The code I used is an old PX4 with the DShot part of your code

@jgw365 I'm sorry but I really can't help you with a backport. Please test the PR as it is, and let me know if works on the fmu-v6c or not and we can go from there.

@jgw365
Copy link

jgw365 commented Nov 1, 2024

Okay, thank you a lot. I cloned your commit in the branch 'dev/bidirectional_dshot_single_timer' and used the command 'make px4_fmu-v6c upload' to upload it to my drone. After plugging in a battery, it seems that the channels are not connected correctly, but the motor output is functioning properly by the actuator test in QGroundControl (QGC).
截图 2024-11-01 10-26-56
图片
@dakejahl

@jgw365
Copy link

jgw365 commented Nov 1, 2024

A few weeks ago, I used julianoes‘ code from the branch 'pr-bidirectional-dshot'. I was able to achieve normal DShot ERPM; however, I noticed that there were many spikes caused by CRC errors, with a rate of about 10%.

@dakejahl
Copy link
Contributor Author

dakejahl commented Nov 5, 2024

@jgw365 it looks like it does indeed work. The fmu-v6x only has 2 DMA channels available so it's only capturing the eRPM from the first 2 channels. Please read the above comments discussing this. I might end up implementing round-robin for the less than 4 DMA cases such as this.

@jgw365
Copy link

jgw365 commented Nov 6, 2024

I might end up implementing round-robin for the less than 4 DMA cases such as this.

Got it, thank you very much! Looking forward to seeing your progress in development.

@julianoes julianoes self-requested a review November 12, 2024 21:41
@dakejahl
Copy link
Contributor Author

@jgw365 can you test again? I pushed a commit which fixes a bug I introduced when trying to fix the F1 compilation

@dakejahl
Copy link
Contributor Author

Alright I've implemented and tested round robin @julianoes

This allows individual boards to override the number of DShot channels
and hence avoid round robin capture of the RPM feedback.
This slows down the ESC_STATUS publication in the case of round robin
capture. E.g. for 800 Hz output with one DMA channel, the ESC_STATUS is
now published at 200 Hz.
Instead of publishing both bidirectional dshot updates as well as
telemetry updates, we now combine the data from both streams, and
publish whenever we get RPM updates, as the latter arrives with higher
rate, e.g. 200 Hz with round robin, or faster otherwise.

When combining the data, we take RPM from bidirectional dshot, and the
rest from telemetry.

When we have only one of the two, either telemetry or bidirectional
dshot, we just publish that one.
@AlexKlimaj AlexKlimaj force-pushed the dev/bidirectional_dshot_single_timer branch from df39300 to fdb94f9 Compare November 26, 2024 19:16
@AlexKlimaj
Copy link
Member

Testing this on an ARK Pi6X Flow.

It looks like with bidirectional dshot we are only getting erpm and everything else is still over serial telem?

With only serial telemetry.

TOPIC NAME                        INST #SUB RATE #Q SIZE
esc_status                           0    8  100  1  336

With bidirectional dshot at 600 and serial telemetry. BOARD_DMA_NUM_DSHOT_CHANNELS not defined

TOPIC NAME                        INST #SUB RATE #Q SIZE
esc_status                           0    8  200  1  336 

With just bidirectional Dshot. BOARD_DMA_NUM_DSHOT_CHANNELS not defined

TOPIC NAME                        INST #SUB RATE #Q SIZE
esc_status                           0    8  201  1  336 

INFO  [arch_dshot] Bidirectional DShot enabled, only one timer will be used
DEBUG 797461     Allocated DMA UP Timer Index 0
DEBUG 797504     Freed DMA UP Timer Index 0
DEBUG 797541     Allocating DMA CH for Timer Index 0 Channel 3
DEBUG 797665     Allocated DMA CH Timer Index 0 Channel 3
INFO  [arch_dshot] Limiting DMA channels to 1
DEBUG 798008     Freed DMA CH Timer Index 0 Channel 3
DEBUG 798114     DShot initialized OutputChannel 0 (bidirectional)
DEBUG 798167     DShot initialized OutputChannel 1 (bidirectional)
DEBUG 798220     DShot initialized OutputChannel 2 (bidirectional)
ekf2 [778:237]
DEBUG 914783     DShot initialized OutputChannel 3 (bidirectional)

Adding BOARD_DMA_NUM_DSHOT_CHANNELS 4

INFO  [arch_dshot] Bidirectional DShot enabled, only one timer will be used
DEBUG 797486     Allocated DMA UP Timer Index 0
DEBUG 797642     Freed DMA UP Timer Index 0
DEBUG 797686     Allocating DMA CH for Timer Index 0 Channel 3
DEBUG 797737     Allocated DMA CH Timer Index 0 Channel 3
DEBUG 797784     Allocating DMA CH for Timer Index 0 Channel 2
DEBUG 797834     Allocated DMA CH Timer Index 0 Channel 2
DEBUG 797880     Allocating DMA CH for Timer Index 0 Channel 1
DEBUG 797930     Allocated DMA CH Timer Index 0 Channel 1
ekf2 [778:237]
DEBUG 914785     Allocating DMA CH for Timer Index 0 Channel 0
DEBUG 914918     Allocated DMA CH Timer Index 0 Channel 0
INFO  [arch_dshot] Limiting DMA channels to 4
DEBUG 915072     Freed DMA CH Timer Index 0 Channel 3
DEBUG 915117     Freed DMA CH Timer Index 0 Channel 2
DEBUG 915162     Freed DMA CH Timer Index 0 Channel 1
DEBUG 915207     Freed DMA CH Timer Index 0 Channel 0
DEBUG 915254     DShot initialized OutputChannel 0 (bidirectional)
DEBUG 915306     DShot initialized OutputChannel 1 (bidirectional)
DEBUG 915358     DShot initialized OutputChannel 2 (bidirectional)
DEBUG 915410     DShot initialized OutputChannel 3 (bidirectional)

TOPIC NAME                        INST #SUB RATE #Q SIZE
esc_status                           0    8  802  1  336

nsh> dshot status
INFO  [dshot] Outputs initialized: yes
INFO  [dshot] Outputs used: 0xf
INFO  [dshot] Outputs on: yes
dshot: cycle: 100145 events, 2703812us elapsed, 27.00us avg, min 20us max 84us 3.000us rms
INFO  [mixer_module] Param prefix: PWM_MAIN
control latency: 100149 events, 19487041us elapsed, 194.58us avg, min 167us max 317us 4.345us rms
INFO  [mixer_module] Switched to rate_ctrl work queue
Channel Configuration:
Channel 0: func: 102, value: 0, failsafe: 0, disarmed: 0, min: 109, max: 1999
Channel 1: func: 103, value: 0, failsafe: 0, disarmed: 0, min: 109, max: 1999
Channel 2: func: 104, value: 0, failsafe: 0, disarmed: 0, min: 109, max: 1999
Channel 3: func: 101, value: 0, failsafe: 0, disarmed: 0, min: 109, max: 1999
Channel 4: func:   0, value: 0, failsafe: 0, disarmed: 0, min: 109, max: 1999
Channel 5: func:   0, value: 0, failsafe: 0, disarmed: 0, min: 109, max: 1999
Channel 6: func:   0, value: 0, failsafe: 0, disarmed: 0, min: 109, max: 1999
Channel 7: func:   0, value: 0, failsafe: 0, disarmed: 0, min: 109, max: 1999
INFO  [dshot] telemetry on: /dev/ttyS2
INFO  [dshot] Number of successful ESC frames: 49732
INFO  [dshot] Number of timeouts: 27
INFO  [dshot] Number of CRC errors: 0
INFO  [arch_dshot] dshot driver stats:
INFO  [arch_dshot] Bidirectional DShot enabled
INFO  [arch_dshot] Available DMA: 4
INFO  [arch_dshot] Timer 0, Channel 0: read 99596, failed nibble 0, failed CRC 0, invalid/zero 591
INFO  [arch_dshot] Timer 0, Channel 1: read 99595, failed nibble 0, failed CRC 0, invalid/zero 595
INFO  [arch_dshot] Timer 0, Channel 2: read 99594, failed nibble 0, failed CRC 0, invalid/zero 598
INFO  [arch_dshot] Timer 0, Channel 3: read 99593, failed nibble 0, failed CRC 0, invalid/zero 604
nsh> 

@AlexKlimaj
Copy link
Member

Do we want to have the default logging rate lower. Then add unlimited rate logging to the high rate topics.

@AlexKlimaj
Copy link
Member

AlexKlimaj commented Nov 26, 2024

Tested on the Pi6X Flow with the ARK 4IN1 ESC.

https://review.px4.io/plot_app?log=5af34fd5-77d6-4530-a3a5-e6634dd56299

image

@DronecodeBot
Copy link

This pull request has been mentioned on Discussion Forum for PX4, Pixhawk, QGroundControl, MAVSDK, MAVLink. There might be relevant details there:

https://discuss.px4.io/t/px4-sync-q-a-nov-27-2024/42604/1

@AlexKlimaj AlexKlimaj force-pushed the dev/bidirectional_dshot_single_timer branch from 6f099a8 to 03528b3 Compare November 27, 2024 18:42
@julianoes
Copy link
Contributor

Bench tested on CubeOrange and CubeOrange+, works for both on the first 4 Aux (FMU) outputs!

(I realized that in order for motor test to work, I had to choose a multirotor airframe, so SYS_AUTOSTART 4001.)

@julianoes
Copy link
Contributor

Tested Holybro 6X and 6C, works fine for both as well.

@julianoes
Copy link
Contributor

I also tested Holybro Kakute H7v1.3, and as expected it unfortunately only works for the first two outputs on the timer 0. The next two outputs are on timer 1.

@hamishwillee
Copy link
Contributor

I also tested Holybro Kakute H7v1.3, and as expected it unfortunately only works for the first two outputs on the timer 0. The next two outputs are on timer 1.

@julianoes When this goes in, worth adding to the Kakute docs?

@DronecodeBot
Copy link

This pull request has been mentioned on Discussion Forum for PX4, Pixhawk, QGroundControl, MAVSDK, MAVLink. There might be relevant details there:

https://discuss.px4.io/t/px4-sync-q-a-dec-11-2024/42790/1

@julianoes
Copy link
Contributor

@hamishwillee I'm planning to try to make this work for 4 outputs for the Kakutes.

@DronecodeBot
Copy link

This pull request has been mentioned on Discussion Forum for PX4, Pixhawk, QGroundControl, MAVSDK, MAVLink. There might be relevant details there:

https://discuss.px4.io/t/px4-sync-q-a-dec-18-2024/42886/1

@DronecodeBot
Copy link

This pull request has been mentioned on Discussion Forum for PX4, Pixhawk, QGroundControl, MAVSDK, MAVLink. There might be relevant details there:

https://discuss.px4.io/t/why-the-esc-status-frequency-is-much-lower-than-dshot-frequency-when-bidirectional-dshot-is-enabled/43024/3

@RENyunfan
Copy link

Tested Holybro 6X and 6C, works fine for both as well.

Hi julianoes, I would like to test 6d1c183 on pixhawkv6x. However, I could not even build the firewire which reports error on flash size:

s/libxx/libxx.a  NuttX/nuttx/drivers/libdrivers.a  NuttX/nuttx/libs/libc/libc.a  NuttX/nuttx/drivers/libdrivers.a  NuttX/nuttx/libs/libc/libc.a  NuttX/nuttx/sched/libsched.a  -lm  -lgcc && :
Memory region         Used Size  Region Size  %age Used
        ITCM_RAM:          0 GB        64 KB      0.00%
           FLASH:     1968072 B      1920 KB    100.10%
       DTCM1_RAM:          0 GB        64 KB      0.00%
       DTCM2_RAM:          0 GB        64 KB      0.00%
        AXI_SRAM:      100860 B       512 KB     19.24%
           SRAM1:          0 GB       128 KB      0.00%
           SRAM2:          0 GB       128 KB      0.00%
           SRAM3:          0 GB        32 KB      0.00%
           SRAM4:          2 KB        64 KB      3.12%
          BKPRAM:          0 GB         4 KB      0.00/opt/gcc-arm-none-eabi-9-2020-q2-update/bin/../lib/gcc/arm-none-eabi/9.3.1/../../../../arm-none-eabi/bin/ld: px4_fmu-v6x_default.elf section `.data' will not fit in region `FLASH'
/opt/gcc-arm-none-eabi-9-2020-q2-update/bin/../lib/gcc/arm-none-eabi/9.3.1/../../../../arm-none-eabi/bin/ld: region `FLASH' overflowed by 1992 bytes
collect2: error: ld returned 1 exit status
%
ninja: build stopped: subcommand failed.
make: *** [Makefile:227: px4_fmu-v6x_default] Error 1

Did I miss anything?

@dakejahl
Copy link
Contributor Author

dakejahl commented Jan 3, 2025

We should get this in as is and add multi-timer support once myself or Julian finds time

@RENyunfan
Copy link

We should get this in as is and add multi-timer support once myself or Julian finds time

Thank you, and thanks to Julian as well! We would greatly appreciate the opportunity to use bidirectional DShot in our research. Looking forward to it!

@Aaron1356
Copy link

Everything looks good for a Pixhawk 6C. I checked that the RPMs were in the log file as well. Just a note about the logs I have the motors spin when armed.

https://review.px4.io/plot_app?log=0df2557f-0bf0-4bb1-bc1b-a0435d0fc055

@AlexKlimaj AlexKlimaj enabled auto-merge (squash) January 9, 2025 00:19
@AlexKlimaj AlexKlimaj requested a review from dagar January 9, 2025 00:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

9 participants