Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

rtdetr.md 6.7 KB

You have to be logged in to leave a comment. Sign In
comments description keywords
true ๋น„๋‘˜๊ธฐ(Baidu)๊ฐ€ ๊ฐœ๋ฐœํ•œ RT-DETR์€ ๋น„์ „ ํŠธ๋žœ์Šคํฌ๋จธ(Vision Transformers)๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ ์‹ค์‹œ๊ฐ„ ๊ฐ์ฒด ๊ฒ€์ถœ๊ธฐ๋กœ, ์‚ฌ์ „ ํ›ˆ๋ จ๋œ ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜์—ฌ ์‹œ๊ฐ„์ง€์—ฐ์ด ์—†๋Š” ๊ณ ์„ฑ๋Šฅ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. RT-DETR, ๋น„๋‘˜๊ธฐ, ๋น„์ „ ํŠธ๋žœ์Šคํฌ๋จธ, ๊ฐ์ฒด ๊ฒ€์ถœ, ์‹ค์‹œ๊ฐ„ ์„ฑ๋Šฅ, CUDA, TensorRT, IoU-aware query selection, Ultralytics, ํŒŒ์ด์ฌ API, PaddlePaddle

๋น„๋‘˜๊ธฐ์˜ RT-DETR: ๋น„์ „ ํŠธ๋žœ์Šคํฌ๋จธ ๊ธฐ๋ฐ˜ ์‹ค์‹œ๊ฐ„ ๊ฐ์ฒด ๊ฒ€์ถœ๊ธฐ

๊ฐœ์š”

๋น„๋‘˜๊ธฐ(Baidu)๊ฐ€ ๊ฐœ๋ฐœํ•œ Real-Time Detection Transformer(RT-DETR)์€ ๊ณ ์ •๋ฐ€๋„๋ฅผ ์œ ์ง€ํ•˜๋ฉด์„œ ์‹ค์‹œ๊ฐ„ ์„ฑ๋Šฅ์„ ์ œ๊ณตํ•˜๋Š” ์ฒจ๋‹จ ์—”๋“œ ํˆฌ ์—”๋“œ ๊ฐ์ฒด ๊ฒ€์ถœ๊ธฐ์ž…๋‹ˆ๋‹ค. ๋น„์ „ ํŠธ๋žœ์Šคํฌ๋จธ(Vision Transformers, ViT)์˜ ์„ฑ๋Šฅ์„ ํ™œ์šฉํ•˜์—ฌ, ๋‹ค์ค‘ ์Šค์ผ€์ผ ํŠน์ง•์„ ํšจ์œจ์ ์œผ๋กœ ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์žˆ๋„๋ก ์ธํŠธ๋ผ ์Šค์ผ€์ผ ์ƒํ˜ธ ์ž‘์šฉ๊ณผ ํฌ๋กœ์Šค ์Šค์ผ€์ผ ํ“จ์ „์„ ๋ถ„๋ฆฌํ•ฉ๋‹ˆ๋‹ค. RT-DETR์€ ๋‹ค์–‘ํ•œ ๋””์ฝ”๋” ๋ ˆ์ด์–ด๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ถ”๋ก  ์†๋„๋ฅผ ์œ ์—ฐํ•˜๊ฒŒ ์กฐ์ •ํ•  ์ˆ˜ ์žˆ์œผ๋ฏ€๋กœ ์žฌํ›ˆ๋ จ ์—†์ด ์‹ค์‹œ๊ฐ„ ๊ฐ์ฒด ๊ฒ€์ถœ์— ์ ์šฉํ•˜๊ธฐ์— ๋งค์šฐ ์ ํ•ฉํ•ฉ๋‹ˆ๋‹ค. ์ด ๋ชจ๋ธ์€ CUDA์™€ TensorRT์™€ ๊ฐ™์€ ๊ฐ€์†ํ™”๋œ ๋ฐฑ์—”๋“œ์—์„œ ๋งŽ์€ ๋‹ค๋ฅธ ์‹ค์‹œ๊ฐ„ ๊ฐ์ฒด ๊ฒ€์ถœ๊ธฐ๋ณด๋‹ค ๋›ฐ์–ด๋‚œ ์„ฑ๋Šฅ์„ ๋ฐœํœ˜ํ•ฉ๋‹ˆ๋‹ค.

๋ชจ๋ธ ์˜ˆ์‹œ ์ด๋ฏธ์ง€ ๋น„๋‘˜๊ธฐ์˜ RT-DETR ๊ฐœ์š” ๋น„๋‘˜๊ธฐ์˜ RT-DETR ๋ชจ๋ธ ๊ตฌ์กฐ ๋‹ค์ด์–ด๊ทธ๋žจ์€ ๋ฐฑ๋ณธ ๋„คํŠธ์›Œํฌ์˜ ๋งˆ์ง€๋ง‰ ์„ธ ๋‹จ๊ณ„ {S3, S4, S5}๋ฅผ ์ธ์ฝ”๋”์˜ ์ž…๋ ฅ์œผ๋กœ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค. ํšจ์œจ์ ์ธ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ธ์ฝ”๋”๋Š” ์ธํŠธ๋ผ์Šค์ผ€์ผ ํŠน์ง• ์ƒํ˜ธ ์ž‘์šฉ(AIFI, intrascale feature interaction)๊ณผ ํฌ๋กœ์Šค ์Šค์ผ€์ผ ํŠน์ง• ํ“จ์ „ ๋ชจ๋“ˆ(CCFM, cross-scale feature-fusion module)์„ ํ†ตํ•ด ๋‹ค์ค‘ ์Šค์ผ€์ผ ํŠน์ง•์„ ์ด๋ฏธ์ง€ ํŠน์ง•์˜ ์‹œํ€€์Šค๋กœ ๋ณ€ํ™˜ํ•ฉ๋‹ˆ๋‹ค. IoU-aware query selection์€ ๋””์ฝ”๋”์— ๋Œ€ํ•œ ์ดˆ๊ธฐ ๊ฐ์ฒด ์ฟผ๋ฆฌ๋กœ ์ž‘๋™ํ•˜๊ธฐ ์œ„ํ•ด ์ผ์ •ํ•œ ์ˆ˜์˜ ์ด๋ฏธ์ง€ ํŠน์ง•์„ ์„ ํƒํ•˜๋Š” ๋ฐ ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค. ๋งˆ์ง€๋ง‰์œผ๋กœ, ๋ณด์กฐ ์˜ˆ์ธก ํ—ค๋“œ์™€ ํ•จ๊ป˜ ๋””์ฝ”๋”๋Š” ๊ฐ์ฒด ์ฟผ๋ฆฌ๋ฅผ ๋ฐ˜๋ณตํ•˜์—ฌ ๋ฐ•์Šค์™€ ์‹ ๋ขฐ๋„ ์ ์ˆ˜๋ฅผ ์ตœ์ ํ™”ํ•ฉ๋‹ˆ๋‹ค. (์›๋ฌธ ์ฐธ์กฐ).

์ฃผ์š” ๊ธฐ๋Šฅ

  • ํšจ์œจ์ ์ธ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ธ์ฝ”๋”: ๋น„๋‘˜๊ธฐ์˜ RT-DETR์€ ๋‹ค์ค‘ ์Šค์ผ€์ผ ํŠน์ง•์„ ์ธํŠธ๋ผ ์Šค์ผ€์ผ ์ƒํ˜ธ ์ž‘์šฉ๊ณผ ํฌ๋กœ์Šค ์Šค์ผ€์ผ ํ“จ์ „์„ ๋ถ„๋ฆฌํ•˜์—ฌ ์ฒ˜๋ฆฌํ•˜๋Š” ํšจ์œจ์ ์ธ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ์ธ์ฝ”๋”๋ฅผ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. ์ด ๋…ํŠนํ•œ ๋น„์ „ ํŠธ๋žœ์Šคํฌ๋จธ ๊ธฐ๋ฐ˜ ๋””์ž์ธ์€ ๊ณ„์‚ฐ ๋น„์šฉ์„ ์ค„์ด๊ณ  ์‹ค์‹œ๊ฐ„ ๊ฐ์ฒด ๊ฒ€์ถœ์ด ๊ฐ€๋Šฅํ•˜๋„๋ก ํ•ฉ๋‹ˆ๋‹ค.
  • IoU-aware ์ฟผ๋ฆฌ ์„ ํƒ: ๋น„๋‘˜๊ธฐ์˜ RT-DETR์€ IoU-aware ์ฟผ๋ฆฌ ์„ ํƒ์„ ์‚ฌ์šฉํ•˜์—ฌ ๊ฐœ์ฒด ์ฟผ๋ฆฌ ์ดˆ๊ธฐํ™”๋ฅผ ๊ฐœ์„ ํ•ฉ๋‹ˆ๋‹ค. ์ด๋ฅผ ํ†ตํ•ด ๋ชจ๋ธ์€ ์žฅ๋ฉด์—์„œ ๊ฐ€์žฅ ๊ด€๋ จ์„ฑ ์žˆ๋Š” ๊ฐœ์ฒด์— ์ง‘์ค‘ํ•˜๋ฉฐ ๊ฒ€์ถœ ์ •ํ™•๋„๋ฅผ ํ–ฅ์ƒ์‹œํ‚ต๋‹ˆ๋‹ค.
  • ์œตํ†ต์„ฑ ์žˆ๋Š” ์ถ”๋ก  ์†๋„ ์กฐ์ •: ๋น„๋‘˜๊ธฐ์˜ RT-DETR์€ ํ›ˆ๋ จ ์—†์ด ๋‹ค๋ฅธ ๋””์ฝ”๋” ๋ ˆ์ด์–ด๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ถ”๋ก  ์†๋„๋ฅผ ์œ ์—ฐํ•˜๊ฒŒ ์กฐ์ •ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ด๋Ÿฌํ•œ ์ ์‘์„ฑ์€ ๋‹ค์–‘ํ•œ ์‹ค์‹œ๊ฐ„ ๊ฐ์ฒด ๊ฒ€์ถœ ์‹œ๋‚˜๋ฆฌ์˜ค์—์„œ ์‹ค์šฉ์ ์ธ ์‘์šฉ์„ ์šฉ์ดํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค.

์‚ฌ์ „ ํ›ˆ๋ จ๋œ ๋ชจ๋ธ

Ultralytics์˜ ํŒŒ์ด์ฌ API๋Š” ๋‹ค์–‘ํ•œ ์Šค์ผ€์ผ์˜ ์‚ฌ์ „ ํ›ˆ๋ จ๋œ PaddlePaddle RT-DETR ๋ชจ๋ธ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค:

  • RT-DETR-L: COCO val2017์—์„œ 53.0% AP, T4 GPU์—์„œ 114 FPS
  • RT-DETR-X: COCO val2017์—์„œ 54.8% AP, T4 GPU์—์„œ 74 FPS

์‚ฌ์šฉ ์˜ˆ์‹œ

์ด ์˜ˆ์‹œ๋Š” ๊ฐ„๋‹จํ•œ RT-DETRR ํ›ˆ๋ จ ๋ฐ ์ถ”๋ก  ์˜ˆ์‹œ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. Predict, Train, Val, Export ๋“ฑ์˜ ์ž์„ธํ•œ ๋ฌธ์„œ๋Š” Predict, Train, Val, Export ๋ฌธ์„œ ํŽ˜์ด์ง€๋ฅผ ์ฐธ์กฐํ•˜์‹ญ์‹œ์˜ค.

!!! ์˜ˆ์‹œ

=== "ํŒŒ์ด์ฌ"

    ```python
    from ultralytics import RTDETR

    # COCO ์‚ฌ์ „ ํ›ˆ๋ จ๋œ RT-DETR-l ๋ชจ๋ธ ๋กœ๋“œ
    model = RTDETR('rtdetr-l.pt')

    # ๋ชจ๋ธ ์ •๋ณด ํ‘œ์‹œ (์„ ํƒ ์‚ฌํ•ญ)
    model.info()

    # COCO8 ์˜ˆ์ œ ๋ฐ์ดํ„ฐ์…‹์— ๋Œ€ํ•ด 100 epoch ๋™์•ˆ ๋ชจ๋ธ ํ›ˆ๋ จ
    results = model.train(data='coco8.yaml', epochs=100, imgsz=640)

    # 'bus.jpg' ์ด๋ฏธ์ง€์—์„œ RT-DETR-l ๋ชจ๋ธ๋กœ ์ถ”๋ก  ์‹คํ–‰
    results = model('path/to/bus.jpg')
    ```

=== "CLI"

    ```bash
    # COCO ์‚ฌ์ „ ํ›ˆ๋ จ๋œ RT-DETR-l ๋ชจ๋ธ ๋กœ๋“œํ•˜๊ณ  COCO8 ์˜ˆ์ œ ๋ฐ์ดํ„ฐ์…‹์— ๋Œ€ํ•ด 100 epoch ๋™์•ˆ ํ›ˆ๋ จ
    yolo train model=rtdetr-l.pt data=coco8.yaml epochs=100 imgsz=640

    # COCO ์‚ฌ์ „ ํ›ˆ๋ จ๋œ RT-DETR-l ๋ชจ๋ธ ๋กœ๋“œํ•˜๊ณ  'bus.jpg' ์ด๋ฏธ์ง€์—์„œ ์ถ”๋ก  ์‹คํ–‰
    yolo predict model=rtdetr-l.pt source=path/to/bus.jpg
    ```

์ง€์›๋˜๋Š” ์ž‘์—… ๋ฐ ๋ชจ๋“œ

์ด ํ…Œ์ด๋ธ”์€ ๊ฐ ๋ชจ๋ธ์˜ ์œ ํ˜•, ํŠน์ • ์‚ฌ์ „ ํ›ˆ๋ จ ๊ฐ€์ค‘์น˜, ๊ฐ ๋ชจ๋ธ์ด ์ง€์›ํ•˜๋Š” ์ž‘์—… ๋ฐ ๋ชจ๋“œ, Val, Predict, Export์™€ ๊ฐ™์€ ๋‹ค์–‘ํ•œ ๋ชจ๋“œ๋ฅผ ๋‚˜ํƒ€๋‚ด๋Š” โœ… ์ด๋ชจ์ง€๋กœ ํ‘œ์‹œ๋œ ๋ชจ๋“œ๋ฅผ ์ง€์›ํ•ฉ๋‹ˆ๋‹ค.

๋ชจ๋ธ ์œ ํ˜• ์‚ฌ์ „ ํ›ˆ๋ จ ๊ฐ€์ค‘์น˜ ์ง€์›๋˜๋Š” ์ž‘์—… ์ถ”๋ก  ๊ฒ€์ฆ ํ›ˆ๋ จ ์ถœ๋ ฅ
RT-DETR Large rtdetr-l.pt ๊ฐ์ฒด ๊ฒ€์ถœ โœ… โœ… โœ… โœ…
RT-DETR Extra-Large rtdetr-x.pt ๊ฐ์ฒด ๊ฒ€์ถœ โœ… โœ… โœ… โœ…

์ธ์šฉ ๋ฐ ๊ฐ์‚ฌ์˜ ๋ง

๋งŒ์•ฝ ์—ฐ๊ตฌ๋‚˜ ๊ฐœ๋ฐœ ์ž‘์—…์—์„œ ๋น„๋‘˜๊ธฐ(Baidu)์˜ RT-DETR์„ ์‚ฌ์šฉํ•œ๋‹ค๋ฉด, ์›๋ž˜ ๋…ผ๋ฌธ์„ ์ธ์šฉํ•ด์ฃผ์‹œ๊ธฐ ๋ฐ”๋ž๋‹ˆ๋‹ค:

!!! Quote ""

=== "BibTeX"

    ```bibtex
    @misc{lv2023detrs,
          title={DETRs Beat YOLOs on Real-time Object Detection},
          author={Wenyu Lv and Shangliang Xu and Yian Zhao and Guanzhong Wang and Jinman Wei and Cheng Cui and Yuning Du and Qingqing Dang and Yi Liu},
          year={2023},
          eprint={2304.08069},
          archivePrefix={arXiv},
          primaryClass={cs.CV}
    }
    ```

์ปดํ“จํ„ฐ ๋น„์ „ ์ปค๋ฎค๋‹ˆํ‹ฐ์—๊ฒŒ ๊ท€์ค‘ํ•œ ์ž๋ฃŒ์ธ ๋น„์ „ ํŠธ๋žœ์Šคํฌ๋จธ ๊ธฐ๋ฐ˜ ์‹ค์‹œ๊ฐ„ ๊ฐ์ฒด ๊ฒ€์ถœ๊ธฐ์ธ ๋น„๋‘˜๊ธฐ(Baidu)์˜ RT-DETR์„ ๋งŒ๋“ค๊ณ  ์œ ์ง€ํ•˜๊ธฐ ์œ„ํ•ด ๋น„๋‘˜๊ธฐ์™€ PaddlePaddle ํŒ€์—๊ฒŒ ๊ฐ์‚ฌ์˜ ์ธ์‚ฌ๋ฅผ ์ „ํ•ฉ๋‹ˆ๋‹ค.

Keywords: RT-DETR, Transformer, ViT, ๋น„์ „ ํŠธ๋žœ์Šคํฌ๋จธ, ๋น„๋‘˜๊ธฐ RT-DETR, PaddlePaddle, Paddle Paddle RT-DETR, ์‹ค์‹œ๊ฐ„ ๊ฐ์ฒด ๊ฒ€์ถœ, ๋น„์ „ ํŠธ๋žœ์Šคํฌ๋จธ ๊ธฐ๋ฐ˜ ๊ฐ์ฒด ๊ฒ€์ถœ, ์‚ฌ์ „ ํ›ˆ๋ จ๋œ PaddlePaddle RT-DETR ๋ชจ๋ธ, ๋น„๋‘˜๊ธฐ RT-DETR ์‚ฌ์šฉ๋ฒ•, Ultralytics ํŒŒ์ด์ฌ API

Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...