Threading Building Blocks TBB is a C++ template library developed by Intel for parallel programming on multi-core processors Using TBB, a computation is broken down into tasks that can run in parallel The library manages and schedules threads to execute these tasks


  • 1 Overview
  • 2 Library contents
  • 3 Systems supported
  • 4 See also
  • 5 Notes
  • 6 References
  • 7 External links


A TBB program creates, synchronizes and destroys graphs of dependent tasks according to algorithms, ie high-level parallel programming paradigms aka Algorithmic Skeletons Tasks are then executed respecting graph dependencies This approach groups TBB in a family of solutions for parallel programming aiming to decouple the programming from the particulars of the underlying machine

TBB implements work stealing to balance a parallel workload across available processing cores in order to increase core utilization and therefore scaling Initially, the workload is evenly divided among the available processor cores If one core completes its work while other cores still have a significant amount of work in their queue, TBB reassigns some of the work from one of the busy cores to the idle core This dynamic capability decouples the programmer from the machine, allowing applications written using the library to scale to utilize the available processing cores with no changes to the source code or the executable program file In a 2008 assessment of the work stealing implementation in TBB, researchers from Princeton University found that it was suboptimal for large numbers of processors cores, causing up to 47% of computing time spent in scheduling overhead when running certain benchmarks on a 32-core system4

TBB, like the STL and the part of the C++ standard library based on it, uses templates extensively This has the advantage of low-overhead polymorphism, since templates are a compile-time construct which modern C++ compilers can largely optimize away

Intel TBB is available commercially as a binary distribution with support,5 and as open-source software in both source and binary forms6

TBB does not provide guarantees of determinism or freedom from data races7

Library contents

TBB is a collection of components for parallel programming:

  • Basic algorithms: parallel_for, parallel_reduce, parallel_scan
  • Advanced algorithms: parallel_while, parallel_do, parallel_pipeline, parallel_sort
  • Containers: concurrent_queue, concurrent_priority_queue, concurrent_vector, concurrent_hash_map
  • Memory allocation: scalable_malloc, scalable_free, scalable_realloc, scalable_calloc, scalable_allocator, cache_aligned_allocator
  • Mutual exclusion: mutex, spin_mutex, queuing_mutex, spin_rw_mutex, queuing_rw_mutex, recursive_mutex
  • Atomic operations: fetch_and_add, fetch_and_increment, fetch_and_decrement, compare_and_swap, fetch_and_store
  • Timing: portable fine grained global time stamp
  • Task scheduler: direct access to control the creation and activation of tasks

Systems supported

The TBB commercial release 30 supports Windows XP or newer, OS X version 1058 or higher and Linux using Visual C++ version 80 or higher, on Windows only, Intel C++ Compiler version 111 or higher or the GNU Compiler Collection gcc8 Additionally, the TBB open source community has contributed patches for Solaris,9 PowerPC, Xbox 360, QNX Neutrino, and FreeBSD

See also

  • Cilk/Cilk Plus
  • Intel Parallel Studio XE
  • Intel Integrated Performance Primitives IPP
  • Intel Data Analytics Acceleration Library DAAL
  • Intel Math Kernel Library MKL
  • Intel Parallel Advisor
  • Intel Parallel Inspector
  • Intel VTune Amplifier
  • Intel Concurrent Collections CnC
  • Algorithmic skeleton
  • Parallel computing
  • List of C++ multi-threading libraries
  • List of C++ template libraries
  • Parallel Patterns Library
  • Grand Central Dispatch GCD


External links

  • Official website
  • tbb on GitHub
  • Official website at Intel

