What Is Video Transcoding? How It Works for IPTV and OTT

By Alex Topilski, Founder · June 1, 2026

Over 80% of global internet traffic is video. Behind every stream - a live football match, a 4K movie, a VOD catalog - a transcoding pipeline is running invisibly, converting raw video into the exact format each subscriber's device can play. For IPTV and OTT operators, transcoding is not a background detail: it determines stream quality, CDN costs, device compatibility, and how many concurrent viewers your infrastructure can handle at $0.20 per subscriber.

What Is Video Transcoding?

Transcoding is the process of converting a video stream or file from one codec, resolution, or bitrate to another. A broadcast-quality HEVC source arriving at 50 Mbps is decoded and re-encoded into multiple output renditions - 1080p H.264 at 4 Mbps, 720p at 2 Mbps, 480p at 800 Kbps - so that each subscriber device receives the highest quality its connection can sustain. The three core operations are:

  • Decoding - the input stream is decoded from its original format (MPEG-2, H.264, HEVC, or RTMP).
  • Encoding - the decoded frames are re-encoded into one or more output codecs and bitrates.
  • Packaging - encoded streams are wrapped into delivery containers (HLS segments, MPEG-DASH fragments) for CDN distribution.

Transcoding differs from transmuxing, which only rewraps a stream into a different container without re-encoding. Transmuxing is faster but cannot change codec, resolution, or bitrate - and therefore cannot create the adaptive renditions that make quality streaming possible.

Why Transcoding Is Critical for IPTV and OTT

Most content sources do not arrive in a format suitable for direct delivery. Satellite broadcasts use MPEG-2 at 15-50 Mbps. RTSP camera feeds produce H.264 at variable bitrates. 4K HDR masters are HEVC files at 20-80 Mbps. Without transcoding, a subscriber on a 4G connection receives a 50 Mbps stream that buffers constantly, iOS and Android devices may not support the source codec at all, and your CDN bill is calculated on the raw bitrate rather than an optimized delivery bitrate. At scale - 500 subscribers each pulling 50 Mbps - that is 25 Gbps of CDN egress per second, orders of magnitude more expensive than a properly transcoded ABR stream.

With transcoding, a single 50 Mbps source becomes an adaptive bitrate (ABR) ladder. Each viewer's player fetches the rendition their current connection supports and switches renditions automatically as bandwidth changes. The viewer never sees a spinner; the operator's CDN costs drop by 60-80% compared to delivering the raw source bitrate.

Adaptive Bitrate Streaming: HLS and DASH

ABR is the delivery model every modern streaming service uses. The transcoder produces multiple renditions of the same content at different resolutions and bitrates, packaged into an HLS (HTTP Live Streaming) or MPEG-DASH manifest. The manifest lists every rendition and the URL for each segment. The subscriber's player reads the manifest, selects the appropriate rendition, and switches automatically as network conditions change.

Rendition Resolution Typical Bitrate
4K UHD 3840×2160 15-25 Mbps
Full HD 1920×1080 3-6 Mbps
HD 1280×720 1.5-3 Mbps
SD 854×480 600 Kbps-1.5 Mbps
Mobile 640×360 300-600 Kbps

HLS is required for Apple devices (iOS, Safari, tvOS) and is also the safer choice for maximum compatibility. MPEG-DASH provides more flexibility for DRM integration on Android, Smart TVs, and browsers. Most production IPTV/OTT platforms deliver both: HLS for Apple, DASH for everything else.

Codecs: H.264, H.265, AV1, and VP9 Compared

The codec determines compression efficiency, device compatibility, and how much compute power transcoding requires. Choosing the wrong codec for your subscriber base either inflates CDN costs (too little compression) or breaks playback on key devices (too little compatibility).

Codec Also Known As Device Support Compression vs H.264 Encode Cost
H.264 AVC Universal Baseline Low
H.265 HEVC Apple, modern Smart TVs ~40% smaller Medium
VP9 VP9 Chrome, Android, most Smart TVs ~35% smaller Medium
AV1 AV1 Chrome, Firefox, Android (2023+) ~50% smaller Very high

H.264 remains the universal baseline: it plays natively on every smartphone, Smart TV, set-top box, and browser made in the last decade. For an operator launching their first IPTV service, H.264 is the safest output codec - maximum reach with the lowest transcoding compute cost.

H.265 delivers significant bandwidth savings for operators delivering 4K content to Apple devices and modern Smart TVs, where hardware HEVC decoding is standard. AV1 is the next generation - dramatically better compression - but its encoding is 5-10× more CPU-intensive than H.265, making GPU hardware acceleration mandatory for any live transcoding use case.

Hardware vs Software Transcoding

Software-only transcoding runs on general-purpose server CPUs. A modern 32-core CPU handles roughly 20-40 simultaneous 1080p H.264 encode jobs in real time. Add H.265 output and that number drops to 8-15 jobs. For an IPTV operator running 50 live channels, each producing 4 ABR renditions, that means 200 simultaneous encode jobs - requiring 5-10 high-end CPU servers running at 100% utilization, 24/7.

Hardware-accelerated transcoding offloads encode and decode work to dedicated silicon: NVIDIA NVENC (supports H.264, H.265, AV1), Intel Quick Sync (H.264, H.265), and AMD VCE. A single NVIDIA RTX 4080 handles 80-120 simultaneous 1080p H.264 encode jobs at a fraction of the electricity cost of CPU-only. For operators running 50 or more live channels simultaneously, hardware transcoding is not an optimization - it is a requirement. CPU-only pipelines cannot sustain the load at an acceptable cost per channel.

Transcoding in a Live IPTV/OTT Workflow

A production IPTV/OTT transcoding pipeline runs in sequence from ingest to playback. Understanding each stage helps operators diagnose failures and plan capacity correctly:

  1. Ingest - the source stream (RTSP, RTMP, HLS pull, satellite SDI) arrives at the media server input.
  2. Decode - the media server decodes the input codec in hardware or software.
  3. Filter - optional processing: watermark overlay, logo insertion, ad splice points.
  4. Encode - the media server re-encodes frames into ABR renditions at configured bitrates and resolutions.
  5. Package - outputs are segmented into HLS or DASH fragments and a manifest is generated.
  6. Distribute - segments are pushed to a CDN or origin server for delivery to subscriber devices.
  7. Playback - subscriber devices fetch segments via the manifest URL, switching renditions automatically based on bandwidth.

Each live channel requires this pipeline running continuously 24/7. A service with 100 live channels producing 4 ABR renditions each is running 400 simultaneous encode jobs at any moment. Catchup/DVR recording adds a second write path: the same transcoded segments are simultaneously stored for time-shifted playback within a configurable window - typically 7 days.

How FastoCloud Handles Transcoding for CrocOTT Operators

FastoCloud is the GStreamer-based media server built to integrate with CrocOTT middleware. It handles transcoding, restreaming, catchup/DVR recording, and CDN origin from a single Linux deployment. The Community edition starts at $25/month and covers a solid channel lineup on a standard Linux server with NVIDIA GPU acceleration enabled. The PRO ML edition adds GPU-accelerated features including automated thumbnail generation and content analysis for larger deployments.

FastoCloud integrates natively with CrocOTT's admin panel. Operators configure transcoding pipelines, monitor per-stream health, and manage catchup windows directly from their middleware dashboard - no separate infrastructure console. The complete stack runs on a single Linux server (Ubuntu 20.04+). Total launch cost is the one-time $300 CrocOTT setup fee plus the FastoCloud subscription, with no per-channel encoding fees on top. Compare this to cloud transcoding vendors charging $0.50-$3.00 per hour per channel: a 50-channel lineup at $1.00/hour costs $1,200 per day in encoding alone. Details on the full setup are on the how it works page.

CrocOTT middleware handles subscriber management, EPG, billing, and content delivery at $0.20 per active subscriber per month. The transcoding layer (FastoCloud) is a separate infrastructure contract, giving operators the flexibility to scale each component independently. If you want to understand every capability in the stack before committing, the feature list covers each module in detail. To see how CrocOTT compares against Setplex, MwareTV, Flussonic, and other middleware platforms on transcoding and media server integration, the comparison page has the full breakdown.

What to Look for When Choosing a Transcoding Solution

Not all media servers handle IPTV and OTT workloads equally well. These are the questions that matter most when evaluating a transcoding platform:

  • Hardware acceleration support - does the server support NVENC, Quick Sync, and AMD VCE, or CPU-only? CPU-only is a deal-breaker for anything above 20 live channels.
  • Codec output flexibility - can it produce H.264, H.265, and AV1 outputs simultaneously from a single input stream?
  • ABR ladder configuration - can you define custom bitrate ladders per channel or content type, or is the ladder fixed?
  • DRM integration - does transcoding integrate with your content packaging and DRM step, or is that a separate vendor contract?
  • Catchup and DVR - can the server simultaneously transcode live and record for time-shifted playback within the same pipeline?
  • Per-stream monitoring - does the platform provide real-time health metrics, alert on stream drops, and surface per-channel error rates from a central dashboard?

The Bottom Line

Transcoding is the engine room of every IPTV and OTT service. Without it, you cannot deliver adaptive quality across the range of devices and connections your subscribers use, control CDN costs, or accept the variety of source formats that real-world content arrives in. The transcoding architecture you choose also determines per-channel infrastructure cost - and that cost compounds fast across a 50- or 100-channel lineup operating around the clock.

The right stack combines a hardware-accelerated media server (FastoCloud, built on GStreamer, starts at $25/month), an ABR packaging pipeline producing both HLS and DASH, and middleware (CrocOTT) that connects subscriber management, content delivery, DRM, and billing into a single admin panel. If you want to see what this looks like in practice, start a free CrocOTT trial - no credit card required.