gotgcall

package module

v0.8.0 Latest Latest Go to latest Published: Jun 9, 2026 License: GPL-3.0 Imports: 13 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/annihilatorrrr/gotgcall

Links

Open Source Insights

README ¶

gotgcall — Pure-Go Telegram Group Call & Voice Chat Streaming Library

gotgcall is a pure-Go library for streaming audio and video into Telegram group calls (voice chats and video chats). It is a drop-in alternative to ntgcalls / pytgcalls built for Go music bots, livestream bots, and broadcast tooling.

Use it to:

Build a Telegram music bot in Go that joins a voice chat and plays MP3/FLAC/M4A/Opus/HLS or any audio.
Stream a live video broadcast (mp4/mkv/webm/RTMP/RTSP) into a Telegram group call.
Push a "go live" RTMP broadcast to a channel via phone.GetGroupCallStreamRtmpUrl.
Wrap any ffmpeg pipeline as a streaming source — atempo, scaling, hardware encoders, atomic source switches.

No libwebrtc, no cgo, no native build chain. CGO_ENABLED=0 go build produces a single static binary on every supported platform. WebRTC runs on pion v4; ffmpeg is invoked as a runtime binary for transcoding only — nothing is linked in.

Status

Work in progress. Built for my own bots; the API is intentionally close to ntgcalls so existing code translates with minimal change.

Install
Architecture at a glance
Quick start
Sources
- FromFile / FromURL
- FromShell — single custom ffmpeg leg
- FromShells — dual ffmpeg legs
- EncodeOptions
Client options
- Enabling debug logs
- UDP mux & scaling
Lifecycle
- WebRTC mode
- RTMP mode
Pause / Resume / Mute
Callbacks
Server-side media-state changes
Errors
Concurrency model
Goroutine budget
Networking
Performance tuning
A/V sync
Pitfalls
Performance vs ntgcalls
Why pure Go
FAQ
See also
License

Install

go get github.com/annihilatorrrr/gotgcall

ffmpeg must be on PATH at runtime (or set gotgcall.WithFFmpegPath("/path/to/ffmpeg")). New() fails fast if the binary isn't found, so the error surfaces at startup rather than on the first stream.

Requires Go 1.26+ (uses errors.AsType[T] and a few stdlib features added in 1.26).

Architecture at a glance

                       ┌──────────────────────────────┐
                       │           Client             │  one process-wide handle
                       │  (gotgcall.go)               │  multiplexes any number of calls
                       └──────────────────────────────┘
                            │           │
                            ▼           ▼
                   ┌──────────────┐  ┌──────────────┐
                   │  GroupCall   │  │   RTMPCall   │  per-chat call instance
                   │  (WebRTC)    │  │  (FFmpeg→RTMP│
                   └──────────────┘  └──────────────┘
                            │              │
                            │              └── single ffmpeg push to Telegram's RTMP URL
                            ▼
                   ┌──────────────┐   ┌──────────────┐
                   │   Streamer   │──▶│ pion Track   │──▶ Telegram SFU
                   │ (paces opus/ │   │ Local Static │
                   │  ivf frames) │   │   Sample     │
                   └──────────────┘   └──────────────┘
                            ▲
                            │ media.Sample (Opus / VP8)
                            │
                   ┌──────────────┐   ┌──────────────┐
                   │ FrameReader  │◀──│ ShellReader  │◀── ffmpeg subprocess
                   │ (OGG / IVF)  │   │ (stdout pipe)│
                   └──────────────┘   └──────────────┘

Blob-only signaling. The library never imports gogram or any MTProto layer. CreateCall(chatID) returns a JSON string; the caller passes it to phone.JoinGroupCall via their own MTProto stack, then hands the response back via Connect(chatID, respJSON). This keeps the library MTProto-version-independent.

One WebRTC stack per call. Send-only audio (Opus PT=111) and video (VP8 PT=100). All calls share one wrtc.Factory (and optionally one UDP socket; see WithSharedUDPMux).

Native WebRTC stack — no pion/webrtc.PeerConnection. Connections are built directly on pion/ice, pion/dtls, pion/srtp, and pion/rtp. The high-level pion/webrtc PeerConnection went through SDP offer/answer with the offerer hardcoded as ICE-CONTROLLING, which conflicted with Telegram's own CONTROLLING role and produced a 487 STUN-error storm pion couldn't recover from. Composing the lower-level packages directly lets us run as ICE-CONTROLLED via ice.Agent.Accept from the first packet, sidestepping the conflict entirely.

ffmpeg outputs ENCODED Opus (OGG) and VP8 (IVF), not raw PCM/YUV. The Track sink expects already-encoded frames, so ffmpeg does the encoding and we skip a Go-side Opus encoder (which would force cgo). This also saves ~48× pipe bandwidth versus PCM.

Factory-side monitor (one goroutine, all calls). A single per-Factory ticker (wrtc/keepalive.go) does two jobs: it generates VP8 padding every ~2 s on every active video track so Telegram's SFU doesn't garbage-collect the SSRC binding during long quiet stretches, and it force-closes any PC stuck out of Connected past 15 s so leaked ICE agents can't outlive the SetSource gate. See Goroutine budget for the full picture.

Quick start

client, err := gotgcall.New()
if err != nil { log.Fatal(err) }
defer client.Close()

client.OnStreamEnd(func(chat int64, t gotgcall.StreamType, d gotgcall.Device, err error) {
    log.Printf("stream end: %v", err)
})
client.OnConnectionChange(func(chat int64, info gotgcall.NetworkInfo) {
    log.Printf("conn state: %s", info.State)
})
client.OnUpgrade(func(chat int64, state gotgcall.MediaState) {
    // Spontaneous transitions only — video leg died mid-stream or ICE
    // failed while video was active. User-initiated SetSource/Pause/
    // Mute/Stop are silent (your code already knows it triggered them).
})

// 1. Local-side JSON.
localParams, _ := client.CreateCall(chatID)

// 2. Drive Telegram via your MTProto layer (gogram, etc.).
//    Pass localParams to phone.JoinGroupCall; read the response.
remoteParams := joinViaYourMTProto(localParams)

// 3. Finish the WebRTC handshake.
client.Connect(chatID, remoteParams)

// 4. Stream.
client.SetStreamSources(chatID, gotgcall.FromFile("song.mp3", gotgcall.EncodeOptions{}))

// 5. Pause / resume / mute / change source any time.
client.Pause(chatID)
client.Resume(chatID)
client.SetStreamSources(chatID, gotgcall.FromURL("https://stream.example.com/radio.m3u8", gotgcall.EncodeOptions{}))

// 6. Stop tears down the call.
client.Stop(chatID)

See examples/bot/ for a runnable skeleton against gogram (own go.mod so the example doesn't taint the library's dependency tree).

Sources

All sources target Opus-in-OGG (audio) and/or VP8-in-IVF (video) on ffmpeg's stdout. The library will not accept raw PCM/YUV — the frame readers can't parse them.

`FromFile` / `FromURL`

gotgcall.FromFile("song.mp3", gotgcall.EncodeOptions{})
gotgcall.FromURL("https://stream.example.com/...", gotgcall.EncodeOptions{})

Anything ffmpeg can decode is fair game — mp3, m4a, flac, ogg, opus, wav, webm, mp4, mkv, mov, m3u8 (HLS), live RTMP/RTSP, etc.

Defaults to audio only, regardless of what the container holds. Opt in to video extraction:

client.SetStreamSources(chatID, gotgcall.FromFile("movie.mp4", gotgcall.EncodeOptions{
    Tracks: gotgcall.TrackAudio | gotgcall.TrackVideo,
    // Or just TrackVideo — TrackVideo implies TrackAudio (a video file is a
    // video file with audio).
}))

Fast-start probing (-analyzeduration 0 -probesize 64k) is on by default for every source — cuts ~1-2 s off ffmpeg's startup latency vs the stock defaults (5 s + 5 MB). HLS sources additionally get -user_agent, -protocol_whitelist file,http,https,tcp,tls, -rw_timeout 10s, -http_persistent 1; HTTP/HTTPS sources get -reconnect 1 -reconnect_at_eof 1 -reconnect_streamed 1 -reconnect_delay_max 5 -timeout 10s so transient network blips don't kill the stream.

Both FromFile and FromURL return seekable sources. Pause records the elapsed offset and Resume re-spawns ffmpeg with -ss <offset> injected before the input.

`FromShell` — single custom ffmpeg leg

gotgcall.FromShell("ffmpeg -i thing.mp3", gotgcall.TrackAudio)

FromShell parses the cmdline as a shell-like argv (handles double-quoted args, plus \" and \\ escape sequences for filenames containing literal " or \ — e.g. a Telegram audio titled (From "Foo") that would otherwise slice the path mid-string when the embedded quote toggled the quote state) and spawns it directly via exec, NOT via /bin/sh. Shell metacharacters in filenames can't inject commands; use %q for filenames!

Missing essentials are filled in automatically:

Input-side (always on): fast-start probing + -err_detect ignore_err before -i.
Output-side (audio): -c:a libopus, -application audio, -frame_duration 20, -page_duration 20000, -mapping_family 0, -ar 48000, -ac 2, -f ogg, pipe:1.
Output-side (video): -c:v libvpx, -deadline realtime, -f ivf, pipe:1.

So the minimum command works:

gotgcall.FromShell(`ffmpeg -i "song.mp3"`, gotgcall.TrackAudio)

…and is equivalent to the fully-spelled-out form:

gotgcall.FromShell(`ffmpeg -analyzeduration 0 -probesize 64k -err_detect ignore_err `+
    `-i "song.mp3" -vn -c:a libopus -b:a 64k -application audio `+
    `-frame_duration 20 -page_duration 20000 -mapping_family 0 `+
    `-ar 48000 -ac 2 -f ogg pipe:1`, gotgcall.TrackAudio)

Video-only example:

gotgcall.FromShell(`ffmpeg -i "movie.mp4" -an -c:v libvpx -deadline realtime `+
    `-b:v 800k -vf scale=1280:720 -r 30 -f ivf pipe:1`, gotgcall.TrackVideo)

A single FromShell call produces a single output (audio OR video). Raw PCM/YUV output codecs (-c:a pcm_*, -f rawvideo, etc.) are rejected up front with a useful error.

`FromShells` — dual ffmpeg legs

For ntgcalls-style "microphone + camera" patterns where you want full control over both legs:

gotgcall.FromShells(
    `ffmpeg -i "x.mp4"`,                                            // audio leg
    `ffmpeg -i "x.mp4" -vf scale=1280:720 -b:v 1500k`,              // video leg
)

Each cmd goes through the same auto-flag injection as FromShell. Either string may be empty to skip that track.

For the convenience path use FromFile/FromURL with Tracks: TrackVideo and let the library construct both ffmpeg commands for you.

`EncodeOptions`

type EncodeOptions struct {
    VideoBitrateKbps int   // default 800
    VideoWidth       int   // default 1280
    VideoHeight      int   // default 720
    VideoFPS         int   // default 30
    AudioBitrateKbps int   // default 128 (music-grade; bump to 192+ for transparent quality, Telegram fmtp accepts up to 510)
    AudioChannels    int   // default 2
    Tracks           Track // default TrackAudio; TrackVideo implies +TrackAudio
}

Set on the constructor (FromFile/FromURL); rides with the Source. FromShell / FromShells ignore EncodeOptions because you control ffmpeg directly.

Client options

gotgcall.New(
    gotgcall.WithFFmpegPath("/opt/ffmpeg/bin/ffmpeg"),  // override binary lookup
    gotgcall.WithLogger(slog.Default()),                // structured logger
    gotgcall.WithDebugLogs(),                           // shortcut: text handler @ Debug level to stderr
    gotgcall.WithFFmpegStderrLog(),                     // tee ffmpeg stderr → debug log
    gotgcall.WithSharedUDPMux(),                        // one UDP socket for all calls
    gotgcall.WithDTLSCertPool(16),                      // pre-generate N DTLS certs
    gotgcall.WithDispatchBuffer(512),                   // event-dispatcher queue size
    gotgcall.WithICEServers([]gotgcall.ICEServer{       // optional TURN (no STUN needed by default)
        {URLs: []string{"turn:turn.example.com:3478"},
         Username: "u", Credential: "p"},
    }),
    gotgcall.WithNetworkTypes(                          // enable IPv6/TCP for restrictive nets
        gotgcall.NetworkTypeUDP4,
        gotgcall.NetworkTypeUDP6,
        gotgcall.NetworkTypeTCP4,
    ),
    gotgcall.WithICETimeouts(60*time.Second, 120*time.Second, 5*time.Second),
)

Option	Default	Notes
`WithFFmpegPath`	`"ffmpeg"`	`New()` fails fast with `exec.LookPath` if the binary is missing.
`WithLogger`	discard	Receives everything: gotgcall events, ffmpeg stderr/exit, dispatcher, and pion's internal ICE/DTLS/SCTP logs (each tagged with `pion=<scope>`).
`WithDebugLogs`	off	Convenience shortcut that installs a `slog.NewTextHandler(os.Stderr, &slog.HandlerOptions{Level: slog.LevelDebug})`. Use when reporting bugs.
`WithFFmpegStderrLog`	off	Tees ffmpeg stderr line-by-line into the logger at Debug level. Without this, stderr is only surfaced in the final error message (last 512 bytes) when ffmpeg crashes — useless for "stream runs but I hear nothing" symptoms.
`WithSharedUDPMux`	off	Opens one `udp4:0` socket and routes ICE for every call through it. See UDP mux scaling below.
`WithDTLSCertPool`	8	Background goroutine keeps N pre-generated ECDSA-P256 certs ready so `CreateCall` doesn't stall on keygen during bursts. 0 = disabled.
`WithDispatchBuffer`	256	Size of the single callback-dispatcher channel. Larger absorbs bursts of state changes before the consumer drains.
`WithICEServers`	(none)	gotgcall ships no default STUN — pion gathers host candidates only and Telegram's SFU peer-reflexively learns our post-NAT source (matches ntgcalls). Set this when the network blocks UDP host-to-host and you need TURN, or when you want srflx for diagnostic reasons.
`WithNetworkTypes`	UDP4+UDP6	Override the ICE candidate network-type whitelist. Add TCP for restrictive environments where UDP is blocked.
`WithICETimeouts`	60 s / 120 s / 2 s	`(disconnect, failed, keepalive)`. Pass `0` for any value to keep the default; shorten for ultra-responsive UIs, lengthen for unstable networks.
`WithConnectTimeout`	10 s	How long `SetSource` / `Resume` wait for ICE+DTLS to settle before returning `ErrNotConnected`. Matches ntgcalls' own internal timeout.
`WithICEPreConnectDelay`	250 ms	Short pause inside `Connect` so Telegram's SFU has registered our credentials before pion's first STUN binding goes out. Negative value disables.
`WithVerboseConnectionLogs`	off	One-flag bundle: Debug-level slog + per-candidate logs + pion trace. Use when reporting a stuck-in-Connecting bug.

Enabling debug logs

For maximum verbosity when reporting a bug:

client, err := gotgcall.New(
    gotgcall.WithVerboseConnectionLogs(), // ICE + DTLS + per-candidate trace
    gotgcall.WithFFmpegStderrLog(),       // ffmpeg stderr line-by-line
)

Pion's internal lines come in tagged with pion=<scope> (e.g. pion=ice, pion=dtls); filter by attribute if it's too much:

slog.SetDefault(slog.New(slog.NewTextHandler(os.Stderr, &slog.HandlerOptions{
    Level: slog.LevelDebug,
    ReplaceAttr: func(_ []string, a slog.Attr) slog.Attr {
        if a.Key == "pion" && a.Value.String() == "dtls" {
            return slog.Attr{} // drop dtls flight chatter
        }
        return a
    },
})))

UDP mux & scaling

The README said "use WithSharedUDPMux at 100+ calls". That was a conservative guess — the real picture:

Default (one socket per call):

1 UDP socket = 1 file descriptor + 1 ephemeral port per call.
Linux defaults: ulimit -n 1024 (raise to 65535), ephemeral port range 32768–60999 (~28000 usable).
Practical ceiling without any tuning: ~900 calls (bounded by FDs, leaving room for other FDs).
After ulimit -n 65535 and net.ipv4.ip_local_port_range="1024 65000": tens of thousands of calls on a beefy server.
Benefit: kernel-level UDP receive-queue per call, parallelism scales with CPU cores naturally.

WithSharedUDPMux (one socket total):

1 UDP socket, 1 FD, 1 port for the entire process — FD/port limits stop mattering.
All traffic funnels through one socket → kernel UDP buffer might become contended at extreme rates.
Per-socket UDP throughput on modern Linux: easily 1–10 Gbps. At ~50 kbps per voice call, that's 20 000–200 000 concurrent voice calls through one socket before throughput becomes the bottleneck.
Best for huge call counts where FD/port pressure is the limiting factor, or where firewall rules need to pin a single port.

Rule of thumb:

< 1000 calls: per-call sockets is fine, simpler, and gives you natural per-call kernel-queue isolation.
1000–10000 calls: either works; WithSharedUDPMux simplifies sysctl tuning.
10000+ calls: WithSharedUDPMux is the easier path; tune the kernel UDP receive buffer (net.core.rmem_max, net.core.rmem_default).

Note: client.Stop(chatID) closes only that call's WebRTC stack (and the per-call socket if not using the shared mux). The shared mux survives every Stop and is only closed when you call client.Close() on the parent client. So you can spin calls up and down freely without leaking or thrashing the shared socket.

Lifecycle

WebRTC mode

The default. Use for normal group voice/video.

localParams, err := client.CreateCall(chatID)
// → send localParams to phone.JoinGroupCall; read remoteParams from response.
err = client.Connect(chatID, remoteParams)
err = client.SetStreamSources(chatID, gotgcall.FromFile("song.mp3", gotgcall.EncodeOptions{}))
// …
err = client.Stop(chatID)

CreateCall returns ErrConnectionExists only if a live call for that chat exists. If the previous call already reached Failed or Closed (e.g. ICE failed permanently behind symmetric NAT), it is reaped automatically and CreateCall returns a fresh handle — no explicit Stop(chatID) required between attempts. Per-chat creation mutex serialises concurrent calls so it never allocates twice.
Connect is idempotent only in the sense that re-calling it re-SetRemoteDescription's; if you call Connect before CreateCall you get ErrConnectionNotFound.
Stop removes the call from the internal map and clears the per-chat mutex; after Stop you can re-use the same chatID cleanly.
client.AudioSSRC(chatID) returns the audio SSRC for phone.LeaveGroupCall's Source field. RTMP calls return ErrWrongMode.

RTMP mode

For "go live" / host-style broadcasts. Obtain the URL via phone.GetGroupCallStreamRtmpUrl:

err := client.StartRTMP(chatID, rtmpURL)
err  = client.SetStreamSources(chatID, gotgcall.FromFile("movie.mp4", gotgcall.EncodeOptions{}))
// Pause/Resume/Stop work identically. Mute/Unmute are best-effort (RTMP push has
// no per-track control); the lib tracks state but doesn't drop frames.

StartRTMP is serialised with CreateCall via the same per-chat mutex. RTMP transcodes to H.264+AAC and pushes FLV.

Pause in RTMP mode is kill-and-restart-with--ss (Telegram's RTMP ingest times out silent streams, so SIGSTOP can't be used). WebRTC mode uses a channel-based gate that keeps ffmpeg alive — the OS pipe absorbs ~1s of frames during pause.

Pause / Resume / Mute

ok, err := client.Pause(chatID)   // false if already paused
ok, err  = client.Resume(chatID)
ok, err  = client.Mute(chatID)    // mute audio track; video keeps going
ok, err  = client.Unmute(chatID)

WebRTC mode:

Pause gate-blocks the streamer's pull loop. ffmpeg keeps running; its stdout pipe buffers the next ~1s of frames. Resume wakes the loop and the pacing baseline jumps forward over the paused window so we don't burst the buffered frames on resume.
Mute is a flag on the streamer — samples are read at the natural cadence but WriteSample is skipped.

RTMP mode:

Pause records elapsed_ms, kills ffmpeg, frees the connection. Resume spawns a fresh ffmpeg with -ss <elapsed>.

SetStreamSources can be called any time. While paused the new source is recorded but not started; Resume starts it at offset 0 (a new source resets the resume offset — it's a different track).

Callbacks

client.OnStreamEnd(func(chat int64, t StreamType, d Device, err error) {
    // Fires on natural EOF (err == nil) or ffmpeg crash (err != nil).
    // Manual Stop / SetSource don't fire — the caller already knows.
    // For video+audio sources fires twice: first Video, then Audio.
})

client.OnConnectionChange(func(chat int64, info NetworkInfo) {
    // info.State: Connecting | Connected | Disconnected | Failed | Closed | Timeout
})

client.OnUpgrade(func(chat int64, state MediaState) {
    // Mirror of ntgcalls' onUpgrade(MediaState). Fires ONLY on
    // spontaneous transitions: a video leg ending mid-stream (EOF /
    // ffmpeg crash) or the WebRTC PC reaching Failed/Closed while video
    // was active. User-initiated transitions (SetStreamSources, Stop,
    // Pause, Resume, Mute, Unmute) are silent — flip your MTProto
    // participant flags directly in those command handlers, not here.
})

All callbacks fire on a single dispatcher goroutine, so you can safely re-enter the API from inside (e.g. call client.Stop(chat) from inside OnStreamEnd). If your callback panics it is recovered and logged; the dispatcher keeps running.

If the dispatch queue fills up (slow consumer), the dispatcher drops the oldest queued event and logs a warning. Tune with WithDispatchBuffer.

Server-side media-state changes (admin mute, video off)

The library is blob-only and never sees MTProto updates. When Telegram tells you the bot was admin-muted (via your UpdateGroupCallParticipants handler), react directly:

tg.AddRawHandler(&telegram.UpdateGroupCallParticipants{}, func(u telegram.Update, _ *telegram.Client) error {
    upd := u.(*telegram.UpdateGroupCallParticipants)
    for _, p := range upd.Participants {
        // compare p.Peer to your own user id, then:
        if p.Muted {
            client.Pause(chatID)
        } else if p.CanSelfUnmute {
            client.Resume(chatID)
        }
    }
    return nil
})

The OnUpgrade(MediaState) callback fires only for outgoing state changes the library initiates (Mute / Pause / video stream end). Server-side mute / video-stop from Telegram is delivered only via your MTProto UpdateGroupCallParticipants handler — gotgcall stays out of MTProto by design.

Errors

All errors are sentinels — branch with errors.Is:

Error	Returned when
`ErrConnectionExists`	`CreateCall`/`StartRTMP` for a chatID that already has a live call (Failed/Closed calls are auto-reaped, so retries on a dead chat just work).
`ErrConnectionNotFound`	Any method called with an unknown chatID, or after `Stop`.
`ErrConnectionTimeout`	Declared for future use (currently surfaced via `OnConnectionChange(Failed)` after pion's 120 s ICE-failed timeout).
`ErrConnectionFailed`	Same — declared for branching; current ICE-failed manifests as `OnConnectionChange(Failed)`.
`ErrInvalidParams`	Malformed remote JSON in `Connect` (missing ufrag/pwd/fingerprints), or `FromShell` with empty/invalid command.
`ErrFFmpegSpawn`	`exec.Cmd.Start` failed (binary missing / permission denied / OS resource exhaustion).
`ErrFFmpegCrashed`	ffmpeg exited non-zero; wrapped error carries `exit=<code>` and the last 512 bytes of stderr for diagnosis. Surfaced both via `OnStreamEnd` and on the `ShellReader.Read` EOF path (the Reader briefly waits — bounded 200 ms — for the reap goroutine to capture the real exit before returning, so a fast-failing child no longer collapses to a bare `io.EOF` swallowed by the OGG/IVF parser).
`ErrFile`	Source contained no playable audio or video stream (OGG / IVF parse failed).
`ErrClosed`	Any method called after `Client.Close()`.
`ErrNotConnected`	`SetSource` timed out waiting for WebRTC to reach Connected (10 s default; override with `WithConnectTimeout`).
`ErrInternal`	Wrapping for pion API errors that shouldn't happen (e.g. `CreateOffer` failure).
`ErrWrongMode`	WebRTC-only method called on an RTMP call (or vice versa).

Concurrency model

One *Client per process multiplexes any number of group calls.
All public methods are safe for concurrent use.
Per-chat operations are serialised internally via a sync.RWMutex on each call instance.
Concurrent CreateCall / StartRTMP for the same chat are gated by a per-chat creation mutex; the first wins, others get ErrConnectionExists without allocating the underlying WebRTC stack.
The createMu map entry is freed in Stop (you can re-use the chatID cleanly).
Callbacks fire on a single dispatcher goroutine — no inter-callback parallelism, but no risk of deadlocking the producer either.

Goroutine budget

The library is deliberately frugal with goroutines. Inventory:

Per-process / per-Factory (constant, shared across every call):

Goroutine	Where	Purpose
`monitor.run`	`wrtc/keepalive.go`	One timer loop that paces video keepalive padding for every active PC and force-closes any PC stuck out of Connected past 15 s.
`dispatcher.loop`	`utils/synccallback.go`	Serializes every callback (`OnStreamEnd`, `OnConnectionChange`, `OnUpgrade`) so user code can safely re-enter the API.
`certpool.refill`	`wrtc/native/certpool.go`	Pre-generates ECDSA-P256 DTLS certs so `CreateCall` never blocks on keygen during bursts. Skipped entirely when `WithDTLSCertPool(0)`. Sleeps on a buffered channel send when the pool is full.

Per-call (proportional to live call count):

Goroutine	Where	Purpose
2× `streamer.run`	`media/streamer.go`	One for audio, one for video. Paces `media.Sample` frames at wall-clock cadence and writes them to the pion track.
1× `stack.drainInbound`	`wrtc/native/stack.go`	Drains the SRTP packet conn after DTLS finishes. Required so pion ICE's recv buffer doesn't fill — we send only, but Telegram still talks back.

Per-source (only when an ffmpeg subprocess is involved):

Goroutine	Where	Purpose
1× `shellReader.reap`	`io/shell_reader.go`	Blocks on `cmd.Wait` and fires the configured `OnExit` hook once the process is fully reaped. The same goroutine also drives any consumer-supplied lifecycle callback, so RTMPCall doesn't need a separate watcher.

That's it for code we own. pion adds ~5–8 internal goroutines per call (ICE agent task loop, DTLS handshake, srtp readers); those are upstream and not reducible from here.

Notes:

Audio + video streamers stay separate because Source.Next() can block on disk/network I/O; combining them would let one leg starve the other. The cost is one extra goroutine per call.
The keepalive monitor was the obvious goroutine-per-call candidate. It is one goroutine per Factory, not per call — a single ticker iterates a map snapshot. Callers that spin thousands of concurrent calls don't pay for thousands of watchdogs.
pc.Close() unregisters from the monitor synchronously, so dead entries never leak and never re-tick.

Networking

Transport: UDP4 + UDP6 by default. Override with WithNetworkTypes(...) to restrict or add TCP.
STUN / TURN: none by default — host candidates work for the great majority of deployments. Pass WithICEServers(...) if you need TURN for symmetric NAT / blocked UDP.
Interface filter: virtual / VPN interfaces (Docker bridges, WSL, VMware, Tailscale, ZeroTier, OpenVPN, etc.) are skipped automatically. Override is not exposed; report a bug if your interface name is being filtered incorrectly.
UDP mux: default = one socket per call. Pass WithSharedUDPMux() to multiplex all calls through one udp4:0 socket (recommended once you're above ~1 000 concurrent calls — see UDP mux & scaling).
Connect gate: SetSource waits up to 10 s for ICE + DTLS to reach Connected before returning ErrNotConnected. Override with WithConnectTimeout(...). A factory-side watchdog also force-closes any PC stuck out of Connected for 15 s, so a leaked socket can never outlive the gate.
ICE timeouts: 60 s disconnect grace, 120 s before declaring failed, 2 s keepalive. Override with WithICETimeouts(...).
ICE role: hardcoded CONTROLLED. Telegram's SFU is always CONTROLLING; pinning our side means we never hit pion's role-conflict (487 storm) path and the connection converges in tens of milliseconds.

Performance tuning

Cert pool size (WithDTLSCertPool): default 8; raise for very bursty workloads. ECDSA-P256 keygen costs ~10 ms per call — the pool keeps a buffer ready so CreateCall doesn't block on it.
Dispatch buffer (WithDispatchBuffer): default 256. Raise if you see drop warnings under bursty callback fan-out.
Shared UDP mux (WithSharedUDPMux): collapses every call onto one udp4:0 socket via ice.UDPMuxDefault. Cuts socket-table pressure and FD use once you're above ~1 000 concurrent calls. Closed cleanly when the Factory shuts down.
Fast cold-start: FromFile / FromURL already inject -analyzeduration 0 -probesize 64k, cutting ~1–2 s from ffmpeg startup. If you go custom via FromShell, set the same flags.

Memory usage

Measured per-process on Linux/amd64, Go 1.26, GOGC=100. Numbers are RSS (resident set), including ffmpeg subprocesses. Round figures — your workload will move them ±30 %.

State	Go heap (in-process)	ffmpeg RSS (per call)	Total per call
Idle (Factory only, no calls)	~6–8 MB	—	—
One audio-only call (`libopus` from a file/URL)	+~1–2 MB	~6–10 MB	~7–12 MB
One audio+video call (`libopus` + `libvpx` 720p30)	+~2–3 MB	~25–40 MB (1 ffmpeg/leg)	~50–80 MB
One RTMP push (single muxed ffmpeg)	+~1 MB	~20–35 MB	~20–35 MB

Notes:

Go-heap growth is dominated by per-Stack pion buffers (~1 MB ICE + DTLS + SRTP scratch, single 1500-byte SRTP encrypt buffer reused per write).
Audio-only is the cheap path. The 25–40 MB number for video is ffmpeg's libvpx encoder state, not gotgcall.
WithSharedUDPMux saves ~4 KB kernel socket buffer per call (the dominant kernel cost on 10 K+ call boxes; you also save FDs).
Goroutine inventory: see Goroutine budget — 3 shared per Factory, 3 per call, plus pion's ~5–8 ICE/DTLS internals.

Concurrency / scaling ballparks

Concurrent calls	Recommended tuning
1–100	Defaults. Don't touch anything.
100–1 000	`WithSharedUDPMux()`. Raise FD limit (`ulimit -n 65535`).
1 000–10 000	Above + `WithDTLSCertPool(64)`, `WithDispatchBuffer(4096)`. Pin GOMAXPROCS. Watch ffmpeg total RSS — this is the bottleneck.
10 000+	Above + shard across processes; ffmpeg memory dominates at this scale.

A/V sync

Audio and video legs share a wall-clock baseline within microseconds and pace by per-frame duration; drift does not accumulate.
Don't apply different time-distortion filters to the two legs — e.g. atempo=1.25 on audio without setpts=PTS/1.25 on video — they will desync linearly.
In RTMP mode, sync is ffmpeg's responsibility (single muxed push).

Pitfalls

Requesting video on an audio-only source. Don't pass Tracks: TrackVideo unless you know the container actually has video; audio failure alongside an empty video leg surfaces as ErrFile.
Raw PCM/YUV is rejected at construction time. FromShell validates codec/container args and returns ErrInvalidParams pointing at libopus/libvpx.
SetSource gates on ICE Connected (10 s default). Samples written before ICE+DTLS settle are silently dropped; the gate blocks the caller until it's safe to push frames. On failure, ErrNotConnected / ErrConnectionFailed.
Pause in RTMP mode is destructive. ffmpeg is killed and Telegram drops the ingest; resume re-establishes at the captured offset. Listeners hear a brief silence.

Performance vs ntgcalls

Both ship into the same Telegram SFU and use the same Opus + VP8 codecs at the same bitrates, so wire bandwidth is identical. The differences are operational, not protocol-level.

Dimension	ntgcalls (libwebrtc, C++)	gotgcall (pion + ffmpeg subprocess, pure Go)
Per-call CPU (steady state, audio-only)	~1–2 % of one core (in-process Opus encoder, no IPC)	~2–4 % of one core (ffmpeg encodes Opus, pipe IPC to Go, pion packetises)
Per-call memory baseline	~15–25 MB (libwebrtc allocators + jitter buffers)	~5–10 MB Go heap + ~10–20 MB per ffmpeg subprocess (drops to ~5 MB on libopus-only audio)
Cold-start to first packet	~50–150 ms (compiled-in encoder ready immediately)	~80–300 ms (ffmpeg spawn + first OGG page; the `-analyzeduration 0 -probesize 64k` fast-probe flags shave ~1–2 s vs ffmpeg defaults)
Cross-compile / deploy	Requires libwebrtc + glibc + a C++ toolchain on the target ABI; cgo enabled	`CGO_ENABLED=0 GOOS=linux GOARCH=arm64 go build` — single static binary, scp it, run. ffmpeg as a runtime dep (typically already installed).
Binary size	~20–30 MB (libwebrtc + cgo glue)	~12–18 MB Go binary (no ffmpeg bundled)
Subprocess footprint	None (everything in-process)	1 ffmpeg per call for audio, 2 if video is on (one per leg). Easy to inspect/kill with standard Unix tools; isolates encoder crashes from the bot process.
Pause/resume latency	Mute internal pipeline, sub-ms	Pause: gate the streamer (sub-ms, ffmpeg keeps running). Resume: wake the gate (sub-ms). RTMP mode: kill+restart with `-ss`, ~100–300 ms gap.
Concurrent calls per process	Bounded by libwebrtc thread pool sizing (~hundreds without tuning)	Bounded by ffmpeg subprocess count + FDs. With `WithSharedUDPMux` and raised FD limits: tens of thousands. See UDP mux & scaling.
Hot-reload of encoder logic	Recompile + redeploy the whole bot	Swap an ffmpeg flag string at runtime (`FromShell`) — no rebuild

Trade-offs at a glance:

ntgcalls wins on raw CPU/memory per call (in-process encoder, no IPC overhead).
gotgcall wins on operability (static binaries, no C++ chain, ffmpeg-flag flexibility, OS-level subprocess isolation).
For a typical music bot (10–500 concurrent voice calls), the per-call overhead difference is invisible on any reasonable server.
For 10 000+ concurrent calls on one box, ntgcalls' lower per-call memory footprint matters; gotgcall offsets some of this by sharing one UDP socket via WithSharedUDPMux.

The actual numbers above are order-of-magnitude estimates; benchmark on your workload before committing to either.

Why pure Go

ntgcalls works fine but pulls in libwebrtc + glibc + a C++ build chain. Cross-compiling music bots becomes a maintenance burden. gotgcall builds with CGO_ENABLED=0 to a single static binary on every supported platform. The trade-off is ffmpeg as a runtime dependency, which most bot deployments already have anyway.

The WebRTC layer underneath gotgcall is built directly on pion's lower-level packages (pion/ice, pion/dtls, pion/srtp, pion/rtp) — no pion/webrtc.PeerConnection. Connections run as ICE-CONTROLLED via ice.Agent.Accept from the first packet (Telegram is always CONTROLLING), so the role conflict that previously caused stuck-in-connecting failures cannot occur. The native stack also lets the close path target only the topmost installed layer, avoiding the double-close panic in pion/ice v4.2.7's task loop.

FAQ

Is this a port of ntgcalls / pytgcalls to Go?

No — it's an independent implementation with a deliberately ntgcalls-shaped API so existing bot code translates almost line-for-line. ntgcalls wraps libwebrtc (C++); gotgcall uses pion, the pure-Go WebRTC stack.

Does it work with gogram, MTProto-Go, or other MTProto libraries?

Yes — any of them. The library is blob-only: it produces and consumes JSON strings; you handle the MTProto layer (phone.JoinGroupCall / phone.LeaveGroupCall) in your bot using whichever MTProto Go library you prefer. The examples/bot/ directory has a runnable skeleton against gogram.

Can I use this for a Telegram music bot?

That's the primary use case. See examples/bot/ and the FromShell recipes for piping yt-dlp / atempo / loudness-normalised ffmpeg pipelines.

Does it support video chats / livestreams / RTMP push?

Yes — three modes:

WebRTC: send-only audio + video into a normal voice/video chat.
RTMP push: "go live" broadcasts to a channel via Telegram's RTMP ingest URL. See RTMP mode.
Custom ffmpeg: FromShell / FromShells lets you point at any decodable container or live source — HLS, RTSP, MJPEG, screen capture, etc.

Does it support TGCalls / MTProto E2E voice calls?

No — only group calls and channel RTMP livestreams. 1-on-1 MTProto voice/video calls (TGCalls) require a different signalling path that this library does not currently target.

What Go version is required?

Go 1.26 or newer (uses errors.AsType[T] and a few stdlib refinements added in 1.26).

Does it run on Windows?

Yes. Pure-Go means no Make/gcc/clang. Pause/Resume in WebRTC mode uses a channel gate (works on every OS); RTMP mode uses kill+restart-with--ss (also OS-agnostic — SIGSTOP would be killed by Telegram's RTMP ingest timeout anyway).

How many concurrent calls can one process handle?

The library has no hardcoded limit. The practical ceiling is ffmpeg subprocess count + ICE socket count. Use WithSharedUDPMux() to collapse all calls onto one UDP socket once you're above ~100 concurrent calls.

Where do I report bugs?

Open an issue with logs from WithLogger(slog.New(slog.NewTextHandler(os.Stderr, &slog.HandlerOptions{Level: slog.LevelDebug}))) — debug-level logging covers streamer state, ffmpeg exit, ICE transitions.

License

MIT — see LICENSE.

Documentation ¶

Overview ¶

Package gotgcall is a pure-Go library for streaming audio and video into Telegram group calls. The public API mirrors ntgcalls method names so bot code translates one-to-one.

The library is blob-only: signaling JSON is exchanged through your own MTProto client (typically gogram). Two calls are required:

params, _ := client.CreateCall(chatID)
resp, _   := tg.PhoneJoinGroupCall(... Params: &DataJson{Data: params})
client.Connect(chatID, resp.Updates[...].Call.Params.Data)
client.SetStreamSources(chatID, gotgcall.FromFile("song.mp3", gotgcall.EncodeOptions{}))

See README.md for the full pattern.

Index ¶

Constants
Variables
type CallInfo
type Client
- func New(opts ...Option) (*Client, error)
type ConnState
type Device
type EncodeOptions
type ICEServer
type MediaState
type NetworkInfo
type NetworkType
type Option
type SeekableSource
type Source
type StreamType
type Track

Constants ¶

View Source

const (
	NetworkTypeUDP4 = wrtc.NetworkTypeUDP4
	NetworkTypeUDP6 = wrtc.NetworkTypeUDP6
	NetworkTypeTCP4 = wrtc.NetworkTypeTCP4
	NetworkTypeTCP6 = wrtc.NetworkTypeTCP6
)

View Source

const (
	TrackAudio = media.TrackAudio
	TrackVideo = media.TrackVideo

	Audio      = models.Audio
	Video      = models.Video
	Microphone = models.Microphone
	Camera     = models.Camera

	Connecting   = models.Connecting
	Connected    = models.Connected
	Disconnected = models.Disconnected
	Failed       = models.Failed
	Closed       = models.Closed
)

Variables ¶

View Source

var (
	FromFile   = media.FromFile
	FromURL    = media.FromURL
	FromShell  = media.FromShell
	FromShells = media.FromShells
)

View Source

var (
	ErrConnectionExists   = models.ErrConnectionExists
	ErrConnectionNotFound = models.ErrConnectionNotFound
	ErrConnectionFailed   = models.ErrConnectionFailed
	ErrInvalidParams      = models.ErrInvalidParams
	ErrFFmpegSpawn        = models.ErrFFmpegSpawn
	ErrFFmpegCrashed      = models.ErrFFmpegCrashed
	ErrFile               = models.ErrFile
	ErrClosed             = models.ErrClosed
	ErrInternal           = models.ErrInternal
	ErrNotConnected       = models.ErrNotConnected
	ErrWrongMode          = models.ErrWrongMode
)

Functions ¶

This section is empty.

Types ¶

type CallInfo ¶

type CallInfo = models.CallInfo

type Client ¶

type Client struct {
	// contains filtered or unexported fields
}

Client multiplexes many concurrent group calls behind a single process-wide handle. Safe for concurrent use.

func New ¶

func New(opts ...Option) (*Client, error)

New constructs a Client with the given options. Fails fast if the ffmpeg binary isn't on PATH (or wherever WithFFmpegPath points) so callers see the error at startup rather than on first stream.

func (*Client) AudioSSRC ¶

func (c *Client) AudioSSRC(chatID int64) (uint32, error)

AudioSSRC returns the audio SSRC of a WebRTC call. Pass as Source to phone.LeaveGroupCall. Returns ErrWrongMode for RTMP calls.

func (*Client) Calls ¶

func (c *Client) Calls() map[int64]CallInfo

Calls returns a snapshot of all active calls.

func (*Client) Close ¶

func (c *Client) Close() error

Close stops every call and releases resources. Idempotent.

func (*Client) Connect ¶

func (c *Client) Connect(chatID int64, telegramParams string) error

Connect finishes the WebRTC handshake using Telegram's response JSON. On error the call is auto-reaped so the caller can immediately retry CreateCall without coordinating a separate Stop.

func (*Client) CreateCall ¶

func (c *Client) CreateCall(chatID int64) (string, error)

CreateCall starts a new WebRTC group-call instance for chatID and returns the JSON params the caller must pass to phone.JoinGroupCall.

Concurrent CreateCall / StartRTMP calls for the same chat are serialized; the first one wins, others get ErrConnectionExists without allocating a pion PeerConnection.

func (*Client) GetState ¶

func (c *Client) GetState(chatID int64) (MediaState, error)

GetState returns the current media-state (mute/pause flags).

func (*Client) Mute ¶

func (c *Client) Mute(chatID int64) (bool, error)

func (*Client) OnConnectionChange ¶

func (c *Client) OnConnectionChange(fn func(chatID int64, info NetworkInfo))

OnConnectionChange registers a callback for ICE/DTLS state transitions.

func (*Client) OnStreamEnd ¶

func (c *Client) OnStreamEnd(fn func(chatID int64, t StreamType, d Device, err error))

OnStreamEnd registers a callback fired when a track ends from EOF or ffmpeg crash. Manual Stop / SetSource don't fire — the caller already knows they initiated those.

For video+audio sources (vplay) the callback fires twice in fixed order: first StreamType=Video, then StreamType=Audio. Audio-only and video-only sources fire once. Called on the dispatcher goroutine so it is safe to re-enter the Client API from within.

func (*Client) OnUpgrade ¶

func (c *Client) OnUpgrade(fn func(chatID int64, state MediaState))

OnUpgrade registers a callback fired only on *spontaneous* outgoing media-state transitions — ones the library itself initiates without an API call from the caller. User-initiated transitions (SetStreamSources, Stop, Pause, Resume, Mute, Unmute) are silent because the caller already knows they triggered them and can flip the MTProto participant flags in the same code path.

In practice OnUpgrade fires when:

A video leg ends mid-stream (EOF / ffmpeg crash) — VideoStopped flips false→true. (Audio-only sources do not fire here because VideoStopped was already true.)
The WebRTC PC transitions to Failed/Closed while video was active — same VideoStopped flip.

Mirror of ntgcalls' onUpgrade(MediaState). Fires on the dispatcher goroutine, safe to re-enter the Client API from within.

func (*Client) Pause ¶

func (c *Client) Pause(chatID int64) (bool, error)

func (*Client) Resume ¶

func (c *Client) Resume(chatID int64) (bool, error)

func (*Client) SetStreamSources ¶

func (c *Client) SetStreamSources(chatID int64, src Source) error

SetStreamSources installs or replaces the streaming source for chatID. Encode options (FPS, tracks, bitrates) ride along with the Source — set them on the constructor (FromFile/FromURL).

On error the call is auto-reaped (closed and removed from the per-client registry) so the caller can immediately retry CreateCall without first invoking Stop. This covers the failure modes the user would otherwise need to clean up by hand: ICE/DTLS gate timeout, "connection closed during setup", and ffmpeg / source-open errors.

func (*Client) StartRTMP ¶

func (c *Client) StartRTMP(chatID int64, rtmpURL string) error

StartRTMP creates an RTMP-push call for chatID. The caller obtains rtmpURL via phone.GetGroupCallStreamRtmpUrl gogram-side. Serialised with CreateCall via the same per-chat creation mutex.

func (*Client) Stop ¶

func (c *Client) Stop(chatID int64) error

Stop tears down the call and clears the per-chat call entry. The per-chat create-mutex is intentionally kept (see reap) so a concurrent CreateCall parked on mu.Lock() doesn't end up racing a later one on a fresh mutex. The mutex memory is negligible (sizeof sync.Mutex per chatID ever used).

func (*Client) Time ¶

func (c *Client) Time(chatID int64) (uint64, error)

Time returns elapsed ms of media pushed.

func (*Client) Unmute ¶

func (c *Client) Unmute(chatID int64) (bool, error)

type ConnState ¶

type ConnState = models.ConnState

type Device ¶

type Device = models.Device

type EncodeOptions ¶

type EncodeOptions = media.EncodeOptions

type ICEServer ¶ added in v0.6.0

type ICEServer = wrtc.ICEServer

ICEServer is re-exported so callers can configure STUN/TURN without importing pion directly.

type MediaState ¶

type MediaState = models.MediaState

type NetworkInfo ¶

type NetworkInfo = models.NetworkInfo

type NetworkType ¶ added in v0.6.0

type NetworkType = wrtc.NetworkType

NetworkType is re-exported for WithNetworkTypes.

type Option ¶

type Option func(*config)

func WithConnectTimeout ¶ added in v0.6.17

func WithConnectTimeout(d time.Duration) Option

WithConnectTimeout overrides how long SetSource/Resume wait for the WebRTC connection to reach Connected before giving up. Default 10s — matches ntgcalls' own internal connection timeout. With pion running as ICE-CONTROLLED (since v0.6.26) and Telegram's edges responding within 50-300ms in healthy networks, 10s is generous. Set higher on unstable networks where ICE re-pairing on cross-DC moves takes longer.

func WithDTLSCertPool ¶

func WithDTLSCertPool(n int) Option

WithDTLSCertPool sets the size of the pre-generated DTLS certificate pool. Larger pools absorb bigger call-creation bursts without keygen latency. 0 disables pre-generation.

func WithDebugLogs ¶ added in v0.6.0

func WithDebugLogs() Option

WithDebugLogs is a convenience that installs a Debug-level text handler writing to os.Stderr. Equivalent to:

WithLogger(slog.New(slog.NewTextHandler(os.Stderr, &slog.HandlerOptions{Level: slog.LevelDebug})))

Use this when reporting bugs — debug-level output covers ICE/DTLS state, ffmpeg exit codes, streamer pacing, and pion-internal events bridged through the new pion→slog adapter.

func WithDispatchBuffer ¶

func WithDispatchBuffer(n int) Option

WithDispatchBuffer sizes the event dispatcher's channel. Default 256.

func WithFFmpegPath ¶

func WithFFmpegPath(p string) Option

WithFFmpegPath overrides the ffmpeg binary path (default "ffmpeg").

func WithFFmpegStderrLog ¶ added in v0.6.0

func WithFFmpegStderrLog() Option

WithFFmpegStderrLog tees ffmpeg's stderr output to the library logger at Debug level while the process is running. Without this, ffmpeg stderr is only surfaced in the final error message (last 512 bytes) when the subprocess crashes — useful for crash diagnosis but useless for "ffmpeg is running, but I see no audio" symptoms. Enable for verbose diagnosis.

func WithICECandidateLogs ¶ added in v0.6.3

func WithICECandidateLogs() Option

WithICECandidateLogs logs every locally-gathered ICE candidate (host / srflx / relay, address, port, foundation) at Debug level via the PeerConnection's OnICECandidate hook. Pairs well with WithPionTraceLogs for "why is ICE failing" diagnosis: this option shows what we offered, pion-trace shows which pairs were tried, and the remote answer's candidate list (parsed in jsonparams) shows what Telegram returned.

func WithICEMaxBindingRequests ¶ added in v0.6.22

func WithICEMaxBindingRequests(n uint16) Option

WithICEMaxBindingRequests overrides pion's per-pair STUN binding retry budget. Library default is 150 (≈30 s of retries at the 200 ms check interval), matching the standard connect gate so a slow Telegram SFU registration still recovers within the gate window. Pion's own default is 7, which lets each pair die in ~1.4 s if early STUN bindings get error responses (common when the SFU hasn't yet bound our ICE credentials post-JoinGroupCall) — pion then idle-ticks on a dead checklist until the connect gate times out.

Use this when you want a tighter or looser per-pair budget independent of the connect gate. 0 keeps the library default.

func WithICEPreConnectDelay ¶ added in v0.6.22

func WithICEPreConnectDelay(d time.Duration) Option

WithICEPreConnectDelay sleeps inside PeerConnection.Connect, after parsing Telegram's remote params but before pion's SetRemoteDescription kicks off ICE. Gives Telegram's SFU a head-start to register our credentials so the first STUN binding succeeds instead of being rejected with an error response (which would waste retries from the ICEMaxBindingRequests budget).

Library default is 250 ms (imperceptible to users; covers typical SFU registration windows around 150-200 ms post-JoinGroupCall). Pass any negative duration to disable the delay entirely. Pair with WithICEMaxBindingRequests for defense in depth — the delay reduces wasted STUN traffic in the SFU registration window; the higher binding budget recovers when the delay alone wasn't long enough.

func WithICEServers ¶ added in v0.6.0

func WithICEServers(servers []ICEServer) Option

WithICEServers overrides the ICE server list (default: none, host candidates only — matching ntgcalls). Pass STUN/TURN entries for bots behind symmetric NAT or restrictive firewalls.

gotgcall.WithICEServers([]gotgcall.ICEServer{
    {URLs: []string{"turn:turn.example.com:3478"},
     Username: "u", Credential: "p"},
})

func WithICETimeouts ¶ added in v0.6.0

func WithICETimeouts(disconnect, failed, keepalive time.Duration) Option

WithICETimeouts overrides pion's ICE timing. Pass 0 for any value to keep the library default (60s disconnect grace / 120s failed / 2s keepalive). Telegram's edge wobble on rejoin often takes 60-90s to settle, hence the generous defaults. Use longer values on unstable networks where brief connectivity drops shouldn't kill the call; pass shorter values for ultra-responsive UIs that need faster fail-detection.

func WithLogger ¶

func WithLogger(l *slog.Logger) Option

WithLogger sets a structured logger for internal events.

func WithNetworkTypes ¶ added in v0.6.0

func WithNetworkTypes(types ...NetworkType) Option

WithNetworkTypes overrides the ICE candidate network-type whitelist. Default is UDP4+UDP6 (matching ntgcalls' PORTALLOCATOR_ENABLE_IPV6). Telegram's SFU accepts IPv6 candidates and dual-stack hosts get more candidate pairs. Add TCP for restrictive environments where UDP is blocked.

gotgcall.WithNetworkTypes(
    gotgcall.NetworkTypeUDP4,
    gotgcall.NetworkTypeUDP6,
    gotgcall.NetworkTypeTCP4,
)

func WithPionTraceLogs ¶ added in v0.6.3

func WithPionTraceLogs() Option

WithPionTraceLogs remaps pion's Trace-level output (per-ICE-check, per- candidate-pair, per-binding-request) to slog.LevelDebug instead of the default sub-debug level. Use this when ICE is stuck in "Checking" and you need to see exactly which candidate pairs are being tried, which fail, and which (if any) get a response from the remote.

gotgcall.New(gotgcall.WithDebugLogs(), gotgcall.WithPionTraceLogs())

Volume warning: ICE Trace at scale is several hundred lines per call. Use for diagnosis, not steady-state production.

func WithSharedUDPMux ¶

func WithSharedUDPMux() Option

WithSharedUDPMux makes all calls share one UDP socket for ICE traffic. Useful for high-concurrency setups (100+ simultaneous calls).

func WithVerboseConnectionLogs ¶ added in v0.6.25

func WithVerboseConnectionLogs() Option

WithVerboseConnectionLogs is a one-flag bundle for diagnosing "ICE/DTLS did not reach Connected within Ns" failures. It enables:

Debug-level slog handler to stderr (WithDebugLogs)
Per-candidate gather logging (WithICECandidateLogs)
Pion's per-binding-request / per-pair-check trace at Debug (WithPionTraceLogs)

Use when reporting a stuck-in-Connecting bug — the resulting log contains every signal the library can surface about the ICE state machine. Library still also emits an Info-level checking-phase snapshot every ~5 seconds without this flag, so most issues can be triaged from a non-Debug run; flip this when those Info lines aren't enough.

type SeekableSource ¶

type SeekableSource = media.SeekableSource

type Source ¶

type Source = media.Source

type StreamType ¶

type StreamType = models.StreamType

type Track ¶

type Track = media.Track

Source Files ¶

View all Source files

gotgcall.go

Directories ¶

Path	Synopsis
instances Package instances holds the per-chat call state.	Package instances holds the per-chat call state.
io
media
models
utils
wrtc
jsonparams Package jsonparams encodes and decodes the SDP-like JSON envelope that Telegram's group-call signaling uses in place of standard SDP O/A.	Package jsonparams encodes and decodes the SDP-like JSON envelope that Telegram's group-call signaling uses in place of standard SDP O/A.
native

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL

README ¶

gotgcall — Pure-Go Telegram Group Call & Voice Chat Streaming Library

Status

Contents

Install

Architecture at a glance

Quick start

Sources

FromFile / FromURL

FromShell — single custom ffmpeg leg

FromShells — dual ffmpeg legs

EncodeOptions

Client options

Enabling debug logs

UDP mux & scaling

Lifecycle

WebRTC mode

RTMP mode

Pause / Resume / Mute

Callbacks

Server-side media-state changes (admin mute, video off)

Errors

Concurrency model

Goroutine budget

Networking

Performance tuning

Memory usage

Concurrency / scaling ballparks

A/V sync

Pitfalls

Performance vs ntgcalls

Why pure Go

FAQ

Is this a port of ntgcalls / pytgcalls to Go?

Does it work with gogram, MTProto-Go, or other MTProto libraries?

Can I use this for a Telegram music bot?

Does it support video chats / livestreams / RTMP push?

Does it support TGCalls / MTProto E2E voice calls?

What Go version is required?

Does it run on Windows?

How many concurrent calls can one process handle?

Where do I report bugs?

See also

License

Documentation ¶

Overview ¶

Index ¶

Constants ¶

Variables ¶

Functions ¶

Types ¶

type CallInfo ¶

type Client ¶

func New ¶

func (*Client) AudioSSRC ¶

func (*Client) Calls ¶

func (*Client) Close ¶

func (*Client) Connect ¶

func (*Client) CreateCall ¶

func (*Client) GetState ¶

func (*Client) Mute ¶

func (*Client) OnConnectionChange ¶

func (*Client) OnStreamEnd ¶

func (*Client) OnUpgrade ¶

func (*Client) Pause ¶

func (*Client) Resume ¶

func (*Client) SetStreamSources ¶

func (*Client) StartRTMP ¶

func (*Client) Stop ¶

func (*Client) Time ¶

func (*Client) Unmute ¶

type ConnState ¶

type Device ¶

type EncodeOptions ¶

type ICEServer ¶ added in v0.6.0

type MediaState ¶

type NetworkInfo ¶

type NetworkType ¶ added in v0.6.0

type Option ¶

func WithConnectTimeout ¶ added in v0.6.17

`FromFile` / `FromURL`

`FromShell` — single custom ffmpeg leg

`FromShells` — dual ffmpeg legs

`EncodeOptions`