GPT-5 (by OpenAI)

GPT-5 (OpenAI)

Advanced multimodal language model for complex reasoning, orchestration, and multimodal tasks.


Quick facts

  • Modes: Text → Text · Image → Text · Video → Text

  • Default output / size: Text responses (token-limited)


What it’s great for

  • Complex coding, reasoning, and orchestration tasks.

  • Video → text transcription & summarization.

  • Multimodal pipeline orchestration.


Copy-and-paste prompts

Summarize this 30s product demo video into a 3-bullet marketing blurb.
Extract timestamps and scene descriptions from this video: Video Block

Parameters

Name
Type
Default
Notes

Prompt

Text

Required


Modes

Mode
Estimated time
Required inputs
Typical use

Text → Text

Streaming

Prompt

Code, summaries, long-form text

Image → Text

Streaming

Images

Image description

Video → Text

Streaming

Videos, Prompt

Transcription & summarization


Output options

Option
Values / notes

Formats

Text / JSON

Notes

Advanced reasoning & tooling integrations available via API


Prompt tips

  • Be explicit about length and format (bullets, code block, JSON).

  • Provide context and any relevant assets.


Last updated

Was this helpful?