Example output pattern:
[
{
'text': 'This is the first part of the transcript.',
'start': 0.0,
'duration': 5.0
},
{
'text': 'And here is the next part.',
'start': 5.0,
'duration': 4.0
},
...
]