OpenAI for Transcription and Summarize of Recordings
You can use OpenAI to summarize your voice recordings, making it easier for you to delete or organize your files.
Stop. Don’t upload sensitive information, I warned you.
I use a single-purpose audio recorder for dictation. By speaking, I can organize my thoughts.
Simple to record.
Difficult to review.
It is a slog.
I decided to use OpenAI to help me transcribe and summarize my recordings.
Python and OpenAI Transcription and Summary
Summarization with Chain of Density and GPT-3.5-Turbo-1106
“A good summary should be detailed and entity-centric without being overly dense and hard to follow”. - Chain of Density Prompt Text Link
from openai import OpenAI
import json
client = OpenAI()
audio_file = open("D:\\RECORDER\\FOLDER_B\\231006_0251.MP3", "rb")
transcript = client.audio.transcriptions.create(
model="whisper-1",
file=audio_file,
response_format="text"
)
chainofdensity = """Link to the Prompt Above"""
response = client.chat.completions.create(
model="gpt-3.5-turbo-1106",
messages=[
{"role": "system", "content": chainofdensity},
{"role": "user", "content": transcript},
]
)
print(transcript)
print(response)
Comparing OpenAI Whisper with Deepgram
I found Deepgram, which is a similar Speech to Text with AI solution. I was able to sign up and upload my recording to the playground.
Let’s compare.
OpenAI and Chain of Density
Creating an effective organization is crucial to avoid overthinking and ensure productivity. Regular feedback is essential for improvement and growth.
Deepgram
The speaker advises the caller to have a clear idea of what they are learning and to use it more often. They stress the importance of distillation and summarization in learning and sharing. The speaker emphasizes the need for feedback and practice to improve their writing.
What do you think?
Direct Message me on X/Twitter or contact me.
Conclusion
In summary, if you use a voice recorder, add more context before recording. OpenAI can transcribe your recording and summarize. You can review the transcription and summary at a glance.