TensorFlow provides Ops to decode and encode audio files using the FFmpeg library. FFmpeg must be locally installed for these Ops to succeed.
Example:
from tensorflow.contrib import ffmpeg
audio_binary = tf.read_file('song.mp3')
waveform = ffmpeg.decode_audio(
audio_binary, file_format='mp3', samples_per_second=44100, channel_count=2)
uncompressed_binary = ffmpeg.encode_audio(
waveform, file_format='wav', samples_per_second=44100)
© 2018 The TensorFlow Authors. All rights reserved.
Licensed under the Creative Commons Attribution License 3.0.
Code samples licensed under the Apache 2.0 License.
https://www.tensorflow.org/api_guides/python/contrib.ffmpeg