librosa.core.load

librosa.core.load(path, sr=22050, mono=True, offset=0.0, duration=None, dtype=<class 'numpy.float32'>, res_type='kaiser_best')[source]

Load an audio file as a floating point time series.

Audio will be automatically resampled to the given rate (default sr=22050).

To preserve the native sampling rate of the file, use sr=None.

Parameters:
path : string, int, or file-like object

path to the input file.

Any codec supported by soundfile or audioread will work.

If the codec is supported by soundfile, then path can also be an open file descriptor (int), or any object implementing Python’s file interface.

If the codec is not supported by soundfile (e.g., MP3), then only string file paths are supported.

sr : number > 0 [scalar]

target sampling rate

‘None’ uses the native sampling rate

mono : bool

convert signal to mono

offset : float

start reading after this time (in seconds)

duration : float

only load up to this much audio (in seconds)

dtype : numeric type

data type of y

res_type : str

resample type (see note)

Note

By default, this uses resampy’s high-quality mode (‘kaiser_best’).

For alternative resampling modes, see resample

Note

audioread may truncate the precision of the audio data to 16 bits.

See https://librosa.github.io/librosa/ioformats.html for alternate loading methods.

Returns:
y : np.ndarray [shape=(n,) or (2, n)]

audio time series

sr : number > 0 [scalar]

sampling rate of y

Examples

>>> # Load an ogg vorbis file
>>> filename = librosa.util.example_audio_file()
>>> y, sr = librosa.load(filename)
>>> y
array([ -4.756e-06,  -6.020e-06, ...,  -1.040e-06,   0.000e+00], dtype=float32)
>>> sr
22050
>>> # Load a file and resample to 11 KHz
>>> filename = librosa.util.example_audio_file()
>>> y, sr = librosa.load(filename, sr=11025)
>>> y
array([ -2.077e-06,  -2.928e-06, ...,  -4.395e-06,   0.000e+00], dtype=float32)
>>> sr
11025
>>> # Load 5 seconds of a file, starting 15 seconds in
>>> filename = librosa.util.example_audio_file()
>>> y, sr = librosa.load(filename, offset=15.0, duration=5.0)
>>> y
array([ 0.069,  0.1  , ..., -0.101,  0.   ], dtype=float32)
>>> sr
22050