Voice file usually longer as 1 to 2 hours.

Mostly quality of audio is poor except for podcasts. 

Can have multiple speakers.