From about 12 seconds to about 53 seconds is English.
Here's a 11 minute demo.
It's a mixed bag. Some of it sounds really good, some of it sounds good but then it has like a robotic pause in between, and some of it is easily recognizable as not a person.
I guess those are pretty old, it's possible there might be some improvements here or there...