- Get a list of the major nightly news broadcasts in Hong Kong.
- Learn whether the closed captions are digital (can be turned on and off with a remote) or burned in. Wikipedia reports,
- "In some East Asian countries, such as China, Hong Kong and Taiwan, subtitling is common in all taped television programs. In these countries, written text remains mostly uniform while regional dialects in the spoken form can be mutually unintelligible. Therefore, subtitling offers a distinct advantage to aid comprehension. With subtitles, programs in Putonghua, the standard Mandarin, or any dialect can be understood by viewers unfamiliar with it."
- The standard in Hong Kong is DTMB. See
- https://en.wikipedia.org/wiki/Digital_Terrestrial_Multimedia_Broadcast
- http://spectrum.ieee.org/consumer-electronics/standards/does-china-have-the-best-digital-television-standard-on-the-planet
- Get some recordings of broadcast streams—not examples from the web, which do not include digital closed captions.
- CCextractor does not handle DTMB. We would need to attract funding to add that capacity to DTMB.
- Also run the recordings through ocr to test ocr extraction.
- Learn what TV tuners will work in Hong Kong
A promising candidate for a Linux capture station the Raspberry Pi 3. For details, see How to set up a Red Hen capture station.
For CCExtractor, we need version 0.78 or later; see How to install CCExtractor.
If we want to test IPTV, see SunTVBox.
We use ccextractor-0.79 to extract the text, first in debug mode: