Hong Kong Capture Station
Task
Task
Can you help establish a Red Hen capture station in Hong Kong?
If so, write to
and we will try to connect you with a mentor.
Related links
Related links
- How to set up a Red Hen capture station
- How to install CCExtractor
- Portugal capture station
- Brazil capture station
More information
More information
Ka Nin (Kenny) Chou at the Hong Kong Polytechnic University reports that, "In Hong Kong, almost all TV shows, including news, have subtitles in Chinese or English." See examples below.
Next Steps
Next Steps
- Get a list of the major nightly news broadcasts in Hong Kong.
- Learn whether the closed captions are digital (can be turned on and off with a remote) or burned in. Wikipedia reports,
- "In some East Asian countries, such as China, Hong Kong and Taiwan, subtitling is common in all taped television programs. In these countries, written text remains mostly uniform while regional dialects in the spoken form can be mutually unintelligible. Therefore, subtitling offers a distinct advantage to aid comprehension. With subtitles, programs in Putonghua, the standard Mandarin, or any dialect can be understood by viewers unfamiliar with it."
- The standard in Hong Kong is DTMB. See
- https://en.wikipedia.org/wiki/Digital_Terrestrial_Multimedia_Broadcast
- Get some recordings of broadcast streams—not examples from the web, which do not include digital closed captions.
- CCextractor does not handle DTMB. We would need to attract funding to add that capacity to DTMB.
- Also run the recordings through ocr to test ocr extraction.
- Learn what TV tuners will work in Hong Kong
Equipment
Equipment
A promising candidate for a Linux capture station the Raspberry Pi 3. For details, see How to set up a Red Hen capture station.
For CCExtractor, we need version 0.78 or later; see How to install CCExtractor.
If we want to test IPTV, see SunTVBox.
Configuration
Configuration
Kernel module
Kernel module
Configure the channels
Configure the channels
Make a recording
Make a recording
Extracting the text
Extracting the text
We use ccextractor-0.79 to extract the text, first in debug mode: