觉醒年代 Age of Awakening is rated 9.3 on Douban, on par with Minning Town, and is almost unheard of in the West. At the same time, the production quality (at least from what I see) is the best the country can offer. But I think the reason they won't bother with English subtitles, is (1) that it's produced by CCTV therefore non-commercial and (2) the nature of the series itself.

That being said, I don't see why not I try and give translating this show a shot, so I translated the trailer and here it is.

I've uploaded the subtitles to a remote Git repository so feel free to contribute whenever you can:

https://forge.chapril.org/agentofchange/age-of-awakening.git

  • dengdidnothingwrong [none/use name]
    hexagon
    ·
    edit-2
    3 years ago

    There's a reason why Chinese GenZ is talking about this drama all the time

    CCTV actually has the entire series on YouTube but with hard-coded Chinese subtitles only, I'm using that as a reference:

    https://youtu.be/playlist?list=PLa213RC5YYIwYGZg9_ffwmsQO_UiEhjxL

    The thing is, I really need some helping hands since it's quite time-intensive to translate so I'm making these clips to grab people's attention and get volunteers

    EDIT: Also, this is the ten minutes clip ComradeSage is referencing: https://ptube.xmanifesto.club/videos/watch/dc4234cd-0846-4975-a0bb-152ad56ca711

      • dengdidnothingwrong [none/use name]
        hexagon
        ·
        3 years ago

        This is actually exactly what I needed, thank you. The issue now is that I'm having troubling installing it, since its opencv dependency requires libstdc++ shared object file but cannot detect it, I'll try to work this out.

          • dengdidnothingwrong [none/use name]
            hexagon
            ·
            edit-2
            3 years ago

            tesseract is already in PATH, I'm referring to OpenCV (or opencv-python). The full traceback is:

            $ python3 ocr.py 
            Traceback (most recent call last):
             File "ocr.py", line 1, in <module>
               from videocr import save_subtitles_to_file
             File "/home/user/.local/lib/python3.8/site-packages/videocr/__init__.py", line 2, in <module>
               from .api import get_subtitles, save_subtitles_to_file
             File "/home/user/.local/lib/python3.8/site-packages/videocr/api.py", line 2, in <module>
               from .video import Video
             File "/home/user/.local/lib/python3.8/site-packages/videocr/video.py", line 6, in <module>
               import cv2
             File "/home/user/.local/lib/python3.8/site-packages/cv2/__init__.py", line 5, in <module>
               from .cv2 import *
            ImportError: libstdc++.so.6: cannot open shared object file: No such file or directory
            

            I'm running Guix System so the problem is probably related to the distribution I use and not the module.

              • dengdidnothingwrong [none/use name]
                hexagon
                ·
                3 years ago

                Unfortunately no, I think the problem is how my distribution packaged opencv, there's something wrong with the compilation in CI

                  • dengdidnothingwrong [none/use name]
                    hexagon
                    ·
                    3 years ago

                    Tesseract is not the problem, openCV is the issue. Anyways, I've managed to run videocr on Ubuntu on a 4-core NUC (not my 12-core machine), it's been 2 hours and it's not finished processing the 5 minute trailer.

                      • dengdidnothingwrong [none/use name]
                        hexagon
                        ·
                        3 years ago

                        I have it working too. The 5-minute video takes at most 20 minutes on a 12-core machine. The problem now is finding the right parameters to get the Chinese subtitles in place. Two subtitles lines are sometimes merged together, not properly timed, and some are ignored completely. Fortunately, when it does correctly extract the subtitle, the text is fine albeit with spaces between them.