[Athen] SRT file formatting question

Ana Thompson thomana at uw.edu
Fri Oct 13 17:52:46 PDT 2017


Cool script, Joseph. I had come up with the reg expressions to run one file at a time, but you did it as a batch. Very cool! Thanks for sharing.

Ana :-]


Ana Thompson, MIS, CSM
Learning Technologist & IT Accessibility Liaison

INFORMATION TECHNOLOGIES
University of Washington Bothell
Office: LBA 204E - 425-352-3794
Mobile: 208-991-3095
Twitter - @EdTechAna
NWeLearn Chair-Elect
NW/MET Director
Ask me about the UW IT Accessibility Liaison Program!
________________________________
From: athen-list <athen-list-bounces at mailman13.u.washington.edu> on behalf of Joseph Polizzotto <jpolizzotto at htctu.net>
Sent: Friday, October 13, 2017 3:13:30 PM
To: Access Technology Higher Education Network
Subject: Re: [Athen] SRT file formatting question

Glad to hear that you found an easy solution.

I also created a script you can run on multiple files for this same purpose. Perhaps this will come in handy for offline use. :)

Best,

Joseph



On Oct 13, 2017, at 1:05 PM, Bourbeau, Maureen <Maureen.Bourbeau at unh.edu<mailto:Maureen.Bourbeau at unh.edu>> wrote:

Paul,
I had tried find/replace but did not know how to do the wildcard. I will keep this tucked away for future knowledge!
The 3Play conversion tool from Doug worked.
--Maureen

From: athen-list [mailto:athen-list-bounces at mailman13.u.washington.edu] On Behalf Of Paul Linh Nguyen
Sent: Friday, October 13, 2017 3:56 PM
To: Access Technology Higher Education Network <athen-list at u.washington.edu<mailto:athen-list at u.washington.edu>>
Subject: Re: [Athen] SRT file formatting question


Hi Maureen,

If the other suggestions by people don't work, then a potentially long shot, especially since I haven't done anything like this before much less work with .SRT files is as follows:

But using your small sample example of the timestamp being:
00:02:22.2500000 --> 00:02:24.2550000


I think you could leverage Microsoft Word's Find & Replace feature. It is a little on the advanced side though using wildcards

I was able to automate, again with just that one sample, to convert to:
00:02:22,250 --> 00:02:24,255

Here is what I did.

(Remember to have a copy of the file so you don't potentially destroy your original!)

Again opening the file as a plain text in Microsoft Word, I started the Find & Replace. I enabled "Use Wildcards" (using this page as my reference in the past<https://urldefense.proofpoint.com/v2/url?u=http-3A__www.gmayor.com_replace-5Fusing-5Fwildcards.htm&d=DwMFaQ&c=c6MrceVCY5m5A_KAUkrdoA&r=O-_o_kUhT6ycJMVO6AvHe0jACC01EzuMNkO6emXHl4I&m=sv6F-g16pBMdOvZ_vI0BW6xaxHA10G2G0yyFcj1LzG4&s=uipg00igDC10_55t6x61MNZ_mCoOmPhAH2wYQRu9biM&e=>) the following inputs:
Find what: ([0-9]{2}:[0-9]{2}:[0-9]{2}).([0-9]{3})[0-9]{1,}
Replace with: \1,\2

The above explained:
Parenthesis tells Microsoft Word to remember what is found in the enclosed statements. Each segment remembered will be in the order of \1, \2, and so on.

First saved set looks for a set of
##:##:##

Note the period is outside the parenthesis as it will discard that

Second saved set looks for 3 numbers directly after the period

Capture (but subsequently discard) anything that follows after the three digits after the period.

Replace with the following: saved set 1. Insert comma. Insert saved set 2

While highly unlikely...but it checks the whole document indiscriminately. So if somehow that exact format shows up in the captions, it will get changed if you were to commit to a "Replace All". The chances of it showing up in regular text is generally low I'd imagine.

Anyways hope you don't have to resort to such but it is a potential option and people are able to suggest something easier.

-Paul Nguyen

On Fri, Oct 13, 2017 at 12:19 PM, Bourbeau, Maureen <Maureen.Bourbeau at unh.edu<mailto:Maureen.Bourbeau at unh.edu>> wrote:
Happy Friday all,
I have a question that is a bit in the weeds, but I’m sure at least a couple of you might have a good solution or recommendation. We are transitioning from Tegrity to Kaltura for lecture capture. Some of the Tegrity videos had been captioned via 3Play. I would now like to move some of these videos into Kaltura just so we don’t have too many variations for students within Canvas. (and because the Kaltura videos are searchable)
Kaltura is using the Cielo24 captioning on the backend and we are either editing the captions or outsourcing.

I’d like to be able to download the SRT file from Tegrity and upload it into Kaltura. Sounds easy enough, but upon closer inspection, the SRT files do not have the same formatting and therefore does not render the captions in the Kaltura video player. We’d like to keep everything within Kaltura such that we can post videos in the Media Gallery which makes them easily searchable. See SRT example below:
Tegrity
2
00:02:22.2500000 --> 00:02:24.2550000
Anybody have any questions before we start?

Other
2
00:02:22,250 --> 00:02:24,255
Anybody have any questions before we start?

Note that the milliseconds are kept to only 3 digits, whereas Tegrity uses 7 digits. The milliseconds should also be preceded by a comma, not a period.

Can anyone suggest a conversion tool? The Tegrity file also contains a cc.xml file but not sure how to convert that to SRT. I tried using Subtitle Edit and it failed.

I appreciate any and all suggestions!
Have a great weekend:)

Maureen Bourbeau
Assistive Technology Specialist
Student Accessibility Services (SAS)
201 Smith Hall, 3 Garrison Avenue
Durham, NH 03824
P: 603.862.2400<tel:%28603%29%20862-2400> | F: 603.862.4043<tel:%28603%29%20862-4043>
maureen.bourbeau at unh.edu<mailto:maureen.bourbeau at unh.edu>
www.unh.edu/studentaccessibility<http://www.unh.edu/studentaccessibility>

[cid:dbcdcd12-bcb7-47a7-ba8c-1931efd84a75 at namprd08.prod.outlook.com]


_______________________________________________
athen-list mailing list
athen-list at mailman13.u.washington.edu<mailto:athen-list at mailman13.u.washington.edu>
http://mailman13.u.washington.edu/mailman/listinfo/athen-list<https://urldefense.proofpoint.com/v2/url?u=http-3A__mailman13.u.washington.edu_mailman_listinfo_athen-2Dlist&d=DwMFaQ&c=c6MrceVCY5m5A_KAUkrdoA&r=O-_o_kUhT6ycJMVO6AvHe0jACC01EzuMNkO6emXHl4I&m=sv6F-g16pBMdOvZ_vI0BW6xaxHA10G2G0yyFcj1LzG4&s=9AIfCBxbbnMt1tW6Rl5cHxb64oJ6d7w_6Vw8xpskqww&e=>


_______________________________________________
athen-list mailing list
athen-list at mailman13.u.washington.edu<mailto:athen-list at mailman13.u.washington.edu>
http://mailman13.u.washington.edu/mailman/listinfo/athen-list

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman12.u.washington.edu/pipermail/athen-list/attachments/20171014/efb3b2fa/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 15467 bytes
Desc: image001.png
URL: <http://mailman12.u.washington.edu/pipermail/athen-list/attachments/20171014/efb3b2fa/attachment.png>


More information about the athen-list mailing list