[Athen] SRT file formatting question

Joseph Polizzotto jpolizzotto at htctu.net
Fri Oct 13 15:13:30 PDT 2017


Glad to hear that you found an easy solution.

I also created a script you can run on multiple files for this same purpose. Perhaps this will come in handy for offline use. :)

Best,

Joseph




> On Oct 13, 2017, at 1:05 PM, Bourbeau, Maureen <Maureen.Bourbeau at unh.edu> wrote:

>

> Paul,

> I had tried find/replace but did not know how to do the wildcard. I will keep this tucked away for future knowledge!

> The 3Play conversion tool from Doug worked.

> --Maureen

>

> From: athen-list [mailto:athen-list-bounces at mailman13.u.washington.edu] On Behalf Of Paul Linh Nguyen

> Sent: Friday, October 13, 2017 3:56 PM

> To: Access Technology Higher Education Network <athen-list at u.washington.edu>

> Subject: Re: [Athen] SRT file formatting question

>

> Hi Maureen,

>

> If the other suggestions by people don't work, then a potentially long shot, especially since I haven't done anything like this before much less work with .SRT files is as follows:

>

> But using your small sample example of the timestamp being:

> 00:02:22.2500000 --> 00:02:24.2550000

>

> I think you could leverage Microsoft Word's Find & Replace feature. It is a little on the advanced side though using wildcards

>

> I was able to automate, again with just that one sample, to convert to:

> 00:02:22,250 --> 00:02:24,255

>

> Here is what I did.

>

> (Remember to have a copy of the file so you don't potentially destroy your original!)

>

> Again opening the file as a plain text in Microsoft Word, I started the Find & Replace. I enabled "Use Wildcards" (using this page as my reference in the past <https://urldefense.proofpoint.com/v2/url?u=http-3A__www.gmayor.com_replace-5Fusing-5Fwildcards.htm&d=DwMFaQ&c=c6MrceVCY5m5A_KAUkrdoA&r=O-_o_kUhT6ycJMVO6AvHe0jACC01EzuMNkO6emXHl4I&m=sv6F-g16pBMdOvZ_vI0BW6xaxHA10G2G0yyFcj1LzG4&s=uipg00igDC10_55t6x61MNZ_mCoOmPhAH2wYQRu9biM&e=>) the following inputs:

> Find what: ([0-9]{2}:[0-9]{2}:[0-9]{2}).([0-9]{3})[0-9]{1,}

> Replace with: \1,\2

>

> The above explained:

> Parenthesis tells Microsoft Word to remember what is found in the enclosed statements. Each segment remembered will be in the order of \1, \2, and so on.

>

> First saved set looks for a set of

> ##:##:##

>

> Note the period is outside the parenthesis as it will discard that

>

> Second saved set looks for 3 numbers directly after the period

>

> Capture (but subsequently discard) anything that follows after the three digits after the period.

>

> Replace with the following: saved set 1. Insert comma. Insert saved set 2

>

> While highly unlikely...but it checks the whole document indiscriminately. So if somehow that exact format shows up in the captions, it will get changed if you were to commit to a "Replace All". The chances of it showing up in regular text is generally low I'd imagine.

>

> Anyways hope you don't have to resort to such but it is a potential option and people are able to suggest something easier.

>

> -Paul Nguyen

>

> On Fri, Oct 13, 2017 at 12:19 PM, Bourbeau, Maureen <Maureen.Bourbeau at unh.edu <mailto:Maureen.Bourbeau at unh.edu>> wrote:

> Happy Friday all,

> I have a question that is a bit in the weeds, but I’m sure at least a couple of you might have a good solution or recommendation. We are transitioning from Tegrity to Kaltura for lecture capture. Some of the Tegrity videos had been captioned via 3Play. I would now like to move some of these videos into Kaltura just so we don’t have too many variations for students within Canvas. (and because the Kaltura videos are searchable)

> Kaltura is using the Cielo24 captioning on the backend and we are either editing the captions or outsourcing.

>

> I’d like to be able to download the SRT file from Tegrity and upload it into Kaltura. Sounds easy enough, but upon closer inspection, the SRT files do not have the same formatting and therefore does not render the captions in the Kaltura video player. We’d like to keep everything within Kaltura such that we can post videos in the Media Gallery which makes them easily searchable. See SRT example below:

> Tegrity

> 2

> 00:02:22.2500000 --> 00:02:24.2550000

> Anybody have any questions before we start?

>

> Other

> 2

> 00:02:22,250 --> 00:02:24,255

> Anybody have any questions before we start?

>

> Note that the milliseconds are kept to only 3 digits, whereas Tegrity uses 7 digits. The milliseconds should also be preceded by a comma, not a period.

>

> Can anyone suggest a conversion tool? The Tegrity file also contains a cc.xml file but not sure how to convert that to SRT. I tried using Subtitle Edit and it failed.

>

> I appreciate any and all suggestions!

> Have a great weekendJ

>

> Maureen Bourbeau

> Assistive Technology Specialist

> Student Accessibility Services (SAS)

> 201 Smith Hall, 3 Garrison Avenue

> Durham, NH 03824

> P: 603.862.2400 <tel:%28603%29%20862-2400> | F: 603.862.4043 <tel:%28603%29%20862-4043>

> maureen.bourbeau at unh.edu <mailto:maureen.bourbeau at unh.edu>

> www.unh.edu/studentaccessibility <http://www.unh.edu/studentaccessibility>

>

>

>

>

> _______________________________________________

> athen-list mailing list

> athen-list at mailman13.u.washington.edu <mailto:athen-list at mailman13.u.washington.edu>

> http://mailman13.u.washington.edu/mailman/listinfo/athen-list <https://urldefense.proofpoint.com/v2/url?u=http-3A__mailman13.u.washington.edu_mailman_listinfo_athen-2Dlist&d=DwMFaQ&c=c6MrceVCY5m5A_KAUkrdoA&r=O-_o_kUhT6ycJMVO6AvHe0jACC01EzuMNkO6emXHl4I&m=sv6F-g16pBMdOvZ_vI0BW6xaxHA10G2G0yyFcj1LzG4&s=9AIfCBxbbnMt1tW6Rl5cHxb64oJ6d7w_6Vw8xpskqww&e=>

>

> _______________________________________________

> athen-list mailing list

> athen-list at mailman13.u.washington.edu

> http://mailman13.u.washington.edu/mailman/listinfo/athen-list


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman12.u.washington.edu/pipermail/athen-list/attachments/20171013/dd485575/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Correct_SRT_Script.sh
Type: application/octet-stream
Size: 776 bytes
Desc: not available
URL: <http://mailman12.u.washington.edu/pipermail/athen-list/attachments/20171013/dd485575/attachment.obj>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman12.u.washington.edu/pipermail/athen-list/attachments/20171013/dd485575/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 15467 bytes
Desc: not available
URL: <http://mailman12.u.washington.edu/pipermail/athen-list/attachments/20171013/dd485575/attachment.png>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman12.u.washington.edu/pipermail/athen-list/attachments/20171013/dd485575/attachment-0002.html>


More information about the athen-list mailing list