Instructions to Copy Text from
Non-Editable Files or Figures

Your job will be to transcribe text from PDF files and from figures in Office files so our translators can translate and the DTP artist insert into appropriate figures or create PDFs in the target languages.

Therefore, you will receive PDF files and/or Office files to extract text from.

·         Before you begin your work: In Word go to TOOLS | OPTIONS VIEW and activate (check) ALL. This will allow you to see formatting marks. Do NOT skip this step.

If you will transcribe text from a few files

·         Download the file in which you will enter the text right-clicking here and choosing SAVE AS.

·         Save the file using the same name of the file from which you will extract the text, but adding "_TFDTP". For example, if the original file is named "abc123.pdf" then you must save this Word file as "abc123_TFDTP.doc".

1      Select the file in Windows Explorer

2      Press F2

3      Press CTRL + C

4      Go to the Word document you are creating or in which you will paste the file name

5      Press CTRL + V to paste the exact name of the file above each table inside the document for many files, OR,

6      Press CTRL + V to paste the exact name of the file in the document you are saving for each file, replacing the extension with "_TFDTP.doc"

·         Repeat this for as many files you must extract text from. There must be one copy of the Word file you just downloaded for each file you extract text from.

If you will transcribe text from many files (more than 10)

·         If your project has many files you need to download a different file. Right-click here and choose save as. In this file you will proceed as instructed below, but all the text will be entered into this single file. Make sure you replace "Name of File" with each original file name from which you extract text from. Copy and paste "Name of File" and the blue tables as needed in order to add text for new files.

·         Replace "Name of File" with the actual name of the file you are extracting text from.

1      Select the file in Windows Explorer

2      Press F2

3      Press CTRL + C

4      Go to the Word document you are creating or in which you will paste the file name

5      Press CTRL + V to paste the exact name of the file above each table

·         Copy one table below the previous for each file from which you extract text.

GENERAL INSTRUCTIONS

·         For each file you create select TOOLS | OPTIONS | SAVE and uncheck EMBED SMART TAGS

·         The source text must be copied in the SOURCE column.

·         There must not be paragraph marks (¶) inside the cells. Do not enter soft returns (manual line breaks).

·         You should enter only ONE SENTENCE in each cell. This is very important.

·         DO NOT split one sentence in two or more cells. This means you must not enter part of a sentence in one cell and the rest in another.

·         The text to be copied is the text that is not directly editable clicking ONCE on it. For example, there might be some inserted figures or tables that you need to double-click to edit. The text in these figures or tables needs to be transcribed.

·         If possible copy and paste the text from the source file to the Word file.

·         Do not assume that if you save a PDF file as text all the text will be saved in the text file. Part of it may not be saved. Therefore, you must compare the original PDF against the text version you saved. And of course, if there are images or figures in these PDFs those will not be saved as text.

·         If it is not possible to copy and paste (scanned pages, images, etc.) follow a certain pattern to make sure you don't leave out any text. For example, you can transcribe text from figures going in a clockwise fashion.

·         After you transcribe each figure or page go over the figure or page again to make sure you didn't leave any text out.

·         If you notice any errors in the original files (spelling, missing text, etc.) please let us know.

·         Text from screen captures will not be edited by default, and consequently there is no need to transcribe it.

Feel free to contact us if you have any doubts.