Having trouble?
Try the newer version at the University of Alberta:
http://taporware.ualberta.ca
Tools Home : Plain Text Tools : Find Text — Co-occurrence

Click here to show HTML tools HTML Tools

Click here to expand XML tools XML tools

Click here to expand plain text tools Plain Text Tools

Click here to expand other tools Other tools

 Beta tools
 Add Tools Demo
 Manual
 About

Find Text — Co-occurrence
?
Summary

Tool looks for two words a certain distance apart from one another. By entering a primary and secondary pattern, TAPoR will search the document for anywhere that the two patterns are within the user-specified limits of words, sentences, or lines.

Note: The input text format should be plain text. If you submit an XML or HTML text, the tool will strip all the tags, and then process it as plain text. For best results with XML or HTML text, it is suggested to use XML-specific or HTML-specific tools.

For more details, see here.

Walkthrough

Example: fetch text from http://www.gutenberg.org/dirs/etext91/peter16.txt; extract from text strings of text that contain the words `Peter' and `Hook', and where these words are no more than ten words apart.
  1. Source text
    1. Enter `http://www.gutenberg.org/dirs/etext91/peter16.txt' in the Text source URL field;
  2. What to find
    1. Enter `Peter' in the Primary pattern text field;
    2. enter `Hook' in the Co-pattern text field.
  3. Context for concordance
    1. Select Words from the Context drop-down menu;
    2. enter `10' in the Context length text field.
  4. Results
    No help written for this yet.
*
» Source text
  Example: http://taporware.mcmaster.ca/sampleDocs/plainText.txt


?
Summary

Determines the text source. Text can be obtained from a URL or by uploading a file.

Fields

Source URL
Text from the entered URL will be used as the data source for the analysis.

Local file
Use this field to upload a local file for analysis.

Treat XML/HTML as plain text
Enabling this option will strip tags from an HTML or XML document. <p> and <br /> in HTML documents and all tags in XML documents are converted to new lines (i.e. \n).
*
» What to find
?
Summary

Searches for the occurence of two text patterns in the text.

Fields

Primary pattern
Enter a word to search for in co-occurrence with the co-pattern.

Co-pattern
Enter a word to search for in co-occurrence with the primary pattern.
*
» Context for concordance
?
Summary

Allows user to define context type (e.g. words or sentences) and length.

Fields

Context
No help written for this yet

Words
places the search term in context by the specified number of words.

Lines
places the search term in context by the specified number of lines.

Sentences
places the search term in context by the specified number of sentences.

Context Length
Indicates the number of words/lines/sentences to be displayed before and after the search term for context purposes.
» Results
?
Summary

Allows the user to choose how the results will be formatted and whether they should be displayed in a new browser window.

Fields

Display as
Determines the format in which results will be delivered

Open results in new window
Checking this box will display the results in a new window. This option is selected by default. In some cases pop-up blockers may disallow windows from being created, in which case this option may be de-selected.
`*' indicates a required field

 

 

TAPoRware Project, McMaster University,