Having trouble?
Try the newer version at the University of Alberta:
http://taporware.ualberta.ca
Tools Home : XML Tools : Find Text -- Co-occurrences

Click here to show HTML tools HTML Tools

Click here to expand XML tools XML tools

Click here to expand plain text tools Plain Text Tools

Click here to expand other tools Other tools

 Beta tools
 Add Tools Demo
 Manual
 About

Find Co-occurring Words
?
Summary

Tool looks for two words a certain distance apart from one another. By entering a primary and secondary pattern, TAPoR will search the document for anywhere where the two patterns are within the user-specified limits of words/sentences/lines or surrounding elements.

For more details, see here.

Walkthrough

Example: fetch XML from http://www.xml.com/1999/03/ie5/first-x.xml; extract text from <para> tags; search for strings of text containing `Microsoft' and `CSS', where `CSS' appears within at least ten words before or after `Microsoft'.
  1. Source text
    1. Enter `http://www.xml.com/1999/03/ie5/first-x.xml' in the URL field.
  2. Subtext limited to
    1. Enter `para' in the Elements field.
  3. What to find
    1. Enter `Microsoft' in the Primary pattern field;
    2. enter 'CSS' in the Co-pattern field.
  4. Context for concordance
    1. Select the Ignore elements option;
    2. select words from the Context drop-down menu;
    3. enter `10' in the Context length text field.
  5. Results
    1. Different tools have a variety of options for displaying results. Select options which best apply to your needs.
*
» Source text
  Example: http://www.globalautonomy.ca/global1/articles/RA_Balibar_Strangers.xml

?
Summary

Determines the XML source. Text can be obtained from a URL or by uploading a file.

Fields

Source URL
Text from the entered URL will be used as the XML source for the analysis.

Local file
Use this field to upload a local XML file for analysis.
» Subtext limited to
?
Summary

Limits text for analysis to text that appears within elements listed in the Elements field. Attribute names and values can also be used to further limit which elements are included in the results.

Fields

Element
The element you want to add to the word/pattern (this one is mandatory)

Attribute name
The attribute name you want to added along with the element.

Attribute value
The attribute value you want to assigned to the attribute name. The attribute name and attribute value are optional but they must come in a pair.
*
» What to find
(use `,' as delimiter)
?
Summary

Searches for the occurence of two text patterns in the text.

Fields

Primary pattern
Enter a word to search for in co-occurrence with the co-pattern.

Co-pattern
Enter a word to search for in co-occurrence with the primary pattern.
» Context for concordance



?
Summary

Determines whether XML elements should be used when searching for a word's context.

Fields

Ignore elements
Leave tags out of the results and just display found text.

Context
Method of searching for the collocation of the search term.

Context length
The number of words/lines/sentences that are returned as results of the collocation.

Use elements
Include elements in results.

Closest Element
Displays the element in which the text is found.

Surrounding Element
Specifies the context in which you would like the results to display for concordance.
» Results
?
Summary
Allows one to choose how the results will be formatted and whether they should be displayed in a new browser window.

Fields
Display as
Different tools allow you to choose from a variety of output formats. Options include HTML, XML and plain text.

Open results in new window
Checking this box will display the results in a new window. This option is selected by default. In some cases pop-up blockers may disallow windows from being created, in which case this option may be de-selected.
`*' indicates a required field

 

 

TAPoRware Project, McMaster University,