Having trouble?
Try the newer version at the University of Alberta:
http://taporware.ualberta.ca
Tools Home : XML Tools : Find Text -- Collocation

Click here to show HTML tools HTML Tools

Click here to expand XML tools XML tools

Click here to expand plain text tools Plain Text Tools

Click here to expand other tools Other tools

 Beta tools
 Add Tools Demo
 Manual
 About

Find Collocates
?
Summary

Tool takes a word from the user and returns all of the words directly before and directly after it based on the given context. The results are listed alphabetically, by frequency, or by Z-score (an indication of how far and in what direction that item deviates from its distribution's mean, expressed in units of its distribution's standard deviation).

For more details, see here.

Walkthrough

Example: fetch XML from http://www.xml.com/1999/03/ie5/first-x.xml; extract text from <para> tags; search for instances of `Microsoft'; display list of unique words that appear up to five words before and after each instance of `Microsoft'.
  1. Source text
    1. Enter `http://www.xml.com/1999/03/ie5/first-x.xml' in the URL field.
  2. Subtext limited to
    1. Enter `para' in the Elements field.
  3. What to find
    1. Enter `Microsoft' in the Word/Pattern to find field.
  4. Context for concordance
    1. Select the Ignore elements option;
    2. select words from the Context drop-down menu;
    3. enter `10' in the Context length text field.
  5. Results
    1. Different tools have a variety of options for displaying results. Select options which best apply to your needs.
*
» Source text
  Example: http://www.globalautonomy.ca/global1/articles/RA_Balibar_Strangers.xml

?
Summary

Determines the XML source. Text can be obtained from a URL or by uploading a file.

Fields

Source URL
Text from the entered URL will be used as the XML source for the analysis.

Local file
Use this field to upload a local XML file for analysis.
» Subtext limited to
?
Summary

Limits text for analysis to text that appears within elements listed in the Elements field. Attribute names and values can also be used to further limit which elements are included in the results.

Fields

Element
The element you want to add to the word/pattern (this one is mandatory)

Attribute name
The attribute name you want to added along with the element.

Attribute value
The attribute value you want to assigned to the attribute name. The attribute name and attribute value are optional but they must come in a pair.
*
» What to find
?
Summary

Words (string) or pattern used by the tool as key

Fields

Words/Pattern (or Pattern)
Enter a word or a string or phrase or a pattern to be searched for within the text. Separate multiple words with commas. Quotations are not needed in searching for phrases.
» Context for concordance



?
Summary

Determines whether XML elements should be used when searching for a word's context.

Fields

Ignore elements
Leave tags out of the results and just display found text.

Context
Method of searching for the collocation of the search term.

Context length
The number of words/lines/sentences that are returned as results of the collocation.

Use elements
Include elements in results.

Closest Element
Displays the element in which the text is found.

Surrounding Element
Specifies the context in which you would like the results to display for concordance.
» Results
?
Summary

Allows the user to choose how the results will be formatted and whether they should be displayed in a new browser window.

Fields

Sort
Allows you to sort the results in one of several ways.

Display as
Determines the format in which results will be delivered

Open results in new window
Checking this box will display the results in a new window. This option is selected by default. In some cases pop-up blockers may disallow windows from being created, in which case this option may be de-selected.
`*' indicates a required field

 

 

TAPoRware Project, McMaster University,