15 maggio 2011

Sneakernet

Sneakernet is a term used to describe the physical transfer of electronic information, especially computer files.

The name is a tongue-in-cheek reference to sneakers, as this way of moving information relies on a courier and removable media such as USB drive and compact discs.




8 maggio 2011

Online term extractors: Terminology Extraction by Translated

Terminology Extraction by Translated uses Poisson statistics, the Maximum Likelihood Estimation and Inverse Document Frequency between the frequency of words in a given document and a generic corpus of 100 million words per language. It uses a probabilistic part of speech tagger to take into account the probability that a particular sequence could be a term. It creates n-grams of words by minimizing the relative entropy.

Terminology Extraction by Translated can be also used to improve search results in traditional search engines (es. Google) by giving a better estimation of how much a keyword is relevant to a document.

Texts may only be submitted for analysis through entering the text to analyze into the text window.

Languages supported: English, Italian, French.

Website: Translated

Online term extractors: AlchemyAPI

AlchemyAPI extracts topic keywords from HTML, text, or web-based content.

AlchemyAPI employes sophisticated statistical algorithms and natural language processing technology to analyze data, extracting keywords that can be utilized to index content, generate tag clouds, and more.

API endpoints are provided for performing keyword extraction on Internet-accessible URLs and posted HTML files or text content.

Extracted meta-data may be returned in XML, JSON, RDF, and Microformats rel-tag formats.

Keyword extraction is supported in over a half-dozen different languages, enabling even foreign-language content to be categorized and tagged:
English, French, German, Italian, Portuguese, Russian, Spanish, Swedish.

Website: AlchemyAPI

Online term extractors: TerMine

Particularly suitable for bio medical-area.

Technical terms are important for knowledge mining, especially in the bio-medical area where vast amount of documents are available. The amount of terms (e.g., names of genes, proteins, chemical compounds, drugs, organisms, etc) is increasing at an astounding rate in the bio-medical literature. Existing terminological resources and scientific databases cannot keep up-to-date with the growth of neologisms. A domain independent method for term recognition is very useful to automatically

Texts may be submitted for analysis through any of the following ways:
  • entering the text you would like to analyze in to the topmost text window;

  • specifying a text file (*.txt or *.pdf) from your computer's hard drive;

  • entering a URL of the Web resource (*.html or *.pdf.
Languages supported: all Unicode-compliant languages.


Website: TerMine

Online term extractors: Maui - indexer

Maui automatically identifies main topics in text documents. Depending on the task, topics are tags, keywords, keyphrases, vocabulary terms, descriptors, index terms or titles of Wikipedia articles. This demo shows how a vocabulary can be used to derive the topics, e.g. High Energy Physics thesaurus or the agricultural thesaurus Agrovoc. It also shows how keyphrases can be extracted from document text.

File formats supported: text, PDF, Microsoft Word file

Website: Maui - indexer

Online term extractors: fivefilters

This is a free software project to enable easy term extraction through a web service.
Given some text it will return a list of terms with (hopefully) the most relevant first.
The list is returned in JSON format. It is a free alternative to Yahoo's Term Extraction service. It is being developed as part of the Five Filters project to promote alternative, non-corporate media.

Languages supported: English

Website: fivefilters

4 maggio 2011

Online term extractors: Vocab Grabber

VocabGrabber analyzes any text you're interested in, generating lists of the most useful vocabulary words and showing you how those words are used in context. Just copy text from a document and paste it into the box, and then click on the "Grab Vocabulary!" button.

VocabGrabber will automatically create a list of vocabulary from your text, which you can then sort, filter, and save. Select any word on the list and you'll see a snapshot of the Visual Thesaurus map and definitions for that word, along with examples of the word in your text.

Languages supported: English.


Website: Vocab Grabber

3 maggio 2011

How to be a certified Terminology Manager

Striving for improving is for me a healthy way to add value and enjoy what I am doing. That's why I decided to get the ECQA certification in Terminology Management. This certification is a a good investment because it can be regarded as specialised professional qualification.

The course comprises 6 units (5 theoretical units and 1 practical unit based on an exercise).
For every unit participants will receive:
 - A set of materials (to be found in a Moodle-based e-learning platform)
 - An online tutorial (in form of a live webinar)
 - And a discussion forum for questions and answers.

UNIT 1: UNDERSTANDING TERMINOLOGY MANAGEMENT

What is terminology?
Why terminology management?
How terminology work is embedded in my organisation and work environment?


UNIT 2: TERMINOLOGY MANAGEMENT SKILLS

How to search and collect terminology?
How to store and retrieve?
How to coin terms?
How to manage monolingual and multilingual terminology?
How to manage terminology projects?


UNIT 3: TERMINOLOGY STRATEGIES FOR BUSINESS PROCESSES

How to present the business case for terminology?
How to calculate and argue costs & return on investments?
How to involve relevant stakeholders?
How to collaborate with relevant organisational units?


UNIT 4: TEAM WORKING & COMMUNICATION SKILLS

How to organise team communication?
How to manage distributed and diverse teams?
Why conflict management?
How to train and motivate your team?


UNIT 5: APPLICATION SCENARIOS

Preparation for Unit 5 and topics for the participants´ projects
Presentation of the participants projects.


UNIT 6: STANDARDS AND LEGAL ISSUES

Which standards are relevant
How to deal with copyright issues in terminology management
What about product liability

The ECQA Certification in Terminology Management is especially suited for professionals who work as Terminologists, Translators, Localisers and in Technical documentation. Also for freelances, in translation companies, in other commercial enterprises (engineers, ICT specialists, etc.), in universities, NGOs or the standardisation framework.


You can find all information about this at: TermNet.

And this is my certificate! :)


Inclusive GIT branch naming

“main” branch is used to avoid naming like “master” and  “slaves” branches “feature branch” for new feature or bug fix   The shift fr...