项目作者: nlpie

项目描述 :
A lightweight (small and dependency-free) Java 8 library for identifying and normalizing numbers and measurements in text.
高级语言: Java
项目地址: git://github.com/nlpie/biomedicus-measures.git
创建时间: 2017-08-22T15:14:04Z
项目社区:https://github.com/nlpie/biomedicus-measures

开源协议:Apache License 2.0

下载


BioMedICUS Measures

A lightweight (small and dependency-free) Java 8 library for identifying and normalizing numbers
whether they occur as decimal numbers (“5.8”), decimal fractions (“1/4”), or English numerals
(“forty-two”). Also contains functionality for detecting unit-of-measurement words. This was
developed as a stand-alone component of
BioMedICUS, a biomedical and clinical NLP engine developed by
the NLP-IE Group at the University of Minnesota Institute for Health Informatics.

This project makes use of the 2017 SPECIALIST Lexicon number files. For more information about the
SPECIALIST Lexicon, see their
website and their
Terms & Conditions.

Using in your project

To use in a maven project, include the following in your pom:

  1. <dependencies>
  2. <dependency>
  3. <groupId>edu.umn.biomedicus</groupId>
  4. <artifactId>biomedicus-measures</artifactId>
  5. <version>2.0.1</version>
  6. </dependency>
  7. </dependencies>

Alternatively, download the .jar and include that in your libraries.

Javadoc

You can find the api documentation for this project here

Detecting numbers in text

  1. DetectorFactory factory = Numbers.createDetectorFactory();
  2. CombinedNumberDetector detector = factory.createCombinedNumberDetector();
  3. Iterable<Token> tokens = ...
  4. for (NumberResult numberResult in detector.findNumbers(tokens)) {
  5. // do something with the detected number
  6. }

Contact and Support

For issues or enhancement requests, feel free to submit to the Issues tab on GitHub.

BioMedICUS has a gitter chat and a
Google Group for contacting
developers with questions, suggestions or feedback.

About Us

BioMedICUS is developed by the
University of Minnesota Institute for Health Informatics NLP/IE Group
with assistance from the
Open Health Natural Language Processing (OHNLP) Consortium.

Contributing

Anyone is welcome and encouraged to contribute. If you discover a bug, or think the project could
use an enhancement, follow these steps:

  1. Create an issue and offer to code a solution. We can discuss the issue and decide whether any
    code would be a good addition to the project.
  2. Fork the project. [https://github.com/nlpie/biomedicus-measures/fork]
  3. Create Feature branch (git checkout -b feature-name)
  4. Code your solution.
    • Follow the Google style guide for Java.
      There are IDE profiles available here.
    • Write unit tests for any non-trivial aspects of your code. If you are fixing a bug write a
      regression test: one that confirms the behavior you fixed stays fixed.
  5. Commit to branch. (git commit -am 'Summary of changes')
  6. Push to GitHub (git push origin feature-name)
  7. Create a pull request on this repository from your forked project. We will review and discuss
    your code and merge it.