A python framework to transform natural language questions to queries in a database query language.

Last update: Dec 18, 2022

Related tags

Overview

  __ _ _   _  ___ _ __  _   _
 / _` | | | |/ _ \ '_ \| | | |
| (_| | |_| |  __/ |_) | |_| |
 \__, |\__,_|\___| .__/ \__, |
    |_|          |_|    |___/

What's quepy?

Quepy is a python framework to transform natural language questions to queries in a database query language. It can be easily customized to different kinds of questions in natural language and database queries. So, with little coding you can build your own system for natural language access to your database.

Currently Quepy provides support for Sparql and MQL query languages. We plan to extended it to other database query languages.

An example

To illustrate what can you do with quepy, we included an example application to access DBpedia contents via their sparql endpoint.

You can try the example online here: Online demo

Or, you can try the example yourself by doing:

python examples/dbpedia/main.py "Who is Tom Cruise?"

And it will output something like this:

SELECT DISTINCT ?x1 WHERE {
    ?x0 rdf:type foaf:Person.
    ?x0 rdfs:label "Tom Cruise"@en.
    ?x0 rdfs:comment ?x1.
}

Thomas Cruise Mapother IV, widely known as Tom Cruise, is an...

The transformation from natural language to sparql is done by first using a special form of regular expressions:

person_name = Group(Plus(Pos("NNP")), "person_name")
regex = Lemma("who") + Lemma("be") + person_name + Question(Pos("."))

And then using and a convenient way to express semantic relations:

person = IsPerson() + HasKeyword(person_name)
definition = DefinitionOf(person)

The rest of the transformation is handled automatically by the framework to finally produce this sparql:

SELECT DISTINCT ?x1 WHERE {
    ?x0 rdf:type foaf:Person.
    ?x0 rdfs:label "Tom Cruise"@en.
    ?x0 rdfs:comment ?x1.
}

Using a very similar procedure you could generate and MQL query for the same question obtaining:

[{
    "/common/topic/description": [{}],
    "/type/object/name": "Tom Cruise",
    "/type/object/type": "/people/person"
}]

Installation

You need to have installed docopt and numpy. Other than that, you can just type:

pip install quepy

You can get more details on the installation here:

http://quepy.readthedocs.org/en/latest/installation.html

Learn more

You can find a tutorial here:

http://quepy.readthedocs.org/en/latest/tutorial.html

And the full documentation here:

http://quepy.readthedocs.org/

Join our mailing list

Contribute!

Want to help develop quepy? Welcome aboard! Find us in http://groups.google.com/group/quepy

A python framework to transform natural language questions to queries in a database query language.

Related tags

Overview

What's quepy?

An example

Installation

Learn more

Contribute!

Owner

Machinalis

A minimal code for fairseq vq-wav2vec model inference.

jiant is an NLP toolkit

Longformer: The Long-Document Transformer

Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form.

A framework for implementing federated learning

A fast and lightweight python-based CTC beam search decoder for speech recognition.

gaiic2021-track3-小布助手对话短文本语义匹配复赛rank3、决赛rank4

Code repository of the paper Neural circuit policies enabling auditable autonomy published in Nature Machine Intelligence

Simple, hackable offline speech to text - using the VOSK-API.

A simple word search made in python

A collection of GNN-based fake news detection models.

A deep learning-based translation library built on Huggingface transformers

easySpeech is an open-source Python wrapper for google speech to text API that doesn't require PyAudio(So you especially windows user don't have to deal with the errors while installing PyAudio) and also works with hugging face transformers

NLPShala , the best IDE for all Natural language processing tasks.

Finds snippets in iambic pentameter in English-language text and tries to combine them to a rhyming sonnet.

This repository contains all the source code that is needed for the project : An Efficient Pipeline For Bloom’s Taxonomy Using Natural Language Processing and Deep Learning

Korea Spell Checker

Azure Text-to-speech service for Home Assistant

Crie tokens de autenticação íntegros e seguros com UToken.

Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question Answering