Jobs [permanent position]: NLP

[Job posted on august 11th 2015 by SESAMm on List LN]
SESAMm: Financial Engineering Startup. Financial Indicators – Social Media.

Computational Linguist

Bonjour,

La start-up SESAMm est à la recherche d’un linguiste informatique pour
booster son développement. Fondée en 2014, SESAMm est l’une des
start-ups FinTech les plus dynamiques des écosystèmes français et
luxembourgeois.  SESAMm conçoit et commercialise des outils de
prévisions financières fondées sur les réseaux sociaux et autres sources
de données textuelles.  Ces développements informatiques s’adressent aux
banques et hedge funds.

L’objectif de SESAMm : s’imposer comme le leader des technologies Big
Data pour les marchés financiers sur la scène internationale.

*Missions et activités du poste*
Finalité du poste : Responsable Linguistique; développement de nouveaux
outils d’analyse de sentiment à partir de différentes sources de données
et en particulier des réseaux sociaux pour créer des indicateurs de
trading destinés aux banques et hedge funds.

*Missions principales : *
– Participation aux projets de recherche en cours et à venir : étude de
l’existant, proposition de solutions originales, implémentation et
évaluation, présentation des résultats lors de réunions et
conférences.
– Identification de sources de données pertinentes et extraction
– Développement de filtres et préparation des données
– Développement de nouvelles méthodes informatiques d’analyse de
sentiment
– Développement de nouvelles méthodologies de fouille de données en
utilisant toutes les annotations (sens, forme, structure etc.)
– Modélisation prédictive et explicative (scorings,…), mise en place
d’indicateurs pertinents, analyse des résultats (segmentation,
ciblage…) et recommandations stratégiques
– Rédaction de notes d’analyse et éventuellement publication des
recherches
– Participation à des conférences et des colloques

*Profil du poste*

*Formation*
– De préférence Doctorat Bac +8 université ou école d’ingénieur ou
Master Bac +5
– Spécialisation recherchée : traitement du langage, Natural Language
Processing, linguistique informatique

*Compétences et expérience*
– Expérience : une expérience préalable en analyse de sentiment est
souhaitable. Une expérience (universitaire ou professionnelle) dans le
domaine de l’analyse des réseaux sociaux serait un plus.
– Connaissances en NPL, Machine Learning, Intelligence Artificielle et
statistiques appliquées
– Intérêt pour l’extraction massive de connaissances à partir du Web et
des réseaux sociaux
– Langues : excellente maîtrise de l’anglais exigée; bonnes capacités de
synthèse et de rédaction
– Informatique : Langages de scripting

Un excellent relationnel et une forte motivation sont demandés. Le
candidat devra savoir faire preuve d’autonomie et de curiosité face à un
environnement en évolution constante. Ce poste intègrera le linguiste
informatique au cœur des développements R&D de la société et
représentera une véritable expérience entrepreneuriale de haut niveau.
La liste des missions décrites n’est pas nécessairement exhaustive et
est susceptible d’être modifiée en fonction des contraintes de
l’entreprise et de l’évolution du poste.

*Conditions de travail*
– Plein temps
– Lieu de travail : Metz (France)
– Moyens matériels mis à disposition : matériel informatique et
logiciels, bureau
– Rémunération : attractive, intéressement / BSA possible ;
– Durée et disponibilité : CDI, à partir d’octobre 2015

Rejoignez une start-up FinTech innovante, ambitieuse et en forte
croissance.
Plus d’informations : www.sesamm.com
Envoi des dossiers de candidature à l’adresse contact@sesamm.com
Candidature : CV, entretiens

*Sylvain FORTE*
Président de SESAMm SAS
2 Bis, rue Brûlée
67000 STRASBOURG
+33 (0)6 09 46 46 80
www.sesamm.com

Suivez nos actualités sur Twitter (https://twitter.com/sesamm_inside),
LinkedIn (https://www.linkedin.com/pub/sesamm-sas/97/ba4/3a5), et Viadéo
(http://www.viadeo.com/fr/profile/sesamm.sas)

Advertisements

Trooclick

logo


Trooclick
 is a french start-up created in November 2012 by Stanislas Motte and located in Paris.

Areas in NLP

opinion analysis, text mining

Need

Whether you are a CEO, member of a government or a personality, it is important to ensure your eReputation. With the development of social networks, we have access to a wealth of information on our eReputation, on what people think of us. But, in return, what we say and what we do spread in large-scale.

There are listening platforms identifying a list of articles talking about a brand, a business or a personality and offering opinion analysis tools. The problem with these solutions is that they apply a very wide crawl and can report data without much interest. The completeness of these platforms can be brought into question. Although the approach certainly is of interest in “quantifying” data, we cannot get something out of it in the “qualified” perspective. First, the result is often a hodgepodge of heterogeneous texts, often highly redundant, with little real opinions expressed (hence the category “neutral” as many opinions analysis systems put next to “positive” and “negative”) category has little value in most cases. Finally, eReputation platforms offering a function of “sentiment analysis” do this for the entire document, which quickly becomes unusable when several points of view are expressed in the same document.

Solution

Trooclick’s Opinion-Driven Search Engine uses natural language processing (NLP) technology to gather quotes from online sources of news and opinions – including content from publishers, blogs, and Twitter. After quotes are extracted, the speakers are categorized (categories include executives, analysts, politicians, journalists…).

Business Plan

The original business plan was to develop the first automated fact-checking app for financial news. « Fact checking is the task of assessing the truthfulness of claims made by public figures such as politicians, pundits, etc. It is commonly performed by journalists employed by news organisations in the process of news article creation. Fact-checking is a time-consuming process » (Vlachos & Riedel 2014)

But, the plug-in worked so well that it didn’t find any errors. The reason is that financial companies know their words are scrutinized by regulators and don’t dare to make misstatements. Influential speakers prefer to use omission rather than errors. The way you combat that problem is by presenting different points of view.

Trooclick’s Opinion-Driven Search Engine can be a solution for companies in social media or search giants such as LinkedIn or Yahoo. It can be also a solution for b-to-b-to-b approach in which a customer could use Trooclick’s technology to provide its own client companies with easily digestible media monitoring.

Team

Trooclick currently has a team of 18 full-time people including 10 engineers and PhD.

1 PhD in Semantic Web
5 NLP engineers

Research and Development issues

Actually, R&D projects in the Artificial Intelligence field :

  • automated quotes collection
  • automated named entity recognition and classification in speaker categories (journalist, politician…)
  • automated evaluation and scoring of quotes (variety of criteria including instances of anticipation, contrast, causality, etc.)

The Trooclick’s Opinion-Driven Search Engine is protected by an US patent obtained on March 2015.

In the long run, Trooclick will develop automatically generated viewpoint summaries. With a huge chunk of readers never making it past the headline, Trooclick sees it as important to quickly summarize the major viewpoints on an issue in the first couple lines of each entry.

Fund raising

In April 2013, Trooclick received financial support from the BPI (French public investment bank) and in June 2013 the French government granted it the Status of “Young Innovative Company” (JEI), recognizing its innovative nature.

Contact

Darcee Meilbeck
Marketing and Sales Assistant
+33 (0)6 59 31 56 59
darcee.meilbeck@trooclick.com

There’s more than one side to every story with Trooclick

I test applications using a natural language interface or a natural language technology and I write an article about my experience with it.

logo
This week, I tested Trooclick, a free Opinion-Driven Search Engine.


A New Approach of Opinion Mining with Trooclick’s Opinion-Driven Search Engine

An important part of our information-gathering behavior has always been to find out what other people think. With the growing availability and popularity of opinion-rich resources such as online review sites and personal blogs, new opportunities and challenges arise as people now can, and do, actively use information technologies to seek out and understand the opinions of others. The sudden eruption of activity in the area of opinion mining and sentiment analysis, which deals with the computational treatment of opinion, sentiment, and subjectivity in text, has thus occurred at least in part as a direct response to the surge of interest in new systems that deal directly with opinions as a first-class object.

There are listening platforms identifying a list of articles talking about a brand, a business or a personality and offering opinion analysis tools: SenticNet, Luminoso and Attensity.

The problem with these solutions is that they apply a very wide crawl reporting data without much interest. The completeness of these platforms can be brought into question. Although the approach certainly is of interest in “quantifying” data, we cannot get something out of it in the “qualified” perspective. First, the result is often a hodgepodge of heterogeneous texts, often highly redundant, with little real opinions expressed (hence the category “neutral” as many opinions analysis systems put next to “positive” and “negative”) category has little value in most cases. Finally, eReputation platforms offering a function of “sentiment analysis” do this for the entire document, which quickly becomes unusable when several points of view are expressed in the same document .

Trooclick’s Opinion-Driven Search Engine (beta) uses natural language processing (NLP) technology to gather quotes (Screenshot 1) from online sources of news and opinions – including content from publishers, blogs, and Twitter (soon expanding to radio and TV). After quotes are extracted, the speakers are categorized (categories include executives, analysts, politicians, clients – 16 total).  They then use this data to rank news articles for quality. Unsurprisingly, for a company that emphasizes the importance of different points of view to understand the news, in the Trooclick universe more points of view from more people means a higher ranking. For now they have two ranking criteria active (number of speakers and quote score) with about 30 more in the pipes (Screenshot 2).

Capture d’écran 2015-07-07 à 11.50.36
Screenshot 1
Capture d’écran 2015-07-07 à 11.50.53
Screenshot 2

The software, based on advanced text mining and semantic analysis technologies, performs several tasks:

  • Quality ranking of articles per event
  • extracting quotes and identifying the speaker, the media, the date of the publication
  • classification of the speaker into categories (manager, analyst, customer, employee, etc …)

Still in beta, the Trooclick’s solution is currently only available in the English-speaking media.

Automatic Detection of quotes

Trooclick tracks quotes from news sites and social media. The system identifies quotes in different ways. As a result not only does the site pull up direct quotes like this:

McDonalds will stop serving antibiotic-raised poultry,said McDonalds President, Mike Andres.

…But also indirect ones like this:

Mike Andres, McDonalds President, said McDonalds will stop serving antibiotic-raised poultry.

And some quotes come from opinion columns (“Hopefully chicken is just the start – I hope the Big Mac and McRib will be next”) and analysis (“McDonald’s decision to sell milk produced without rBST was a good step because the growth hormone can cause health problems in dairycows.”)

Automation of Fact-Checking for Journalism

Fact checking is the task of assessing the truthfulness of claims made by public figures such as politicians, pundits, etc. It is commonly performed by journalists employed by news organisations in the process of news article creation. Fact-checking is a time-consuming process » (Vlachos & Riedel 2014).

Journalism is about finding facts, interpreting their importance, and then sharing that information with the audience. That’s all journalists do: find, verify, enrich and then disseminate information. It sounds easy, doesn’t it, observing what is going on, asking questions, uncovering facts and then telling the public what we have discovered. But we are dealing with volatile raw material. Handled carelessly, the facts we uncover, research and present have the power to cause misunderstandings, damage and could, potentially, change the course of history. That’s why it’s essential that we apply robust fact-checking to all our journalism.

But, fact-checking is a time-consuming process. Automating the process of fact checking has recently been discussed in the context of computational journalism (Cohen et al., 2011Flew et al., 2012). Inspired by the recent progress in natural language processing, databases and information retrieval, the vision is to provide journalists with tools that would allow them to perform this task automatically,

One way to solve that problem is by presenting different points of view as Trooclick’s Opinion-Driven Search Engine. There are other solutions in the area of fact-checking such as Truth Teller: this Washington Post initiative transcribes political videos and checks them against a database that draws on PolitiFact, FactCheck.org, and the paper’s own Fact Checker blog. The program then tells viewers which statements are true and which are false. In 2015, the Post plans to annotate videos in real time.

There are also approaches based on crowdsourcing as Fiskkit and Grasswire, a platforms that invites the public to fact-check breaking news stories.

Trooclick’s Opinion-Driven Search Engine is available online on http://trooclick.com

Here a video presentation of Trooclick:

To contact Trooclick

earth Web Site        twitter @Trooclick      mail Email

What do you think about Trooclick’s solution ? Please, don’t hesitate to drop your comments below.

Jobs [permament position]: NLP

[Job posted on May 16th 2016 by LinkedIn on List LN]

Applied Scientist, NLP

At LinkedIn, we regularly process the semi-structured content in the
340+ million member profiles and the content they create on LinkedIn,
such as comments, job descriptions, group discussions, and Influencer
posts. We are building an NLP (Natural Language Processing) team at
LinkedIn and this is a great opportunity to get in on the ground floor.

Responsibilities:

– Design and develop NLP systems and tools.
– Research and evaluate different solutions to NLP problems.
– Produce deliverable results and see them through from development to
production.
– Interact and work with remote teams located in different timezones.

Requirements:

– PhD in Computer Science or related discipline.
– Expertise in several of the following domains: sentiment analysis,
information extraction (POS, N/E tagging with HMMs/CRF etc), feature
extraction, classification, tokenization, and processing of
non-English text.
– Machine Learning & Text mining exposure and familiarity with R, Weka,
NTLK etc.
– Solid experience in Java, C++, or another object-oriented language.
– Excellent communication skills, drive and discipline to get things
done.

We’d prefer if you also have:

– Worked with web-scale traffic and data.
– Experience with Hadoop, Pig, or other MapReduce paradigms.
– Experience with Lucene, SOLR or other open-source IR toolkits.
– Published work in academic conferences or industry circles.
– Experience with consumer-facing product development and design.

To apply go to https://www.linkedin.com/jobs2/view/44208053 or to reach
out for more information, please email Deirdre Hogan
(dhogan@linkedIn.com<mailto:dhogan@linkedIn.com>) and Jorge Handl
(jhandl@linkedin.com).

Jobs [temporary position, 1 month]: linguistics/NLP

[Jobs posted on March 22th 2014 by ELDA on LN list]

Annotateur (H/F) en langue

Française

ELDA (Evaluation and Language resources Distribution Agency,
http://www.elda.org/) a pour activités principales la distribution et la
production de ressources linguistiques, ainsi que l’évaluation de
technologies de la langue.

Dans le cadre de ses activités de production, ELDA offre plusieurs
postes d’annotateur (H/F) à plein-temps.

_Mission_ :__

Il s’agit d’annoter des documents textuels de types microblogs (tweets)
afin d’en extraire les opinions et les sentiments. Le travail sera
effectué via un logiciel et selon des conventions d’annotation
spécifiques. La formation au logiciel et aux conventions sera assurée
par l’employeur.

_Profil recherché_ :__

* Français langue maternelle
* Bonne maîtrise de l’outil informatique
* Capacité à intégrer des règles (d’annotation) et à les suivre
scrupuleusement et avec constance
* Une première expérience d’annotation serait un plus

_Durée_ : Plein-temps, pour une durée de 1 mois et demi (extensible à 2).

_Lieu_ : la mission s’effectuera au sein des locaux d’ELDA (Paris 13e)

_Salaire_ : selon profil et performances.

Les candidatures (CV, lettre de motivation) doivent être adressées à
leixa@elda.org et à chomicha@elda.org .
############################

################################
Annotateur (H/F) en langue Allemande

ELDA (Evaluation and Language resources Distribution Agency,
http://www.elda.org/) a pour activités principales la distribution et la
production de ressources linguistiques, ainsi que l’évaluation de
technologies de la langue.

Dans le cadre de ses activités de production, ELDA offre plusieurs
postes d’annotateur (H/F) à plein-temps.

_Mission_ :__

Il s’agit d’annoter des documents textuels de types microblogs (tweets)
afin d’en extraire les opinions et les sentiments. Le travail sera
effectué via un logiciel et selon des conventions d’annotation
spécifiques. La formation au logiciel et aux conventions sera assurée
par l’employeur.

_Profil recherché_ :__

* Allemand langue maternelle
* Bonne maîtrise de l’outil informatique
* Capacité à intégrer des règles (d’annotation) et à les suivre
scrupuleusement et avec constance
* Une première expérience d’annotation serait un plus

_Durée_ : Plein-temps, pour une durée de 1 mois et demi (extensible à 2).

_Lieu_ : la mission s’effectuera au sein des locaux d’ELDA (Paris 13e)

_Salaire_ : selon profil et performances.

Les candidatures (CV, lettre de motivation) doivent être adressées à
leixa@elda.org et à chomicha@elda.org .
############################

Jobs [permanent position] : NLP, Machine Learning

[Jobs posted on february 24 2014 on Knowsis Company]

NLP/Machine Learning Researcher

Who we are

Knowsis is a London based web intelligence company building next generation financial markets data. Our mission is to develop and market products and services that bridge the information gap between the global financial sector and the social web. We are a dynamic and experienced team, with backgrounds in both technology and finance.

The role

We are looking for passionate engineers to join our growing team who can lead on the development of our systems’ Natural Language understanding. You will work with our team to focus on core problems in sentiment analysis, information extraction, clustering, summarisation etc. This role is also open to current MS/PhD students on a part-time/intern basis.

What we are looking for *

  • MSc/PhD in Computational Linguistics/Machine Learning/Artificial Intelligence or similar field
  • Experience working hands-on with large-scale data sets
  • Experience developing NLP grammars, parsers and machine learning algorithms
  • Fluent in a programming language (preferably Python, but R, Haskell, OCaml, LISP etc are fine)
  • Ability to attract additional world-class engineers & data scientists

* We know that great people can adapt and learn, so we’re not too fussed about everything on these lists!

What we offer

  • The opportunity to direct research efforts in new and exciting directions
  • Collaboration with an experienced team breaking new ground in FinTech
  • An intellectually stimulating and relaxed working environment
  • A fantastic central London location
  • Generous pay

Location

Soho, London

When

We can be flexible and work around current commitments but we’re ideally looking for someone to start immediately. You must be willing and able to work in the UK.

No recruiters, please.

Emploi [stage, ? mois] : TAL

[Offre diffusée le 8 février 2014 sur la liste LN]

Development of linguistic resources to improve an information extraction tool

Trooclick France is a company that specializes in the development of web
applications for the automatic processing of information. Our goal is to
create services that rebuild the user’s trust in digital content. Up to
now, Web players were able to enhance the relevance of this content; we
go a step further and contribute to improve its reliability.

Trooclick was created in November 2012. Just a few months later, in
April 2013, it received financial support from the BPI (French public
investment bank) and in June 2013 the French government granted it the
Status of “Young Innovative Company” (JEI), recognizing its innovative
nature. It now counts twelve committed and passionate members in its
tight-knit team.

The company carries out R&D projects in search of technical solutions in
the Artificial Intelligence field. Due to its growth, Trooclick is now
looking for candidates for a 6 month internship for its office in Paris
(17ème).

Missions:

As a member of the technical team, you will benefit from ongoing
training and you will help us design and build our information
extraction framework based on advanced NLP technologies.

You will turn ideas into well-documented and reliable linguistic
resources (both dictionaries and extraction rules) to ensure efficiency,
quality, performance and scalability.

A great team player, you will interact with other departments to
understand and fine tune specifications.
You will carry out unitary testing, create and maintain our test
validation corpus and participate in editing technical documents. All
developments will be done in English.

Qualifications:

– BSc/MSc
– Experience with NLP tools such as Gate, Treetagger, NooJ, Stanford
for linguistic annotation, named entity recognition, relationship
and fact extraction, sentiment analysis, etc.
– Experience in scripting languages such as Perl or Python as well as
XML format to be autonomous in completing some technical tasks.
– Experience with basic database management operations (SQL language)
Knowledge of Semantic Web technologies (RDF, OWL, SKOS, etc.) will
be a plus.
– Excellent communication skills in English and French
– We are open to new ideas that will significantly contribute to our
success. Our friendly team will provide the opportunity for
valuable collaboration.

– We offer you career perspectives in a young and dynamic company
with an interesting and diversified scope of duties at the cutting
edge of research. We welcome applications from highly motivated
individuals able to learn new techniques and share knowledge and
experience with the team.

Interested? Then send your application to jobs@trooclick.com!

Emploi [CDI] : TAL

[Offre diffusée le 4 février 2014 sur la liste linguistlist par DFKI GmbH]

University or Organization: DFKI GmbH
Department: Language Technology Lab
Job Location: Berlin, Germany
Web Address: http://www.dfki.de/lt
Job Rank: Researcher

Researcher in Computational Linguistics

Specialty Areas: Computational Linguistics; Text/Corpus Linguistics

Description:

The Language Technology Lab of the German Research Center for Artificial Intelligence (DFKI) offers a position at its Berlin site as Full-Time Researcher in Language Technology in the area of information extraction with a strong focus on the detection of relation, events and opinions.

We are looking for people who enjoy working in an innovative and enthusiastic team, love challenges, and are passionate about research and development.  We offer excellent research and working environment with interesting research and development topics in a multi-discipline and international team in an internationally renowned center of AI research.

Applicants should have a masters in computational linguistics or computer science or an equivalent degree. Candidates with a doctoral degree are also encouraged to apply.

Candidates should bring experience and qualification in the following areas:
– Application of NLP tools to large-scale textual corpora
– Information extraction
– Machine learning techniques
– Programming experience of complex systems in Java
– Experience of development under Linux, Unix and Windows

Experience in Web-based IE and in the application of machine learning techniques to relation or event extraction would be a plus.

General Skills
– Good communication skills
– Excellent problem-solving skills
– Team work capabilities
– High level of motivation and initiative
– Ability to manage own workload and meet deadlines
– Good organizational skills.
– Good standard of written and spoken English
– Preferably also a good standard of written and spoken German

Application and Conditions

Successful candidates will be offered a competitive salary based on their qualifications and experience. They will also get opportunities for further qualification and professional development.

DFKI is an equal opportunities employer. Applications of women are thus especially encouraged; applications of disabled persons will be given preferential treatment to those of other candidates with equal qualifications.

To apply for this vacancy, please send a cover letter and copy of a recent CV to Prof. Hans Uszkoreit (uszkoreit@dfki.de) and Dr. Feiyu Xu (feiyu@dfki.de).

Application Deadline:  (Open until filled)

Email Address for Applications: uszkoreit@dfki.de
Contact Information:
Dr. Feiyu Xu
Email: feiyu@dfki.de

Emploi [CDI] : TAL

[Offre diffusée le 20 décembre 2013 sur la liste linguistlist par NetBase Solutions]

Computational Linguist

University or Organization: NetBase Solutions
Job Location: California, USA
Web Address: http://www.netbase.com
Job Rank: Computational Linguist

Specialty Areas: Computational Linguistics; NLP

Required Language(s): Spanish (spa)

Description:

We are seeking a Spanish-speaking computational linguist to be part of our NLP development team. The main responsibility will be to further enhance our insight extraction system for social-media contents in Spanish. This is an integrated part of a team effort for our multilingual program primarily based on grammars and rules to support the extraction of consumer sentiments and opinions about brands and products. The position can be based in Mountain View, California, or Heidelberg, Germany.

Responsibilities:
– Develop and maintain morphosyntactic and semantic lexicons for Spanish NLP, in particular sentiment extraction, with a special focus on internet jargon and slang
– Enhance an FST-based dependency grammar for Spanish as well as modules for the correction of the PoS-tagger/lemmatizer output
– Enhance dependency-based graph grammars for sentiment extraction and other information-extraction tasks
– Design, build and unit-test software in a collaborative environment
– Conduct code review

Requirements:
– MA or PhD in Computational Linguistics or a related field
– Industry experience with Natural Language Processing software and developing computational grammars and/or lexicons
– Native fluency in Mexican Spanish or Spanish spoken in Latin America
– Good grasp of social media culture for that language
– Experience with robust parsing techniques
– Experience using a scripting language such as Python
– Strong problem solving, critical thinking, and algorithm skills
– Authorization to work in Germany or the US and willingness to relocate

Desirable Skills:
– Experience with Java programming, internationalization and localization
– Familiarity with development and debugging in a software development environment (Eclipse is a plus)
– Passion for developing efficient, testable and well-documented code
– Agile development methodologies
– Machine learning on sentiment analysis and topic clustering
– Polyglot a plus

Benefits of working at NetBase include:
– Competitive compensation and benefits package, and equity
– Mac, Windows, or Linux laptop of your choice
– Colleagues who are some of the smartest engineers and tech entrepreneurs
– Open, collaborative environment
– Telecommuting option
– Opportunity to work with what our Fortune 100 customers say is the best, most-scalable natural language platform in existence

Application Deadline:  (Open until filled)

Email Address for Applications: team@netbase.com
Contact Information:
Dr. Masayo Iida
Email: miida@netbase.com

Emploi [stage, 6 mois] : TAL

[Offre diffusée le 30 novembre 2013 sur la liste LN par Trooclick]

Design and build an information extraction framework based on advanced NLP technologies

Trooclick France is a company that specializes in the development of web
applications for the automatic processing of information. Our goal is to
create services that rebuild the user’s trust in digital content. Up to
now, Web players were able to enhance the relevance of this content; we
go a step further and contribute to improve its reliability.

Trooclick was created in November 2012. Just a few months later, in
April 2013, it received financial support from the BPI (French public
investment bank) and in June 2013 the French government granted it the
Status of “Young Innovative Company” (JEI), recognizing its innovative
nature. It now counts twelve committed and passionate members in its
tight-knit team.

The company carries out R&D projects in search of technical solutions in
the Artificial Intelligence field. Due to its growth, Trooclick is now
looking for candidates for a 6 month internship for its office in Paris
(17ème).

Missions:

As a member of the technical team, you will benefit from ongoing
training and you will help us design and build our information
extraction framework based on advanced NLP technologies.

You will turn ideas into well-documented and reliable linguistic
resources (both dictionaries and extraction rules) to ensure efficiency,
quality, performance and scalability.

A great team player, you will interact with other departments to
understand and fine tune specifications.
You will carry out unitary testing, create and maintain our test
validation corpus and participate in editing technical documents. All
developments will be done in English.

Qualifications:

– BSc/MSc
– Experience with NLP tools such as Gate, Treetagger, NooJ, Stanford
for linguistic annotation, named entity recognition, relationship
and fact extraction, sentiment analysis, etc.
– Experience in scripting languages such as Perl or Python as well as
XML format to be autonomous in completing some technical tasks.
– Experience with basic database management operations (SQL language)
Knowledge of Semantic Web technologies (RDF, OWL, SKOS, etc.) will
be a plus.
– Excellent communication skills in English and French
– We are open to new ideas that will significantly contribute to our
success. Our friendly team will provide the opportunity for
valuable collaboration.
– We offer you career perspectives in a young and dynamic company
with an interesting and diversified scope of duties at the cutting
edge of research. We welcome applications from highly motivated
individuals able to learn new techniques and share knowledge and
experience with the team.

Interested? Then send your application to jobs@trooclick.com!