Articles About Natural Language Processing

A Guide to Feature Engineering in NLP

Overview Feature engineering in NLP is understanding the context of the text. In this blog, we will look at some of the common feature engineering in NLP. We will compare the results of a classification task with and without doing feature engineering   Table of Content Introduction NLP task overview List of features with code Implementation Results comparison with and without doing feature engineering Conclusion Introduction   “If 80 percent of our work is data preparation, then ensuring data quality […]

Read more

Issue #126 – Learning Feature Weights for Denoising Parallel Corpora

15 Apr21 Issue #126 – Learning Feature Weights for Denoising Parallel Corpora Author: Dr. Patrik Lambert, Senior Machine Translation Scientist @ Iconic Introduction Large web-crawled parallel corpora constitute a very useful source of data to improve neural machine translation (NMT) engines. However, their effectiveness is reduced by the large amount of noise they usually contain. As early as in issue #2 of this series, we pointed out that NMT is particularly sensitive to noise in the training data. In issue […]

Read more

Score-Based Generative Modeling through Stochastic Differential Equations

score_sde_pytorch PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral). This repo contains a PyTorch implementation for the paper Score-Based Generative Modeling through Stochastic Differential Equations by Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole What does this code do? Aside from the NCSN++ and DDPM++ models in our paper, this codebase also re-implements many previous score-based models in one place, including NCSN from Generative Modeling by Estimating Gradients of […]

Read more

Code for Multiple Instance Active Learning for Object Detection

MI-AOD This is the code for Multiple Instance Active Learning for Object Detection, CVPR 2021. In this paper, we propose Multiple Instance Active Object Detection (MI-AOD), to select the most informative images for detector training by observing instance-level uncertainty. MI-AOD defines an instance uncertainty learning module, which leverages the discrepancy of two adversarial instance classifiers trained on the labeled set to predict instance uncertainty of the unlabeled set. MI-AOD treats unlabeled images as instance bags and feature anchors in images […]

Read more

RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction

RfD-Net Yinyu Nie, Ji Hou, Xiaoguang Han, Matthias NießnerIn CVPR, 2021. From an incomplete point cloud of a 3D scene (left), our method learns to jointly understand the 3D objects and reconstruct instance meshes as the output (right). Install This implementation uses Python 3.6, Pytorch1.7.1, cudatoolkit 11.0. We recommend to use conda to deploy the environment. conda env create -f environment.yml conda activate rfdnet pip install -r requirements.txt Next, compile the external libraries by python setup.py build_ext –inplace Install PointNet++ […]

Read more

Live coding in Python with PyCharm, Emacs or even a browser

Live Coding in Python Visualize your Python code while you type it in PyCharm, Emacs, Sublime Text, or even your browser. To see how to use one of the Live Coding in Python plugins, watch the demo video or read the getting started pages for PyCharm, Emacs, or Sublime Text. Want to try it without installing anything? Try the browser version. You can also try Space Tracer, the command-line tool that trades time for space when you debug. You might […]

Read more

A drop in single file solution for templating with Variable Precedence

Jinja2++ Jinja2++ is a drop in single file solution for templating with Variable Precedence. You can use ANY language! How it works Based on the Unix pipeline : JSON – stdin |> stdout – JSON A var_file is any executable that slurps JSON from STDIN, and spits JSON from STD out. We assemble a series of cascading var_files based on the file hierarchy to overwrite proceeding variables. We also do context injection, in the same fashion, ie context as the […]

Read more

Convert markdown to HTML using the GitHub API with Python

github-flavored-markdown-to-html A user-friendly python-module and command-line frontend to convert markdown to html. It uses GitHubs online Markdown-to-html-API by default (which requires internet connection), but comes with an option for offline conversion (which closely imitates GitHubs behavior), and any other python- or commandline tool can be plugged into it as well. Whatever you use it with is automatically extended with a ton of functionality, like more in- and output options, github-flavored CSS, formula support, image downloading, host-ready file- and image-placement, pdf-conversion, […]

Read more

Hugging Face – Issue 8 – Mar 30th 2021

News Hugging Face Raises Series B! 📣 We are so excited to announce our $40M series B led by Lee Fixel at Addition with participation from Lux Capital, A.Capital Ventures, and betaworks! Thank you to all our open source contributors, pull requesters, issue openers, notebook creators, model architects, tweeting supporters & community members all over the world 🌎! We couldn’t do what we do & be where we are – in a field    

Read more
1 23 24 25 26 27 71