Just links – Telegram

Just links

6.57K subscribers

358 photos

39 videos

10 files

7.76K links

That's just link aggregator of everything I consider interesting, especially DL and topological condensed matter physics. @EvgeniyZh

Download Telegram

About

Blog

Apps

Platform

6.57K subscribers

https://media.neurips.cc/Conferences/NIPS2018/Slides/Deep_Unsupervised_Learning.pdf

433 views07:37

https://twitter.com/DeepMindAI/status/1087743023100903426

Join us and @Blizzard_Ent this Thursday at 6:00pm GMT for an exciting #StarCraft demonstration, hosted by @Artosis and @RotterdaM08! Livestream on YouTube: https://t.co/lQytLEsT0o Read more about #StarCraft2 as an environment for AI research: https://t.co/TSUdS9vttG

459 views16:36

Forwarded from Spark in me (Alexander)

Pre-trained BERT in PyTorch

https://github.com/huggingface/pytorch-pretrained-BERT

(1)
Model code here is just awesome.
Integrated DataParallel / DDP wrappers / FP16 wrappers also are awesome.

FP16 precision training from APEX just works (no idea about convergence though yet).

(2)
As for model weights - I cannot really tell, there is no dedicated Russian model.
The only problem I am facing now - using large embeddings bags batch size is literally 1-4 even for smaller models.

And training models with sentence piece is kind of feasible for rich languages, but you will always worry about generalization.

(3)
Did not try the generative pre-training (and sentence prediction pre-training), I hope that properly initializing embeddings will also work for a closed domain with a smaller model (they pre-train 4 days on 4+ TPUs, lol).

(5)
Why even tackle such models?
Chat / dialogue / machine comprehension models are complex / require one-off feature engineering.
Being able to tune something like BERT on publicly available benchmarks and then on your domain can provide a good way to embed complex situations (like questions in dialogues).

#nlp
#deep_learning

GitHub - huggingface/transformers: 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models…

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training. - GitHub - huggingface/t...

12 views08:13

https://www.reddit.com/r/Coq/comments/aj87a5/so_i_translated_part_of_the_1st_chapter_of_adam/

From the Coq community on Reddit: So I translated part of the 1st chapter of Adam Chlipala's book code from Coq into C++ template…

Explore this post and more from the Coq community

486 views08:44

https://twitter.com/aureliengeron/status/1088358749561999360

Aurélien Geron

Just added code examples and exercises on #autodiff in @TensorFlow 2.0. Covers autodiff basics, computing 2nd order derivatives, how to write a custom training loop to train a Keras model, and how to use running metrics. See notebook #2 in https://t.co/j77gSiJDVt…

427 views17:39

Forwarded from Hacker News

AlphaStar: Mastering the Real-Time Strategy Game StarCraft II (Score: 112+ in 1 hour)

Link: https://readhacker.news/s/3Wamk
Comments: https://readhacker.news/c/3Wamk

8 views07:04

https://www.technologyreview.com/s/612768/we-analyzed-16625-papers-to-figure-out-where-ai-is-headed-next/

MIT Technology Review

We analyzed 16,625 papers to figure out where AI is headed next

Our study of 25 years of artificial-intelligence research suggests the era of deep learning may come to an end.

429 views14:54

https://www.reddit.com/r/MachineLearning/comments/ajgzoc/we_are_oriol_vinyals_and_david_silver_from/

We are Oriol Vinyals and David Silver from DeepMind’s AlphaStar...

Hi there! We are Oriol Vinyals (/u/OriolVinyals) and David Silver (/u/David_Silver), lead researchers on DeepMind’s AlphaStar team, joined by...

424 views15:22

https://github.com/quantumlib/OpenFermion

GitHub - quantumlib/OpenFermion: Python package for compiling and analyzing quantum algorithms to simulate electronic structures.

Python package for compiling and analyzing quantum algorithms to simulate electronic structures. - quantumlib/OpenFermion

448 views16:34

https://arxiv.org/abs/1901.08106

430 views17:45

https://arxiv.org/abs/1901.05003

Quantum Teleportation-Inspired Algorithm for Sampling Large Random...

We show that low-depth random quantum circuits can be efficiently simulated
by a quantum teleportation-inspired algorithm. By using logical qubits to
redirect and teleport the quantum information...

431 views21:57

https://ai.google.com/research/ConceptualCaptions/competition

439 views21:59

https://arxiv.org/abs/1901.03440

503 views22:02

https://arxiv.org/abs/1901.08162

Causal Reasoning from Meta-reinforcement Learning

Discovering and exploiting the causal structure in the environment is a crucial challenge for intelligent agents. Here we explore whether causal reasoning can emerge via meta-reinforcement...

461 views23:32

https://arxiv.org/abs/1901.06140

Backbone Can Not be Trained at Once: Rolling Back to Pre-trained...

In person re-identification (ReID) task, because of its shortage of trainable
dataset, it is common to utilize fine-tuning method using a classification
network pre-trained on a large dataset....

519 views23:38

https://github.com/westerndigitalcorporation/swerv_eh1

GitHub - westerndigitalcorporation/swerv_eh1: A directory of Western Digital’s RISC-V SweRV Cores

A directory of Western Digital’s RISC-V SweRV Cores - westerndigitalcorporation/swerv_eh1

474 views09:43

https://arxiv.org/abs/1811.08027

Neural Lander: Stable Drone Landing Control using Learned Dynamics

Precise near-ground trajectory control is difficult for multi-rotor drones, due to the complex aerodynamic effects caused by interactions between multi-rotor airflow and the environment....

487 views10:37

https://twitter.com/MaureenBug/status/1088999913659105280

Dr Maureen Berg, microboiologist

okay so I gave a 10 min presentation today about not hating your life in grad school, and here are the slides as promised. this covers a VERY small amount of good advice that is out there. i modified it just slightly bc i was giving some advice specific to…

544 views11:54

https://twitter.com/JanelleCShane/status/1089202198125240320

Dogflowers and geyserdogs: exploring the latent space of #BigGAN using https://t.co/SdupSbLVIx https://t.co/dsgTtsvBBu

627 views20:13

https://arxiv.org/abs/1901.08168

https://github.com/danielkunin/Regularized-Linear-Autoencoders

Loss Landscapes of Regularized Linear Autoencoders

Autoencoders are a deep learning model for representation learning. When trained to minimize the distance between the data and its reconstruction, linear autoencoders (LAEs) learn the subspace...

549 viewsedited 08:40

Physics in Two Dimensions https://courses.physics.illinois.edu/phys598PTD/fa2013/

581 viewsedited 13:09