Just links

Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video https://arxiv.org/abs/2203.08534
#cv #3d

👍1

901 views08:03

Just links

A Conversational Paradigm for Program Synthesis https://arxiv.org/abs/2203.13474
#plm

👍1

820 views03:53

Just links

SharpContour: A Contour-based Boundary Refinement Approach for Efficient and Accurate Instance Segmentation https://arxiv.org/abs/2203.13312
#cv #instance

894 views03:55

Just links

Weak-coupling to strong-coupling quantum criticality crossover in a Kitaev quantum spin liquid α-RuCl3 https://arxiv.org/abs/2203.13407
#physics #ksl #αrucl

954 viewsedited 04:31

Just links

https://twitter.com/kolesnikov/status/1508448580960501760

Twitter

Alexander Kolesnikov

@karpathy The question is partially addressed here (as a by-product of studying the effect of the batch size): jmlr.org/papers/volume2…. For example for ImageNet they show that until the batch size becomes huge, the number of images seen is the only thing…

2.95K views14:48

Just links

https://twitter.com/evgeniyzhe/status/1508833760946671620

Twitter

Evgenii Zheltonozhskii

@n_astrakhantsev @dmitry_grinko Aleksandr Berezutskii and me are 2nd at #QHack2022 @qiskit challenge reproducing @GoogleQuantumAI @PedramRoushan paper on anyons in toric code simul github.com/XanaduAI/QHack… 31 qubit simul on Mac. Hope to release real hardware…

🔥5

1.26K views15:50

Just links

Just links pinned «https://twitter.com/evgeniyzhe/status/1508833760946671620»

15:51

Just links

Training Compute-Optimal Large Language Models https://arxiv.org/abs/2203.15556
#nlp #llm

1.07K views01:20

Just links

Forwarded from Arxiv

- Self-supervised machine learning model for analysis of nanowire morphologies from transmission electron microscopy images. (arXiv:2203.13875v1 [cond-mat.mtrl-sci])
http://arxiv.org/abs/2203.13875

906 views02:37

Just links

Pathways: Asynchronous Distributed Dataflow for ML https://arxiv.org/abs/2203.12533
#ml #large_scale

945 views06:30

Just links

Review of experiments on the chiral anomaly in Dirac-Weyl semimetals https://arxiv.org/abs/2010.08564
#physics #weyl_semimetals

861 views16:54

Just links

Exploring Plain Vision Transformer Backbones for Object Detection https://arxiv.org/abs/2203.16527
#cv #detection

883 views04:26

Just links

Forwarded from Empty Set of Ideas

Если считать гомологии ресурсоёмко, то вот, есть сеточка для подсчета персистентных гомологий на облаках точек

GitHub

GitHub - hensel-f/ripsnet: RipsNet: a general architecture for fast and robust estimation of the persistent homology of point clouds

RipsNet: a general architecture for fast and robust estimation of the persistent homology of point clouds - GitHub - hensel-f/ripsnet: RipsNet: a general architecture for fast and robust estimation...

837 views12:59

Just links

High-Temperature Majorana Zero Modes https://journals.aps.org/prl/abstract/10.1103/PhysRevLett.128.137002
#physics #mzm

Physical Review Letters

High-Temperature Majorana Zero Modes

A new proposal for generating Majorana zero modes---electronic states with potential for quantum computing---would not require subkelvin temperatures.

870 views14:47

Just links

Contrasting the landscape of contrastive and non-contrastive learning https://arxiv.org/abs/2203.15702
#self_supervised #contrastive

861 views06:54

Just links

Forwarded from Experimental chill

Продолжаем наши пути неисповедимые в сортировке в C++.

Ох, наконец-то мне можно говорить об этом.

Тут наши друзья из DeepMind решили запушить свои находки в сортировках 3, 4 и 5 элементов примитивных типов. https://reviews.llvm.org/D118029

Такой кейс очень интересный, потому что компилируются в машинный код без веток (только с помощью cmov).

Количество инструкций скомпилированного sortN без веток равно 2N + 4M (M -- оптимальное количество сравнений N элементов):

1. N копирований инструкций из памяти
2. N копирований инструкции из регистров
3. 4 инструкции на компаратор
3.1. Переместить во временный регистр
3.2. Сравнить
3.3. 2 условных хода с помощью cmov

Если посчитать количество инструкций, то вы можете увидеть
Sort3 2*3 + 4*3 = 18 (3 элемента за 3 сравнения)
Sort4 2*4 + 4*5 = 28 (4 элемента за 5 сравнений)
Sort5 2*5 + 4*9 = 46 (5 элементов за 9 сравнений)

И компилятор это генерирует на картинке снизу и по ссылке https://gcc.godbolt.org/z/Mdn8WxaMK

Ребята из DeepMind решили применить MuZero (та самая AlphaZero, дада) на то, чтобы она поискала какие-то улучшения в branchless sorting

И она нашла как сделать sort3 за 17 инструкций, sort5 за 43.

Условно когда мы сортируем 3 элемента A, B, C мы делаем

cond_swap(B, C)
cond_swap(A, C)
cond_swap(A, B)

Каждая по 6 инструкций

MuZero нашёл это сделать так:

cond_swap(B, C) // B < C
magic_swap(A, B, C)

magic_swap похож на двойной cond_swap, но с одним отличием:

1. Move C into tmp.
2. Compare A and C.
3. Conditionally move A into C.
4. Conditionally move A into tmp.
// By now C’ = max(A, C), tmp = min(A, C)
~~Move tmp into A~~. !!!, эта была в двойном cond_swap, а теперь ушло
5. Compare tmp and B.
6. Conditionally move B into A.
7. Conditionally move tmp into B.

Это настолько круто, насколько это возможно. Теперь мы с помощью reinforcement learning находим оптимизации в сортировках.

Я пилю просто огромный пост по поводу того, что мы в итоге сделали с сортировками в Google, это будет одна из мелких частей.

gcc.godbolt.org

Compiler Explorer - C++

template <typename _Compare, typename _RandomAccessIterator>
inline void
__magic_swap(_RandomAccessIterator __x, _RandomAccessIterator __y,
_RandomAccessIterator __z, _Compare __c) {
typedef
typename std::iterator_traits<_RandomAcces…

👍13👎1

681 views11:44

Just links

MaskGroup: Hierarchical Point Grouping and Masking for 3D Instance Segmentation https://arxiv.org/abs/2203.14662
#cv #3d #instance

781 views11:16

Just links

DeepDPM: Deep Clustering With an Unknown Number of Clusters https://arxiv.org/abs/2203.14309
#cv #clustering

arXiv.org

DeepDPM: Deep Clustering With an Unknown Number of Clusters