Zhen Lv's personal website

Hi! I'm Lv Zhen

I'm currently a senior algorithm expert in Computer Science at Alibaba.

My research interests lie in Computer Vision and Deep Learning, focusing on 3D vision, multimodal models mainly.

Education

Ph.D. in Wuhan University
2011 - 2018
Photogrammetry and Remote Sensing

B.S. in Wuhan University
2007 - 2011
Remote Sensing Science and Technology

Publication

SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input

An Adaptive Multifeature Method for Semiautomatic Road Extraction From High-Resolution Stereo Mapping Satellite Images

IEEE Geoscience and Remote Sensing Letters (GRSL), 2019

An Adaptive Multifeature Sparsity-Based Model for Semiautomatic Road Extraction From High-Resolution Satellite Images in Urban Areas

IEEE Geoscience and Remote Sensing Letters (GRSL), 2017

Joint image registration and point spread function estimation for the super-resolution of satellite images

Signal Processing: Image Communication (SPIC), 2017

A New Change Detection Method of Remote Sensing Image

Geomatics and Information Science of Wuhan University, 2016

Work Experience

ALIBABA - SENIOR ALGORITHM EXPERT / APR 2021 - NOW

Led the development of speech driven facial animation similar to Lipsync.
Drove and launched the project that realized 3D hand tracking in the wild, as well as virtual scene interaction by Unreal engine on mobile.
Realized the AI music creation application including lyric generation, singing voice synthesis and singing voice conversion.
Realized the application of dance motion creation by diffusion model.
Dominated 3D photo/video project similar to Apple 15 pro's 3D spatial video.

Hangzhou Faceunity Technology - Senior CV Engineer / FEB 2020 - Mar 2021

Developed video processing algorithms and optimized mobile applications.
Duplicated After Effect's plugin LockDown, including 2D mesh tracking (ARAP), mesh rendering on mobile.
Completed face swapping algorithm based on DeepFace.

Vivo Mobile Communication - CV Engineer / Dec 2017 - Jan 2020

Responsible for projects including multi-image registration, denoising, and de- blurring, completing PC and mobiles side algorithm development work.

Engineering Project

4K Spatial Video with SOTA Performance

Realtime LIPSYNC

AIGC-Based Song and Dance Animation Generation

Reimplementation of AE LOCKDOWN

APP:随剪

FACE SWAPING

Italian Trulli

AR 3D Hand Interaction in Real-Time on Mobile