🗣 NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit
Neural network-based singing voice synthesis library for research
⭐️ Docs: https://nnsvs.github.io/
🔩Github: https://github.com/nnsvs/nnsvs
📄Paper: https://arxiv.org/abs/2210.15987v1
↪️ Samples by r9y9: https://soundcloud.com/r9y9/sets/dnn-based-singing-voice
👣 Demo: https://www.youtube.com/watch?time_continue=1&v=0sSd31TUVCU&feature=emb_logo&ab_channel=DYVAUX
@ai_machinelearning_big_data
⭐️DiffusionDB
DiffusionDB is the first large-scale text-to-image prompt dataset.
🖥 Github: https://github.com/poloclub/diffusiondb
🗒 Paper: https://arxiv.org/abs/2210.14896v1
➡️ Dataset: https://huggingface.co/datasets/poloclub/diffusiondb
@ai_machinelearning_big_data
➡️ Towards Realistic Low-resource Relation Extraction: A Benchmark with Empirical Baseline Studyconda create -n knowprompt python=3.8
conda activate knowprompt
🖥 Github: https://github.com/zjunlp/KnowPrompt
🗒 Paper: https://arxiv.org/abs/2210.10678v1
➡️ Dataset: https://github.com/zjunlp/KnowPrompt/blob/master/dataset/semeval
@ai_machinelearning_big_data
💨 Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clusteringpip3 install spectralcluster==0.1.0
🖥 Github: https://github.com/wq2012/SpectralCluster
🗒 Paper: https://arxiv.org/abs/2210.13690v1
🔩 Speaker Diarization with LSTM: https://google.github.io/speaker-id/publications/LstmDiarization/
@ai_machinelearning_big_data
➡️ MetaFormer Baselines for Vision
🖥 Github: https://github.com/sail-sg/metaformer
🗒 Paper: https://arxiv.org/abs/2210.13452v1
➡️ Dataset: https://paperswithcode.com/dataset/imagenet
@ai_machinelearning_big_data