APEX FusedRMSNorm not available, using native implementation
/home/ubuntu/vibevoice/vibevoice/processor/vibevoice_asr_processor.py:23: UserWarning: audio_utils not available, will fall back to soundfile for audio loading
  warnings.warn("audio_utils not available, will fall back to soundfile for audio loading")
loading file vocab.json from cache at /home/ubuntu/.cache/huggingface/hub/models--Qwen--Qwen2.5-1.5B/snapshots/8faed761d45a263340a0528343f099c05c9a4323/vocab.json
loading file merges.txt from cache at /home/ubuntu/.cache/huggingface/hub/models--Qwen--Qwen2.5-1.5B/snapshots/8faed761d45a263340a0528343f099c05c9a4323/merges.txt
loading file tokenizer.json from cache at /home/ubuntu/.cache/huggingface/hub/models--Qwen--Qwen2.5-1.5B/snapshots/8faed761d45a263340a0528343f099c05c9a4323/tokenizer.json
loading file added_tokens.json from cache at None
loading file special_tokens_map.json from cache at None
loading file tokenizer_config.json from cache at /home/ubuntu/.cache/huggingface/hub/models--Qwen--Qwen2.5-1.5B/snapshots/8faed761d45a263340a0528343f099c05c9a4323/tokenizer_config.json
loading file chat_template.jinja from cache at None
The tokenizer class you load from this checkpoint is not the same type as the class this function is called from. It may result in unexpected tokenization. 
The tokenizer class you load from this checkpoint is 'Qwen2Tokenizer'. 
The class this function is called from is 'VibeVoiceTextTokenizerFast'.
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
loading configuration file config.json from cache at /home/ubuntu/.cache/huggingface/hub/models--microsoft--VibeVoice-1.5B/snapshots/c00898d257e6b46004e3e2866a47534085fb685a/config.json
Model config VibeVoiceConfig {
  "acoustic_tokenizer_config": {
    "causal": true,
    "channels": 1,
    "conv_bias": true,
    "conv_norm": "none",
    "corpus_normalize": 0.0,
    "decoder_depths": null,
    "decoder_n_filters": 32,
    "decoder_ratios": [
      8,
      5,
      5,
      4,
      2,
      2
    ],
    "disable_last_norm": true,
    "encoder_depths": "3-3-3-3-3-3-8",
    "encoder_n_filters": 32,
    "encoder_ratios": [
      8,
      5,
      5,
      4,
      2,
      2
    ],
    "fix_std": 0.5,
    "layer_scale_init_value": 1e-06,
    "layernorm": "RMSNorm",
    "layernorm_elementwise_affine": true,
    "layernorm_eps": 1e-05,
    "mixer_layer": "depthwise_conv",
    "model_type": "vibevoice_acoustic_tokenizer",
    "pad_mode": "constant",
    "std_dist_type": "gaussian",
    "vae_dim": 64,
    "weight_init_value": 0.01
  },
  "acoustic_vae_dim": 64,
  "architectures": [
    "VibeVoiceForConditionalGeneration"
  ],
  "decoder_config": {
    "attention_dropout": 0.0,
    "hidden_act": "silu",
    "hidden_size": 1536,
    "initializer_range": 0.02,
    "intermediate_size": 8960,
    "max_position_embeddings": 65536,
    "max_window_layers": 28,
    "model_type": "qwen2",
    "num_attention_heads": 12,
    "num_hidden_layers": 28,
    "num_key_value_heads": 2,
    "rms_norm_eps": 1e-06,
    "rope_scaling": null,
    "rope_theta": 1000000.0,
    "sliding_window": null,
    "tie_word_embeddings": true,
    "torch_dtype": "bfloat16",
    "use_cache": true,
    "use_sliding_window": false,
    "vocab_size": 151936
  },
  "diffusion_head_config": {
    "ddpm_batch_mul": 4,
    "ddpm_beta_schedule": "cosine",
    "ddpm_num_inference_steps": 20,
    "ddpm_num_steps": 1000,
    "diffusion_type": "ddpm",
    "head_ffn_ratio": 3.0,
    "head_layers": 4,
    "hidden_size": 1536,
    "latent_size": 64,
    "model_type": "vibevoice_diffusion_head",
    "prediction_type": "v_prediction",
    "rms_norm_eps": 1e-05,
    "speech_vae_dim": 64
  },
  "model_type": "vibevoice",
  "semantic_tokenizer_config": {
    "causal": true,
    "channels": 1,
    "conv_bias": true,
    "conv_norm": "none",
    "corpus_normalize": 0.0,
    "disable_last_norm": true,
    "encoder_depths": "3-3-3-3-3-3-8",
    "encoder_n_filters": 32,
    "encoder_ratios": [
      8,
      5,
      5,
      4,
      2,
      2
    ],
    "fix_std": 0,
    "layer_scale_init_value": 1e-06,
    "layernorm": "RMSNorm",
    "layernorm_elementwise_affine": true,
    "layernorm_eps": 1e-05,
    "mixer_layer": "depthwise_conv",
    "model_type": "vibevoice_semantic_tokenizer",
    "pad_mode": "constant",
    "std_dist_type": "none",
    "vae_dim": 128,
    "weight_init_value": 0.01
  },
  "semantic_vae_dim": 128,
  "torch_dtype": "bfloat16",
  "transformers_version": "4.51.3"
}

loading weights file model.safetensors from cache at /home/ubuntu/.cache/huggingface/hub/models--microsoft--VibeVoice-1.5B/snapshots/c00898d257e6b46004e3e2866a47534085fb685a/model.safetensors.index.json
Using device: cuda
Setting seed: 42
Found 10 voice files in /home/ubuntu/vibevoice/demo/voices
Available voices: en-Alice_woman, en-Carter_man, en-Frank_man, en-Mary_woman_bgm, en-Maya_woman, in-Samuel_man, modi, zh-Anchen_man_bgm, zh-Bowen_man, zh-Xinran_woman
Reading script from: demo/text_examples/modi_hindi.txt
Found 2 speaker segments:
  1. Speaker 1
     Text preview: Speaker 1: Mere pyaare deshvasiyon, aaj main aapke saath kuch bahut zaroori baatein karna chahta hoo...
  2. Speaker 1
     Text preview: Speaker 1: Aaj hum Digital India ki baat karte hain. Gaon gaon mein internet pahunch raha hai. Kisan...

Speaker mapping:
  Speaker 1 -> modi
Speaker 1 ('modi') -> Voice: modi.wav
Loading processor & model from microsoft/VibeVoice-1.5B
Using device: cuda, torch_dtype: torch.bfloat16, attn_implementation: flash_attention_2
Fetching 3 files:   0%|          | 0/3 [00:00<?, ?it/s]Fetching 3 files:  33%|███▎      | 1/3 [00:08<00:16,  8.28s/it]Fetching 3 files:  67%|██████▋   | 2/3 [00:08<00:03,  3.52s/it]Fetching 3 files: 100%|██████████| 3/3 [00:08<00:00,  2.82s/it]
Instantiating VibeVoiceForConditionalGenerationInference model under default dtype torch.bfloat16.
loading configuration file config.json from cache at /home/ubuntu/.cache/huggingface/hub/models--microsoft--VibeVoice-1.5B/snapshots/c00898d257e6b46004e3e2866a47534085fb685a/config.json
Model config VibeVoiceConfig {
  "acoustic_tokenizer_config": {
    "causal": true,
    "channels": 1,
    "conv_bias": true,
    "conv_norm": "none",
    "corpus_normalize": 0.0,
    "decoder_depths": null,
    "decoder_n_filters": 32,
    "decoder_ratios": [
      8,
      5,
      5,
      4,
      2,
      2
    ],
    "disable_last_norm": true,
    "encoder_depths": "3-3-3-3-3-3-8",
    "encoder_n_filters": 32,
    "encoder_ratios": [
      8,
      5,
      5,
      4,
      2,
      2
    ],
    "fix_std": 0.5,
    "layer_scale_init_value": 1e-06,
    "layernorm": "RMSNorm",
    "layernorm_elementwise_affine": true,
    "layernorm_eps": 1e-05,
    "mixer_layer": "depthwise_conv",
    "model_type": "vibevoice_acoustic_tokenizer",
    "pad_mode": "constant",
    "std_dist_type": "gaussian",
    "vae_dim": 64,
    "weight_init_value": 0.01
  },
  "acoustic_vae_dim": 64,
  "architectures": [
    "VibeVoiceForConditionalGeneration"
  ],
  "decoder_config": {
    "attention_dropout": 0.0,
    "hidden_act": "silu",
    "hidden_size": 1536,
    "initializer_range": 0.02,
    "intermediate_size": 8960,
    "max_position_embeddings": 65536,
    "max_window_layers": 28,
    "model_type": "qwen2",
    "num_attention_heads": 12,
    "num_hidden_layers": 28,
    "num_key_value_heads": 2,
    "rms_norm_eps": 1e-06,
    "rope_scaling": null,
    "rope_theta": 1000000.0,
    "sliding_window": null,
    "tie_word_embeddings": true,
    "torch_dtype": "bfloat16",
    "use_cache": true,
    "use_sliding_window": false,
    "vocab_size": 151936
  },
  "diffusion_head_config": {
    "ddpm_batch_mul": 4,
    "ddpm_beta_schedule": "cosine",
    "ddpm_num_inference_steps": 20,
    "ddpm_num_steps": 1000,
    "diffusion_type": "ddpm",
    "head_ffn_ratio": 3.0,
    "head_layers": 4,
    "hidden_size": 1536,
    "latent_size": 64,
    "model_type": "vibevoice_diffusion_head",
    "prediction_type": "v_prediction",
    "rms_norm_eps": 1e-05,
    "speech_vae_dim": 64
  },
  "model_type": "vibevoice",
  "semantic_tokenizer_config": {
    "causal": true,
    "channels": 1,
    "conv_bias": true,
    "conv_norm": "none",
    "corpus_normalize": 0.0,
    "disable_last_norm": true,
    "encoder_depths": "3-3-3-3-3-3-8",
    "encoder_n_filters": 32,
    "encoder_ratios": [
      8,
      5,
      5,
      4,
      2,
      2
    ],
    "fix_std": 0,
    "layer_scale_init_value": 1e-06,
    "layernorm": "RMSNorm",
    "layernorm_elementwise_affine": true,
    "layernorm_eps": 1e-05,
    "mixer_layer": "depthwise_conv",
    "model_type": "vibevoice_semantic_tokenizer",
    "pad_mode": "constant",
    "std_dist_type": "none",
    "vae_dim": 128,
    "weight_init_value": 0.01
  },
  "semantic_vae_dim": 128,
  "torch_dtype": "bfloat16",
  "transformers_version": "4.51.3"
}

loading weights file model.safetensors from cache at /home/ubuntu/.cache/huggingface/hub/models--microsoft--VibeVoice-1.5B/snapshots/c00898d257e6b46004e3e2866a47534085fb685a/model.safetensors.index.json
Instantiating VibeVoiceForConditionalGenerationInference model under default dtype torch.bfloat16.
Generate config GenerationConfig {}

Instantiating Qwen2Model model under default dtype torch.bfloat16.
Instantiating VibeVoiceAcousticTokenizerModel model under default dtype torch.bfloat16.
Instantiating VibeVoiceSemanticTokenizerModel model under default dtype torch.bfloat16.
Instantiating VibeVoiceDiffusionHead model under default dtype torch.bfloat16.
[ERROR] : ImportError: FlashAttention2 has been toggled on, but it cannot be used due to the following error: the package flash_attn seems to be not installed. Please refer to the documentation of https://huggingface.co/docs/transformers/perf_infer_gpu_one#flashattention-2 to install Flash Attention 2.
Traceback (most recent call last):
  File "/home/ubuntu/vibevoice/demo/inference_from_file.py", line 305, in main
    model = VibeVoiceForConditionalGenerationInference.from_pretrained(
  File "/home/ubuntu/.local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 279, in _wrapper
    return func(*args, **kwargs)
  File "/home/ubuntu/.local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 4336, in from_pretrained
    config = cls._autoset_attn_implementation(
  File "/home/ubuntu/.local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 2109, in _autoset_attn_implementation
    cls._check_and_enable_flash_attn_2(
  File "/home/ubuntu/.local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 2252, in _check_and_enable_flash_attn_2
    raise ImportError(f"{preface} the package flash_attn seems to be not installed. {install_message}")
ImportError: FlashAttention2 has been toggled on, but it cannot be used due to the following error: the package flash_attn seems to be not installed. Please refer to the documentation of https://huggingface.co/docs/transformers/perf_infer_gpu_one#flashattention-2 to install Flash Attention 2.

Error loading the model. Trying to use SDPA. However, note that only flash_attention_2 has been fully tested, and using SDPA may result in lower audio quality.
Loading checkpoint shards:   0%|          | 0/3 [00:00<?, ?it/s]Loading checkpoint shards:  33%|███▎      | 1/3 [00:00<00:01,  1.80it/s]Loading checkpoint shards:  67%|██████▋   | 2/3 [00:01<00:00,  1.96it/s]Loading checkpoint shards: 100%|██████████| 3/3 [00:01<00:00,  2.08it/s]Loading checkpoint shards: 100%|██████████| 3/3 [00:01<00:00,  2.02it/s]
All model checkpoint weights were used when initializing VibeVoiceForConditionalGenerationInference.

All the weights of VibeVoiceForConditionalGenerationInference were initialized from the model checkpoint at microsoft/VibeVoice-1.5B.
If your task is similar to the task the model of the checkpoint was trained on, you can already use VibeVoiceForConditionalGenerationInference for predictions without further training.
Generation config file not found, using a generation config created from the model config.
Voice cloning enabled: running generation with is_prefill=True
Language model attention: sdpa
Starting generation with cfg_scale: 1.3
Generating:   0%|          | 0/576 [00:00<?, ?it/s]Generating (active: 1/1):   0%|          | 0/576 [00:00<?, ?it/s]Generating (active: 1/1):   0%|          | 1/576 [00:00<06:42,  1.43it/s]Generating (active: 1/1):   0%|          | 1/576 [00:00<06:42,  1.43it/s]Generating (active: 1/1):   0%|          | 2/576 [00:00<03:29,  2.75it/s]Generating (active: 1/1):   0%|          | 2/576 [00:00<03:29,  2.75it/s]Generating (active: 1/1):   1%|          | 3/576 [00:00<02:26,  3.92it/s]Generating (active: 1/1):   1%|          | 3/576 [00:00<02:26,  3.92it/s]Generating (active: 1/1):   1%|          | 4/576 [00:01<01:56,  4.92it/s]Generating (active: 1/1):   1%|          | 4/576 [00:01<01:56,  4.92it/s]Generating (active: 1/1):   1%|          | 5/576 [00:01<01:40,  5.71it/s]Generating (active: 1/1):   1%|          | 5/576 [00:01<01:40,  5.71it/s]Generating (active: 1/1):   1%|          | 6/576 [00:01<01:30,  6.33it/s]Generating (active: 1/1):   1%|          | 6/576 [00:01<01:30,  6.33it/s]Generating (active: 1/1):   1%|          | 7/576 [00:01<01:23,  6.81it/s]Generating (active: 1/1):   1%|          | 7/576 [00:01<01:23,  6.81it/s]Generating (active: 1/1):   1%|▏         | 8/576 [00:01<01:19,  7.17it/s]Generating (active: 1/1):   1%|▏         | 8/576 [00:01<01:19,  7.17it/s]Generating (active: 1/1):   2%|▏         | 9/576 [00:01<01:16,  7.42it/s]Generating (active: 1/1):   2%|▏         | 9/576 [00:01<01:16,  7.42it/s]Generating (active: 1/1):   2%|▏         | 10/576 [00:01<01:14,  7.59it/s]Generating (active: 1/1):   2%|▏         | 10/576 [00:01<01:14,  7.59it/s]Generating (active: 1/1):   2%|▏         | 11/576 [00:01<01:13,  7.73it/s]Generating (active: 1/1):   2%|▏         | 11/576 [00:01<01:13,  7.73it/s]Generating (active: 1/1):   2%|▏         | 12/576 [00:02<01:12,  7.82it/s]Generating (active: 1/1):   2%|▏         | 12/576 [00:02<01:12,  7.82it/s]Generating (active: 1/1):   2%|▏         | 13/576 [00:02<01:11,  7.89it/s]Generating (active: 1/1):   2%|▏         | 13/576 [00:02<01:11,  7.89it/s]Generating (active: 1/1):   2%|▏         | 14/576 [00:02<01:12,  7.78it/s]Generating (active: 1/1):   2%|▏         | 14/576 [00:02<01:12,  7.78it/s]Generating (active: 1/1):   3%|▎         | 15/576 [00:02<01:11,  7.83it/s]Generating (active: 1/1):   3%|▎         | 15/576 [00:02<01:11,  7.83it/s]Generating (active: 1/1):   3%|▎         | 16/576 [00:02<01:10,  7.90it/s]Generating (active: 1/1):   3%|▎         | 16/576 [00:02<01:10,  7.90it/s]Generating (active: 1/1):   3%|▎         | 17/576 [00:02<01:10,  7.95it/s]Generating (active: 1/1):   3%|▎         | 17/576 [00:02<01:10,  7.95it/s]Generating (active: 1/1):   3%|▎         | 18/576 [00:02<01:09,  7.99it/s]Generating (active: 1/1):   3%|▎         | 18/576 [00:02<01:09,  7.99it/s]Generating (active: 1/1):   3%|▎         | 19/576 [00:02<01:09,  8.03it/s]Generating (active: 1/1):   3%|▎         | 19/576 [00:02<01:09,  8.03it/s]Generating (active: 1/1):   3%|▎         | 20/576 [00:03<01:09,  7.96it/s]Generating (active: 1/1):   3%|▎         | 20/576 [00:03<01:09,  7.96it/s]Generating (active: 1/1):   4%|▎         | 21/576 [00:03<01:10,  7.90it/s]Generating (active: 1/1):   4%|▎         | 21/576 [00:03<01:10,  7.90it/s]Generating (active: 1/1):   4%|▍         | 22/576 [00:03<01:10,  7.87it/s]Generating (active: 1/1):   4%|▍         | 22/576 [00:03<01:10,  7.87it/s]Generating (active: 1/1):   4%|▍         | 23/576 [00:03<01:10,  7.86it/s]Generating (active: 1/1):   4%|▍         | 23/576 [00:03<01:10,  7.86it/s]Generating (active: 1/1):   4%|▍         | 24/576 [00:03<01:10,  7.86it/s]Generating (active: 1/1):   4%|▍         | 24/576 [00:03<01:10,  7.86it/s]Generating (active: 1/1):   4%|▍         | 25/576 [00:03<01:10,  7.85it/s]Generating (active: 1/1):   4%|▍         | 25/576 [00:03<01:10,  7.85it/s]Generating (active: 1/1):   5%|▍         | 26/576 [00:03<01:10,  7.81it/s]Generating (active: 1/1):   5%|▍         | 26/576 [00:03<01:10,  7.81it/s]Generating (active: 1/1):   5%|▍         | 27/576 [00:03<01:10,  7.82it/s]Generating (active: 1/1):   5%|▍         | 27/576 [00:03<01:10,  7.82it/s]Generating (active: 1/1):   5%|▍         | 28/576 [00:04<01:10,  7.82it/s]Generating (active: 1/1):   5%|▍         | 28/576 [00:04<01:10,  7.82it/s]Generating (active: 1/1):   5%|▌         | 29/576 [00:04<01:09,  7.84it/s]Generating (active: 1/1):   5%|▌         | 29/576 [00:04<01:09,  7.84it/s]Generating (active: 1/1):   5%|▌         | 30/576 [00:04<01:09,  7.87it/s]Generating (active: 1/1):   5%|▌         | 30/576 [00:04<01:09,  7.87it/s]Generating (active: 1/1):   5%|▌         | 31/576 [00:04<01:09,  7.86it/s]Generating (active: 1/1):   5%|▌         | 31/576 [00:04<01:09,  7.86it/s]Generating (active: 1/1):   6%|▌         | 32/576 [00:04<01:09,  7.85it/s]Generating (active: 1/1):   6%|▌         | 32/576 [00:04<01:09,  7.85it/s]Generating (active: 1/1):   6%|▌         | 33/576 [00:04<01:09,  7.83it/s]Generating (active: 1/1):   6%|▌         | 33/576 [00:04<01:09,  7.83it/s]Generating (active: 1/1):   6%|▌         | 34/576 [00:04<01:09,  7.83it/s]Generating (active: 1/1):   6%|▌         | 34/576 [00:04<01:09,  7.83it/s]Generating (active: 1/1):   6%|▌         | 35/576 [00:04<01:08,  7.85it/s]Generating (active: 1/1):   6%|▌         | 35/576 [00:04<01:08,  7.85it/s]Generating (active: 1/1):   6%|▋         | 36/576 [00:05<01:08,  7.84it/s]Generating (active: 1/1):   6%|▋         | 36/576 [00:05<01:08,  7.84it/s]Generating (active: 1/1):   6%|▋         | 37/576 [00:05<01:08,  7.86it/s]Generating (active: 1/1):   6%|▋         | 37/576 [00:05<01:08,  7.86it/s]Generating (active: 1/1):   7%|▋         | 38/576 [00:05<01:08,  7.86it/s]Generating (active: 1/1):   7%|▋         | 38/576 [00:05<01:08,  7.86it/s]Generating (active: 1/1):   7%|▋         | 39/576 [00:05<01:08,  7.88it/s]Generating (active: 1/1):   7%|▋         | 39/576 [00:05<01:08,  7.88it/s]Generating (active: 1/1):   7%|▋         | 40/576 [00:05<01:07,  7.91it/s]Generating (active: 1/1):   7%|▋         | 40/576 [00:05<01:07,  7.91it/s]Generating (active: 1/1):   7%|▋         | 41/576 [00:05<01:07,  7.93it/s]Generating (active: 1/1):   7%|▋         | 41/576 [00:05<01:07,  7.93it/s]Generating (active: 1/1):   7%|▋         | 42/576 [00:05<01:07,  7.95it/s]Generating (active: 1/1):   7%|▋         | 42/576 [00:05<01:07,  7.95it/s]Generating (active: 1/1):   7%|▋         | 43/576 [00:06<01:06,  7.96it/s]Generating (active: 1/1):   7%|▋         | 43/576 [00:06<01:06,  7.96it/s]Generating (active: 1/1):   8%|▊         | 44/576 [00:06<01:06,  7.96it/s]Generating (active: 1/1):   8%|▊         | 44/576 [00:06<01:06,  7.96it/s]Generating (active: 1/1):   8%|▊         | 45/576 [00:06<01:06,  7.96it/s]Generating (active: 1/1):   8%|▊         | 45/576 [00:06<01:06,  7.96it/s]Generating (active: 1/1):   8%|▊         | 46/576 [00:06<01:06,  7.97it/s]Generating (active: 1/1):   8%|▊         | 46/576 [00:06<01:06,  7.97it/s]Generating (active: 1/1):   8%|▊         | 47/576 [00:06<01:06,  7.98it/s]Generating (active: 1/1):   8%|▊         | 47/576 [00:06<01:06,  7.98it/s]Generating (active: 1/1):   8%|▊         | 48/576 [00:06<01:06,  7.99it/s]Generating (active: 1/1):   8%|▊         | 48/576 [00:06<01:06,  7.99it/s]Generating (active: 1/1):   9%|▊         | 49/576 [00:06<01:05,  7.99it/s]Generating (active: 1/1):   9%|▊         | 49/576 [00:06<01:05,  7.99it/s]Generating (active: 1/1):   9%|▊         | 50/576 [00:06<01:05,  7.99it/s]Generating (active: 1/1):   9%|▊         | 50/576 [00:06<01:05,  7.99it/s]Generating (active: 1/1):   9%|▉         | 51/576 [00:07<01:05,  7.98it/s]Generating (active: 1/1):   9%|▉         | 51/576 [00:07<01:05,  7.98it/s]Generating (active: 1/1):   9%|▉         | 52/576 [00:07<01:05,  7.99it/s]Generating (active: 1/1):   9%|▉         | 52/576 [00:07<01:05,  7.99it/s]Generating (active: 1/1):   9%|▉         | 53/576 [00:07<01:05,  7.99it/s]Generating (active: 1/1):   9%|▉         | 53/576 [00:07<01:05,  7.99it/s]Generating (active: 1/1):   9%|▉         | 54/576 [00:07<01:05,  7.99it/s]Generating (active: 1/1):   9%|▉         | 54/576 [00:07<01:05,  7.99it/s]Generating (active: 1/1):  10%|▉         | 55/576 [00:07<01:05,  8.00it/s]Generating (active: 1/1):  10%|▉         | 55/576 [00:07<01:05,  8.00it/s]Generating (active: 1/1):  10%|▉         | 56/576 [00:07<01:04,  8.02it/s]Generating (active: 1/1):  10%|▉         | 56/576 [00:07<01:04,  8.02it/s]Generating (active: 1/1):  10%|▉         | 57/576 [00:07<01:04,  8.02it/s]Generating (active: 1/1):  10%|▉         | 57/576 [00:07<01:04,  8.02it/s]Generating (active: 1/1):  10%|█         | 58/576 [00:07<01:04,  8.02it/s]Generating (active: 1/1):  10%|█         | 58/576 [00:07<01:04,  8.02it/s]Generating (active: 1/1):  10%|█         | 59/576 [00:08<01:04,  8.01it/s]Generating (active: 1/1):  10%|█         | 59/576 [00:08<01:04,  8.01it/s]Generating (active: 1/1):  10%|█         | 60/576 [00:08<01:04,  8.00it/s]Generating (active: 1/1):  10%|█         | 60/576 [00:08<01:04,  8.00it/s]Generating (active: 1/1):  11%|█         | 61/576 [00:08<01:04,  7.98it/s]Generating (active: 1/1):  11%|█         | 61/576 [00:08<01:04,  7.98it/s]Generating (active: 1/1):  11%|█         | 62/576 [00:08<01:04,  7.97it/s]Generating (active: 1/1):  11%|█         | 62/576 [00:08<01:04,  7.97it/s]Generating (active: 1/1):  11%|█         | 63/576 [00:08<01:04,  7.97it/s]Generating (active: 1/1):  11%|█         | 63/576 [00:08<01:04,  7.97it/s]Generating (active: 1/1):  11%|█         | 64/576 [00:08<01:04,  7.98it/s]Generating (active: 1/1):  11%|█         | 64/576 [00:08<01:04,  7.98it/s]Generating (active: 1/1):  11%|█▏        | 65/576 [00:08<01:04,  7.98it/s]Generating (active: 1/1):  11%|█▏        | 65/576 [00:08<01:04,  7.98it/s]Generating (active: 1/1):  11%|█▏        | 66/576 [00:08<01:03,  7.98it/s]Generating (active: 1/1):  11%|█▏        | 66/576 [00:08<01:03,  7.98it/s]Generating (active: 1/1):  12%|█▏        | 67/576 [00:09<01:03,  7.99it/s]Generating (active: 1/1):  12%|█▏        | 67/576 [00:09<01:03,  7.99it/s]Generating (active: 1/1):  12%|█▏        | 68/576 [00:09<01:03,  8.00it/s]Generating (active: 1/1):  12%|█▏        | 68/576 [00:09<01:03,  8.00it/s]Generating (active: 1/1):  12%|█▏        | 69/576 [00:09<01:03,  7.99it/s]Generating (active: 1/1):  12%|█▏        | 69/576 [00:09<01:03,  7.99it/s]Generating (active: 1/1):  12%|█▏        | 70/576 [00:09<01:03,  7.99it/s]Generating (active: 1/1):  12%|█▏        | 70/576 [00:09<01:03,  7.99it/s]Generating (active: 1/1):  12%|█▏        | 71/576 [00:09<01:03,  7.99it/s]Generating (active: 1/1):  12%|█▏        | 71/576 [00:09<01:03,  7.99it/s]Generating (active: 1/1):  12%|█▎        | 72/576 [00:09<01:03,  7.99it/s]Generating (active: 1/1):  12%|█▎        | 72/576 [00:09<01:03,  7.99it/s]Generating (active: 1/1):  13%|█▎        | 73/576 [00:09<01:03,  7.98it/s]Generating (active: 1/1):  13%|█▎        | 73/576 [00:09<01:03,  7.98it/s]Generating (active: 1/1):  13%|█▎        | 74/576 [00:09<01:02,  7.99it/s]Generating (active: 1/1):  13%|█▎        | 74/576 [00:09<01:02,  7.99it/s]Generating (active: 1/1):  13%|█▎        | 75/576 [00:10<01:02,  7.99it/s]Generating (active: 1/1):  13%|█▎        | 75/576 [00:10<01:02,  7.99it/s]Generating (active: 1/1):  13%|█▎        | 76/576 [00:10<01:02,  7.96it/s]Generating (active: 1/1):  13%|█▎        | 76/576 [00:10<01:02,  7.96it/s]Generating (active: 1/1):  13%|█▎        | 77/576 [00:10<01:02,  7.98it/s]Generating (active: 1/1):  13%|█▎        | 77/576 [00:10<01:02,  7.98it/s]Generating (active: 1/1):  14%|█▎        | 78/576 [00:10<01:02,  7.97it/s]Generating (active: 1/1):  14%|█▎        | 78/576 [00:10<01:02,  7.97it/s]Generating (active: 1/1):  14%|█▎        | 79/576 [00:10<01:02,  7.99it/s]Generating (active: 1/1):  14%|█▎        | 79/576 [00:10<01:02,  7.99it/s]Generating (active: 1/1):  14%|█▍        | 80/576 [00:10<01:02,  8.00it/s]Generating (active: 1/1):  14%|█▍        | 80/576 [00:10<01:02,  8.00it/s]Generating (active: 1/1):  14%|█▍        | 81/576 [00:10<01:01,  7.99it/s]Generating (active: 1/1):  14%|█▍        | 81/576 [00:10<01:01,  7.99it/s]Generating (active: 1/1):  14%|█▍        | 82/576 [00:10<01:01,  7.97it/s]Generating (active: 1/1):  14%|█▍        | 82/576 [00:10<01:01,  7.97it/s]Generating (active: 1/1):  14%|█▍        | 83/576 [00:11<01:02,  7.92it/s]Generating (active: 1/1):  14%|█▍        | 83/576 [00:11<01:02,  7.92it/s]Generating (active: 1/1):  15%|█▍        | 84/576 [00:11<01:02,  7.93it/s]Generating (active: 1/1):  15%|█▍        | 84/576 [00:11<01:02,  7.93it/s]Generating (active: 1/1):  15%|█▍        | 85/576 [00:11<01:01,  7.93it/s]Generating (active: 1/1):  15%|█▍        | 85/576 [00:11<01:01,  7.93it/s]Generating (active: 1/1):  15%|█▍        | 86/576 [00:11<01:02,  7.90it/s]Generating (active: 1/1):  15%|█▍        | 86/576 [00:11<01:02,  7.90it/s]Generating (active: 1/1):  15%|█▌        | 87/576 [00:11<01:01,  7.91it/s]Generating (active: 1/1):  15%|█▌        | 87/576 [00:11<01:01,  7.91it/s]Generating (active: 1/1):  15%|█▌        | 88/576 [00:11<01:01,  7.96it/s]Generating (active: 1/1):  15%|█▌        | 88/576 [00:11<01:01,  7.96it/s]Generating (active: 1/1):  15%|█▌        | 89/576 [00:11<01:01,  7.96it/s]Generating (active: 1/1):  15%|█▌        | 89/576 [00:11<01:01,  7.96it/s]Generating (active: 1/1):  16%|█▌        | 90/576 [00:11<01:00,  7.97it/s]Generating (active: 1/1):  16%|█▌        | 90/576 [00:11<01:00,  7.97it/s]Generating (active: 1/1):  16%|█▌        | 91/576 [00:12<01:00,  7.98it/s]Generating (active: 1/1):  16%|█▌        | 91/576 [00:12<01:00,  7.98it/s]Generating (active: 1/1):  16%|█▌        | 92/576 [00:12<01:00,  7.99it/s]Generating (active: 1/1):  16%|█▌        | 92/576 [00:12<01:00,  7.99it/s]Generating (active: 1/1):  16%|█▌        | 93/576 [00:12<01:00,  7.98it/s]Generating (active: 1/1):  16%|█▌        | 93/576 [00:12<01:00,  7.98it/s]Generating (active: 1/1):  16%|█▋        | 94/576 [00:12<01:00,  7.97it/s]Generating (active: 1/1):  16%|█▋        | 94/576 [00:12<01:00,  7.97it/s]Generating (active: 1/1):  16%|█▋        | 95/576 [00:12<01:00,  7.98it/s]Generating (active: 1/1):  16%|█▋        | 95/576 [00:12<01:00,  7.98it/s]Generating (active: 1/1):  17%|█▋        | 96/576 [00:12<01:00,  7.98it/s]Generating (active: 1/1):  17%|█▋        | 96/576 [00:12<01:00,  7.98it/s]Generating (active: 1/1):  17%|█▋        | 97/576 [00:12<01:00,  7.98it/s]Generating (active: 1/1):  17%|█▋        | 97/576 [00:12<01:00,  7.98it/s]Generating (active: 1/1):  17%|█▋        | 98/576 [00:12<00:59,  7.97it/s]Generating (active: 1/1):  17%|█▋        | 98/576 [00:12<00:59,  7.97it/s]Generating (active: 1/1):  17%|█▋        | 99/576 [00:13<00:59,  7.97it/s]Generating (active: 1/1):  17%|█▋        | 99/576 [00:13<00:59,  7.97it/s]Generating (active: 1/1):  17%|█▋        | 100/576 [00:13<00:59,  7.99it/s]Generating (active: 1/1):  17%|█▋        | 100/576 [00:13<00:59,  7.99it/s]Generating (active: 1/1):  18%|█▊        | 101/576 [00:13<00:59,  7.97it/s]Generating (active: 1/1):  18%|█▊        | 101/576 [00:13<00:59,  7.97it/s]Generating (active: 1/1):  18%|█▊        | 102/576 [00:13<00:59,  7.98it/s]Generating (active: 1/1):  18%|█▊        | 102/576 [00:13<00:59,  7.98it/s]Generating (active: 1/1):  18%|█▊        | 103/576 [00:13<00:59,  7.99it/s]Generating (active: 1/1):  18%|█▊        | 103/576 [00:13<00:59,  7.99it/s]Generating (active: 1/1):  18%|█▊        | 104/576 [00:13<00:59,  8.00it/s]Generating (active: 1/1):  18%|█▊        | 104/576 [00:13<00:59,  8.00it/s]Generating (active: 1/1):  18%|█▊        | 105/576 [00:13<00:59,  7.94it/s]Generating (active: 1/1):  18%|█▊        | 105/576 [00:13<00:59,  7.94it/s]Generating (active: 1/1):  18%|█▊        | 106/576 [00:13<00:59,  7.96it/s]Generating (active: 1/1):  18%|█▊        | 106/576 [00:13<00:59,  7.96it/s]Generating (active: 1/1):  19%|█▊        | 107/576 [00:14<00:58,  7.97it/s]Generating (active: 1/1):  19%|█▊        | 107/576 [00:14<00:58,  7.97it/s]Generating (active: 1/1):  19%|█▉        | 108/576 [00:14<00:58,  7.98it/s]Generating (active: 1/1):  19%|█▉        | 108/576 [00:14<00:58,  7.98it/s]Generating (active: 1/1):  19%|█▉        | 109/576 [00:14<00:58,  7.96it/s]Generating (active: 1/1):  19%|█▉        | 109/576 [00:14<00:58,  7.96it/s]Generating (active: 1/1):  19%|█▉        | 110/576 [00:14<00:58,  7.97it/s]Generating (active: 1/1):  19%|█▉        | 110/576 [00:14<00:58,  7.97it/s]Generating (active: 1/1):  19%|█▉        | 111/576 [00:14<00:58,  7.98it/s]Generating (active: 1/1):  19%|█▉        | 111/576 [00:14<00:58,  7.98it/s]Generating (active: 1/1):  19%|█▉        | 112/576 [00:14<00:58,  7.91it/s]Generating (active: 1/1):  19%|█▉        | 112/576 [00:14<00:58,  7.91it/s]Generating (active: 1/1):  20%|█▉        | 113/576 [00:14<00:58,  7.91it/s]Generating (active: 1/1):  20%|█▉        | 113/576 [00:14<00:58,  7.91it/s]Generating (active: 1/1):  20%|█▉        | 114/576 [00:14<00:58,  7.94it/s]Generating (active: 1/1):  20%|█▉        | 114/576 [00:14<00:58,  7.94it/s]Generating (active: 1/1):  20%|█▉        | 115/576 [00:15<00:57,  7.96it/s]Generating (active: 1/1):  20%|█▉        | 115/576 [00:15<00:57,  7.96it/s]Generating (active: 1/1):  20%|██        | 116/576 [00:15<00:57,  7.96it/s]Generating (active: 1/1):  20%|██        | 116/576 [00:15<00:57,  7.96it/s]Generating (active: 1/1):  20%|██        | 117/576 [00:15<00:57,  7.97it/s]Generating (active: 1/1):  20%|██        | 117/576 [00:15<00:57,  7.97it/s]Generating (active: 1/1):  20%|██        | 118/576 [00:15<00:57,  7.99it/s]Generating (active: 1/1):  20%|██        | 118/576 [00:15<00:57,  7.99it/s]Generating (active: 1/1):  21%|██        | 119/576 [00:15<00:57,  7.92it/s]Generating (active: 1/1):  21%|██        | 119/576 [00:15<00:57,  7.92it/s]Generating (active: 1/1):  21%|██        | 120/576 [00:15<00:57,  7.92it/s]Generating (active: 1/1):  21%|██        | 120/576 [00:15<00:57,  7.92it/s]Generating (active: 1/1):  21%|██        | 121/576 [00:15<00:57,  7.93it/s]Generating (active: 1/1):  21%|██        | 121/576 [00:15<00:57,  7.93it/s]Generating (active: 1/1):  21%|██        | 122/576 [00:15<00:57,  7.95it/s]Generating (active: 1/1):  21%|██        | 122/576 [00:15<00:57,  7.95it/s]Generating (active: 1/1):  21%|██▏       | 123/576 [00:16<00:56,  7.96it/s]Generating (active: 1/1):  21%|██▏       | 123/576 [00:16<00:56,  7.96it/s]Generating (active: 1/1):  22%|██▏       | 124/576 [00:16<00:56,  7.97it/s]Generating (active: 1/1):  22%|██▏       | 124/576 [00:16<00:56,  7.97it/s]Generating (active: 1/1):  22%|██▏       | 125/576 [00:16<00:56,  7.98it/s]Generating (active: 1/1):  22%|██▏       | 125/576 [00:16<00:56,  7.98it/s]Generating (active: 1/1):  22%|██▏       | 126/576 [00:16<00:56,  7.99it/s]Generating (active: 1/1):  22%|██▏       | 126/576 [00:16<00:56,  7.99it/s]Generating (active: 1/1):  22%|██▏       | 127/576 [00:16<00:57,  7.79it/s]Generating (active: 1/1):  22%|██▏       | 127/576 [00:16<00:57,  7.79it/s]Generating (active: 1/1):  22%|██▏       | 128/576 [00:16<00:57,  7.85it/s]Generating (active: 1/1):  22%|██▏       | 128/576 [00:16<00:57,  7.85it/s]Generating (active: 1/1):  22%|██▏       | 129/576 [00:16<00:56,  7.88it/s]Generating (active: 1/1):  22%|██▏       | 129/576 [00:16<00:56,  7.88it/s]Generating (active: 1/1):  23%|██▎       | 130/576 [00:16<00:56,  7.90it/s]Generating (active: 1/1):  23%|██▎       | 130/576 [00:16<00:56,  7.90it/s]Generating (active: 1/1):  23%|██▎       | 131/576 [00:17<00:56,  7.92it/s]Generating (active: 1/1):  23%|██▎       | 131/576 [00:17<00:56,  7.92it/s]Generating (active: 1/1):  23%|██▎       | 132/576 [00:17<00:55,  7.93it/s]Generating (active: 1/1):  23%|██▎       | 132/576 [00:17<00:55,  7.93it/s]Generating (active: 1/1):  23%|██▎       | 133/576 [00:17<00:55,  7.96it/s]Generating (active: 1/1):  23%|██▎       | 133/576 [00:17<00:55,  7.96it/s]Generating (active: 1/1):  23%|██▎       | 134/576 [00:17<00:55,  7.96it/s]Generating (active: 1/1):  23%|██▎       | 134/576 [00:17<00:55,  7.96it/s]Generating (active: 1/1):  23%|██▎       | 135/576 [00:17<00:55,  7.95it/s]Generating (active: 1/1):  23%|██▎       | 135/576 [00:17<00:55,  7.95it/s]Generating (active: 1/1):  24%|██▎       | 136/576 [00:17<00:55,  7.95it/s]Generating (active: 1/1):  24%|██▎       | 136/576 [00:17<00:55,  7.95it/s]Generating (active: 1/1):  24%|██▍       | 137/576 [00:17<00:55,  7.95it/s]Generating (active: 1/1):  24%|██▍       | 137/576 [00:17<00:55,  7.95it/s]Generating (active: 1/1):  24%|██▍       | 138/576 [00:17<00:55,  7.95it/s]Generating (active: 1/1):  24%|██▍       | 138/576 [00:17<00:55,  7.95it/s]Generating (active: 1/1):  24%|██▍       | 139/576 [00:18<00:54,  7.95it/s]Generating (active: 1/1):  24%|██▍       | 139/576 [00:18<00:54,  7.95it/s]Generating (active: 1/1):  24%|██▍       | 140/576 [00:18<00:54,  7.95it/s]Generating (active: 1/1):  24%|██▍       | 140/576 [00:18<00:54,  7.95it/s]Generating (active: 1/1):  24%|██▍       | 141/576 [00:18<00:54,  7.96it/s]Generating (active: 1/1):  24%|██▍       | 141/576 [00:18<00:54,  7.96it/s]Generating (active: 1/1):  25%|██▍       | 142/576 [00:18<00:54,  7.95it/s]Generating (active: 1/1):  25%|██▍       | 142/576 [00:18<00:54,  7.95it/s]Generating (active: 1/1):  25%|██▍       | 143/576 [00:18<00:54,  7.95it/s]Generating (active: 1/1):  25%|██▍       | 143/576 [00:18<00:54,  7.95it/s]Generating (active: 1/1):  25%|██▌       | 144/576 [00:18<00:54,  7.96it/s]Generating (active: 1/1):  25%|██▌       | 144/576 [00:18<00:54,  7.96it/s]Generating (active: 1/1):  25%|██▌       | 145/576 [00:18<00:54,  7.98it/s]Generating (active: 1/1):  25%|██▌       | 145/576 [00:18<00:54,  7.98it/s]Generating (active: 1/1):  25%|██▌       | 146/576 [00:18<00:53,  7.98it/s]Generating (active: 1/1):  25%|██▌       | 146/576 [00:18<00:53,  7.98it/s]Generating (active: 1/1):  26%|██▌       | 147/576 [00:19<00:53,  7.99it/s]Generating (active: 1/1):  26%|██▌       | 147/576 [00:19<00:53,  7.99it/s]Generating (active: 1/1):  26%|██▌       | 148/576 [00:19<00:53,  7.99it/s]Generating (active: 1/1):  26%|██▌       | 148/576 [00:19<00:53,  7.99it/s]Generating (active: 1/1):  26%|██▌       | 149/576 [00:19<00:53,  7.99it/s]Generating (active: 1/1):  26%|██▌       | 149/576 [00:19<00:53,  7.99it/s]Generating (active: 1/1):  26%|██▌       | 150/576 [00:19<00:53,  7.99it/s]Generating (active: 1/1):  26%|██▌       | 150/576 [00:19<00:53,  7.99it/s]Generating (active: 1/1):  26%|██▌       | 151/576 [00:19<00:53,  7.98it/s]Generating (active: 1/1):  26%|██▌       | 151/576 [00:19<00:53,  7.98it/s]Generating (active: 1/1):  26%|██▋       | 152/576 [00:19<00:53,  7.98it/s]Generating (active: 1/1):  26%|██▋       | 152/576 [00:19<00:53,  7.98it/s]Generating (active: 1/1):  27%|██▋       | 153/576 [00:19<00:53,  7.97it/s]Generating (active: 1/1):  27%|██▋       | 153/576 [00:19<00:53,  7.97it/s]Generating (active: 1/1):  27%|██▋       | 154/576 [00:19<00:52,  7.98it/s]Generating (active: 1/1):  27%|██▋       | 154/576 [00:19<00:52,  7.98it/s]Generating (active: 1/1):  27%|██▋       | 155/576 [00:20<00:52,  7.97it/s]Generating (active: 1/1):  27%|██▋       | 155/576 [00:20<00:52,  7.97it/s]Generating (active: 1/1):  27%|██▋       | 156/576 [00:20<00:52,  7.98it/s]Generating (active: 1/1):  27%|██▋       | 156/576 [00:20<00:52,  7.98it/s]Generating (active: 1/1):  27%|██▋       | 157/576 [00:20<00:52,  7.99it/s]Generating (active: 1/1):  27%|██▋       | 157/576 [00:20<00:52,  7.99it/s]Generating (active: 1/1):  27%|██▋       | 158/576 [00:20<00:52,  8.00it/s]Generating (active: 1/1):  27%|██▋       | 158/576 [00:20<00:52,  8.00it/s]Generating (active: 1/1):  28%|██▊       | 159/576 [00:20<00:52,  8.00it/s]Generating (active: 1/1):  28%|██▊       | 159/576 [00:20<00:52,  8.00it/s]Generating (active: 1/1):  28%|██▊       | 160/576 [00:20<00:52,  7.99it/s]Generating (active: 1/1):  28%|██▊       | 160/576 [00:20<00:52,  7.99it/s]Generating (active: 1/1):  28%|██▊       | 161/576 [00:20<00:52,  7.96it/s]Generating (active: 1/1):  28%|██▊       | 161/576 [00:20<00:52,  7.96it/s]Generating (active: 1/1):  28%|██▊       | 162/576 [00:20<00:51,  7.96it/s]Generating (active: 1/1):  28%|██▊       | 162/576 [00:20<00:51,  7.96it/s]Generating (active: 1/1):  28%|██▊       | 163/576 [00:21<00:51,  7.97it/s]Generating (active: 1/1):  28%|██▊       | 163/576 [00:21<00:51,  7.97it/s]Generating (active: 1/1):  28%|██▊       | 164/576 [00:21<00:51,  7.98it/s]Generating (active: 1/1):  28%|██▊       | 164/576 [00:21<00:51,  7.98it/s]Generating (active: 1/1):  29%|██▊       | 165/576 [00:21<00:51,  7.98it/s]Generating (active: 1/1):  29%|██▊       | 165/576 [00:21<00:51,  7.98it/s]Generating (active: 1/1):  29%|██▉       | 166/576 [00:21<00:51,  7.99it/s]Generating (active: 1/1):  29%|██▉       | 166/576 [00:21<00:51,  7.99it/s]Generating (active: 1/1):  29%|██▉       | 167/576 [00:21<00:51,  7.99it/s]Generating (active: 1/1):  29%|██▉       | 167/576 [00:21<00:51,  7.99it/s]Generating (active: 1/1):  29%|██▉       | 168/576 [00:21<00:51,  7.93it/s]Generating (active: 1/1):  29%|██▉       | 168/576 [00:21<00:51,  7.93it/s]Generating (active: 1/1):  29%|██▉       | 169/576 [00:21<00:51,  7.95it/s]Generating (active: 1/1):  29%|██▉       | 169/576 [00:21<00:51,  7.95it/s]Generating (active: 1/1):  30%|██▉       | 170/576 [00:21<00:51,  7.96it/s]Generating (active: 1/1):  30%|██▉       | 170/576 [00:21<00:51,  7.96it/s]Generating (active: 1/1):  30%|██▉       | 171/576 [00:22<00:50,  7.97it/s]Generating (active: 1/1):  30%|██▉       | 171/576 [00:22<00:50,  7.97it/s]Generating (active: 1/1):  30%|██▉       | 172/576 [00:22<00:50,  7.98it/s]Generating (active: 1/1):  30%|██▉       | 172/576 [00:22<00:50,  7.98it/s]Generating (active: 1/1):  30%|███       | 173/576 [00:22<00:50,  7.94it/s]Generating (active: 1/1):  30%|███       | 173/576 [00:22<00:50,  7.94it/s]Generating (active: 1/1):  30%|███       | 174/576 [00:22<00:51,  7.83it/s]Generating (active: 1/1):  30%|███       | 174/576 [00:22<00:51,  7.83it/s]Generating (active: 1/1):  30%|███       | 175/576 [00:22<00:51,  7.81it/s]Generating (active: 1/1):  30%|███       | 175/576 [00:22<00:51,  7.81it/s]Generating (active: 1/1):  31%|███       | 176/576 [00:22<00:51,  7.80it/s]Generating (active: 1/1):  31%|███       | 176/576 [00:22<00:51,  7.80it/s]Generating (active: 1/1):  31%|███       | 177/576 [00:22<00:51,  7.79it/s]Generating (active: 1/1):  31%|███       | 177/576 [00:22<00:51,  7.79it/s]Generating (active: 1/1):  31%|███       | 178/576 [00:22<00:51,  7.76it/s]Generating (active: 1/1):  31%|███       | 178/576 [00:22<00:51,  7.76it/s]Generating (active: 1/1):  31%|███       | 179/576 [00:23<00:51,  7.65it/s]Generating (active: 1/1):  31%|███       | 179/576 [00:23<00:51,  7.65it/s]Generating (active: 1/1):  31%|███▏      | 180/576 [00:23<00:51,  7.63it/s]Generating (active: 1/1):  31%|███▏      | 180/576 [00:23<00:51,  7.63it/s]Generating (active: 1/1):  31%|███▏      | 181/576 [00:23<00:54,  7.23it/s]Generating (active: 1/1):  31%|███▏      | 181/576 [00:23<00:54,  7.23it/s]Generating (active: 1/1):  32%|███▏      | 182/576 [00:23<00:56,  6.92it/s]Generating (active: 1/1):  32%|███▏      | 182/576 [00:23<00:56,  6.92it/s]Generating (active: 1/1):  32%|███▏      | 183/576 [00:23<01:03,  6.18it/s]Generating (active: 1/1):  32%|███▏      | 183/576 [00:23<01:03,  6.18it/s]Generating (active: 1/1):  32%|███▏      | 184/576 [00:23<01:05,  5.96it/s]Generating (active: 1/1):  32%|███▏      | 184/576 [00:23<01:05,  5.96it/s]Generating (active: 1/1):  32%|███▏      | 185/576 [00:24<01:01,  6.40it/s]Generating (active: 1/1):  32%|███▏      | 185/576 [00:24<01:01,  6.40it/s]Generating (active: 1/1):  32%|███▏      | 186/576 [00:24<00:57,  6.78it/s]Generating (active: 1/1):  32%|███▏      | 186/576 [00:24<00:57,  6.78it/s]Generating (active: 1/1):  32%|███▏      | 187/576 [00:24<00:54,  7.08it/s]Generating (active: 1/1):  32%|███▏      | 187/576 [00:24<00:54,  7.08it/s]Generating (active: 1/1):  33%|███▎      | 188/576 [00:24<00:53,  7.30it/s]Generating (active: 1/1):  33%|███▎      | 188/576 [00:24<00:53,  7.30it/s]Generating (active: 1/1):  33%|███▎      | 189/576 [00:24<00:52,  7.44it/s]Generating (active: 1/1):  33%|███▎      | 189/576 [00:24<00:52,  7.44it/s]Generating (active: 1/1):  33%|███▎      | 190/576 [00:24<00:51,  7.55it/s]Generating (active: 1/1):  33%|███▎      | 190/576 [00:24<00:51,  7.55it/s]Generating (active: 1/1):  33%|███▎      | 191/576 [00:24<00:50,  7.66it/s]Generating (active: 1/1):  33%|███▎      | 191/576 [00:24<00:50,  7.66it/s]Generating (active: 1/1):  33%|███▎      | 192/576 [00:24<00:49,  7.74it/s]Generating (active: 1/1):  33%|███▎      | 192/576 [00:24<00:49,  7.74it/s]Generating (active: 1/1):  34%|███▎      | 193/576 [00:25<00:49,  7.80it/s]Generating (active: 1/1):  34%|███▎      | 193/576 [00:25<00:49,  7.80it/s]Generating (active: 1/1):  34%|███▎      | 194/576 [00:25<00:48,  7.85it/s]Generating (active: 1/1):  34%|███▎      | 194/576 [00:25<00:48,  7.85it/s]Generating (active: 1/1):  34%|███▍      | 195/576 [00:25<00:48,  7.89it/s]Generating (active: 1/1):  34%|███▍      | 195/576 [00:25<00:48,  7.89it/s]Generating (active: 1/1):  34%|███▍      | 196/576 [00:25<00:48,  7.92it/s]Generating (active: 1/1):  34%|███▍      | 196/576 [00:25<00:48,  7.92it/s]Generating (active: 1/1):  34%|███▍      | 197/576 [00:25<00:47,  7.93it/s]Generating (active: 1/1):  34%|███▍      | 197/576 [00:25<00:47,  7.93it/s]Generating (active: 1/1):  34%|███▍      | 198/576 [00:25<00:47,  7.93it/s]Generating (active: 1/1):  34%|███▍      | 198/576 [00:25<00:47,  7.93it/s]Generating (active: 1/1):  35%|███▍      | 199/576 [00:25<00:47,  7.95it/s]Generating (active: 1/1):  35%|███▍      | 199/576 [00:25<00:47,  7.95it/s]Generating (active: 1/1):  35%|███▍      | 200/576 [00:25<00:47,  7.96it/s]Generating (active: 1/1):  35%|███▍      | 200/576 [00:25<00:47,  7.96it/s]Generating (active: 1/1):  35%|███▍      | 201/576 [00:26<00:47,  7.95it/s]Generating (active: 1/1):  35%|███▍      | 201/576 [00:26<00:47,  7.95it/s]Generating (active: 1/1):  35%|███▌      | 202/576 [00:26<00:47,  7.89it/s]Generating (active: 1/1):  35%|███▌      | 202/576 [00:26<00:47,  7.89it/s]Generating (active: 1/1):  35%|███▌      | 203/576 [00:26<00:47,  7.90it/s]Generating (active: 1/1):  35%|███▌      | 203/576 [00:26<00:47,  7.90it/s]Generating (active: 1/1):  35%|███▌      | 204/576 [00:26<00:46,  7.92it/s]Generating (active: 1/1):  35%|███▌      | 204/576 [00:26<00:46,  7.92it/s]Generating (active: 1/1):  36%|███▌      | 205/576 [00:26<00:46,  7.93it/s]Generating (active: 1/1):  36%|███▌      | 205/576 [00:26<00:46,  7.93it/s]Generating (active: 1/1):  36%|███▌      | 206/576 [00:26<00:47,  7.86it/s]Generating (active: 1/1):  36%|███▌      | 206/576 [00:26<00:47,  7.86it/s]Generating (active: 1/1):  36%|███▌      | 207/576 [00:26<00:46,  7.87it/s]Generating (active: 1/1):  36%|███▌      | 207/576 [00:26<00:46,  7.87it/s]Generating (active: 1/1):  36%|███▌      | 208/576 [00:26<00:46,  7.89it/s]Generating (active: 1/1):  36%|███▌      | 208/576 [00:26<00:46,  7.89it/s]Generating (active: 1/1):  36%|███▋      | 209/576 [00:27<00:46,  7.90it/s]Generating (active: 1/1):  36%|███▋      | 209/576 [00:27<00:46,  7.90it/s]Generating (active: 1/1):  36%|███▋      | 210/576 [00:27<00:46,  7.91it/s]Generating (active: 1/1):  36%|███▋      | 210/576 [00:27<00:46,  7.91it/s]Generating (active: 1/1):  37%|███▋      | 211/576 [00:27<00:45,  7.94it/s]Generating (active: 1/1):  37%|███▋      | 211/576 [00:27<00:45,  7.94it/s]Generating (active: 1/1):  37%|███▋      | 212/576 [00:27<00:45,  7.95it/s]Generating (active: 1/1):  37%|███▋      | 212/576 [00:27<00:45,  7.95it/s]Generating (active: 1/1):  37%|███▋      | 213/576 [00:27<00:45,  7.96it/s]Generating (active: 1/1):  37%|███▋      | 213/576 [00:27<00:45,  7.96it/s]Generating (active: 1/1):  37%|███▋      | 214/576 [00:27<00:45,  7.96it/s]Generating (active: 1/1):  37%|███▋      | 214/576 [00:27<00:45,  7.96it/s]Generating (active: 1/1):  37%|███▋      | 215/576 [00:27<00:45,  7.96it/s]Generating (active: 1/1):  37%|███▋      | 215/576 [00:27<00:45,  7.96it/s]Generating (active: 1/1):  38%|███▊      | 216/576 [00:27<00:45,  7.96it/s]Generating (active: 1/1):  38%|███▊      | 216/576 [00:27<00:45,  7.96it/s]Generating (active: 1/1):  38%|███▊      | 217/576 [00:28<00:45,  7.97it/s]Generating (active: 1/1):  38%|███▊      | 217/576 [00:28<00:45,  7.97it/s]Generating (active: 1/1):  38%|███▊      | 218/576 [00:28<00:44,  7.96it/s]Generating (active: 1/1):  38%|███▊      | 218/576 [00:28<00:44,  7.96it/s]Generating (active: 1/1):  38%|███▊      | 219/576 [00:28<00:44,  7.97it/s]Generating (active: 1/1):  38%|███▊      | 219/576 [00:28<00:44,  7.97it/s]Generating (active: 1/1):  38%|███▊      | 220/576 [00:28<00:44,  7.98it/s]Generating (active: 1/1):  38%|███▊      | 220/576 [00:28<00:44,  7.98it/s]Generating (active: 1/1):  38%|███▊      | 221/576 [00:28<00:44,  7.99it/s]Generating (active: 1/1):  38%|███▊      | 221/576 [00:28<00:44,  7.99it/s]Generating (active: 1/1):  39%|███▊      | 222/576 [00:28<00:44,  7.98it/s]Generating (active: 1/1):  39%|███▊      | 222/576 [00:28<00:44,  7.98it/s]Generating (active: 1/1):  39%|███▊      | 223/576 [00:28<00:44,  7.99it/s]Generating (active: 1/1):  39%|███▊      | 223/576 [00:28<00:44,  7.99it/s]Generating (active: 1/1):  39%|███▉      | 224/576 [00:28<00:44,  7.95it/s]Generating (active: 1/1):  39%|███▉      | 224/576 [00:28<00:44,  7.95it/s]Generating (active: 1/1):  39%|███▉      | 224/576 [00:29<00:44,  7.95it/s]Samples [0] reached EOS token at step 226.
Generation complete:  39%|███▉      | 224/576 [00:29<00:44,  7.95it/s]                                                                           Generation time: 29.11 seconds
Generated audio duration: 29.87 seconds
RTF (Real Time Factor): 0.97x
Prefilling tokens: 288
Generated tokens: 226
Total tokens: 514
Saved output to /home/ubuntu/vibevoice_output/modi_hindi_generated.wav

==================================================
GENERATION SUMMARY
==================================================
Input file: demo/text_examples/modi_hindi.txt
Output file: /home/ubuntu/vibevoice_output/modi_hindi_generated.wav
Speaker names: ['modi']
Number of unique speakers: 1
Number of segments: 2
Prefilling tokens: 288
Generated tokens: 226
Total tokens: 514
Generation time: 29.11 seconds
Audio duration: 29.87 seconds
RTF (Real Time Factor): 0.97x
Seed used: 42
==================================================