Mock Version: 3.5 ENTER ['do_with_status'](['bash', '--login', '-c', '/usr/bin/rpmbuild -bs --target x86_64 --nodeps /builddir/build/SPECS/deepspeed.spec'], chrootPath='/var/lib/mock/openeuler-23.09-x86_64-1713230795.511514/root'env={'TERM': 'vt100', 'SHELL': '/bin/bash', 'HOME': '/builddir', 'HOSTNAME': 'mock', 'PATH': '/usr/bin:/bin:/usr/sbin:/sbin', 'PROMPT_COMMAND': 'printf "\\033]0;\\007"', 'PS1': ' \\s-\\v\\$ ', 'LANG': 'C.UTF-8'}shell=Falselogger=timeout=0uid=1000gid=135user='mockbuild'nspawn_args=[]unshare_net=TrueprintOutput=True) Executing command: ['bash', '--login', '-c', '/usr/bin/rpmbuild -bs --target x86_64 --nodeps /builddir/build/SPECS/deepspeed.spec'] with env {'TERM': 'vt100', 'SHELL': '/bin/bash', 'HOME': '/builddir', 'HOSTNAME': 'mock', 'PATH': '/usr/bin:/bin:/usr/sbin:/sbin', 'PROMPT_COMMAND': 'printf "\\033]0;\\007"', 'PS1': ' \\s-\\v\\$ ', 'LANG': 'C.UTF-8'} and shell False :1: DeprecationWarning: The distutils package is deprecated and slated for removal in Python 3.12. Use setuptools or check PEP 632 for potential alternatives :1: DeprecationWarning: The distutils.sysconfig module is deprecated, use sysconfig instead Building target platforms: x86_64 Building for target x86_64 Wrote: /builddir/build/SRPMS/deepspeed-0.14.0-1.src.rpm Child return code was: 0 Mock Version: 3.5 ENTER ['do_with_status'](['bash', '--login', '-c', '/usr/bin/rpmbuild -bs --target x86_64 --nodeps /builddir/build/SPECS/deepspeed.spec'], chrootPath='/var/lib/mock/openeuler-23.09-x86_64-1713230795.511514/root'env={'TERM': 'vt100', 'SHELL': '/bin/bash', 'HOME': '/builddir', 'HOSTNAME': 'mock', 'PATH': '/usr/bin:/bin:/usr/sbin:/sbin', 'PROMPT_COMMAND': 'printf "\\033]0;\\007"', 'PS1': ' \\s-\\v\\$ ', 'LANG': 'C.UTF-8'}shell=Falselogger=timeout=0uid=1000gid=135user='mockbuild'nspawn_args=[]unshare_net=TrueprintOutput=True) Executing command: ['bash', '--login', '-c', '/usr/bin/rpmbuild -bs --target x86_64 --nodeps /builddir/build/SPECS/deepspeed.spec'] with env {'TERM': 'vt100', 'SHELL': '/bin/bash', 'HOME': '/builddir', 'HOSTNAME': 'mock', 'PATH': '/usr/bin:/bin:/usr/sbin:/sbin', 'PROMPT_COMMAND': 'printf "\\033]0;\\007"', 'PS1': ' \\s-\\v\\$ ', 'LANG': 'C.UTF-8'} and shell False :1: DeprecationWarning: The distutils package is deprecated and slated for removal in Python 3.12. Use setuptools or check PEP 632 for potential alternatives :1: DeprecationWarning: The distutils.sysconfig module is deprecated, use sysconfig instead Building target platforms: x86_64 Building for target x86_64 Wrote: /builddir/build/SRPMS/deepspeed-0.14.0-1.src.rpm Child return code was: 0 ENTER ['do_with_status'](['bash', '--login', '-c', '/usr/bin/rpmbuild -bb --target x86_64 --nodeps /builddir/build/SPECS/deepspeed.spec'], chrootPath='/var/lib/mock/openeuler-23.09-x86_64-1713230795.511514/root'env={'TERM': 'vt100', 'SHELL': '/bin/bash', 'HOME': '/builddir', 'HOSTNAME': 'mock', 'PATH': '/usr/bin:/bin:/usr/sbin:/sbin', 'PROMPT_COMMAND': 'printf "\\033]0;\\007"', 'PS1': ' \\s-\\v\\$ ', 'LANG': 'C.UTF-8'}shell=Falselogger=timeout=0uid=1000gid=135user='mockbuild'nspawn_args=[]unshare_net=TrueprintOutput=True) Executing command: ['bash', '--login', '-c', '/usr/bin/rpmbuild -bb --target x86_64 --nodeps /builddir/build/SPECS/deepspeed.spec'] with env {'TERM': 'vt100', 'SHELL': '/bin/bash', 'HOME': '/builddir', 'HOSTNAME': 'mock', 'PATH': '/usr/bin:/bin:/usr/sbin:/sbin', 'PROMPT_COMMAND': 'printf "\\033]0;\\007"', 'PS1': ' \\s-\\v\\$ ', 'LANG': 'C.UTF-8'} and shell False Building target platforms: x86_64 Building for target x86_64 Executing(%prep): /bin/sh -e /var/tmp/rpm-tmp.Kx0bI7 + umask 022 + cd /builddir/build/BUILD + cd /builddir/build/BUILD + rm -rf DeepSpeed-0.14.0 + /usr/lib/rpm/rpmuncompress -x /builddir/build/SOURCES/DeepSpeed-0.14.0.tar.gz + STATUS=0 + '[' 0 -ne 0 ']' + cd DeepSpeed-0.14.0 + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w . + RPM_EC=0 ++ jobs -p + exit 0 Executing(%build): /bin/sh -e /var/tmp/rpm-tmp.xeQayP + umask 022 + cd /builddir/build/BUILD + cd DeepSpeed-0.14.0 + CFLAGS='-O2 -g -pipe -Wall -Werror=format-security -Wp,-D_FORTIFY_SOURCE=2 -Wp,-D_GLIBCXX_ASSERTIONS -fstack-protector-strong -grecord-gcc-switches -specs=/usr/lib/rpm/generic-hardened-cc1 -m64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection ' + LDFLAGS='-Wl,-z,relro -Wl,-z,now -specs=/usr/lib/rpm/generic-hardened-ld' + /usr/bin/python3 -mpip wheel --verbose --progress-bar off --disable-pip-version-check --use-pep517 --no-build-isolation --no-deps --wheel-dir ./build . Processing /builddir/build/BUILD/DeepSpeed-0.14.0 Preparing metadata (pyproject.toml): started Running command Preparing metadata (pyproject.toml) /bin/sh: line 1: type: git: not found [WARNING] Torch did not find cuda available, if cross-compiling or running with cpu only you can ignore this message. Adding compute capability for Pascal, Volta, and Turing (compute capabilities 6.0, 6.1, 6.2) DS_BUILD_OPS=0 [WARNING] async_io requires the dev libaio .so object and headers but these were not found. [WARNING] async_io: please install the libaio-devel package with yum [WARNING] If libaio is already installed (perhaps from source), try setting the CFLAGS and LDFLAGS environment variables to where it can be found. [WARNING] Please specify the CUTLASS repo directory as environment variable $CUTLASS_PATH [WARNING] sparse_attn cuda is not available from torch [WARNING] sparse_attn requires a torch version >= 1.5 and < 2.0 but detected 2.0 [WARNING] please install triton==1.0.0 if you want to use sparse attention Install Ops={'async_io': False, 'fused_adam': False, 'cpu_adam': False, 'cpu_adagrad': False, 'cpu_lion': False, 'evoformer_attn': False, 'fused_lamb': False, 'fused_lion': False, 'inference_core_ops': False, 'cutlass_ops': False, 'transformer_inference': False, 'quantizer': False, 'ragged_device_ops': False, 'ragged_ops': False, 'random_ltd': False, 'sparse_attn': False, 'spatial_inference': False, 'transformer': False, 'stochastic_transformer': False} version=0.14.0+unknown, git_hash=unknown, git_branch=unknown install_requires=['hjson', 'ninja', 'numpy', 'packaging>=20.0', 'psutil', 'py-cpuinfo', 'pydantic', 'pynvml', 'torch', 'tqdm'] compatible_ops={'async_io': False, 'fused_adam': True, 'cpu_adam': True, 'cpu_adagrad': True, 'cpu_lion': True, 'evoformer_attn': False, 'fused_lamb': True, 'fused_lion': True, 'inference_core_ops': True, 'cutlass_ops': True, 'transformer_inference': True, 'quantizer': True, 'ragged_device_ops': True, 'ragged_ops': True, 'random_ltd': True, 'sparse_attn': False, 'spatial_inference': True, 'transformer': True, 'stochastic_transformer': True, 'deepspeed_not_implemented': False} ext_modules=[] running dist_info creating /tmp/pip-modern-metadata-orq_f374/deepspeed.egg-info writing /tmp/pip-modern-metadata-orq_f374/deepspeed.egg-info/PKG-INFO writing dependency_links to /tmp/pip-modern-metadata-orq_f374/deepspeed.egg-info/dependency_links.txt writing entry points to /tmp/pip-modern-metadata-orq_f374/deepspeed.egg-info/entry_points.txt writing requirements to /tmp/pip-modern-metadata-orq_f374/deepspeed.egg-info/requires.txt writing top-level names to /tmp/pip-modern-metadata-orq_f374/deepspeed.egg-info/top_level.txt writing manifest file '/tmp/pip-modern-metadata-orq_f374/deepspeed.egg-info/SOURCES.txt' reading manifest file '/tmp/pip-modern-metadata-orq_f374/deepspeed.egg-info/SOURCES.txt' reading manifest template 'MANIFEST.in' warning: no files found matching 'deepspeed/inference/v2/kernels/ragged_ops/libs/*.so' warning: no files found matching 'deepspeed/inference/v2/kernels/cutlass_ops/libs/*.so' warning: no files found matching '*.hip' under directory 'deepspeed' warning: no files found matching '*.cc' under directory 'deepspeed' warning: no files found matching '*.tr' under directory 'csrc' warning: no files found matching '*.cc' under directory 'csrc' warning: no files found matching '*.py' under directory 'benchmarks' adding license file 'LICENSE' writing manifest file '/tmp/pip-modern-metadata-orq_f374/deepspeed.egg-info/SOURCES.txt' creating '/tmp/pip-modern-metadata-orq_f374/deepspeed-0.14.0+unknown.dist-info' deepspeed build time = 0.2354264259338379 secs Preparing metadata (pyproject.toml): finished with status 'done' Building wheels for collected packages: deepspeed Building wheel for deepspeed (pyproject.toml): started Running command Building wheel for deepspeed (pyproject.toml) /bin/sh: line 1: type: git: not found [WARNING] Torch did not find cuda available, if cross-compiling or running with cpu only you can ignore this message. Adding compute capability for Pascal, Volta, and Turing (compute capabilities 6.0, 6.1, 6.2) DS_BUILD_OPS=0 [WARNING] async_io requires the dev libaio .so object and headers but these were not found. [WARNING] async_io: please install the libaio-devel package with yum [WARNING] If libaio is already installed (perhaps from source), try setting the CFLAGS and LDFLAGS environment variables to where it can be found. [WARNING] Please specify the CUTLASS repo directory as environment variable $CUTLASS_PATH [WARNING] sparse_attn cuda is not available from torch [WARNING] sparse_attn requires a torch version >= 1.5 and < 2.0 but detected 2.0 [WARNING] please install triton==1.0.0 if you want to use sparse attention Install Ops={'async_io': False, 'fused_adam': False, 'cpu_adam': False, 'cpu_adagrad': False, 'cpu_lion': False, 'evoformer_attn': False, 'fused_lamb': False, 'fused_lion': False, 'inference_core_ops': False, 'cutlass_ops': False, 'transformer_inference': False, 'quantizer': False, 'ragged_device_ops': False, 'ragged_ops': False, 'random_ltd': False, 'sparse_attn': False, 'spatial_inference': False, 'transformer': False, 'stochastic_transformer': False} version=0.14.0+unknown, git_hash=unknown, git_branch=unknown install_requires=['hjson', 'ninja', 'numpy', 'packaging>=20.0', 'psutil', 'py-cpuinfo', 'pydantic', 'pynvml', 'torch', 'tqdm'] compatible_ops={'async_io': False, 'fused_adam': True, 'cpu_adam': True, 'cpu_adagrad': True, 'cpu_lion': True, 'evoformer_attn': False, 'fused_lamb': True, 'fused_lion': True, 'inference_core_ops': True, 'cutlass_ops': True, 'transformer_inference': True, 'quantizer': True, 'ragged_device_ops': True, 'ragged_ops': True, 'random_ltd': True, 'sparse_attn': False, 'spatial_inference': True, 'transformer': True, 'stochastic_transformer': True, 'deepspeed_not_implemented': False} ext_modules=[] running bdist_wheel running build running build_py creating build/lib creating build/lib/deepspeed copying deepspeed/__init__.py -> build/lib/deepspeed copying deepspeed/constants.py -> build/lib/deepspeed copying deepspeed/env_report.py -> build/lib/deepspeed copying deepspeed/git_version_info.py -> build/lib/deepspeed copying deepspeed/pydantic_v1.py -> build/lib/deepspeed copying deepspeed/git_version_info_installed.py -> build/lib/deepspeed creating build/lib/deepspeed/autotuning copying deepspeed/autotuning/__init__.py -> build/lib/deepspeed/autotuning copying deepspeed/autotuning/autotuner.py -> build/lib/deepspeed/autotuning copying deepspeed/autotuning/config.py -> build/lib/deepspeed/autotuning copying deepspeed/autotuning/constants.py -> build/lib/deepspeed/autotuning copying deepspeed/autotuning/scheduler.py -> build/lib/deepspeed/autotuning copying deepspeed/autotuning/utils.py -> build/lib/deepspeed/autotuning creating build/lib/deepspeed/checkpoint copying deepspeed/checkpoint/__init__.py -> build/lib/deepspeed/checkpoint copying deepspeed/checkpoint/constants.py -> build/lib/deepspeed/checkpoint copying deepspeed/checkpoint/deepspeed_checkpoint.py -> build/lib/deepspeed/checkpoint copying deepspeed/checkpoint/ds_to_universal.py -> build/lib/deepspeed/checkpoint copying deepspeed/checkpoint/reshape_3d_utils.py -> build/lib/deepspeed/checkpoint copying deepspeed/checkpoint/reshape_meg_2d.py -> build/lib/deepspeed/checkpoint copying deepspeed/checkpoint/reshape_utils.py -> build/lib/deepspeed/checkpoint copying deepspeed/checkpoint/universal_checkpoint.py -> build/lib/deepspeed/checkpoint copying deepspeed/checkpoint/utils.py -> build/lib/deepspeed/checkpoint copying deepspeed/checkpoint/zero_checkpoint.py -> build/lib/deepspeed/checkpoint creating build/lib/deepspeed/comm copying deepspeed/comm/__init__.py -> build/lib/deepspeed/comm copying deepspeed/comm/backend.py -> build/lib/deepspeed/comm copying deepspeed/comm/ccl.py -> build/lib/deepspeed/comm copying deepspeed/comm/comm.py -> build/lib/deepspeed/comm copying deepspeed/comm/config.py -> build/lib/deepspeed/comm copying deepspeed/comm/constants.py -> build/lib/deepspeed/comm copying deepspeed/comm/reduce_op.py -> build/lib/deepspeed/comm copying deepspeed/comm/torch.py -> build/lib/deepspeed/comm copying deepspeed/comm/utils.py -> build/lib/deepspeed/comm creating build/lib/deepspeed/compression copying deepspeed/compression/__init__.py -> build/lib/deepspeed/compression copying deepspeed/compression/basic_layer.py -> build/lib/deepspeed/compression copying deepspeed/compression/compress.py -> build/lib/deepspeed/compression copying deepspeed/compression/config.py -> build/lib/deepspeed/compression copying deepspeed/compression/constants.py -> build/lib/deepspeed/compression copying deepspeed/compression/helper.py -> build/lib/deepspeed/compression copying deepspeed/compression/scheduler.py -> build/lib/deepspeed/compression copying deepspeed/compression/utils.py -> build/lib/deepspeed/compression creating build/lib/deepspeed/elasticity copying deepspeed/elasticity/__init__.py -> build/lib/deepspeed/elasticity copying deepspeed/elasticity/config.py -> build/lib/deepspeed/elasticity copying deepspeed/elasticity/constants.py -> build/lib/deepspeed/elasticity copying deepspeed/elasticity/elastic_agent.py -> build/lib/deepspeed/elasticity copying deepspeed/elasticity/elasticity.py -> build/lib/deepspeed/elasticity copying deepspeed/elasticity/utils.py -> build/lib/deepspeed/elasticity creating build/lib/deepspeed/inference copying deepspeed/inference/__init__.py -> build/lib/deepspeed/inference copying deepspeed/inference/config.py -> build/lib/deepspeed/inference copying deepspeed/inference/engine.py -> build/lib/deepspeed/inference creating build/lib/deepspeed/launcher copying deepspeed/launcher/__init__.py -> build/lib/deepspeed/launcher copying deepspeed/launcher/constants.py -> build/lib/deepspeed/launcher copying deepspeed/launcher/launch.py -> build/lib/deepspeed/launcher copying deepspeed/launcher/launcher_helper.py -> build/lib/deepspeed/launcher copying deepspeed/launcher/multinode_runner.py -> build/lib/deepspeed/launcher copying deepspeed/launcher/runner.py -> build/lib/deepspeed/launcher creating build/lib/deepspeed/model_implementations copying deepspeed/model_implementations/__init__.py -> build/lib/deepspeed/model_implementations creating build/lib/deepspeed/module_inject copying deepspeed/module_inject/__init__.py -> build/lib/deepspeed/module_inject copying deepspeed/module_inject/auto_tp.py -> build/lib/deepspeed/module_inject copying deepspeed/module_inject/auto_tp_model_utils.py -> build/lib/deepspeed/module_inject copying deepspeed/module_inject/fusedqkv_utils.py -> build/lib/deepspeed/module_inject copying deepspeed/module_inject/inject.py -> build/lib/deepspeed/module_inject copying deepspeed/module_inject/layers.py -> build/lib/deepspeed/module_inject copying deepspeed/module_inject/load_checkpoint.py -> build/lib/deepspeed/module_inject copying deepspeed/module_inject/module_quantize.py -> build/lib/deepspeed/module_inject copying deepspeed/module_inject/policy.py -> build/lib/deepspeed/module_inject copying deepspeed/module_inject/replace_module.py -> build/lib/deepspeed/module_inject copying deepspeed/module_inject/replace_policy.py -> build/lib/deepspeed/module_inject copying deepspeed/module_inject/tp_shard.py -> build/lib/deepspeed/module_inject copying deepspeed/module_inject/utils.py -> build/lib/deepspeed/module_inject creating build/lib/deepspeed/moe copying deepspeed/moe/__init__.py -> build/lib/deepspeed/moe copying deepspeed/moe/experts.py -> build/lib/deepspeed/moe copying deepspeed/moe/layer.py -> build/lib/deepspeed/moe copying deepspeed/moe/mappings.py -> build/lib/deepspeed/moe copying deepspeed/moe/sharded_moe.py -> build/lib/deepspeed/moe copying deepspeed/moe/utils.py -> build/lib/deepspeed/moe creating build/lib/deepspeed/monitor copying deepspeed/monitor/__init__.py -> build/lib/deepspeed/monitor copying deepspeed/monitor/config.py -> build/lib/deepspeed/monitor copying deepspeed/monitor/csv_monitor.py -> build/lib/deepspeed/monitor copying deepspeed/monitor/monitor.py -> build/lib/deepspeed/monitor copying deepspeed/monitor/tensorboard.py -> build/lib/deepspeed/monitor copying deepspeed/monitor/utils.py -> build/lib/deepspeed/monitor copying deepspeed/monitor/wandb.py -> build/lib/deepspeed/monitor creating build/lib/deepspeed/nebula copying deepspeed/nebula/__init__.py -> build/lib/deepspeed/nebula copying deepspeed/nebula/config.py -> build/lib/deepspeed/nebula copying deepspeed/nebula/constants.py -> build/lib/deepspeed/nebula creating build/lib/deepspeed/ops copying deepspeed/ops/__init__.py -> build/lib/deepspeed/ops creating build/lib/deepspeed/pipe copying deepspeed/pipe/__init__.py -> build/lib/deepspeed/pipe creating build/lib/deepspeed/profiling copying deepspeed/profiling/__init__.py -> build/lib/deepspeed/profiling copying deepspeed/profiling/config.py -> build/lib/deepspeed/profiling copying deepspeed/profiling/constants.py -> build/lib/deepspeed/profiling creating build/lib/deepspeed/runtime copying deepspeed/runtime/__init__.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/bf16_optimizer.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/compiler.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/config.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/config_utils.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/constants.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/dataloader.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/eigenvalue.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/engine.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/hybrid_engine.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/lr_schedules.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/progressive_layer_drop.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/quantize.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/sparse_tensor.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/state_dict_factory.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/utils.py -> build/lib/deepspeed/runtime copying deepspeed/runtime/weight_quantizer.py -> build/lib/deepspeed/runtime creating build/lib/deepspeed/sequence copying deepspeed/sequence/__init__.py -> build/lib/deepspeed/sequence copying deepspeed/sequence/layer.py -> build/lib/deepspeed/sequence creating build/lib/deepspeed/utils copying deepspeed/utils/__init__.py -> build/lib/deepspeed/utils copying deepspeed/utils/comms_logging.py -> build/lib/deepspeed/utils copying deepspeed/utils/debug.py -> build/lib/deepspeed/utils copying deepspeed/utils/exceptions.py -> build/lib/deepspeed/utils copying deepspeed/utils/groups.py -> build/lib/deepspeed/utils copying deepspeed/utils/init_on_device.py -> build/lib/deepspeed/utils copying deepspeed/utils/logging.py -> build/lib/deepspeed/utils copying deepspeed/utils/mixed_precision_linkage.py -> build/lib/deepspeed/utils copying deepspeed/utils/numa.py -> build/lib/deepspeed/utils copying deepspeed/utils/nvtx.py -> build/lib/deepspeed/utils copying deepspeed/utils/tensor_fragment.py -> build/lib/deepspeed/utils copying deepspeed/utils/timer.py -> build/lib/deepspeed/utils copying deepspeed/utils/types.py -> build/lib/deepspeed/utils copying deepspeed/utils/z3_leaf_module.py -> build/lib/deepspeed/utils copying deepspeed/utils/zero_to_fp32.py -> build/lib/deepspeed/utils creating build/lib/deepspeed/accelerator copying deepspeed/accelerator/__init__.py -> build/lib/deepspeed/accelerator copying deepspeed/accelerator/abstract_accelerator.py -> build/lib/deepspeed/accelerator copying deepspeed/accelerator/cpu_accelerator.py -> build/lib/deepspeed/accelerator copying deepspeed/accelerator/cuda_accelerator.py -> build/lib/deepspeed/accelerator copying deepspeed/accelerator/hpu_accelerator.py -> build/lib/deepspeed/accelerator copying deepspeed/accelerator/mps_accelerator.py -> build/lib/deepspeed/accelerator copying deepspeed/accelerator/npu_accelerator.py -> build/lib/deepspeed/accelerator copying deepspeed/accelerator/real_accelerator.py -> build/lib/deepspeed/accelerator copying deepspeed/accelerator/xpu_accelerator.py -> build/lib/deepspeed/accelerator creating build/lib/deepspeed/autotuning/tuner copying deepspeed/autotuning/tuner/__init__.py -> build/lib/deepspeed/autotuning/tuner copying deepspeed/autotuning/tuner/base_tuner.py -> build/lib/deepspeed/autotuning/tuner copying deepspeed/autotuning/tuner/cost_model.py -> build/lib/deepspeed/autotuning/tuner copying deepspeed/autotuning/tuner/index_based_tuner.py -> build/lib/deepspeed/autotuning/tuner copying deepspeed/autotuning/tuner/model_based_tuner.py -> build/lib/deepspeed/autotuning/tuner copying deepspeed/autotuning/tuner/utils.py -> build/lib/deepspeed/autotuning/tuner creating build/lib/deepspeed/inference/quantization copying deepspeed/inference/quantization/__init__.py -> build/lib/deepspeed/inference/quantization copying deepspeed/inference/quantization/layers.py -> build/lib/deepspeed/inference/quantization copying deepspeed/inference/quantization/quantization.py -> build/lib/deepspeed/inference/quantization copying deepspeed/inference/quantization/quantization_context.py -> build/lib/deepspeed/inference/quantization copying deepspeed/inference/quantization/utils.py -> build/lib/deepspeed/inference/quantization creating build/lib/deepspeed/inference/v2 copying deepspeed/inference/v2/__init__.py -> build/lib/deepspeed/inference/v2 copying deepspeed/inference/v2/allocator.py -> build/lib/deepspeed/inference/v2 copying deepspeed/inference/v2/config_v2.py -> build/lib/deepspeed/inference/v2 copying deepspeed/inference/v2/engine_factory.py -> build/lib/deepspeed/inference/v2 copying deepspeed/inference/v2/engine_v2.py -> build/lib/deepspeed/inference/v2 copying deepspeed/inference/v2/inference_parameter.py -> build/lib/deepspeed/inference/v2 copying deepspeed/inference/v2/inference_utils.py -> build/lib/deepspeed/inference/v2 copying deepspeed/inference/v2/logging.py -> build/lib/deepspeed/inference/v2 copying deepspeed/inference/v2/scheduling_utils.py -> build/lib/deepspeed/inference/v2 creating build/lib/deepspeed/inference/v2/checkpoint copying deepspeed/inference/v2/checkpoint/__init__.py -> build/lib/deepspeed/inference/v2/checkpoint copying deepspeed/inference/v2/checkpoint/base_engine.py -> build/lib/deepspeed/inference/v2/checkpoint copying deepspeed/inference/v2/checkpoint/huggingface_engine.py -> build/lib/deepspeed/inference/v2/checkpoint copying deepspeed/inference/v2/checkpoint/in_memory_engine.py -> build/lib/deepspeed/inference/v2/checkpoint creating build/lib/deepspeed/inference/v2/kernels copying deepspeed/inference/v2/kernels/__init__.py -> build/lib/deepspeed/inference/v2/kernels copying deepspeed/inference/v2/kernels/ds_kernel.py -> build/lib/deepspeed/inference/v2/kernels creating build/lib/deepspeed/inference/v2/model_implementations copying deepspeed/inference/v2/model_implementations/__init__.py -> build/lib/deepspeed/inference/v2/model_implementations copying deepspeed/inference/v2/model_implementations/flat_model_helpers.py -> build/lib/deepspeed/inference/v2/model_implementations copying deepspeed/inference/v2/model_implementations/inference_model_base.py -> build/lib/deepspeed/inference/v2/model_implementations copying deepspeed/inference/v2/model_implementations/inference_policy_base.py -> build/lib/deepspeed/inference/v2/model_implementations copying deepspeed/inference/v2/model_implementations/inference_transformer_base.py -> build/lib/deepspeed/inference/v2/model_implementations copying deepspeed/inference/v2/model_implementations/layer_container_base.py -> build/lib/deepspeed/inference/v2/model_implementations copying deepspeed/inference/v2/model_implementations/parameter_base.py -> build/lib/deepspeed/inference/v2/model_implementations creating build/lib/deepspeed/inference/v2/modules copying deepspeed/inference/v2/modules/__init__.py -> build/lib/deepspeed/inference/v2/modules copying deepspeed/inference/v2/modules/ds_module.py -> build/lib/deepspeed/inference/v2/modules copying deepspeed/inference/v2/modules/heuristics.py -> build/lib/deepspeed/inference/v2/modules copying deepspeed/inference/v2/modules/module_registry.py -> build/lib/deepspeed/inference/v2/modules creating build/lib/deepspeed/inference/v2/ragged copying deepspeed/inference/v2/ragged/__init__.py -> build/lib/deepspeed/inference/v2/ragged copying deepspeed/inference/v2/ragged/blocked_allocator.py -> build/lib/deepspeed/inference/v2/ragged copying deepspeed/inference/v2/ragged/kv_cache.py -> build/lib/deepspeed/inference/v2/ragged copying deepspeed/inference/v2/ragged/manager_configs.py -> build/lib/deepspeed/inference/v2/ragged copying deepspeed/inference/v2/ragged/ragged_manager.py -> build/lib/deepspeed/inference/v2/ragged copying deepspeed/inference/v2/ragged/ragged_wrapper.py -> build/lib/deepspeed/inference/v2/ragged copying deepspeed/inference/v2/ragged/sequence_descriptor.py -> build/lib/deepspeed/inference/v2/ragged creating build/lib/deepspeed/inference/v2/kernels/core_ops copying deepspeed/inference/v2/kernels/core_ops/__init__.py -> build/lib/deepspeed/inference/v2/kernels/core_ops creating build/lib/deepspeed/inference/v2/kernels/cutlass_ops copying deepspeed/inference/v2/kernels/cutlass_ops/__init__.py -> build/lib/deepspeed/inference/v2/kernels/cutlass_ops creating build/lib/deepspeed/inference/v2/kernels/ragged_ops copying deepspeed/inference/v2/kernels/ragged_ops/__init__.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops creating build/lib/deepspeed/inference/v2/kernels/core_ops/bias_activations copying deepspeed/inference/v2/kernels/core_ops/bias_activations/__init__.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/bias_activations copying deepspeed/inference/v2/kernels/core_ops/bias_activations/bias_activation.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/bias_activations creating build/lib/deepspeed/inference/v2/kernels/core_ops/blas_kernels copying deepspeed/inference/v2/kernels/core_ops/blas_kernels/__init__.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/blas_kernels copying deepspeed/inference/v2/kernels/core_ops/blas_kernels/blas_linear.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/blas_kernels creating build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/__init__.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/cuda_fp_ln_base.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/cuda_ln.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/cuda_post_ln.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/cuda_pre_ln.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm creating build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/__init__.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/cuda_linear.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear creating build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm copying deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/__init__.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm copying deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm copying deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm_base.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm copying deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_pre_norm.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm creating build/lib/deepspeed/inference/v2/kernels/core_ops/gated_activations copying deepspeed/inference/v2/kernels/core_ops/gated_activations/__init__.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/gated_activations copying deepspeed/inference/v2/kernels/core_ops/gated_activations/gated_activation.py -> build/lib/deepspeed/inference/v2/kernels/core_ops/gated_activations creating build/lib/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm copying deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/__init__.py -> build/lib/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm copying deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/mixed_gemm.py -> build/lib/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm creating build/lib/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm copying deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/__init__.py -> build/lib/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm copying deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/mixed_moe_gemm.py -> build/lib/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm copying deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/moe_gemm.py -> build/lib/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm creating build/lib/deepspeed/inference/v2/kernels/ragged_ops/atom_builder copying deepspeed/inference/v2/kernels/ragged_ops/atom_builder/__init__.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/atom_builder copying deepspeed/inference/v2/kernels/ragged_ops/atom_builder/atom_builder.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/atom_builder creating build/lib/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash copying deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/__init__.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash copying deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/blocked_flash.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash creating build/lib/deepspeed/inference/v2/kernels/ragged_ops/embed copying deepspeed/inference/v2/kernels/ragged_ops/embed/__init__.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/embed copying deepspeed/inference/v2/kernels/ragged_ops/embed/embed.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/embed creating build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/__init__.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_trained_kv_rotary.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/linear_blocked_kv_copy.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary creating build/lib/deepspeed/inference/v2/kernels/ragged_ops/logits_gather copying deepspeed/inference/v2/kernels/ragged_ops/logits_gather/__init__.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/logits_gather copying deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/logits_gather creating build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_gather copying deepspeed/inference/v2/kernels/ragged_ops/moe_gather/__init__.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_gather copying deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_gather creating build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter copying deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/__init__.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter copying deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter creating build/lib/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating copying deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/__init__.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating copying deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating.py -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating creating build/lib/deepspeed/inference/v2/model_implementations/common_parameters copying deepspeed/inference/v2/model_implementations/common_parameters/__init__.py -> build/lib/deepspeed/inference/v2/model_implementations/common_parameters copying deepspeed/inference/v2/model_implementations/common_parameters/attn_output_parameters.py -> build/lib/deepspeed/inference/v2/model_implementations/common_parameters copying deepspeed/inference/v2/model_implementations/common_parameters/embedding_parameters.py -> build/lib/deepspeed/inference/v2/model_implementations/common_parameters copying deepspeed/inference/v2/model_implementations/common_parameters/invfreq_parameters.py -> build/lib/deepspeed/inference/v2/model_implementations/common_parameters copying deepspeed/inference/v2/model_implementations/common_parameters/mlp_parameters.py -> build/lib/deepspeed/inference/v2/model_implementations/common_parameters copying deepspeed/inference/v2/model_implementations/common_parameters/moe_parameters.py -> build/lib/deepspeed/inference/v2/model_implementations/common_parameters copying deepspeed/inference/v2/model_implementations/common_parameters/norm_parameters.py -> build/lib/deepspeed/inference/v2/model_implementations/common_parameters copying deepspeed/inference/v2/model_implementations/common_parameters/qkv_parameters.py -> build/lib/deepspeed/inference/v2/model_implementations/common_parameters copying deepspeed/inference/v2/model_implementations/common_parameters/unembed_parameters.py -> build/lib/deepspeed/inference/v2/model_implementations/common_parameters creating build/lib/deepspeed/inference/v2/model_implementations/falcon copying deepspeed/inference/v2/model_implementations/falcon/__init__.py -> build/lib/deepspeed/inference/v2/model_implementations/falcon copying deepspeed/inference/v2/model_implementations/falcon/container.py -> build/lib/deepspeed/inference/v2/model_implementations/falcon copying deepspeed/inference/v2/model_implementations/falcon/model.py -> build/lib/deepspeed/inference/v2/model_implementations/falcon copying deepspeed/inference/v2/model_implementations/falcon/policy.py -> build/lib/deepspeed/inference/v2/model_implementations/falcon creating build/lib/deepspeed/inference/v2/model_implementations/llama_v2 copying deepspeed/inference/v2/model_implementations/llama_v2/__init__.py -> build/lib/deepspeed/inference/v2/model_implementations/llama_v2 copying deepspeed/inference/v2/model_implementations/llama_v2/container.py -> build/lib/deepspeed/inference/v2/model_implementations/llama_v2 copying deepspeed/inference/v2/model_implementations/llama_v2/model.py -> build/lib/deepspeed/inference/v2/model_implementations/llama_v2 copying deepspeed/inference/v2/model_implementations/llama_v2/policy.py -> build/lib/deepspeed/inference/v2/model_implementations/llama_v2 creating build/lib/deepspeed/inference/v2/model_implementations/mistral copying deepspeed/inference/v2/model_implementations/mistral/__init__.py -> build/lib/deepspeed/inference/v2/model_implementations/mistral copying deepspeed/inference/v2/model_implementations/mistral/container.py -> build/lib/deepspeed/inference/v2/model_implementations/mistral copying deepspeed/inference/v2/model_implementations/mistral/model.py -> build/lib/deepspeed/inference/v2/model_implementations/mistral copying deepspeed/inference/v2/model_implementations/mistral/policy.py -> build/lib/deepspeed/inference/v2/model_implementations/mistral creating build/lib/deepspeed/inference/v2/model_implementations/mixtral copying deepspeed/inference/v2/model_implementations/mixtral/__init__.py -> build/lib/deepspeed/inference/v2/model_implementations/mixtral copying deepspeed/inference/v2/model_implementations/mixtral/container.py -> build/lib/deepspeed/inference/v2/model_implementations/mixtral copying deepspeed/inference/v2/model_implementations/mixtral/model.py -> build/lib/deepspeed/inference/v2/model_implementations/mixtral copying deepspeed/inference/v2/model_implementations/mixtral/policy.py -> build/lib/deepspeed/inference/v2/model_implementations/mixtral creating build/lib/deepspeed/inference/v2/model_implementations/opt copying deepspeed/inference/v2/model_implementations/opt/__init__.py -> build/lib/deepspeed/inference/v2/model_implementations/opt copying deepspeed/inference/v2/model_implementations/opt/container.py -> build/lib/deepspeed/inference/v2/model_implementations/opt copying deepspeed/inference/v2/model_implementations/opt/model.py -> build/lib/deepspeed/inference/v2/model_implementations/opt copying deepspeed/inference/v2/model_implementations/opt/policy.py -> build/lib/deepspeed/inference/v2/model_implementations/opt creating build/lib/deepspeed/inference/v2/model_implementations/phi copying deepspeed/inference/v2/model_implementations/phi/__init__.py -> build/lib/deepspeed/inference/v2/model_implementations/phi copying deepspeed/inference/v2/model_implementations/phi/containers.py -> build/lib/deepspeed/inference/v2/model_implementations/phi copying deepspeed/inference/v2/model_implementations/phi/model.py -> build/lib/deepspeed/inference/v2/model_implementations/phi copying deepspeed/inference/v2/model_implementations/phi/policy.py -> build/lib/deepspeed/inference/v2/model_implementations/phi creating build/lib/deepspeed/inference/v2/model_implementations/qwen copying deepspeed/inference/v2/model_implementations/qwen/__init__.py -> build/lib/deepspeed/inference/v2/model_implementations/qwen copying deepspeed/inference/v2/model_implementations/qwen/container.py -> build/lib/deepspeed/inference/v2/model_implementations/qwen copying deepspeed/inference/v2/model_implementations/qwen/model.py -> build/lib/deepspeed/inference/v2/model_implementations/qwen copying deepspeed/inference/v2/model_implementations/qwen/policy.py -> build/lib/deepspeed/inference/v2/model_implementations/qwen creating build/lib/deepspeed/inference/v2/model_implementations/qwen_v2 copying deepspeed/inference/v2/model_implementations/qwen_v2/__init__.py -> build/lib/deepspeed/inference/v2/model_implementations/qwen_v2 copying deepspeed/inference/v2/model_implementations/qwen_v2/container.py -> build/lib/deepspeed/inference/v2/model_implementations/qwen_v2 copying deepspeed/inference/v2/model_implementations/qwen_v2/model.py -> build/lib/deepspeed/inference/v2/model_implementations/qwen_v2 copying deepspeed/inference/v2/model_implementations/qwen_v2/policy.py -> build/lib/deepspeed/inference/v2/model_implementations/qwen_v2 creating build/lib/deepspeed/inference/v2/model_implementations/sharding copying deepspeed/inference/v2/model_implementations/sharding/__init__.py -> build/lib/deepspeed/inference/v2/model_implementations/sharding copying deepspeed/inference/v2/model_implementations/sharding/attn.py -> build/lib/deepspeed/inference/v2/model_implementations/sharding copying deepspeed/inference/v2/model_implementations/sharding/attn_out.py -> build/lib/deepspeed/inference/v2/model_implementations/sharding copying deepspeed/inference/v2/model_implementations/sharding/embedding.py -> build/lib/deepspeed/inference/v2/model_implementations/sharding copying deepspeed/inference/v2/model_implementations/sharding/mlp.py -> build/lib/deepspeed/inference/v2/model_implementations/sharding copying deepspeed/inference/v2/model_implementations/sharding/qkv.py -> build/lib/deepspeed/inference/v2/model_implementations/sharding copying deepspeed/inference/v2/model_implementations/sharding/types.py -> build/lib/deepspeed/inference/v2/model_implementations/sharding copying deepspeed/inference/v2/model_implementations/sharding/unembed.py -> build/lib/deepspeed/inference/v2/model_implementations/sharding copying deepspeed/inference/v2/model_implementations/sharding/utils.py -> build/lib/deepspeed/inference/v2/model_implementations/sharding creating build/lib/deepspeed/inference/v2/modules/configs copying deepspeed/inference/v2/modules/configs/__init__.py -> build/lib/deepspeed/inference/v2/modules/configs copying deepspeed/inference/v2/modules/configs/attention_configs.py -> build/lib/deepspeed/inference/v2/modules/configs copying deepspeed/inference/v2/modules/configs/embedding_config.py -> build/lib/deepspeed/inference/v2/modules/configs copying deepspeed/inference/v2/modules/configs/linear_config.py -> build/lib/deepspeed/inference/v2/modules/configs copying deepspeed/inference/v2/modules/configs/moe_config.py -> build/lib/deepspeed/inference/v2/modules/configs copying deepspeed/inference/v2/modules/configs/norm_config.py -> build/lib/deepspeed/inference/v2/modules/configs copying deepspeed/inference/v2/modules/configs/unembed_config.py -> build/lib/deepspeed/inference/v2/modules/configs creating build/lib/deepspeed/inference/v2/modules/implementations copying deepspeed/inference/v2/modules/implementations/__init__.py -> build/lib/deepspeed/inference/v2/modules/implementations creating build/lib/deepspeed/inference/v2/modules/interfaces copying deepspeed/inference/v2/modules/interfaces/__init__.py -> build/lib/deepspeed/inference/v2/modules/interfaces copying deepspeed/inference/v2/modules/interfaces/attention_base.py -> build/lib/deepspeed/inference/v2/modules/interfaces copying deepspeed/inference/v2/modules/interfaces/embedding_base.py -> build/lib/deepspeed/inference/v2/modules/interfaces copying deepspeed/inference/v2/modules/interfaces/linear_base.py -> build/lib/deepspeed/inference/v2/modules/interfaces copying deepspeed/inference/v2/modules/interfaces/moe_base.py -> build/lib/deepspeed/inference/v2/modules/interfaces copying deepspeed/inference/v2/modules/interfaces/post_norm_base.py -> build/lib/deepspeed/inference/v2/modules/interfaces copying deepspeed/inference/v2/modules/interfaces/pre_norm_base.py -> build/lib/deepspeed/inference/v2/modules/interfaces copying deepspeed/inference/v2/modules/interfaces/unembed_base.py -> build/lib/deepspeed/inference/v2/modules/interfaces creating build/lib/deepspeed/inference/v2/modules/implementations/attention copying deepspeed/inference/v2/modules/implementations/attention/__init__.py -> build/lib/deepspeed/inference/v2/modules/implementations/attention copying deepspeed/inference/v2/modules/implementations/attention/dense_blocked_attention.py -> build/lib/deepspeed/inference/v2/modules/implementations/attention creating build/lib/deepspeed/inference/v2/modules/implementations/embedding copying deepspeed/inference/v2/modules/implementations/embedding/__init__.py -> build/lib/deepspeed/inference/v2/modules/implementations/embedding copying deepspeed/inference/v2/modules/implementations/embedding/ragged_embedding.py -> build/lib/deepspeed/inference/v2/modules/implementations/embedding creating build/lib/deepspeed/inference/v2/modules/implementations/linear copying deepspeed/inference/v2/modules/implementations/linear/__init__.py -> build/lib/deepspeed/inference/v2/modules/implementations/linear copying deepspeed/inference/v2/modules/implementations/linear/blas_fp_linear.py -> build/lib/deepspeed/inference/v2/modules/implementations/linear copying deepspeed/inference/v2/modules/implementations/linear/quantized_linear.py -> build/lib/deepspeed/inference/v2/modules/implementations/linear creating build/lib/deepspeed/inference/v2/modules/implementations/moe copying deepspeed/inference/v2/modules/implementations/moe/__init__.py -> build/lib/deepspeed/inference/v2/modules/implementations/moe copying deepspeed/inference/v2/modules/implementations/moe/cutlass_multi_gemm.py -> build/lib/deepspeed/inference/v2/modules/implementations/moe creating build/lib/deepspeed/inference/v2/modules/implementations/post_norm copying deepspeed/inference/v2/modules/implementations/post_norm/__init__.py -> build/lib/deepspeed/inference/v2/modules/implementations/post_norm copying deepspeed/inference/v2/modules/implementations/post_norm/cuda_post_ln.py -> build/lib/deepspeed/inference/v2/modules/implementations/post_norm creating build/lib/deepspeed/inference/v2/modules/implementations/pre_norm copying deepspeed/inference/v2/modules/implementations/pre_norm/__init__.py -> build/lib/deepspeed/inference/v2/modules/implementations/pre_norm copying deepspeed/inference/v2/modules/implementations/pre_norm/cuda_pre_ln.py -> build/lib/deepspeed/inference/v2/modules/implementations/pre_norm copying deepspeed/inference/v2/modules/implementations/pre_norm/cuda_pre_rms.py -> build/lib/deepspeed/inference/v2/modules/implementations/pre_norm creating build/lib/deepspeed/inference/v2/modules/implementations/unembed copying deepspeed/inference/v2/modules/implementations/unembed/__init__.py -> build/lib/deepspeed/inference/v2/modules/implementations/unembed copying deepspeed/inference/v2/modules/implementations/unembed/ragged_unembed.py -> build/lib/deepspeed/inference/v2/modules/implementations/unembed creating build/lib/deepspeed/model_implementations/diffusers copying deepspeed/model_implementations/diffusers/__init__.py -> build/lib/deepspeed/model_implementations/diffusers copying deepspeed/model_implementations/diffusers/unet.py -> build/lib/deepspeed/model_implementations/diffusers copying deepspeed/model_implementations/diffusers/vae.py -> build/lib/deepspeed/model_implementations/diffusers creating build/lib/deepspeed/model_implementations/features copying deepspeed/model_implementations/features/__init__.py -> build/lib/deepspeed/model_implementations/features copying deepspeed/model_implementations/features/cuda_graph.py -> build/lib/deepspeed/model_implementations/features creating build/lib/deepspeed/model_implementations/transformers copying deepspeed/model_implementations/transformers/__init__.py -> build/lib/deepspeed/model_implementations/transformers copying deepspeed/model_implementations/transformers/clip_encoder.py -> build/lib/deepspeed/model_implementations/transformers copying deepspeed/model_implementations/transformers/ds_base.py -> build/lib/deepspeed/model_implementations/transformers copying deepspeed/model_implementations/transformers/ds_bert.py -> build/lib/deepspeed/model_implementations/transformers copying deepspeed/model_implementations/transformers/ds_bloom.py -> build/lib/deepspeed/model_implementations/transformers copying deepspeed/model_implementations/transformers/ds_gpt.py -> build/lib/deepspeed/model_implementations/transformers copying deepspeed/model_implementations/transformers/ds_llama2.py -> build/lib/deepspeed/model_implementations/transformers copying deepspeed/model_implementations/transformers/ds_megatron_gpt.py -> build/lib/deepspeed/model_implementations/transformers copying deepspeed/model_implementations/transformers/ds_opt.py -> build/lib/deepspeed/model_implementations/transformers copying deepspeed/model_implementations/transformers/ds_transformer.py -> build/lib/deepspeed/model_implementations/transformers creating build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/__init__.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/base.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/base_moe.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/bert.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/bloom.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/clip.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/distil_bert.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/gpt2.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/gptj.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/gptneo.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/gptneox.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/internlm.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/llama.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/llama2.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/megatron_gpt.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/megatron_gpt_moe.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/opt.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/unet.py -> build/lib/deepspeed/module_inject/containers copying deepspeed/module_inject/containers/vae.py -> build/lib/deepspeed/module_inject/containers creating build/lib/deepspeed/module_inject/containers/features copying deepspeed/module_inject/containers/features/__init__.py -> build/lib/deepspeed/module_inject/containers/features copying deepspeed/module_inject/containers/features/gated_mlp.py -> build/lib/deepspeed/module_inject/containers/features copying deepspeed/module_inject/containers/features/hybrid_engine.py -> build/lib/deepspeed/module_inject/containers/features copying deepspeed/module_inject/containers/features/hybrid_megatron.py -> build/lib/deepspeed/module_inject/containers/features copying deepspeed/module_inject/containers/features/megatron.py -> build/lib/deepspeed/module_inject/containers/features copying deepspeed/module_inject/containers/features/meta_tensor.py -> build/lib/deepspeed/module_inject/containers/features copying deepspeed/module_inject/containers/features/split_qkv.py -> build/lib/deepspeed/module_inject/containers/features creating build/lib/deepspeed/ops/adagrad copying deepspeed/ops/adagrad/__init__.py -> build/lib/deepspeed/ops/adagrad copying deepspeed/ops/adagrad/cpu_adagrad.py -> build/lib/deepspeed/ops/adagrad creating build/lib/deepspeed/ops/adam copying deepspeed/ops/adam/__init__.py -> build/lib/deepspeed/ops/adam copying deepspeed/ops/adam/cpu_adam.py -> build/lib/deepspeed/ops/adam copying deepspeed/ops/adam/fused_adam.py -> build/lib/deepspeed/ops/adam copying deepspeed/ops/adam/multi_tensor_apply.py -> build/lib/deepspeed/ops/adam creating build/lib/deepspeed/ops/aio copying deepspeed/ops/aio/__init__.py -> build/lib/deepspeed/ops/aio creating build/lib/deepspeed/ops/deepspeed4science copying deepspeed/ops/deepspeed4science/__init__.py -> build/lib/deepspeed/ops/deepspeed4science copying deepspeed/ops/deepspeed4science/evoformer_attn.py -> build/lib/deepspeed/ops/deepspeed4science creating build/lib/deepspeed/ops/lamb copying deepspeed/ops/lamb/__init__.py -> build/lib/deepspeed/ops/lamb copying deepspeed/ops/lamb/fused_lamb.py -> build/lib/deepspeed/ops/lamb creating build/lib/deepspeed/ops/lion copying deepspeed/ops/lion/__init__.py -> build/lib/deepspeed/ops/lion copying deepspeed/ops/lion/cpu_lion.py -> build/lib/deepspeed/ops/lion copying deepspeed/ops/lion/fused_lion.py -> build/lib/deepspeed/ops/lion copying deepspeed/ops/lion/multi_tensor_apply.py -> build/lib/deepspeed/ops/lion creating build/lib/deepspeed/ops/quantizer copying deepspeed/ops/quantizer/__init__.py -> build/lib/deepspeed/ops/quantizer copying deepspeed/ops/quantizer/quantizer.py -> build/lib/deepspeed/ops/quantizer creating build/lib/deepspeed/ops/random_ltd copying deepspeed/ops/random_ltd/__init__.py -> build/lib/deepspeed/ops/random_ltd copying deepspeed/ops/random_ltd/dropping_utils.py -> build/lib/deepspeed/ops/random_ltd creating build/lib/deepspeed/ops/sparse_attention copying deepspeed/ops/sparse_attention/__init__.py -> build/lib/deepspeed/ops/sparse_attention copying deepspeed/ops/sparse_attention/bert_sparse_self_attention.py -> build/lib/deepspeed/ops/sparse_attention copying deepspeed/ops/sparse_attention/matmul.py -> build/lib/deepspeed/ops/sparse_attention copying deepspeed/ops/sparse_attention/softmax.py -> build/lib/deepspeed/ops/sparse_attention copying deepspeed/ops/sparse_attention/sparse_attention_utils.py -> build/lib/deepspeed/ops/sparse_attention copying deepspeed/ops/sparse_attention/sparse_self_attention.py -> build/lib/deepspeed/ops/sparse_attention copying deepspeed/ops/sparse_attention/sparsity_config.py -> build/lib/deepspeed/ops/sparse_attention creating build/lib/deepspeed/ops/transformer copying deepspeed/ops/transformer/__init__.py -> build/lib/deepspeed/ops/transformer copying deepspeed/ops/transformer/transformer.py -> build/lib/deepspeed/ops/transformer creating build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/__init__.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/all_ops.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/async_io.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/builder.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/cpu_adagrad.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/cpu_adam.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/cpu_lion.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/evoformer_attn.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/fused_adam.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/fused_lamb.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/fused_lion.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/inference_core_ops.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/inference_cutlass_builder.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/quantizer.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/ragged_ops.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/ragged_utils.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/random_ltd.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/sparse_attn.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/spatial_inference.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/stochastic_transformer.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/transformer.py -> build/lib/deepspeed/ops/op_builder copying deepspeed/ops/op_builder/transformer_inference.py -> build/lib/deepspeed/ops/op_builder creating build/lib/deepspeed/ops/sparse_attention/trsrc copying deepspeed/ops/sparse_attention/trsrc/__init__.py -> build/lib/deepspeed/ops/sparse_attention/trsrc creating build/lib/deepspeed/ops/transformer/inference copying deepspeed/ops/transformer/inference/__init__.py -> build/lib/deepspeed/ops/transformer/inference copying deepspeed/ops/transformer/inference/bias_add.py -> build/lib/deepspeed/ops/transformer/inference copying deepspeed/ops/transformer/inference/config.py -> build/lib/deepspeed/ops/transformer/inference copying deepspeed/ops/transformer/inference/diffusers_2d_transformer.py -> build/lib/deepspeed/ops/transformer/inference copying deepspeed/ops/transformer/inference/diffusers_attention.py -> build/lib/deepspeed/ops/transformer/inference copying deepspeed/ops/transformer/inference/diffusers_transformer_block.py -> build/lib/deepspeed/ops/transformer/inference copying deepspeed/ops/transformer/inference/ds_attention.py -> build/lib/deepspeed/ops/transformer/inference copying deepspeed/ops/transformer/inference/ds_mlp.py -> build/lib/deepspeed/ops/transformer/inference copying deepspeed/ops/transformer/inference/moe_inference.py -> build/lib/deepspeed/ops/transformer/inference copying deepspeed/ops/transformer/inference/triton_ops.py -> build/lib/deepspeed/ops/transformer/inference creating build/lib/deepspeed/ops/transformer/inference/op_binding copying deepspeed/ops/transformer/inference/op_binding/__init__.py -> build/lib/deepspeed/ops/transformer/inference/op_binding copying deepspeed/ops/transformer/inference/op_binding/base.py -> build/lib/deepspeed/ops/transformer/inference/op_binding copying deepspeed/ops/transformer/inference/op_binding/gelu_gemm.py -> build/lib/deepspeed/ops/transformer/inference/op_binding copying deepspeed/ops/transformer/inference/op_binding/linear.py -> build/lib/deepspeed/ops/transformer/inference/op_binding copying deepspeed/ops/transformer/inference/op_binding/mlp_gemm.py -> build/lib/deepspeed/ops/transformer/inference/op_binding copying deepspeed/ops/transformer/inference/op_binding/qkv_gemm.py -> build/lib/deepspeed/ops/transformer/inference/op_binding copying deepspeed/ops/transformer/inference/op_binding/residual_add.py -> build/lib/deepspeed/ops/transformer/inference/op_binding copying deepspeed/ops/transformer/inference/op_binding/softmax.py -> build/lib/deepspeed/ops/transformer/inference/op_binding copying deepspeed/ops/transformer/inference/op_binding/softmax_context.py -> build/lib/deepspeed/ops/transformer/inference/op_binding copying deepspeed/ops/transformer/inference/op_binding/vector_matmul.py -> build/lib/deepspeed/ops/transformer/inference/op_binding creating build/lib/deepspeed/ops/transformer/inference/triton copying deepspeed/ops/transformer/inference/triton/__init__.py -> build/lib/deepspeed/ops/transformer/inference/triton copying deepspeed/ops/transformer/inference/triton/attention.py -> build/lib/deepspeed/ops/transformer/inference/triton copying deepspeed/ops/transformer/inference/triton/gelu.py -> build/lib/deepspeed/ops/transformer/inference/triton copying deepspeed/ops/transformer/inference/triton/layer_norm.py -> build/lib/deepspeed/ops/transformer/inference/triton copying deepspeed/ops/transformer/inference/triton/matmul_ext.py -> build/lib/deepspeed/ops/transformer/inference/triton copying deepspeed/ops/transformer/inference/triton/mlp.py -> build/lib/deepspeed/ops/transformer/inference/triton copying deepspeed/ops/transformer/inference/triton/ops.py -> build/lib/deepspeed/ops/transformer/inference/triton copying deepspeed/ops/transformer/inference/triton/residual_add.py -> build/lib/deepspeed/ops/transformer/inference/triton copying deepspeed/ops/transformer/inference/triton/softmax.py -> build/lib/deepspeed/ops/transformer/inference/triton copying deepspeed/ops/transformer/inference/triton/triton_matmul_kernel.py -> build/lib/deepspeed/ops/transformer/inference/triton creating build/lib/deepspeed/ops/op_builder/cpu copying deepspeed/ops/op_builder/cpu/__init__.py -> build/lib/deepspeed/ops/op_builder/cpu copying deepspeed/ops/op_builder/cpu/builder.py -> build/lib/deepspeed/ops/op_builder/cpu copying deepspeed/ops/op_builder/cpu/comm.py -> build/lib/deepspeed/ops/op_builder/cpu copying deepspeed/ops/op_builder/cpu/cpu_adam.py -> build/lib/deepspeed/ops/op_builder/cpu copying deepspeed/ops/op_builder/cpu/fused_adam.py -> build/lib/deepspeed/ops/op_builder/cpu copying deepspeed/ops/op_builder/cpu/no_impl.py -> build/lib/deepspeed/ops/op_builder/cpu creating build/lib/deepspeed/ops/op_builder/hpu copying deepspeed/ops/op_builder/hpu/__init__.py -> build/lib/deepspeed/ops/op_builder/hpu copying deepspeed/ops/op_builder/hpu/builder.py -> build/lib/deepspeed/ops/op_builder/hpu copying deepspeed/ops/op_builder/hpu/cpu_adam.py -> build/lib/deepspeed/ops/op_builder/hpu copying deepspeed/ops/op_builder/hpu/fused_adam.py -> build/lib/deepspeed/ops/op_builder/hpu copying deepspeed/ops/op_builder/hpu/no_impl.py -> build/lib/deepspeed/ops/op_builder/hpu creating build/lib/deepspeed/ops/op_builder/npu copying deepspeed/ops/op_builder/npu/__init__.py -> build/lib/deepspeed/ops/op_builder/npu copying deepspeed/ops/op_builder/npu/async_io.py -> build/lib/deepspeed/ops/op_builder/npu copying deepspeed/ops/op_builder/npu/builder.py -> build/lib/deepspeed/ops/op_builder/npu copying deepspeed/ops/op_builder/npu/cpu_adagrad.py -> build/lib/deepspeed/ops/op_builder/npu copying deepspeed/ops/op_builder/npu/cpu_adam.py -> build/lib/deepspeed/ops/op_builder/npu copying deepspeed/ops/op_builder/npu/cpu_lion.py -> build/lib/deepspeed/ops/op_builder/npu copying deepspeed/ops/op_builder/npu/fused_adam.py -> build/lib/deepspeed/ops/op_builder/npu copying deepspeed/ops/op_builder/npu/inference.py -> build/lib/deepspeed/ops/op_builder/npu copying deepspeed/ops/op_builder/npu/no_impl.py -> build/lib/deepspeed/ops/op_builder/npu creating build/lib/deepspeed/ops/op_builder/xpu copying deepspeed/ops/op_builder/xpu/__init__.py -> build/lib/deepspeed/ops/op_builder/xpu copying deepspeed/ops/op_builder/xpu/async_io.py -> build/lib/deepspeed/ops/op_builder/xpu copying deepspeed/ops/op_builder/xpu/builder.py -> build/lib/deepspeed/ops/op_builder/xpu copying deepspeed/ops/op_builder/xpu/cpu_adagrad.py -> build/lib/deepspeed/ops/op_builder/xpu copying deepspeed/ops/op_builder/xpu/cpu_adam.py -> build/lib/deepspeed/ops/op_builder/xpu copying deepspeed/ops/op_builder/xpu/fused_adam.py -> build/lib/deepspeed/ops/op_builder/xpu creating build/lib/deepspeed/profiling/flops_profiler copying deepspeed/profiling/flops_profiler/__init__.py -> build/lib/deepspeed/profiling/flops_profiler copying deepspeed/profiling/flops_profiler/profiler.py -> build/lib/deepspeed/profiling/flops_profiler creating build/lib/deepspeed/runtime/activation_checkpointing copying deepspeed/runtime/activation_checkpointing/__init__.py -> build/lib/deepspeed/runtime/activation_checkpointing copying deepspeed/runtime/activation_checkpointing/checkpointing.py -> build/lib/deepspeed/runtime/activation_checkpointing copying deepspeed/runtime/activation_checkpointing/config.py -> build/lib/deepspeed/runtime/activation_checkpointing creating build/lib/deepspeed/runtime/checkpoint_engine copying deepspeed/runtime/checkpoint_engine/__init__.py -> build/lib/deepspeed/runtime/checkpoint_engine copying deepspeed/runtime/checkpoint_engine/checkpoint_engine.py -> build/lib/deepspeed/runtime/checkpoint_engine copying deepspeed/runtime/checkpoint_engine/nebula_checkpoint_engine.py -> build/lib/deepspeed/runtime/checkpoint_engine copying deepspeed/runtime/checkpoint_engine/torch_checkpoint_engine.py -> build/lib/deepspeed/runtime/checkpoint_engine creating build/lib/deepspeed/runtime/comm copying deepspeed/runtime/comm/__init__.py -> build/lib/deepspeed/runtime/comm copying deepspeed/runtime/comm/coalesced_collectives.py -> build/lib/deepspeed/runtime/comm copying deepspeed/runtime/comm/hccl.py -> build/lib/deepspeed/runtime/comm copying deepspeed/runtime/comm/mpi.py -> build/lib/deepspeed/runtime/comm copying deepspeed/runtime/comm/nccl.py -> build/lib/deepspeed/runtime/comm creating build/lib/deepspeed/runtime/compression copying deepspeed/runtime/compression/__init__.py -> build/lib/deepspeed/runtime/compression copying deepspeed/runtime/compression/cupy.py -> build/lib/deepspeed/runtime/compression creating build/lib/deepspeed/runtime/data_pipeline copying deepspeed/runtime/data_pipeline/__init__.py -> build/lib/deepspeed/runtime/data_pipeline copying deepspeed/runtime/data_pipeline/config.py -> build/lib/deepspeed/runtime/data_pipeline copying deepspeed/runtime/data_pipeline/constants.py -> build/lib/deepspeed/runtime/data_pipeline copying deepspeed/runtime/data_pipeline/curriculum_scheduler.py -> build/lib/deepspeed/runtime/data_pipeline creating build/lib/deepspeed/runtime/fp16 copying deepspeed/runtime/fp16/__init__.py -> build/lib/deepspeed/runtime/fp16 copying deepspeed/runtime/fp16/fused_optimizer.py -> build/lib/deepspeed/runtime/fp16 copying deepspeed/runtime/fp16/loss_scaler.py -> build/lib/deepspeed/runtime/fp16 copying deepspeed/runtime/fp16/unfused_optimizer.py -> build/lib/deepspeed/runtime/fp16 creating build/lib/deepspeed/runtime/pipe copying deepspeed/runtime/pipe/__init__.py -> build/lib/deepspeed/runtime/pipe copying deepspeed/runtime/pipe/engine.py -> build/lib/deepspeed/runtime/pipe copying deepspeed/runtime/pipe/module.py -> build/lib/deepspeed/runtime/pipe copying deepspeed/runtime/pipe/p2p.py -> build/lib/deepspeed/runtime/pipe copying deepspeed/runtime/pipe/schedule.py -> build/lib/deepspeed/runtime/pipe copying deepspeed/runtime/pipe/topology.py -> build/lib/deepspeed/runtime/pipe creating build/lib/deepspeed/runtime/swap_tensor copying deepspeed/runtime/swap_tensor/__init__.py -> build/lib/deepspeed/runtime/swap_tensor copying deepspeed/runtime/swap_tensor/aio_config.py -> build/lib/deepspeed/runtime/swap_tensor copying deepspeed/runtime/swap_tensor/async_swapper.py -> build/lib/deepspeed/runtime/swap_tensor copying deepspeed/runtime/swap_tensor/constants.py -> build/lib/deepspeed/runtime/swap_tensor copying deepspeed/runtime/swap_tensor/optimizer_utils.py -> build/lib/deepspeed/runtime/swap_tensor copying deepspeed/runtime/swap_tensor/partitioned_optimizer_swapper.py -> build/lib/deepspeed/runtime/swap_tensor copying deepspeed/runtime/swap_tensor/partitioned_param_swapper.py -> build/lib/deepspeed/runtime/swap_tensor copying deepspeed/runtime/swap_tensor/pipelined_optimizer_swapper.py -> build/lib/deepspeed/runtime/swap_tensor copying deepspeed/runtime/swap_tensor/utils.py -> build/lib/deepspeed/runtime/swap_tensor creating build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/__init__.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/config.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/contiguous_memory_allocator.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/linear.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/mics.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/mics_utils.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/offload_config.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/parameter_offload.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/partition_parameters.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/partitioned_param_coordinator.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/partitioned_param_profiler.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/stage3.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/stage_1_and_2.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/test.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/tiling.py -> build/lib/deepspeed/runtime/zero copying deepspeed/runtime/zero/utils.py -> build/lib/deepspeed/runtime/zero creating build/lib/deepspeed/runtime/data_pipeline/data_routing copying deepspeed/runtime/data_pipeline/data_routing/__init__.py -> build/lib/deepspeed/runtime/data_pipeline/data_routing copying deepspeed/runtime/data_pipeline/data_routing/basic_layer.py -> build/lib/deepspeed/runtime/data_pipeline/data_routing copying deepspeed/runtime/data_pipeline/data_routing/helper.py -> build/lib/deepspeed/runtime/data_pipeline/data_routing copying deepspeed/runtime/data_pipeline/data_routing/scheduler.py -> build/lib/deepspeed/runtime/data_pipeline/data_routing copying deepspeed/runtime/data_pipeline/data_routing/utils.py -> build/lib/deepspeed/runtime/data_pipeline/data_routing creating build/lib/deepspeed/runtime/data_pipeline/data_sampling copying deepspeed/runtime/data_pipeline/data_sampling/__init__.py -> build/lib/deepspeed/runtime/data_pipeline/data_sampling copying deepspeed/runtime/data_pipeline/data_sampling/data_analyzer.py -> build/lib/deepspeed/runtime/data_pipeline/data_sampling copying deepspeed/runtime/data_pipeline/data_sampling/data_sampler.py -> build/lib/deepspeed/runtime/data_pipeline/data_sampling copying deepspeed/runtime/data_pipeline/data_sampling/indexed_dataset.py -> build/lib/deepspeed/runtime/data_pipeline/data_sampling copying deepspeed/runtime/data_pipeline/data_sampling/utils.py -> build/lib/deepspeed/runtime/data_pipeline/data_sampling creating build/lib/deepspeed/runtime/fp16/onebit copying deepspeed/runtime/fp16/onebit/__init__.py -> build/lib/deepspeed/runtime/fp16/onebit copying deepspeed/runtime/fp16/onebit/adam.py -> build/lib/deepspeed/runtime/fp16/onebit copying deepspeed/runtime/fp16/onebit/lamb.py -> build/lib/deepspeed/runtime/fp16/onebit copying deepspeed/runtime/fp16/onebit/zoadam.py -> build/lib/deepspeed/runtime/fp16/onebit running egg_info creating deepspeed.egg-info writing deepspeed.egg-info/PKG-INFO writing dependency_links to deepspeed.egg-info/dependency_links.txt writing entry points to deepspeed.egg-info/entry_points.txt writing requirements to deepspeed.egg-info/requires.txt writing top-level names to deepspeed.egg-info/top_level.txt writing manifest file 'deepspeed.egg-info/SOURCES.txt' reading manifest file 'deepspeed.egg-info/SOURCES.txt' reading manifest template 'MANIFEST.in' warning: no files found matching 'deepspeed/inference/v2/kernels/ragged_ops/libs/*.so' warning: no files found matching 'deepspeed/inference/v2/kernels/cutlass_ops/libs/*.so' warning: no files found matching '*.hip' under directory 'deepspeed' warning: no files found matching '*.cc' under directory 'deepspeed' warning: no files found matching '*.tr' under directory 'csrc' warning: no files found matching '*.cc' under directory 'csrc' warning: no files found matching '*.py' under directory 'benchmarks' adding license file 'LICENSE' writing manifest file 'deepspeed.egg-info/SOURCES.txt' /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.autotuning.config_templates' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.autotuning.config_templates' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.autotuning.config_templates' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.autotuning.config_templates' to be distributed and are already explicitly excluding 'deepspeed.autotuning.config_templates' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.inference.v2.kernels.core_ops.cuda_linear.include' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.inference.v2.kernels.core_ops.cuda_linear.include' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.inference.v2.kernels.core_ops.cuda_linear.include' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.inference.v2.kernels.core_ops.cuda_linear.include' to be distributed and are already explicitly excluding 'deepspeed.inference.v2.kernels.core_ops.cuda_linear.include' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.inference.v2.kernels.cutlass_ops.shared_resources' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.inference.v2.kernels.cutlass_ops.shared_resources' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.inference.v2.kernels.cutlass_ops.shared_resources' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.inference.v2.kernels.cutlass_ops.shared_resources' to be distributed and are already explicitly excluding 'deepspeed.inference.v2.kernels.cutlass_ops.shared_resources' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.inference.v2.kernels.includes' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.inference.v2.kernels.includes' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.inference.v2.kernels.includes' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.inference.v2.kernels.includes' to be distributed and are already explicitly excluding 'deepspeed.inference.v2.kernels.includes' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.inference.v2.kernels.ragged_ops.includes' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.inference.v2.kernels.ragged_ops.includes' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.inference.v2.kernels.ragged_ops.includes' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.inference.v2.kernels.ragged_ops.includes' to be distributed and are already explicitly excluding 'deepspeed.inference.v2.kernels.ragged_ops.includes' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.inference.v2.kernels.ragged_ops.ragged_helpers' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.inference.v2.kernels.ragged_ops.ragged_helpers' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.inference.v2.kernels.ragged_ops.ragged_helpers' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.inference.v2.kernels.ragged_ops.ragged_helpers' to be distributed and are already explicitly excluding 'deepspeed.inference.v2.kernels.ragged_ops.ragged_helpers' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.inference.v2.ragged.csrc' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.inference.v2.ragged.csrc' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.inference.v2.ragged.csrc' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.inference.v2.ragged.csrc' to be distributed and are already explicitly excluding 'deepspeed.inference.v2.ragged.csrc' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.inference.v2.ragged.includes' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.inference.v2.ragged.includes' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.inference.v2.ragged.includes' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.inference.v2.ragged.includes' to be distributed and are already explicitly excluding 'deepspeed.inference.v2.ragged.includes' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.adagrad' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.adagrad' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.adagrad' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.adagrad' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.adagrad' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.adam' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.adam' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.adam' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.adam' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.adam' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.aio.common' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.aio.common' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.aio.common' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.aio.common' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.aio.common' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.aio.py_lib' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.aio.py_lib' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.aio.py_lib' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.aio.py_lib' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.aio.py_lib' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.aio.py_test' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.aio.py_test' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.aio.py_test' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.aio.py_test' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.aio.py_test' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.common' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.common' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.common' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.common' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.common' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.cpu.adam' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.cpu.adam' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.cpu.adam' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.cpu.adam' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.cpu.adam' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.cpu.comm' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.cpu.comm' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.cpu.comm' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.cpu.comm' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.cpu.comm' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.cpu.lion' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.cpu.lion' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.cpu.lion' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.cpu.lion' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.cpu.lion' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.epilogue' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.epilogue' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.epilogue' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.epilogue' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.epilogue' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.gemm' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.gemm' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.gemm' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.gemm' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.gemm' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.iterators' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.iterators' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.iterators' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.iterators' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.iterators' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.transform' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.transform' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.transform' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.transform' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.deepspeed4science.evoformer_attn.transform' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.includes' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.includes' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.includes' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.includes' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.includes' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.lamb' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.lamb' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.lamb' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.lamb' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.lamb' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.lion' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.lion' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.lion' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.lion' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.lion' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.quantization' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.quantization' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.quantization' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.quantization' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.quantization' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.random_ltd' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.random_ltd' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.random_ltd' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.random_ltd' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.random_ltd' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.sparse_attention' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.sparse_attention' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.sparse_attention' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.sparse_attention' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.sparse_attention' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.spatial.csrc' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.spatial.csrc' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.spatial.csrc' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.spatial.csrc' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.spatial.csrc' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.spatial.includes' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.spatial.includes' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.spatial.includes' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.spatial.includes' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.spatial.includes' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.transformer' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.transformer' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.transformer' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.transformer' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.transformer' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.transformer.inference.csrc' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.transformer.inference.csrc' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.transformer.inference.csrc' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.transformer.inference.csrc' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.transformer.inference.csrc' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.transformer.inference.includes' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.transformer.inference.includes' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.transformer.inference.includes' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.transformer.inference.includes' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.transformer.inference.includes' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.utils' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.utils' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.utils' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.utils' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.utils' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.xpu.adagrad' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.xpu.adagrad' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.xpu.adagrad' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.xpu.adagrad' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.xpu.adagrad' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.xpu.adam' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.xpu.adam' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.xpu.adam' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.xpu.adam' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.xpu.adam' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.xpu.common' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.xpu.common' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.xpu.common' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.xpu.common' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.xpu.common' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) /usr/lib/python3.11/site-packages/setuptools/command/build_py.py:201: _Warning: Package 'deepspeed.ops.csrc.xpu.includes' is absent from the `packages` configuration. !! ******************************************************************************** ############################ # Package would be ignored # ############################ Python recognizes 'deepspeed.ops.csrc.xpu.includes' as an importable package[^1], but it is absent from setuptools' `packages` configuration. This leads to an ambiguous overall configuration. If you want to distribute this package, please make sure that 'deepspeed.ops.csrc.xpu.includes' is explicitly added to the `packages` configuration field. Alternatively, you can also rely on setuptools' discovery methods (for example by using `find_namespace_packages(...)`/`find_namespace:` instead of `find_packages(...)`/`find:`). You can read more about "package discovery" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/package_discovery.html If you don't want 'deepspeed.ops.csrc.xpu.includes' to be distributed and are already explicitly excluding 'deepspeed.ops.csrc.xpu.includes' via `find_namespace_packages(...)/find_namespace` or `find_packages(...)/find`, you can try to use `exclude_package_data`, or `include-package-data=False` in combination with a more fine grained `package-data` configuration. You can read more about "package data files" on setuptools documentation page: - https://setuptools.pypa.io/en/latest/userguide/datafiles.html [^1]: For Python, any directory (with suitable naming) can be imported, even if it does not contain any `.py` files. On the other hand, currently there is no concept of package data directory, all directories are treated like packages. ******************************************************************************** !! check.warn(importable) creating build/lib/deepspeed/autotuning/config_templates copying deepspeed/autotuning/config_templates/template_zero0.json -> build/lib/deepspeed/autotuning/config_templates copying deepspeed/autotuning/config_templates/template_zero1.json -> build/lib/deepspeed/autotuning/config_templates copying deepspeed/autotuning/config_templates/template_zero2.json -> build/lib/deepspeed/autotuning/config_templates copying deepspeed/autotuning/config_templates/template_zero3.json -> build/lib/deepspeed/autotuning/config_templates creating build/lib/deepspeed/ops/csrc creating build/lib/deepspeed/ops/csrc/adagrad copying deepspeed/ops/csrc/adagrad/cpu_adagrad.cpp -> build/lib/deepspeed/ops/csrc/adagrad creating build/lib/deepspeed/ops/csrc/adam copying deepspeed/ops/csrc/adam/cpu_adam.cpp -> build/lib/deepspeed/ops/csrc/adam copying deepspeed/ops/csrc/adam/cpu_adam_impl.cpp -> build/lib/deepspeed/ops/csrc/adam copying deepspeed/ops/csrc/adam/fused_adam_frontend.cpp -> build/lib/deepspeed/ops/csrc/adam copying deepspeed/ops/csrc/adam/multi_tensor_adam.cu -> build/lib/deepspeed/ops/csrc/adam copying deepspeed/ops/csrc/adam/multi_tensor_apply.cuh -> build/lib/deepspeed/ops/csrc/adam creating build/lib/deepspeed/ops/csrc/aio creating build/lib/deepspeed/ops/csrc/aio/common copying deepspeed/ops/csrc/aio/common/deepspeed_aio_common.cpp -> build/lib/deepspeed/ops/csrc/aio/common copying deepspeed/ops/csrc/aio/common/deepspeed_aio_common.h -> build/lib/deepspeed/ops/csrc/aio/common copying deepspeed/ops/csrc/aio/common/deepspeed_aio_types.cpp -> build/lib/deepspeed/ops/csrc/aio/common copying deepspeed/ops/csrc/aio/common/deepspeed_aio_types.h -> build/lib/deepspeed/ops/csrc/aio/common copying deepspeed/ops/csrc/aio/common/deepspeed_aio_utils.cpp -> build/lib/deepspeed/ops/csrc/aio/common copying deepspeed/ops/csrc/aio/common/deepspeed_aio_utils.h -> build/lib/deepspeed/ops/csrc/aio/common creating build/lib/deepspeed/ops/csrc/aio/py_lib copying deepspeed/ops/csrc/aio/py_lib/deepspeed_aio_thread.cpp -> build/lib/deepspeed/ops/csrc/aio/py_lib copying deepspeed/ops/csrc/aio/py_lib/deepspeed_aio_thread.h -> build/lib/deepspeed/ops/csrc/aio/py_lib copying deepspeed/ops/csrc/aio/py_lib/deepspeed_pin_tensor.cpp -> build/lib/deepspeed/ops/csrc/aio/py_lib copying deepspeed/ops/csrc/aio/py_lib/deepspeed_pin_tensor.h -> build/lib/deepspeed/ops/csrc/aio/py_lib copying deepspeed/ops/csrc/aio/py_lib/deepspeed_py_aio.cpp -> build/lib/deepspeed/ops/csrc/aio/py_lib copying deepspeed/ops/csrc/aio/py_lib/deepspeed_py_aio.h -> build/lib/deepspeed/ops/csrc/aio/py_lib copying deepspeed/ops/csrc/aio/py_lib/deepspeed_py_aio_handle.cpp -> build/lib/deepspeed/ops/csrc/aio/py_lib copying deepspeed/ops/csrc/aio/py_lib/deepspeed_py_aio_handle.h -> build/lib/deepspeed/ops/csrc/aio/py_lib copying deepspeed/ops/csrc/aio/py_lib/deepspeed_py_copy.cpp -> build/lib/deepspeed/ops/csrc/aio/py_lib copying deepspeed/ops/csrc/aio/py_lib/deepspeed_py_copy.h -> build/lib/deepspeed/ops/csrc/aio/py_lib copying deepspeed/ops/csrc/aio/py_lib/py_ds_aio.cpp -> build/lib/deepspeed/ops/csrc/aio/py_lib creating build/lib/deepspeed/ops/csrc/aio/py_test copying deepspeed/ops/csrc/aio/py_test/single_process_config.json -> build/lib/deepspeed/ops/csrc/aio/py_test creating build/lib/deepspeed/ops/csrc/common copying deepspeed/ops/csrc/common/custom_cuda_kernel.cu -> build/lib/deepspeed/ops/csrc/common creating build/lib/deepspeed/ops/csrc/cpu creating build/lib/deepspeed/ops/csrc/cpu/adam copying deepspeed/ops/csrc/cpu/adam/fused_adam.cpp -> build/lib/deepspeed/ops/csrc/cpu/adam creating build/lib/deepspeed/ops/csrc/cpu/comm copying deepspeed/ops/csrc/cpu/comm/ccl.cpp -> build/lib/deepspeed/ops/csrc/cpu/comm creating build/lib/deepspeed/ops/csrc/cpu/lion copying deepspeed/ops/csrc/cpu/lion/fused_lion.cpp -> build/lib/deepspeed/ops/csrc/cpu/lion creating build/lib/deepspeed/ops/csrc/deepspeed4science creating build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/attention.cpp -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/attention_back.cu -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/attention_cu.cu -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm_kernel_utils.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/kernel_backward.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/kernel_forward.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn creating build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue/epilogue_grad_bias.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue/epilogue_pipelined.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue/epilogue_rescale_output.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue/epilogue_thread_apply_logsumexp.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue creating build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/custom_mma.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/custom_mma_base.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/custom_mma_multistage.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/custom_mma_pipelined.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/find_default_mma.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/mma_accum_lambda_iterator.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/mma_from_smem.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm creating build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/epilogue_predicated_tile_iterator.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/make_residual_last.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/predicated_tile_access_iterator_residual_last.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/predicated_tile_iterator_atomic.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/predicated_tile_iterator_residual_last.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/transpose_warp_iterator.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/warp_iterator_from_smem.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators creating build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/transform copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/transform/bias_broadcast.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/transform copying deepspeed/ops/csrc/deepspeed4science/evoformer_attn/transform/tile_smem_loader.h -> build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/transform creating build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/StopWatch.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/Timer.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/activation_type.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/compat.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/context.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/conversion_utils.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/cpu_adagrad.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/cpu_adam.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/cpu_lion.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/cublas_wrappers.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/custom_cuda_layers.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/dequantization_utils.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/dropout.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/ds_kernel_utils.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/ds_transformer_cuda.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/feed_forward.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/gelu.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/gemm_test.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/general_kernels.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/memory_access_utils.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/normalize_layer.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/quantization.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/quantization_utils.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/quantizer.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/reduction_utils.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/simd.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/softmax.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/strided_batch_gemm.h -> build/lib/deepspeed/ops/csrc/includes copying deepspeed/ops/csrc/includes/type_shim.h -> build/lib/deepspeed/ops/csrc/includes creating build/lib/deepspeed/ops/csrc/lamb copying deepspeed/ops/csrc/lamb/fused_lamb_cuda.cpp -> build/lib/deepspeed/ops/csrc/lamb copying deepspeed/ops/csrc/lamb/fused_lamb_cuda_kernel.cu -> build/lib/deepspeed/ops/csrc/lamb creating build/lib/deepspeed/ops/csrc/lion copying deepspeed/ops/csrc/lion/cpu_lion.cpp -> build/lib/deepspeed/ops/csrc/lion copying deepspeed/ops/csrc/lion/cpu_lion_impl.cpp -> build/lib/deepspeed/ops/csrc/lion copying deepspeed/ops/csrc/lion/fused_lion_frontend.cpp -> build/lib/deepspeed/ops/csrc/lion copying deepspeed/ops/csrc/lion/multi_tensor_apply.cuh -> build/lib/deepspeed/ops/csrc/lion copying deepspeed/ops/csrc/lion/multi_tensor_lion.cu -> build/lib/deepspeed/ops/csrc/lion creating build/lib/deepspeed/ops/csrc/quantization copying deepspeed/ops/csrc/quantization/dequantize.cu -> build/lib/deepspeed/ops/csrc/quantization copying deepspeed/ops/csrc/quantization/fake_quantizer.cu -> build/lib/deepspeed/ops/csrc/quantization copying deepspeed/ops/csrc/quantization/pt_binding.cpp -> build/lib/deepspeed/ops/csrc/quantization copying deepspeed/ops/csrc/quantization/quant_reduce.cu -> build/lib/deepspeed/ops/csrc/quantization copying deepspeed/ops/csrc/quantization/quantize.cu -> build/lib/deepspeed/ops/csrc/quantization copying deepspeed/ops/csrc/quantization/quantize_intX.cu -> build/lib/deepspeed/ops/csrc/quantization copying deepspeed/ops/csrc/quantization/swizzled_quantize.cu -> build/lib/deepspeed/ops/csrc/quantization creating build/lib/deepspeed/ops/csrc/random_ltd copying deepspeed/ops/csrc/random_ltd/gather_scatter.cu -> build/lib/deepspeed/ops/csrc/random_ltd copying deepspeed/ops/csrc/random_ltd/pt_binding.cpp -> build/lib/deepspeed/ops/csrc/random_ltd copying deepspeed/ops/csrc/random_ltd/slice_attn_masks.cu -> build/lib/deepspeed/ops/csrc/random_ltd copying deepspeed/ops/csrc/random_ltd/token_sort.cu -> build/lib/deepspeed/ops/csrc/random_ltd creating build/lib/deepspeed/ops/csrc/sparse_attention copying deepspeed/ops/csrc/sparse_attention/utils.cpp -> build/lib/deepspeed/ops/csrc/sparse_attention creating build/lib/deepspeed/ops/csrc/spatial creating build/lib/deepspeed/ops/csrc/spatial/csrc copying deepspeed/ops/csrc/spatial/csrc/opt_bias_add.cu -> build/lib/deepspeed/ops/csrc/spatial/csrc copying deepspeed/ops/csrc/spatial/csrc/pt_binding.cpp -> build/lib/deepspeed/ops/csrc/spatial/csrc creating build/lib/deepspeed/ops/csrc/spatial/includes copying deepspeed/ops/csrc/spatial/includes/spatial_cuda_layers.h -> build/lib/deepspeed/ops/csrc/spatial/includes creating build/lib/deepspeed/ops/csrc/transformer copying deepspeed/ops/csrc/transformer/cublas_wrappers.cu -> build/lib/deepspeed/ops/csrc/transformer copying deepspeed/ops/csrc/transformer/dropout_kernels.cu -> build/lib/deepspeed/ops/csrc/transformer copying deepspeed/ops/csrc/transformer/ds_transformer_cuda.cpp -> build/lib/deepspeed/ops/csrc/transformer copying deepspeed/ops/csrc/transformer/gelu_kernels.cu -> build/lib/deepspeed/ops/csrc/transformer copying deepspeed/ops/csrc/transformer/general_kernels.cu -> build/lib/deepspeed/ops/csrc/transformer copying deepspeed/ops/csrc/transformer/normalize_kernels.cu -> build/lib/deepspeed/ops/csrc/transformer copying deepspeed/ops/csrc/transformer/softmax_kernels.cu -> build/lib/deepspeed/ops/csrc/transformer copying deepspeed/ops/csrc/transformer/transform_kernels.cu -> build/lib/deepspeed/ops/csrc/transformer creating build/lib/deepspeed/ops/csrc/transformer/inference creating build/lib/deepspeed/ops/csrc/transformer/inference/csrc copying deepspeed/ops/csrc/transformer/inference/csrc/apply_rotary_pos_emb.cu -> build/lib/deepspeed/ops/csrc/transformer/inference/csrc copying deepspeed/ops/csrc/transformer/inference/csrc/dequantize.cu -> build/lib/deepspeed/ops/csrc/transformer/inference/csrc copying deepspeed/ops/csrc/transformer/inference/csrc/gelu.cu -> build/lib/deepspeed/ops/csrc/transformer/inference/csrc copying deepspeed/ops/csrc/transformer/inference/csrc/layer_norm.cu -> build/lib/deepspeed/ops/csrc/transformer/inference/csrc copying deepspeed/ops/csrc/transformer/inference/csrc/pointwise_ops.cu -> build/lib/deepspeed/ops/csrc/transformer/inference/csrc copying deepspeed/ops/csrc/transformer/inference/csrc/pt_binding.cpp -> build/lib/deepspeed/ops/csrc/transformer/inference/csrc copying deepspeed/ops/csrc/transformer/inference/csrc/relu.cu -> build/lib/deepspeed/ops/csrc/transformer/inference/csrc copying deepspeed/ops/csrc/transformer/inference/csrc/rms_norm.cu -> build/lib/deepspeed/ops/csrc/transformer/inference/csrc copying deepspeed/ops/csrc/transformer/inference/csrc/softmax.cu -> build/lib/deepspeed/ops/csrc/transformer/inference/csrc copying deepspeed/ops/csrc/transformer/inference/csrc/transform.cu -> build/lib/deepspeed/ops/csrc/transformer/inference/csrc creating build/lib/deepspeed/ops/csrc/transformer/inference/includes copying deepspeed/ops/csrc/transformer/inference/includes/inference_context.h -> build/lib/deepspeed/ops/csrc/transformer/inference/includes copying deepspeed/ops/csrc/transformer/inference/includes/inference_cublas_wrappers.h -> build/lib/deepspeed/ops/csrc/transformer/inference/includes copying deepspeed/ops/csrc/transformer/inference/includes/inference_cuda_layers.h -> build/lib/deepspeed/ops/csrc/transformer/inference/includes creating build/lib/deepspeed/ops/csrc/utils copying deepspeed/ops/csrc/utils/flatten_unflatten.cpp -> build/lib/deepspeed/ops/csrc/utils creating build/lib/deepspeed/ops/csrc/xpu creating build/lib/deepspeed/ops/csrc/xpu/adagrad copying deepspeed/ops/csrc/xpu/adagrad/cpu_adagrad.cpp -> build/lib/deepspeed/ops/csrc/xpu/adagrad creating build/lib/deepspeed/ops/csrc/xpu/adam copying deepspeed/ops/csrc/xpu/adam/cpu_adam.cpp -> build/lib/deepspeed/ops/csrc/xpu/adam copying deepspeed/ops/csrc/xpu/adam/cpu_adam_impl.cpp -> build/lib/deepspeed/ops/csrc/xpu/adam copying deepspeed/ops/csrc/xpu/adam/fused_adam_frontend.cpp -> build/lib/deepspeed/ops/csrc/xpu/adam copying deepspeed/ops/csrc/xpu/adam/multi_tensor_adam.dp.cpp -> build/lib/deepspeed/ops/csrc/xpu/adam creating build/lib/deepspeed/ops/csrc/xpu/common copying deepspeed/ops/csrc/xpu/common/custom_cuda_kernel.dp.cpp -> build/lib/deepspeed/ops/csrc/xpu/common creating build/lib/deepspeed/ops/csrc/xpu/includes copying deepspeed/ops/csrc/xpu/includes/compat.h -> build/lib/deepspeed/ops/csrc/xpu/includes copying deepspeed/ops/csrc/xpu/includes/cpu_adagrad.h -> build/lib/deepspeed/ops/csrc/xpu/includes copying deepspeed/ops/csrc/xpu/includes/cpu_adam.h -> build/lib/deepspeed/ops/csrc/xpu/includes copying deepspeed/ops/csrc/xpu/includes/simd.h -> build/lib/deepspeed/ops/csrc/xpu/includes copying deepspeed/ops/csrc/xpu/includes/type_shim.h -> build/lib/deepspeed/ops/csrc/xpu/includes creating build/lib/deepspeed/inference/v2/kernels/includes copying deepspeed/inference/v2/kernels/includes/activation_type.h -> build/lib/deepspeed/inference/v2/kernels/includes copying deepspeed/inference/v2/kernels/includes/conversion_utils.h -> build/lib/deepspeed/inference/v2/kernels/includes copying deepspeed/inference/v2/kernels/includes/ds_kernel_utils.h -> build/lib/deepspeed/inference/v2/kernels/includes copying deepspeed/inference/v2/kernels/includes/memory_access_utils.h -> build/lib/deepspeed/inference/v2/kernels/includes copying deepspeed/inference/v2/kernels/includes/reduction_utils.h -> build/lib/deepspeed/inference/v2/kernels/includes creating build/lib/deepspeed/inference/v2/ragged/csrc copying deepspeed/inference/v2/ragged/csrc/fast_host_buffer.cu -> build/lib/deepspeed/inference/v2/ragged/csrc copying deepspeed/inference/v2/ragged/csrc/ragged_ops.cpp -> build/lib/deepspeed/inference/v2/ragged/csrc creating build/lib/deepspeed/inference/v2/ragged/includes copying deepspeed/inference/v2/ragged/includes/fast_host_buffer.h -> build/lib/deepspeed/inference/v2/ragged/includes copying deepspeed/inference/v2/kernels/core_ops/core_ops.cpp -> build/lib/deepspeed/inference/v2/kernels/core_ops copying deepspeed/inference/v2/kernels/cutlass_ops/cutlass_ops.cpp -> build/lib/deepspeed/inference/v2/kernels/cutlass_ops creating build/lib/deepspeed/inference/v2/kernels/cutlass_ops/shared_resources copying deepspeed/inference/v2/kernels/cutlass_ops/shared_resources/weight_variant.h -> build/lib/deepspeed/inference/v2/kernels/cutlass_ops/shared_resources copying deepspeed/inference/v2/kernels/ragged_ops/ragged_ops.cpp -> build/lib/deepspeed/inference/v2/kernels/ragged_ops creating build/lib/deepspeed/inference/v2/kernels/ragged_ops/includes copying deepspeed/inference/v2/kernels/ragged_ops/includes/top_k_utils.h -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/includes creating build/lib/deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers copying deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers/ragged_dtypes.h -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers copying deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers/ragged_kernel_helpers.cpp -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers copying deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers/ragged_kernel_helpers.h -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers copying deepspeed/inference/v2/kernels/core_ops/bias_activations/bias_activation.cpp -> build/lib/deepspeed/inference/v2/kernels/core_ops/bias_activations copying deepspeed/inference/v2/kernels/core_ops/bias_activations/bias_activation.h -> build/lib/deepspeed/inference/v2/kernels/core_ops/bias_activations copying deepspeed/inference/v2/kernels/core_ops/bias_activations/bias_activation_cuda.cu -> build/lib/deepspeed/inference/v2/kernels/core_ops/bias_activations copying deepspeed/inference/v2/kernels/core_ops/blas_kernels/blas.h -> build/lib/deepspeed/inference/v2/kernels/core_ops/blas_kernels copying deepspeed/inference/v2/kernels/core_ops/blas_kernels/blas_utils.h -> build/lib/deepspeed/inference/v2/kernels/core_ops/blas_kernels copying deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/layer_norm.cpp -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/layer_norm.h -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/layer_norm_cuda.cu -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/cuda_linear_kernels.cpp -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/cuda_linear_kernels.h -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/fp6_linear.cu -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/fp6_linear.cuh -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear creating build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/configs.h -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/kernel_matmul.cuh -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/kernel_reduction.cuh -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/ptx_cp.async.cuh -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/ptx_mma.cuh -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/utils_core.cuh -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/utils_gmem.cuh -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/utils_paralleldequant.cuh -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/weight_prepacking.h -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm.cpp -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm copying deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm.h -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm copying deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm_cuda.cu -> build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm copying deepspeed/inference/v2/kernels/core_ops/gated_activations/gated_activation_kernels.cpp -> build/lib/deepspeed/inference/v2/kernels/core_ops/gated_activations copying deepspeed/inference/v2/kernels/core_ops/gated_activations/gated_activation_kernels.h -> build/lib/deepspeed/inference/v2/kernels/core_ops/gated_activations copying deepspeed/inference/v2/kernels/core_ops/gated_activations/gated_activation_kernels_cuda.cu -> build/lib/deepspeed/inference/v2/kernels/core_ops/gated_activations copying deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/mixed_gemm.cu -> build/lib/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm copying deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/mixed_gemm.h -> build/lib/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm copying deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/mixed_gemm_api.h -> build/lib/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm copying deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/moe_gemm.cu -> build/lib/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm copying deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/moe_gemm.h -> build/lib/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm copying deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/moe_gemm_api.h -> build/lib/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm copying deepspeed/inference/v2/kernels/ragged_ops/atom_builder/atom_builder.cpp -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/atom_builder copying deepspeed/inference/v2/kernels/ragged_ops/atom_builder/atom_builder.h -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/atom_builder copying deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/attention_atom.h -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash copying deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/blocked_flash.cpp -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash copying deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/blocked_flash.h -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash copying deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/flash.h -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash copying deepspeed/inference/v2/kernels/ragged_ops/embed/embed.cpp -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/embed copying deepspeed/inference/v2/kernels/ragged_ops/embed/embed.cuh -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/embed copying deepspeed/inference/v2/kernels/ragged_ops/embed/embed.h -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/embed copying deepspeed/inference/v2/kernels/ragged_ops/embed/embed_cuda.cu -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/embed copying deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary.cpp -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary.cuh -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary.h -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary_cuda.cu -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather.cpp -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/logits_gather copying deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather.cuh -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/logits_gather copying deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather.h -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/logits_gather copying deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather_cuda.cu -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/logits_gather copying deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather.cpp -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_gather copying deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather.cuh -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_gather copying deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather.h -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_gather copying deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather_cuda.cu -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_gather copying deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter.cpp -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter copying deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter.cuh -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter copying deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter.h -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter copying deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter_cuda.cu -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter copying deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating.cpp -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating copying deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating.cuh -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating copying deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating.h -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating copying deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating_cuda.cu -> build/lib/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating copying deepspeed/ops/sparse_attention/trsrc/matmul.tr -> build/lib/deepspeed/ops/sparse_attention/trsrc copying deepspeed/ops/sparse_attention/trsrc/softmax_bwd.tr -> build/lib/deepspeed/ops/sparse_attention/trsrc copying deepspeed/ops/sparse_attention/trsrc/softmax_fwd.tr -> build/lib/deepspeed/ops/sparse_attention/trsrc running build_scripts creating build/scripts-3.11 copying and adjusting bin/deepspeed -> build/scripts-3.11 copying and adjusting bin/deepspeed.pt -> build/scripts-3.11 copying and adjusting bin/ds -> build/scripts-3.11 copying bin/ds_ssh -> build/scripts-3.11 copying and adjusting bin/ds_report -> build/scripts-3.11 copying and adjusting bin/ds_bench -> build/scripts-3.11 copying and adjusting bin/dsr -> build/scripts-3.11 copying and adjusting bin/ds_elastic -> build/scripts-3.11 changing mode of build/scripts-3.11/deepspeed from 644 to 755 changing mode of build/scripts-3.11/deepspeed.pt from 644 to 755 changing mode of build/scripts-3.11/ds from 644 to 755 changing mode of build/scripts-3.11/ds_report from 644 to 755 changing mode of build/scripts-3.11/ds_bench from 644 to 755 changing mode of build/scripts-3.11/dsr from 644 to 755 changing mode of build/scripts-3.11/ds_elastic from 644 to 755 installing to build/bdist.linux-x86_64/wheel running install running install_lib creating build/bdist.linux-x86_64 creating build/bdist.linux-x86_64/wheel creating build/bdist.linux-x86_64/wheel/deepspeed copying build/lib/deepspeed/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed copying build/lib/deepspeed/constants.py -> build/bdist.linux-x86_64/wheel/deepspeed copying build/lib/deepspeed/env_report.py -> build/bdist.linux-x86_64/wheel/deepspeed copying build/lib/deepspeed/git_version_info.py -> build/bdist.linux-x86_64/wheel/deepspeed copying build/lib/deepspeed/pydantic_v1.py -> build/bdist.linux-x86_64/wheel/deepspeed copying build/lib/deepspeed/git_version_info_installed.py -> build/bdist.linux-x86_64/wheel/deepspeed creating build/bdist.linux-x86_64/wheel/deepspeed/autotuning copying build/lib/deepspeed/autotuning/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning copying build/lib/deepspeed/autotuning/autotuner.py -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning copying build/lib/deepspeed/autotuning/config.py -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning copying build/lib/deepspeed/autotuning/constants.py -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning copying build/lib/deepspeed/autotuning/scheduler.py -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning copying build/lib/deepspeed/autotuning/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning creating build/bdist.linux-x86_64/wheel/deepspeed/autotuning/tuner copying build/lib/deepspeed/autotuning/tuner/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning/tuner copying build/lib/deepspeed/autotuning/tuner/base_tuner.py -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning/tuner copying build/lib/deepspeed/autotuning/tuner/cost_model.py -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning/tuner copying build/lib/deepspeed/autotuning/tuner/index_based_tuner.py -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning/tuner copying build/lib/deepspeed/autotuning/tuner/model_based_tuner.py -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning/tuner copying build/lib/deepspeed/autotuning/tuner/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning/tuner creating build/bdist.linux-x86_64/wheel/deepspeed/autotuning/config_templates copying build/lib/deepspeed/autotuning/config_templates/template_zero0.json -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning/config_templates copying build/lib/deepspeed/autotuning/config_templates/template_zero1.json -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning/config_templates copying build/lib/deepspeed/autotuning/config_templates/template_zero2.json -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning/config_templates copying build/lib/deepspeed/autotuning/config_templates/template_zero3.json -> build/bdist.linux-x86_64/wheel/deepspeed/autotuning/config_templates creating build/bdist.linux-x86_64/wheel/deepspeed/checkpoint copying build/lib/deepspeed/checkpoint/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/checkpoint copying build/lib/deepspeed/checkpoint/constants.py -> build/bdist.linux-x86_64/wheel/deepspeed/checkpoint copying build/lib/deepspeed/checkpoint/deepspeed_checkpoint.py -> build/bdist.linux-x86_64/wheel/deepspeed/checkpoint copying build/lib/deepspeed/checkpoint/ds_to_universal.py -> build/bdist.linux-x86_64/wheel/deepspeed/checkpoint copying build/lib/deepspeed/checkpoint/reshape_3d_utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/checkpoint copying build/lib/deepspeed/checkpoint/reshape_meg_2d.py -> build/bdist.linux-x86_64/wheel/deepspeed/checkpoint copying build/lib/deepspeed/checkpoint/reshape_utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/checkpoint copying build/lib/deepspeed/checkpoint/universal_checkpoint.py -> build/bdist.linux-x86_64/wheel/deepspeed/checkpoint copying build/lib/deepspeed/checkpoint/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/checkpoint copying build/lib/deepspeed/checkpoint/zero_checkpoint.py -> build/bdist.linux-x86_64/wheel/deepspeed/checkpoint creating build/bdist.linux-x86_64/wheel/deepspeed/comm copying build/lib/deepspeed/comm/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/comm copying build/lib/deepspeed/comm/backend.py -> build/bdist.linux-x86_64/wheel/deepspeed/comm copying build/lib/deepspeed/comm/ccl.py -> build/bdist.linux-x86_64/wheel/deepspeed/comm copying build/lib/deepspeed/comm/comm.py -> build/bdist.linux-x86_64/wheel/deepspeed/comm copying build/lib/deepspeed/comm/config.py -> build/bdist.linux-x86_64/wheel/deepspeed/comm copying build/lib/deepspeed/comm/constants.py -> build/bdist.linux-x86_64/wheel/deepspeed/comm copying build/lib/deepspeed/comm/reduce_op.py -> build/bdist.linux-x86_64/wheel/deepspeed/comm copying build/lib/deepspeed/comm/torch.py -> build/bdist.linux-x86_64/wheel/deepspeed/comm copying build/lib/deepspeed/comm/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/comm creating build/bdist.linux-x86_64/wheel/deepspeed/compression copying build/lib/deepspeed/compression/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/compression copying build/lib/deepspeed/compression/basic_layer.py -> build/bdist.linux-x86_64/wheel/deepspeed/compression copying build/lib/deepspeed/compression/compress.py -> build/bdist.linux-x86_64/wheel/deepspeed/compression copying build/lib/deepspeed/compression/config.py -> build/bdist.linux-x86_64/wheel/deepspeed/compression copying build/lib/deepspeed/compression/constants.py -> build/bdist.linux-x86_64/wheel/deepspeed/compression copying build/lib/deepspeed/compression/helper.py -> build/bdist.linux-x86_64/wheel/deepspeed/compression copying build/lib/deepspeed/compression/scheduler.py -> build/bdist.linux-x86_64/wheel/deepspeed/compression copying build/lib/deepspeed/compression/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/compression creating build/bdist.linux-x86_64/wheel/deepspeed/elasticity copying build/lib/deepspeed/elasticity/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/elasticity copying build/lib/deepspeed/elasticity/config.py -> build/bdist.linux-x86_64/wheel/deepspeed/elasticity copying build/lib/deepspeed/elasticity/constants.py -> build/bdist.linux-x86_64/wheel/deepspeed/elasticity copying build/lib/deepspeed/elasticity/elastic_agent.py -> build/bdist.linux-x86_64/wheel/deepspeed/elasticity copying build/lib/deepspeed/elasticity/elasticity.py -> build/bdist.linux-x86_64/wheel/deepspeed/elasticity copying build/lib/deepspeed/elasticity/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/elasticity creating build/bdist.linux-x86_64/wheel/deepspeed/inference copying build/lib/deepspeed/inference/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference copying build/lib/deepspeed/inference/config.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference copying build/lib/deepspeed/inference/engine.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference creating build/bdist.linux-x86_64/wheel/deepspeed/inference/quantization copying build/lib/deepspeed/inference/quantization/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/quantization copying build/lib/deepspeed/inference/quantization/layers.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/quantization copying build/lib/deepspeed/inference/quantization/quantization.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/quantization copying build/lib/deepspeed/inference/quantization/quantization_context.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/quantization copying build/lib/deepspeed/inference/quantization/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/quantization creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2 copying build/lib/deepspeed/inference/v2/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2 copying build/lib/deepspeed/inference/v2/allocator.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2 copying build/lib/deepspeed/inference/v2/config_v2.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2 copying build/lib/deepspeed/inference/v2/engine_factory.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2 copying build/lib/deepspeed/inference/v2/engine_v2.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2 copying build/lib/deepspeed/inference/v2/inference_parameter.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2 copying build/lib/deepspeed/inference/v2/inference_utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2 copying build/lib/deepspeed/inference/v2/logging.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2 copying build/lib/deepspeed/inference/v2/scheduling_utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2 creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/checkpoint copying build/lib/deepspeed/inference/v2/checkpoint/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/checkpoint copying build/lib/deepspeed/inference/v2/checkpoint/base_engine.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/checkpoint copying build/lib/deepspeed/inference/v2/checkpoint/huggingface_engine.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/checkpoint copying build/lib/deepspeed/inference/v2/checkpoint/in_memory_engine.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/checkpoint creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels copying build/lib/deepspeed/inference/v2/kernels/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels copying build/lib/deepspeed/inference/v2/kernels/ds_kernel.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops copying build/lib/deepspeed/inference/v2/kernels/core_ops/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/bias_activations copying build/lib/deepspeed/inference/v2/kernels/core_ops/bias_activations/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/bias_activations copying build/lib/deepspeed/inference/v2/kernels/core_ops/bias_activations/bias_activation.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/bias_activations copying build/lib/deepspeed/inference/v2/kernels/core_ops/bias_activations/bias_activation.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/bias_activations copying build/lib/deepspeed/inference/v2/kernels/core_ops/bias_activations/bias_activation.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/bias_activations copying build/lib/deepspeed/inference/v2/kernels/core_ops/bias_activations/bias_activation_cuda.cu -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/bias_activations creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/blas_kernels copying build/lib/deepspeed/inference/v2/kernels/core_ops/blas_kernels/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/blas_kernels copying build/lib/deepspeed/inference/v2/kernels/core_ops/blas_kernels/blas_linear.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/blas_kernels copying build/lib/deepspeed/inference/v2/kernels/core_ops/blas_kernels/blas.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/blas_kernels copying build/lib/deepspeed/inference/v2/kernels/core_ops/blas_kernels/blas_utils.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/blas_kernels creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/cuda_fp_ln_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/cuda_ln.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/cuda_post_ln.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/cuda_pre_ln.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/layer_norm.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/layer_norm.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/layer_norm_cuda.cu -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/cuda_linear.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/cuda_linear_kernels.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/cuda_linear_kernels.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/fp6_linear.cu -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/fp6_linear.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/configs.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/kernel_matmul.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/kernel_reduction.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/ptx_cp.async.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/ptx_mma.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/utils_core.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/utils_gmem.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/utils_paralleldequant.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/weight_prepacking.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_linear/include creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_pre_norm.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm copying build/lib/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm_cuda.cu -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/gated_activations copying build/lib/deepspeed/inference/v2/kernels/core_ops/gated_activations/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/gated_activations copying build/lib/deepspeed/inference/v2/kernels/core_ops/gated_activations/gated_activation.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/gated_activations copying build/lib/deepspeed/inference/v2/kernels/core_ops/gated_activations/gated_activation_kernels.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/gated_activations copying build/lib/deepspeed/inference/v2/kernels/core_ops/gated_activations/gated_activation_kernels.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/gated_activations copying build/lib/deepspeed/inference/v2/kernels/core_ops/gated_activations/gated_activation_kernels_cuda.cu -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops/gated_activations copying build/lib/deepspeed/inference/v2/kernels/core_ops/core_ops.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/core_ops creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops copying build/lib/deepspeed/inference/v2/kernels/cutlass_ops/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm copying build/lib/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm copying build/lib/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/mixed_gemm.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm copying build/lib/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/mixed_gemm.cu -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm copying build/lib/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/mixed_gemm.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm copying build/lib/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/mixed_gemm_api.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm copying build/lib/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm copying build/lib/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/mixed_moe_gemm.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm copying build/lib/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/moe_gemm.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm copying build/lib/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/moe_gemm.cu -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm copying build/lib/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/moe_gemm.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm copying build/lib/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/moe_gemm_api.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm copying build/lib/deepspeed/inference/v2/kernels/cutlass_ops/cutlass_ops.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/shared_resources copying build/lib/deepspeed/inference/v2/kernels/cutlass_ops/shared_resources/weight_variant.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/cutlass_ops/shared_resources creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/atom_builder copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/atom_builder/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/atom_builder copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/atom_builder/atom_builder.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/atom_builder copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/atom_builder/atom_builder.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/atom_builder copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/atom_builder/atom_builder.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/atom_builder creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/blocked_flash.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/attention_atom.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/blocked_flash.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/blocked_flash.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/flash.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/blocked_flash creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/embed copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/embed/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/embed copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/embed/embed.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/embed copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/embed/embed.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/embed copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/embed/embed.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/embed copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/embed/embed.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/embed copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/embed/embed_cuda.cu -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/embed creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_trained_kv_rotary.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/linear_blocked_kv_copy.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary_cuda.cu -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/logits_gather copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/logits_gather/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/logits_gather copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/logits_gather copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/logits_gather copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/logits_gather copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/logits_gather copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather_cuda.cu -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/logits_gather creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/moe_gather copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_gather/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/moe_gather copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/moe_gather copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/moe_gather copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/moe_gather copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/moe_gather copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather_cuda.cu -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/moe_gather creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter_cuda.cu -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/moe_scatter creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating_cuda.cu -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/top_k_gating copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/ragged_ops.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/includes copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/includes/top_k_utils.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/includes creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers/ragged_dtypes.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers/ragged_kernel_helpers.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers copying build/lib/deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers/ragged_kernel_helpers.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/includes copying build/lib/deepspeed/inference/v2/kernels/includes/activation_type.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/includes copying build/lib/deepspeed/inference/v2/kernels/includes/conversion_utils.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/includes copying build/lib/deepspeed/inference/v2/kernels/includes/ds_kernel_utils.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/includes copying build/lib/deepspeed/inference/v2/kernels/includes/memory_access_utils.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/includes copying build/lib/deepspeed/inference/v2/kernels/includes/reduction_utils.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/kernels/includes creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations copying build/lib/deepspeed/inference/v2/model_implementations/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations copying build/lib/deepspeed/inference/v2/model_implementations/flat_model_helpers.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations copying build/lib/deepspeed/inference/v2/model_implementations/inference_model_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations copying build/lib/deepspeed/inference/v2/model_implementations/inference_policy_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations copying build/lib/deepspeed/inference/v2/model_implementations/inference_transformer_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations copying build/lib/deepspeed/inference/v2/model_implementations/layer_container_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations copying build/lib/deepspeed/inference/v2/model_implementations/parameter_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/common_parameters copying build/lib/deepspeed/inference/v2/model_implementations/common_parameters/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/common_parameters copying build/lib/deepspeed/inference/v2/model_implementations/common_parameters/attn_output_parameters.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/common_parameters copying build/lib/deepspeed/inference/v2/model_implementations/common_parameters/embedding_parameters.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/common_parameters copying build/lib/deepspeed/inference/v2/model_implementations/common_parameters/invfreq_parameters.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/common_parameters copying build/lib/deepspeed/inference/v2/model_implementations/common_parameters/mlp_parameters.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/common_parameters copying build/lib/deepspeed/inference/v2/model_implementations/common_parameters/moe_parameters.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/common_parameters copying build/lib/deepspeed/inference/v2/model_implementations/common_parameters/norm_parameters.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/common_parameters copying build/lib/deepspeed/inference/v2/model_implementations/common_parameters/qkv_parameters.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/common_parameters copying build/lib/deepspeed/inference/v2/model_implementations/common_parameters/unembed_parameters.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/common_parameters creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/falcon copying build/lib/deepspeed/inference/v2/model_implementations/falcon/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/falcon copying build/lib/deepspeed/inference/v2/model_implementations/falcon/container.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/falcon copying build/lib/deepspeed/inference/v2/model_implementations/falcon/model.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/falcon copying build/lib/deepspeed/inference/v2/model_implementations/falcon/policy.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/falcon creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/llama_v2 copying build/lib/deepspeed/inference/v2/model_implementations/llama_v2/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/llama_v2 copying build/lib/deepspeed/inference/v2/model_implementations/llama_v2/container.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/llama_v2 copying build/lib/deepspeed/inference/v2/model_implementations/llama_v2/model.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/llama_v2 copying build/lib/deepspeed/inference/v2/model_implementations/llama_v2/policy.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/llama_v2 creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/mistral copying build/lib/deepspeed/inference/v2/model_implementations/mistral/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/mistral copying build/lib/deepspeed/inference/v2/model_implementations/mistral/container.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/mistral copying build/lib/deepspeed/inference/v2/model_implementations/mistral/model.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/mistral copying build/lib/deepspeed/inference/v2/model_implementations/mistral/policy.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/mistral creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/mixtral copying build/lib/deepspeed/inference/v2/model_implementations/mixtral/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/mixtral copying build/lib/deepspeed/inference/v2/model_implementations/mixtral/container.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/mixtral copying build/lib/deepspeed/inference/v2/model_implementations/mixtral/model.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/mixtral copying build/lib/deepspeed/inference/v2/model_implementations/mixtral/policy.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/mixtral creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/opt copying build/lib/deepspeed/inference/v2/model_implementations/opt/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/opt copying build/lib/deepspeed/inference/v2/model_implementations/opt/container.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/opt copying build/lib/deepspeed/inference/v2/model_implementations/opt/model.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/opt copying build/lib/deepspeed/inference/v2/model_implementations/opt/policy.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/opt creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/phi copying build/lib/deepspeed/inference/v2/model_implementations/phi/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/phi copying build/lib/deepspeed/inference/v2/model_implementations/phi/containers.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/phi copying build/lib/deepspeed/inference/v2/model_implementations/phi/model.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/phi copying build/lib/deepspeed/inference/v2/model_implementations/phi/policy.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/phi creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/qwen copying build/lib/deepspeed/inference/v2/model_implementations/qwen/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/qwen copying build/lib/deepspeed/inference/v2/model_implementations/qwen/container.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/qwen copying build/lib/deepspeed/inference/v2/model_implementations/qwen/model.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/qwen copying build/lib/deepspeed/inference/v2/model_implementations/qwen/policy.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/qwen creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/qwen_v2 copying build/lib/deepspeed/inference/v2/model_implementations/qwen_v2/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/qwen_v2 copying build/lib/deepspeed/inference/v2/model_implementations/qwen_v2/container.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/qwen_v2 copying build/lib/deepspeed/inference/v2/model_implementations/qwen_v2/model.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/qwen_v2 copying build/lib/deepspeed/inference/v2/model_implementations/qwen_v2/policy.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/qwen_v2 creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/sharding copying build/lib/deepspeed/inference/v2/model_implementations/sharding/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/sharding copying build/lib/deepspeed/inference/v2/model_implementations/sharding/attn.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/sharding copying build/lib/deepspeed/inference/v2/model_implementations/sharding/attn_out.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/sharding copying build/lib/deepspeed/inference/v2/model_implementations/sharding/embedding.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/sharding copying build/lib/deepspeed/inference/v2/model_implementations/sharding/mlp.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/sharding copying build/lib/deepspeed/inference/v2/model_implementations/sharding/qkv.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/sharding copying build/lib/deepspeed/inference/v2/model_implementations/sharding/types.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/sharding copying build/lib/deepspeed/inference/v2/model_implementations/sharding/unembed.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/sharding copying build/lib/deepspeed/inference/v2/model_implementations/sharding/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/model_implementations/sharding creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules copying build/lib/deepspeed/inference/v2/modules/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules copying build/lib/deepspeed/inference/v2/modules/ds_module.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules copying build/lib/deepspeed/inference/v2/modules/heuristics.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules copying build/lib/deepspeed/inference/v2/modules/module_registry.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/configs copying build/lib/deepspeed/inference/v2/modules/configs/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/configs copying build/lib/deepspeed/inference/v2/modules/configs/attention_configs.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/configs copying build/lib/deepspeed/inference/v2/modules/configs/embedding_config.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/configs copying build/lib/deepspeed/inference/v2/modules/configs/linear_config.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/configs copying build/lib/deepspeed/inference/v2/modules/configs/moe_config.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/configs copying build/lib/deepspeed/inference/v2/modules/configs/norm_config.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/configs copying build/lib/deepspeed/inference/v2/modules/configs/unembed_config.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/configs creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations copying build/lib/deepspeed/inference/v2/modules/implementations/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/attention copying build/lib/deepspeed/inference/v2/modules/implementations/attention/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/attention copying build/lib/deepspeed/inference/v2/modules/implementations/attention/dense_blocked_attention.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/attention creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/embedding copying build/lib/deepspeed/inference/v2/modules/implementations/embedding/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/embedding copying build/lib/deepspeed/inference/v2/modules/implementations/embedding/ragged_embedding.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/embedding creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/linear copying build/lib/deepspeed/inference/v2/modules/implementations/linear/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/linear copying build/lib/deepspeed/inference/v2/modules/implementations/linear/blas_fp_linear.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/linear copying build/lib/deepspeed/inference/v2/modules/implementations/linear/quantized_linear.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/linear creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/moe copying build/lib/deepspeed/inference/v2/modules/implementations/moe/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/moe copying build/lib/deepspeed/inference/v2/modules/implementations/moe/cutlass_multi_gemm.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/moe creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/post_norm copying build/lib/deepspeed/inference/v2/modules/implementations/post_norm/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/post_norm copying build/lib/deepspeed/inference/v2/modules/implementations/post_norm/cuda_post_ln.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/post_norm creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/pre_norm copying build/lib/deepspeed/inference/v2/modules/implementations/pre_norm/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/pre_norm copying build/lib/deepspeed/inference/v2/modules/implementations/pre_norm/cuda_pre_ln.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/pre_norm copying build/lib/deepspeed/inference/v2/modules/implementations/pre_norm/cuda_pre_rms.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/pre_norm creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/unembed copying build/lib/deepspeed/inference/v2/modules/implementations/unembed/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/unembed copying build/lib/deepspeed/inference/v2/modules/implementations/unembed/ragged_unembed.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/implementations/unembed creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/interfaces copying build/lib/deepspeed/inference/v2/modules/interfaces/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/interfaces copying build/lib/deepspeed/inference/v2/modules/interfaces/attention_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/interfaces copying build/lib/deepspeed/inference/v2/modules/interfaces/embedding_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/interfaces copying build/lib/deepspeed/inference/v2/modules/interfaces/linear_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/interfaces copying build/lib/deepspeed/inference/v2/modules/interfaces/moe_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/interfaces copying build/lib/deepspeed/inference/v2/modules/interfaces/post_norm_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/interfaces copying build/lib/deepspeed/inference/v2/modules/interfaces/pre_norm_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/interfaces copying build/lib/deepspeed/inference/v2/modules/interfaces/unembed_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/modules/interfaces creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/ragged copying build/lib/deepspeed/inference/v2/ragged/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/ragged copying build/lib/deepspeed/inference/v2/ragged/blocked_allocator.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/ragged copying build/lib/deepspeed/inference/v2/ragged/kv_cache.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/ragged copying build/lib/deepspeed/inference/v2/ragged/manager_configs.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/ragged copying build/lib/deepspeed/inference/v2/ragged/ragged_manager.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/ragged copying build/lib/deepspeed/inference/v2/ragged/ragged_wrapper.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/ragged copying build/lib/deepspeed/inference/v2/ragged/sequence_descriptor.py -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/ragged creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/ragged/csrc copying build/lib/deepspeed/inference/v2/ragged/csrc/fast_host_buffer.cu -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/ragged/csrc copying build/lib/deepspeed/inference/v2/ragged/csrc/ragged_ops.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/ragged/csrc creating build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/ragged/includes copying build/lib/deepspeed/inference/v2/ragged/includes/fast_host_buffer.h -> build/bdist.linux-x86_64/wheel/deepspeed/inference/v2/ragged/includes creating build/bdist.linux-x86_64/wheel/deepspeed/launcher copying build/lib/deepspeed/launcher/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/launcher copying build/lib/deepspeed/launcher/constants.py -> build/bdist.linux-x86_64/wheel/deepspeed/launcher copying build/lib/deepspeed/launcher/launch.py -> build/bdist.linux-x86_64/wheel/deepspeed/launcher copying build/lib/deepspeed/launcher/launcher_helper.py -> build/bdist.linux-x86_64/wheel/deepspeed/launcher copying build/lib/deepspeed/launcher/multinode_runner.py -> build/bdist.linux-x86_64/wheel/deepspeed/launcher copying build/lib/deepspeed/launcher/runner.py -> build/bdist.linux-x86_64/wheel/deepspeed/launcher creating build/bdist.linux-x86_64/wheel/deepspeed/model_implementations copying build/lib/deepspeed/model_implementations/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations creating build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/diffusers copying build/lib/deepspeed/model_implementations/diffusers/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/diffusers copying build/lib/deepspeed/model_implementations/diffusers/unet.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/diffusers copying build/lib/deepspeed/model_implementations/diffusers/vae.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/diffusers creating build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/features copying build/lib/deepspeed/model_implementations/features/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/features copying build/lib/deepspeed/model_implementations/features/cuda_graph.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/features creating build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/transformers copying build/lib/deepspeed/model_implementations/transformers/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/transformers copying build/lib/deepspeed/model_implementations/transformers/clip_encoder.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/transformers copying build/lib/deepspeed/model_implementations/transformers/ds_base.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/transformers copying build/lib/deepspeed/model_implementations/transformers/ds_bert.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/transformers copying build/lib/deepspeed/model_implementations/transformers/ds_bloom.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/transformers copying build/lib/deepspeed/model_implementations/transformers/ds_gpt.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/transformers copying build/lib/deepspeed/model_implementations/transformers/ds_llama2.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/transformers copying build/lib/deepspeed/model_implementations/transformers/ds_megatron_gpt.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/transformers copying build/lib/deepspeed/model_implementations/transformers/ds_opt.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/transformers copying build/lib/deepspeed/model_implementations/transformers/ds_transformer.py -> build/bdist.linux-x86_64/wheel/deepspeed/model_implementations/transformers creating build/bdist.linux-x86_64/wheel/deepspeed/module_inject copying build/lib/deepspeed/module_inject/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject copying build/lib/deepspeed/module_inject/auto_tp.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject copying build/lib/deepspeed/module_inject/auto_tp_model_utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject copying build/lib/deepspeed/module_inject/fusedqkv_utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject copying build/lib/deepspeed/module_inject/inject.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject copying build/lib/deepspeed/module_inject/layers.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject copying build/lib/deepspeed/module_inject/load_checkpoint.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject copying build/lib/deepspeed/module_inject/module_quantize.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject copying build/lib/deepspeed/module_inject/policy.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject copying build/lib/deepspeed/module_inject/replace_module.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject copying build/lib/deepspeed/module_inject/replace_policy.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject copying build/lib/deepspeed/module_inject/tp_shard.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject copying build/lib/deepspeed/module_inject/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject creating build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/base.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/base_moe.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/bert.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/bloom.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/clip.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/distil_bert.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/gpt2.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/gptj.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/gptneo.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/gptneox.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/internlm.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/llama.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/llama2.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/megatron_gpt.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/megatron_gpt_moe.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/opt.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/unet.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers copying build/lib/deepspeed/module_inject/containers/vae.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers creating build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers/features copying build/lib/deepspeed/module_inject/containers/features/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers/features copying build/lib/deepspeed/module_inject/containers/features/gated_mlp.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers/features copying build/lib/deepspeed/module_inject/containers/features/hybrid_engine.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers/features copying build/lib/deepspeed/module_inject/containers/features/hybrid_megatron.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers/features copying build/lib/deepspeed/module_inject/containers/features/megatron.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers/features copying build/lib/deepspeed/module_inject/containers/features/meta_tensor.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers/features copying build/lib/deepspeed/module_inject/containers/features/split_qkv.py -> build/bdist.linux-x86_64/wheel/deepspeed/module_inject/containers/features creating build/bdist.linux-x86_64/wheel/deepspeed/moe copying build/lib/deepspeed/moe/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/moe copying build/lib/deepspeed/moe/experts.py -> build/bdist.linux-x86_64/wheel/deepspeed/moe copying build/lib/deepspeed/moe/layer.py -> build/bdist.linux-x86_64/wheel/deepspeed/moe copying build/lib/deepspeed/moe/mappings.py -> build/bdist.linux-x86_64/wheel/deepspeed/moe copying build/lib/deepspeed/moe/sharded_moe.py -> build/bdist.linux-x86_64/wheel/deepspeed/moe copying build/lib/deepspeed/moe/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/moe creating build/bdist.linux-x86_64/wheel/deepspeed/monitor copying build/lib/deepspeed/monitor/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/monitor copying build/lib/deepspeed/monitor/config.py -> build/bdist.linux-x86_64/wheel/deepspeed/monitor copying build/lib/deepspeed/monitor/csv_monitor.py -> build/bdist.linux-x86_64/wheel/deepspeed/monitor copying build/lib/deepspeed/monitor/monitor.py -> build/bdist.linux-x86_64/wheel/deepspeed/monitor copying build/lib/deepspeed/monitor/tensorboard.py -> build/bdist.linux-x86_64/wheel/deepspeed/monitor copying build/lib/deepspeed/monitor/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/monitor copying build/lib/deepspeed/monitor/wandb.py -> build/bdist.linux-x86_64/wheel/deepspeed/monitor creating build/bdist.linux-x86_64/wheel/deepspeed/nebula copying build/lib/deepspeed/nebula/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/nebula copying build/lib/deepspeed/nebula/config.py -> build/bdist.linux-x86_64/wheel/deepspeed/nebula copying build/lib/deepspeed/nebula/constants.py -> build/bdist.linux-x86_64/wheel/deepspeed/nebula creating build/bdist.linux-x86_64/wheel/deepspeed/ops copying build/lib/deepspeed/ops/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops creating build/bdist.linux-x86_64/wheel/deepspeed/ops/adagrad copying build/lib/deepspeed/ops/adagrad/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/adagrad copying build/lib/deepspeed/ops/adagrad/cpu_adagrad.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/adagrad creating build/bdist.linux-x86_64/wheel/deepspeed/ops/adam copying build/lib/deepspeed/ops/adam/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/adam copying build/lib/deepspeed/ops/adam/cpu_adam.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/adam copying build/lib/deepspeed/ops/adam/fused_adam.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/adam copying build/lib/deepspeed/ops/adam/multi_tensor_apply.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/adam creating build/bdist.linux-x86_64/wheel/deepspeed/ops/aio copying build/lib/deepspeed/ops/aio/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/aio creating build/bdist.linux-x86_64/wheel/deepspeed/ops/deepspeed4science copying build/lib/deepspeed/ops/deepspeed4science/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/deepspeed4science copying build/lib/deepspeed/ops/deepspeed4science/evoformer_attn.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/deepspeed4science creating build/bdist.linux-x86_64/wheel/deepspeed/ops/lamb copying build/lib/deepspeed/ops/lamb/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/lamb copying build/lib/deepspeed/ops/lamb/fused_lamb.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/lamb creating build/bdist.linux-x86_64/wheel/deepspeed/ops/lion copying build/lib/deepspeed/ops/lion/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/lion copying build/lib/deepspeed/ops/lion/cpu_lion.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/lion copying build/lib/deepspeed/ops/lion/fused_lion.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/lion copying build/lib/deepspeed/ops/lion/multi_tensor_apply.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/lion creating build/bdist.linux-x86_64/wheel/deepspeed/ops/quantizer copying build/lib/deepspeed/ops/quantizer/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/quantizer copying build/lib/deepspeed/ops/quantizer/quantizer.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/quantizer creating build/bdist.linux-x86_64/wheel/deepspeed/ops/random_ltd copying build/lib/deepspeed/ops/random_ltd/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/random_ltd copying build/lib/deepspeed/ops/random_ltd/dropping_utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/random_ltd creating build/bdist.linux-x86_64/wheel/deepspeed/ops/sparse_attention copying build/lib/deepspeed/ops/sparse_attention/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/sparse_attention copying build/lib/deepspeed/ops/sparse_attention/bert_sparse_self_attention.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/sparse_attention copying build/lib/deepspeed/ops/sparse_attention/matmul.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/sparse_attention copying build/lib/deepspeed/ops/sparse_attention/softmax.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/sparse_attention copying build/lib/deepspeed/ops/sparse_attention/sparse_attention_utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/sparse_attention copying build/lib/deepspeed/ops/sparse_attention/sparse_self_attention.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/sparse_attention copying build/lib/deepspeed/ops/sparse_attention/sparsity_config.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/sparse_attention creating build/bdist.linux-x86_64/wheel/deepspeed/ops/sparse_attention/trsrc copying build/lib/deepspeed/ops/sparse_attention/trsrc/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/sparse_attention/trsrc copying build/lib/deepspeed/ops/sparse_attention/trsrc/matmul.tr -> build/bdist.linux-x86_64/wheel/deepspeed/ops/sparse_attention/trsrc copying build/lib/deepspeed/ops/sparse_attention/trsrc/softmax_bwd.tr -> build/bdist.linux-x86_64/wheel/deepspeed/ops/sparse_attention/trsrc copying build/lib/deepspeed/ops/sparse_attention/trsrc/softmax_fwd.tr -> build/bdist.linux-x86_64/wheel/deepspeed/ops/sparse_attention/trsrc creating build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer copying build/lib/deepspeed/ops/transformer/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer copying build/lib/deepspeed/ops/transformer/transformer.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer creating build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference copying build/lib/deepspeed/ops/transformer/inference/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference copying build/lib/deepspeed/ops/transformer/inference/bias_add.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference copying build/lib/deepspeed/ops/transformer/inference/config.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference copying build/lib/deepspeed/ops/transformer/inference/diffusers_2d_transformer.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference copying build/lib/deepspeed/ops/transformer/inference/diffusers_attention.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference copying build/lib/deepspeed/ops/transformer/inference/diffusers_transformer_block.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference copying build/lib/deepspeed/ops/transformer/inference/ds_attention.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference copying build/lib/deepspeed/ops/transformer/inference/ds_mlp.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference copying build/lib/deepspeed/ops/transformer/inference/moe_inference.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference copying build/lib/deepspeed/ops/transformer/inference/triton_ops.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference creating build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/op_binding copying build/lib/deepspeed/ops/transformer/inference/op_binding/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/op_binding copying build/lib/deepspeed/ops/transformer/inference/op_binding/base.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/op_binding copying build/lib/deepspeed/ops/transformer/inference/op_binding/gelu_gemm.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/op_binding copying build/lib/deepspeed/ops/transformer/inference/op_binding/linear.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/op_binding copying build/lib/deepspeed/ops/transformer/inference/op_binding/mlp_gemm.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/op_binding copying build/lib/deepspeed/ops/transformer/inference/op_binding/qkv_gemm.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/op_binding copying build/lib/deepspeed/ops/transformer/inference/op_binding/residual_add.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/op_binding copying build/lib/deepspeed/ops/transformer/inference/op_binding/softmax.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/op_binding copying build/lib/deepspeed/ops/transformer/inference/op_binding/softmax_context.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/op_binding copying build/lib/deepspeed/ops/transformer/inference/op_binding/vector_matmul.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/op_binding creating build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/triton copying build/lib/deepspeed/ops/transformer/inference/triton/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/triton copying build/lib/deepspeed/ops/transformer/inference/triton/attention.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/triton copying build/lib/deepspeed/ops/transformer/inference/triton/gelu.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/triton copying build/lib/deepspeed/ops/transformer/inference/triton/layer_norm.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/triton copying build/lib/deepspeed/ops/transformer/inference/triton/matmul_ext.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/triton copying build/lib/deepspeed/ops/transformer/inference/triton/mlp.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/triton copying build/lib/deepspeed/ops/transformer/inference/triton/ops.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/triton copying build/lib/deepspeed/ops/transformer/inference/triton/residual_add.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/triton copying build/lib/deepspeed/ops/transformer/inference/triton/softmax.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/triton copying build/lib/deepspeed/ops/transformer/inference/triton/triton_matmul_kernel.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/transformer/inference/triton creating build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/all_ops.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/async_io.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/builder.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/cpu_adagrad.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/cpu_adam.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/cpu_lion.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/evoformer_attn.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/fused_adam.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/fused_lamb.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/fused_lion.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/inference_core_ops.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/inference_cutlass_builder.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/quantizer.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/ragged_ops.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/ragged_utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/random_ltd.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/sparse_attn.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/spatial_inference.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/stochastic_transformer.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/transformer.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder copying build/lib/deepspeed/ops/op_builder/transformer_inference.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder creating build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/cpu copying build/lib/deepspeed/ops/op_builder/cpu/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/cpu copying build/lib/deepspeed/ops/op_builder/cpu/builder.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/cpu copying build/lib/deepspeed/ops/op_builder/cpu/comm.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/cpu copying build/lib/deepspeed/ops/op_builder/cpu/cpu_adam.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/cpu copying build/lib/deepspeed/ops/op_builder/cpu/fused_adam.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/cpu copying build/lib/deepspeed/ops/op_builder/cpu/no_impl.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/cpu creating build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/hpu copying build/lib/deepspeed/ops/op_builder/hpu/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/hpu copying build/lib/deepspeed/ops/op_builder/hpu/builder.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/hpu copying build/lib/deepspeed/ops/op_builder/hpu/cpu_adam.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/hpu copying build/lib/deepspeed/ops/op_builder/hpu/fused_adam.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/hpu copying build/lib/deepspeed/ops/op_builder/hpu/no_impl.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/hpu creating build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/npu copying build/lib/deepspeed/ops/op_builder/npu/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/npu copying build/lib/deepspeed/ops/op_builder/npu/async_io.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/npu copying build/lib/deepspeed/ops/op_builder/npu/builder.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/npu copying build/lib/deepspeed/ops/op_builder/npu/cpu_adagrad.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/npu copying build/lib/deepspeed/ops/op_builder/npu/cpu_adam.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/npu copying build/lib/deepspeed/ops/op_builder/npu/cpu_lion.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/npu copying build/lib/deepspeed/ops/op_builder/npu/fused_adam.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/npu copying build/lib/deepspeed/ops/op_builder/npu/inference.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/npu copying build/lib/deepspeed/ops/op_builder/npu/no_impl.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/npu creating build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/xpu copying build/lib/deepspeed/ops/op_builder/xpu/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/xpu copying build/lib/deepspeed/ops/op_builder/xpu/async_io.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/xpu copying build/lib/deepspeed/ops/op_builder/xpu/builder.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/xpu copying build/lib/deepspeed/ops/op_builder/xpu/cpu_adagrad.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/xpu copying build/lib/deepspeed/ops/op_builder/xpu/cpu_adam.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/xpu copying build/lib/deepspeed/ops/op_builder/xpu/fused_adam.py -> build/bdist.linux-x86_64/wheel/deepspeed/ops/op_builder/xpu creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/adagrad copying build/lib/deepspeed/ops/csrc/adagrad/cpu_adagrad.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/adagrad creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/adam copying build/lib/deepspeed/ops/csrc/adam/cpu_adam.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/adam copying build/lib/deepspeed/ops/csrc/adam/cpu_adam_impl.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/adam copying build/lib/deepspeed/ops/csrc/adam/fused_adam_frontend.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/adam copying build/lib/deepspeed/ops/csrc/adam/multi_tensor_adam.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/adam copying build/lib/deepspeed/ops/csrc/adam/multi_tensor_apply.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/adam creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/common copying build/lib/deepspeed/ops/csrc/aio/common/deepspeed_aio_common.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/common copying build/lib/deepspeed/ops/csrc/aio/common/deepspeed_aio_common.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/common copying build/lib/deepspeed/ops/csrc/aio/common/deepspeed_aio_types.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/common copying build/lib/deepspeed/ops/csrc/aio/common/deepspeed_aio_types.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/common copying build/lib/deepspeed/ops/csrc/aio/common/deepspeed_aio_utils.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/common copying build/lib/deepspeed/ops/csrc/aio/common/deepspeed_aio_utils.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/common creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/py_lib copying build/lib/deepspeed/ops/csrc/aio/py_lib/deepspeed_aio_thread.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/py_lib copying build/lib/deepspeed/ops/csrc/aio/py_lib/deepspeed_aio_thread.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/py_lib copying build/lib/deepspeed/ops/csrc/aio/py_lib/deepspeed_pin_tensor.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/py_lib copying build/lib/deepspeed/ops/csrc/aio/py_lib/deepspeed_pin_tensor.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/py_lib copying build/lib/deepspeed/ops/csrc/aio/py_lib/deepspeed_py_aio.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/py_lib copying build/lib/deepspeed/ops/csrc/aio/py_lib/deepspeed_py_aio.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/py_lib copying build/lib/deepspeed/ops/csrc/aio/py_lib/deepspeed_py_aio_handle.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/py_lib copying build/lib/deepspeed/ops/csrc/aio/py_lib/deepspeed_py_aio_handle.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/py_lib copying build/lib/deepspeed/ops/csrc/aio/py_lib/deepspeed_py_copy.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/py_lib copying build/lib/deepspeed/ops/csrc/aio/py_lib/deepspeed_py_copy.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/py_lib copying build/lib/deepspeed/ops/csrc/aio/py_lib/py_ds_aio.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/py_lib creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/py_test copying build/lib/deepspeed/ops/csrc/aio/py_test/single_process_config.json -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/aio/py_test creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/common copying build/lib/deepspeed/ops/csrc/common/custom_cuda_kernel.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/common creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/cpu creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/cpu/adam copying build/lib/deepspeed/ops/csrc/cpu/adam/fused_adam.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/cpu/adam creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/cpu/comm copying build/lib/deepspeed/ops/csrc/cpu/comm/ccl.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/cpu/comm creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/cpu/lion copying build/lib/deepspeed/ops/csrc/cpu/lion/fused_lion.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/cpu/lion creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/attention.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/attention_back.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/attention_cu.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm_kernel_utils.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/kernel_backward.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/kernel_forward.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue/epilogue_grad_bias.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue/epilogue_pipelined.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue/epilogue_rescale_output.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue/epilogue_thread_apply_logsumexp.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/custom_mma.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/custom_mma_base.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/custom_mma_multistage.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/custom_mma_pipelined.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/find_default_mma.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/mma_accum_lambda_iterator.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/mma_from_smem.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/epilogue_predicated_tile_iterator.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/make_residual_last.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/predicated_tile_access_iterator_residual_last.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/predicated_tile_iterator_atomic.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/predicated_tile_iterator_residual_last.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/transpose_warp_iterator.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/warp_iterator_from_smem.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/transform copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/transform/bias_broadcast.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/transform copying build/lib/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/transform/tile_smem_loader.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/deepspeed4science/evoformer_attn/transform creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/StopWatch.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/Timer.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/activation_type.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/compat.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/context.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/conversion_utils.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/cpu_adagrad.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/cpu_adam.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/cpu_lion.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/cublas_wrappers.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/custom_cuda_layers.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/dequantization_utils.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/dropout.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/ds_kernel_utils.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/ds_transformer_cuda.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/feed_forward.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/gelu.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/gemm_test.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/general_kernels.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/memory_access_utils.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/normalize_layer.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/quantization.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/quantization_utils.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/quantizer.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/reduction_utils.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/simd.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/softmax.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/strided_batch_gemm.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes copying build/lib/deepspeed/ops/csrc/includes/type_shim.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/includes creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/lamb copying build/lib/deepspeed/ops/csrc/lamb/fused_lamb_cuda.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/lamb copying build/lib/deepspeed/ops/csrc/lamb/fused_lamb_cuda_kernel.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/lamb creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/lion copying build/lib/deepspeed/ops/csrc/lion/cpu_lion.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/lion copying build/lib/deepspeed/ops/csrc/lion/cpu_lion_impl.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/lion copying build/lib/deepspeed/ops/csrc/lion/fused_lion_frontend.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/lion copying build/lib/deepspeed/ops/csrc/lion/multi_tensor_apply.cuh -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/lion copying build/lib/deepspeed/ops/csrc/lion/multi_tensor_lion.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/lion creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/quantization copying build/lib/deepspeed/ops/csrc/quantization/dequantize.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/quantization copying build/lib/deepspeed/ops/csrc/quantization/fake_quantizer.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/quantization copying build/lib/deepspeed/ops/csrc/quantization/pt_binding.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/quantization copying build/lib/deepspeed/ops/csrc/quantization/quant_reduce.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/quantization copying build/lib/deepspeed/ops/csrc/quantization/quantize.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/quantization copying build/lib/deepspeed/ops/csrc/quantization/quantize_intX.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/quantization copying build/lib/deepspeed/ops/csrc/quantization/swizzled_quantize.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/quantization creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/random_ltd copying build/lib/deepspeed/ops/csrc/random_ltd/gather_scatter.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/random_ltd copying build/lib/deepspeed/ops/csrc/random_ltd/pt_binding.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/random_ltd copying build/lib/deepspeed/ops/csrc/random_ltd/slice_attn_masks.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/random_ltd copying build/lib/deepspeed/ops/csrc/random_ltd/token_sort.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/random_ltd creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/sparse_attention copying build/lib/deepspeed/ops/csrc/sparse_attention/utils.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/sparse_attention creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/spatial creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/spatial/csrc copying build/lib/deepspeed/ops/csrc/spatial/csrc/opt_bias_add.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/spatial/csrc copying build/lib/deepspeed/ops/csrc/spatial/csrc/pt_binding.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/spatial/csrc creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/spatial/includes copying build/lib/deepspeed/ops/csrc/spatial/includes/spatial_cuda_layers.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/spatial/includes creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer copying build/lib/deepspeed/ops/csrc/transformer/cublas_wrappers.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer copying build/lib/deepspeed/ops/csrc/transformer/dropout_kernels.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer copying build/lib/deepspeed/ops/csrc/transformer/ds_transformer_cuda.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer copying build/lib/deepspeed/ops/csrc/transformer/gelu_kernels.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer copying build/lib/deepspeed/ops/csrc/transformer/general_kernels.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer copying build/lib/deepspeed/ops/csrc/transformer/normalize_kernels.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer copying build/lib/deepspeed/ops/csrc/transformer/softmax_kernels.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer copying build/lib/deepspeed/ops/csrc/transformer/transform_kernels.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/csrc copying build/lib/deepspeed/ops/csrc/transformer/inference/csrc/apply_rotary_pos_emb.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/csrc copying build/lib/deepspeed/ops/csrc/transformer/inference/csrc/dequantize.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/csrc copying build/lib/deepspeed/ops/csrc/transformer/inference/csrc/gelu.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/csrc copying build/lib/deepspeed/ops/csrc/transformer/inference/csrc/layer_norm.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/csrc copying build/lib/deepspeed/ops/csrc/transformer/inference/csrc/pointwise_ops.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/csrc copying build/lib/deepspeed/ops/csrc/transformer/inference/csrc/pt_binding.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/csrc copying build/lib/deepspeed/ops/csrc/transformer/inference/csrc/relu.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/csrc copying build/lib/deepspeed/ops/csrc/transformer/inference/csrc/rms_norm.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/csrc copying build/lib/deepspeed/ops/csrc/transformer/inference/csrc/softmax.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/csrc copying build/lib/deepspeed/ops/csrc/transformer/inference/csrc/transform.cu -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/csrc creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/includes copying build/lib/deepspeed/ops/csrc/transformer/inference/includes/inference_context.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/includes copying build/lib/deepspeed/ops/csrc/transformer/inference/includes/inference_cublas_wrappers.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/includes copying build/lib/deepspeed/ops/csrc/transformer/inference/includes/inference_cuda_layers.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/transformer/inference/includes creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/utils copying build/lib/deepspeed/ops/csrc/utils/flatten_unflatten.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/utils creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/adagrad copying build/lib/deepspeed/ops/csrc/xpu/adagrad/cpu_adagrad.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/adagrad creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/adam copying build/lib/deepspeed/ops/csrc/xpu/adam/cpu_adam.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/adam copying build/lib/deepspeed/ops/csrc/xpu/adam/cpu_adam_impl.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/adam copying build/lib/deepspeed/ops/csrc/xpu/adam/fused_adam_frontend.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/adam copying build/lib/deepspeed/ops/csrc/xpu/adam/multi_tensor_adam.dp.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/adam creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/common copying build/lib/deepspeed/ops/csrc/xpu/common/custom_cuda_kernel.dp.cpp -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/common creating build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/includes copying build/lib/deepspeed/ops/csrc/xpu/includes/compat.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/includes copying build/lib/deepspeed/ops/csrc/xpu/includes/cpu_adagrad.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/includes copying build/lib/deepspeed/ops/csrc/xpu/includes/cpu_adam.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/includes copying build/lib/deepspeed/ops/csrc/xpu/includes/simd.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/includes copying build/lib/deepspeed/ops/csrc/xpu/includes/type_shim.h -> build/bdist.linux-x86_64/wheel/deepspeed/ops/csrc/xpu/includes creating build/bdist.linux-x86_64/wheel/deepspeed/pipe copying build/lib/deepspeed/pipe/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/pipe creating build/bdist.linux-x86_64/wheel/deepspeed/profiling copying build/lib/deepspeed/profiling/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/profiling copying build/lib/deepspeed/profiling/config.py -> build/bdist.linux-x86_64/wheel/deepspeed/profiling copying build/lib/deepspeed/profiling/constants.py -> build/bdist.linux-x86_64/wheel/deepspeed/profiling creating build/bdist.linux-x86_64/wheel/deepspeed/profiling/flops_profiler copying build/lib/deepspeed/profiling/flops_profiler/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/profiling/flops_profiler copying build/lib/deepspeed/profiling/flops_profiler/profiler.py -> build/bdist.linux-x86_64/wheel/deepspeed/profiling/flops_profiler creating build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/bf16_optimizer.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/compiler.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/config.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/config_utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/constants.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/dataloader.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/eigenvalue.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/engine.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/hybrid_engine.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/lr_schedules.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/progressive_layer_drop.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/quantize.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/sparse_tensor.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/state_dict_factory.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime copying build/lib/deepspeed/runtime/weight_quantizer.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime creating build/bdist.linux-x86_64/wheel/deepspeed/runtime/activation_checkpointing copying build/lib/deepspeed/runtime/activation_checkpointing/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/activation_checkpointing copying build/lib/deepspeed/runtime/activation_checkpointing/checkpointing.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/activation_checkpointing copying build/lib/deepspeed/runtime/activation_checkpointing/config.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/activation_checkpointing creating build/bdist.linux-x86_64/wheel/deepspeed/runtime/checkpoint_engine copying build/lib/deepspeed/runtime/checkpoint_engine/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/checkpoint_engine copying build/lib/deepspeed/runtime/checkpoint_engine/checkpoint_engine.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/checkpoint_engine copying build/lib/deepspeed/runtime/checkpoint_engine/nebula_checkpoint_engine.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/checkpoint_engine copying build/lib/deepspeed/runtime/checkpoint_engine/torch_checkpoint_engine.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/checkpoint_engine creating build/bdist.linux-x86_64/wheel/deepspeed/runtime/comm copying build/lib/deepspeed/runtime/comm/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/comm copying build/lib/deepspeed/runtime/comm/coalesced_collectives.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/comm copying build/lib/deepspeed/runtime/comm/hccl.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/comm copying build/lib/deepspeed/runtime/comm/mpi.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/comm copying build/lib/deepspeed/runtime/comm/nccl.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/comm creating build/bdist.linux-x86_64/wheel/deepspeed/runtime/compression copying build/lib/deepspeed/runtime/compression/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/compression copying build/lib/deepspeed/runtime/compression/cupy.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/compression creating build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline copying build/lib/deepspeed/runtime/data_pipeline/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline copying build/lib/deepspeed/runtime/data_pipeline/config.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline copying build/lib/deepspeed/runtime/data_pipeline/constants.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline copying build/lib/deepspeed/runtime/data_pipeline/curriculum_scheduler.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline creating build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline/data_routing copying build/lib/deepspeed/runtime/data_pipeline/data_routing/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline/data_routing copying build/lib/deepspeed/runtime/data_pipeline/data_routing/basic_layer.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline/data_routing copying build/lib/deepspeed/runtime/data_pipeline/data_routing/helper.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline/data_routing copying build/lib/deepspeed/runtime/data_pipeline/data_routing/scheduler.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline/data_routing copying build/lib/deepspeed/runtime/data_pipeline/data_routing/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline/data_routing creating build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline/data_sampling copying build/lib/deepspeed/runtime/data_pipeline/data_sampling/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline/data_sampling copying build/lib/deepspeed/runtime/data_pipeline/data_sampling/data_analyzer.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline/data_sampling copying build/lib/deepspeed/runtime/data_pipeline/data_sampling/data_sampler.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline/data_sampling copying build/lib/deepspeed/runtime/data_pipeline/data_sampling/indexed_dataset.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline/data_sampling copying build/lib/deepspeed/runtime/data_pipeline/data_sampling/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/data_pipeline/data_sampling creating build/bdist.linux-x86_64/wheel/deepspeed/runtime/fp16 copying build/lib/deepspeed/runtime/fp16/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/fp16 copying build/lib/deepspeed/runtime/fp16/fused_optimizer.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/fp16 copying build/lib/deepspeed/runtime/fp16/loss_scaler.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/fp16 copying build/lib/deepspeed/runtime/fp16/unfused_optimizer.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/fp16 creating build/bdist.linux-x86_64/wheel/deepspeed/runtime/fp16/onebit copying build/lib/deepspeed/runtime/fp16/onebit/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/fp16/onebit copying build/lib/deepspeed/runtime/fp16/onebit/adam.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/fp16/onebit copying build/lib/deepspeed/runtime/fp16/onebit/lamb.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/fp16/onebit copying build/lib/deepspeed/runtime/fp16/onebit/zoadam.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/fp16/onebit creating build/bdist.linux-x86_64/wheel/deepspeed/runtime/pipe copying build/lib/deepspeed/runtime/pipe/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/pipe copying build/lib/deepspeed/runtime/pipe/engine.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/pipe copying build/lib/deepspeed/runtime/pipe/module.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/pipe copying build/lib/deepspeed/runtime/pipe/p2p.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/pipe copying build/lib/deepspeed/runtime/pipe/schedule.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/pipe copying build/lib/deepspeed/runtime/pipe/topology.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/pipe creating build/bdist.linux-x86_64/wheel/deepspeed/runtime/swap_tensor copying build/lib/deepspeed/runtime/swap_tensor/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/swap_tensor copying build/lib/deepspeed/runtime/swap_tensor/aio_config.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/swap_tensor copying build/lib/deepspeed/runtime/swap_tensor/async_swapper.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/swap_tensor copying build/lib/deepspeed/runtime/swap_tensor/constants.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/swap_tensor copying build/lib/deepspeed/runtime/swap_tensor/optimizer_utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/swap_tensor copying build/lib/deepspeed/runtime/swap_tensor/partitioned_optimizer_swapper.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/swap_tensor copying build/lib/deepspeed/runtime/swap_tensor/partitioned_param_swapper.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/swap_tensor copying build/lib/deepspeed/runtime/swap_tensor/pipelined_optimizer_swapper.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/swap_tensor copying build/lib/deepspeed/runtime/swap_tensor/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/swap_tensor creating build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/config.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/contiguous_memory_allocator.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/linear.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/mics.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/mics_utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/offload_config.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/parameter_offload.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/partition_parameters.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/partitioned_param_coordinator.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/partitioned_param_profiler.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/stage3.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/stage_1_and_2.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/test.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/tiling.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero copying build/lib/deepspeed/runtime/zero/utils.py -> build/bdist.linux-x86_64/wheel/deepspeed/runtime/zero creating build/bdist.linux-x86_64/wheel/deepspeed/sequence copying build/lib/deepspeed/sequence/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/sequence copying build/lib/deepspeed/sequence/layer.py -> build/bdist.linux-x86_64/wheel/deepspeed/sequence creating build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/comms_logging.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/debug.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/exceptions.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/groups.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/init_on_device.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/logging.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/mixed_precision_linkage.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/numa.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/nvtx.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/tensor_fragment.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/timer.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/types.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/z3_leaf_module.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils copying build/lib/deepspeed/utils/zero_to_fp32.py -> build/bdist.linux-x86_64/wheel/deepspeed/utils creating build/bdist.linux-x86_64/wheel/deepspeed/accelerator copying build/lib/deepspeed/accelerator/__init__.py -> build/bdist.linux-x86_64/wheel/deepspeed/accelerator copying build/lib/deepspeed/accelerator/abstract_accelerator.py -> build/bdist.linux-x86_64/wheel/deepspeed/accelerator copying build/lib/deepspeed/accelerator/cpu_accelerator.py -> build/bdist.linux-x86_64/wheel/deepspeed/accelerator copying build/lib/deepspeed/accelerator/cuda_accelerator.py -> build/bdist.linux-x86_64/wheel/deepspeed/accelerator copying build/lib/deepspeed/accelerator/hpu_accelerator.py -> build/bdist.linux-x86_64/wheel/deepspeed/accelerator copying build/lib/deepspeed/accelerator/mps_accelerator.py -> build/bdist.linux-x86_64/wheel/deepspeed/accelerator copying build/lib/deepspeed/accelerator/npu_accelerator.py -> build/bdist.linux-x86_64/wheel/deepspeed/accelerator copying build/lib/deepspeed/accelerator/real_accelerator.py -> build/bdist.linux-x86_64/wheel/deepspeed/accelerator copying build/lib/deepspeed/accelerator/xpu_accelerator.py -> build/bdist.linux-x86_64/wheel/deepspeed/accelerator running install_egg_info Copying deepspeed.egg-info to build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown-py3.11.egg-info running install_scripts creating build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data creating build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts copying build/scripts-3.11/deepspeed -> build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts copying build/scripts-3.11/deepspeed.pt -> build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts copying build/scripts-3.11/ds -> build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts copying build/scripts-3.11/ds_ssh -> build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts copying build/scripts-3.11/ds_report -> build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts copying build/scripts-3.11/ds_bench -> build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts copying build/scripts-3.11/dsr -> build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts copying build/scripts-3.11/ds_elastic -> build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts changing mode of build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts/deepspeed to 755 changing mode of build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts/deepspeed.pt to 755 changing mode of build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts/ds to 755 changing mode of build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts/ds_ssh to 755 changing mode of build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts/ds_report to 755 changing mode of build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts/ds_bench to 755 changing mode of build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts/dsr to 755 changing mode of build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.data/scripts/ds_elastic to 755 creating build/bdist.linux-x86_64/wheel/deepspeed-0.14.0+unknown.dist-info/WHEEL creating '/tmp/pip-wheel-mgrq7tmi/.tmp-q5ctv4tz/deepspeed-0.14.0+unknown-py3-none-any.whl' and adding 'build/bdist.linux-x86_64/wheel' to it adding 'deepspeed/__init__.py' adding 'deepspeed/constants.py' adding 'deepspeed/env_report.py' adding 'deepspeed/git_version_info.py' adding 'deepspeed/git_version_info_installed.py' adding 'deepspeed/pydantic_v1.py' adding 'deepspeed/accelerator/__init__.py' adding 'deepspeed/accelerator/abstract_accelerator.py' adding 'deepspeed/accelerator/cpu_accelerator.py' adding 'deepspeed/accelerator/cuda_accelerator.py' adding 'deepspeed/accelerator/hpu_accelerator.py' adding 'deepspeed/accelerator/mps_accelerator.py' adding 'deepspeed/accelerator/npu_accelerator.py' adding 'deepspeed/accelerator/real_accelerator.py' adding 'deepspeed/accelerator/xpu_accelerator.py' adding 'deepspeed/autotuning/__init__.py' adding 'deepspeed/autotuning/autotuner.py' adding 'deepspeed/autotuning/config.py' adding 'deepspeed/autotuning/constants.py' adding 'deepspeed/autotuning/scheduler.py' adding 'deepspeed/autotuning/utils.py' adding 'deepspeed/autotuning/config_templates/template_zero0.json' adding 'deepspeed/autotuning/config_templates/template_zero1.json' adding 'deepspeed/autotuning/config_templates/template_zero2.json' adding 'deepspeed/autotuning/config_templates/template_zero3.json' adding 'deepspeed/autotuning/tuner/__init__.py' adding 'deepspeed/autotuning/tuner/base_tuner.py' adding 'deepspeed/autotuning/tuner/cost_model.py' adding 'deepspeed/autotuning/tuner/index_based_tuner.py' adding 'deepspeed/autotuning/tuner/model_based_tuner.py' adding 'deepspeed/autotuning/tuner/utils.py' adding 'deepspeed/checkpoint/__init__.py' adding 'deepspeed/checkpoint/constants.py' adding 'deepspeed/checkpoint/deepspeed_checkpoint.py' adding 'deepspeed/checkpoint/ds_to_universal.py' adding 'deepspeed/checkpoint/reshape_3d_utils.py' adding 'deepspeed/checkpoint/reshape_meg_2d.py' adding 'deepspeed/checkpoint/reshape_utils.py' adding 'deepspeed/checkpoint/universal_checkpoint.py' adding 'deepspeed/checkpoint/utils.py' adding 'deepspeed/checkpoint/zero_checkpoint.py' adding 'deepspeed/comm/__init__.py' adding 'deepspeed/comm/backend.py' adding 'deepspeed/comm/ccl.py' adding 'deepspeed/comm/comm.py' adding 'deepspeed/comm/config.py' adding 'deepspeed/comm/constants.py' adding 'deepspeed/comm/reduce_op.py' adding 'deepspeed/comm/torch.py' adding 'deepspeed/comm/utils.py' adding 'deepspeed/compression/__init__.py' adding 'deepspeed/compression/basic_layer.py' adding 'deepspeed/compression/compress.py' adding 'deepspeed/compression/config.py' adding 'deepspeed/compression/constants.py' adding 'deepspeed/compression/helper.py' adding 'deepspeed/compression/scheduler.py' adding 'deepspeed/compression/utils.py' adding 'deepspeed/elasticity/__init__.py' adding 'deepspeed/elasticity/config.py' adding 'deepspeed/elasticity/constants.py' adding 'deepspeed/elasticity/elastic_agent.py' adding 'deepspeed/elasticity/elasticity.py' adding 'deepspeed/elasticity/utils.py' adding 'deepspeed/inference/__init__.py' adding 'deepspeed/inference/config.py' adding 'deepspeed/inference/engine.py' adding 'deepspeed/inference/quantization/__init__.py' adding 'deepspeed/inference/quantization/layers.py' adding 'deepspeed/inference/quantization/quantization.py' adding 'deepspeed/inference/quantization/quantization_context.py' adding 'deepspeed/inference/quantization/utils.py' adding 'deepspeed/inference/v2/__init__.py' adding 'deepspeed/inference/v2/allocator.py' adding 'deepspeed/inference/v2/config_v2.py' adding 'deepspeed/inference/v2/engine_factory.py' adding 'deepspeed/inference/v2/engine_v2.py' adding 'deepspeed/inference/v2/inference_parameter.py' adding 'deepspeed/inference/v2/inference_utils.py' adding 'deepspeed/inference/v2/logging.py' adding 'deepspeed/inference/v2/scheduling_utils.py' adding 'deepspeed/inference/v2/checkpoint/__init__.py' adding 'deepspeed/inference/v2/checkpoint/base_engine.py' adding 'deepspeed/inference/v2/checkpoint/huggingface_engine.py' adding 'deepspeed/inference/v2/checkpoint/in_memory_engine.py' adding 'deepspeed/inference/v2/kernels/__init__.py' adding 'deepspeed/inference/v2/kernels/ds_kernel.py' adding 'deepspeed/inference/v2/kernels/core_ops/__init__.py' adding 'deepspeed/inference/v2/kernels/core_ops/core_ops.cpp' adding 'deepspeed/inference/v2/kernels/core_ops/bias_activations/__init__.py' adding 'deepspeed/inference/v2/kernels/core_ops/bias_activations/bias_activation.cpp' adding 'deepspeed/inference/v2/kernels/core_ops/bias_activations/bias_activation.h' adding 'deepspeed/inference/v2/kernels/core_ops/bias_activations/bias_activation.py' adding 'deepspeed/inference/v2/kernels/core_ops/bias_activations/bias_activation_cuda.cu' adding 'deepspeed/inference/v2/kernels/core_ops/blas_kernels/__init__.py' adding 'deepspeed/inference/v2/kernels/core_ops/blas_kernels/blas.h' adding 'deepspeed/inference/v2/kernels/core_ops/blas_kernels/blas_linear.py' adding 'deepspeed/inference/v2/kernels/core_ops/blas_kernels/blas_utils.h' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/__init__.py' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/cuda_fp_ln_base.py' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/cuda_ln.py' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/cuda_post_ln.py' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/cuda_pre_ln.py' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/layer_norm.cpp' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/layer_norm.h' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_layer_norm/layer_norm_cuda.cu' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/__init__.py' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/cuda_linear.py' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/cuda_linear_kernels.cpp' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/cuda_linear_kernels.h' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/fp6_linear.cu' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/fp6_linear.cuh' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/configs.h' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/kernel_matmul.cuh' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/kernel_reduction.cuh' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/ptx_cp.async.cuh' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/ptx_mma.cuh' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/utils_core.cuh' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/utils_gmem.cuh' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/utils_paralleldequant.cuh' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_linear/include/weight_prepacking.h' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/__init__.py' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm.cpp' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm.h' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm.py' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm_base.py' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_norm_cuda.cu' adding 'deepspeed/inference/v2/kernels/core_ops/cuda_rms_norm/rms_pre_norm.py' adding 'deepspeed/inference/v2/kernels/core_ops/gated_activations/__init__.py' adding 'deepspeed/inference/v2/kernels/core_ops/gated_activations/gated_activation.py' adding 'deepspeed/inference/v2/kernels/core_ops/gated_activations/gated_activation_kernels.cpp' adding 'deepspeed/inference/v2/kernels/core_ops/gated_activations/gated_activation_kernels.h' adding 'deepspeed/inference/v2/kernels/core_ops/gated_activations/gated_activation_kernels_cuda.cu' adding 'deepspeed/inference/v2/kernels/cutlass_ops/__init__.py' adding 'deepspeed/inference/v2/kernels/cutlass_ops/cutlass_ops.cpp' adding 'deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/__init__.py' adding 'deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/mixed_gemm.cu' adding 'deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/mixed_gemm.h' adding 'deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/mixed_gemm.py' adding 'deepspeed/inference/v2/kernels/cutlass_ops/mixed_gemm/mixed_gemm_api.h' adding 'deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/__init__.py' adding 'deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/mixed_moe_gemm.py' adding 'deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/moe_gemm.cu' adding 'deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/moe_gemm.h' adding 'deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/moe_gemm.py' adding 'deepspeed/inference/v2/kernels/cutlass_ops/moe_gemm/moe_gemm_api.h' adding 'deepspeed/inference/v2/kernels/cutlass_ops/shared_resources/weight_variant.h' adding 'deepspeed/inference/v2/kernels/includes/activation_type.h' adding 'deepspeed/inference/v2/kernels/includes/conversion_utils.h' adding 'deepspeed/inference/v2/kernels/includes/ds_kernel_utils.h' adding 'deepspeed/inference/v2/kernels/includes/memory_access_utils.h' adding 'deepspeed/inference/v2/kernels/includes/reduction_utils.h' adding 'deepspeed/inference/v2/kernels/ragged_ops/__init__.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/ragged_ops.cpp' adding 'deepspeed/inference/v2/kernels/ragged_ops/atom_builder/__init__.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/atom_builder/atom_builder.cpp' adding 'deepspeed/inference/v2/kernels/ragged_ops/atom_builder/atom_builder.h' adding 'deepspeed/inference/v2/kernels/ragged_ops/atom_builder/atom_builder.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/__init__.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/attention_atom.h' adding 'deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/blocked_flash.cpp' adding 'deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/blocked_flash.h' adding 'deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/blocked_flash.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/blocked_flash/flash.h' adding 'deepspeed/inference/v2/kernels/ragged_ops/embed/__init__.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/embed/embed.cpp' adding 'deepspeed/inference/v2/kernels/ragged_ops/embed/embed.cuh' adding 'deepspeed/inference/v2/kernels/ragged_ops/embed/embed.h' adding 'deepspeed/inference/v2/kernels/ragged_ops/embed/embed.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/embed/embed_cuda.cu' adding 'deepspeed/inference/v2/kernels/ragged_ops/includes/top_k_utils.h' adding 'deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/__init__.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary.cpp' adding 'deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary.cuh' adding 'deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary.h' adding 'deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_kv_rotary_cuda.cu' adding 'deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/blocked_trained_kv_rotary.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/linear_blocked_kv_rotary/linear_blocked_kv_copy.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/logits_gather/__init__.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather.cpp' adding 'deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather.cuh' adding 'deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather.h' adding 'deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/logits_gather/logits_gather_cuda.cu' adding 'deepspeed/inference/v2/kernels/ragged_ops/moe_gather/__init__.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather.cpp' adding 'deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather.cuh' adding 'deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather.h' adding 'deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/moe_gather/moe_gather_cuda.cu' adding 'deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/__init__.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter.cpp' adding 'deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter.cuh' adding 'deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter.h' adding 'deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/moe_scatter/moe_scatter_cuda.cu' adding 'deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers/ragged_dtypes.h' adding 'deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers/ragged_kernel_helpers.cpp' adding 'deepspeed/inference/v2/kernels/ragged_ops/ragged_helpers/ragged_kernel_helpers.h' adding 'deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/__init__.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating.cpp' adding 'deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating.cuh' adding 'deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating.h' adding 'deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating.py' adding 'deepspeed/inference/v2/kernels/ragged_ops/top_k_gating/top_k_gating_cuda.cu' adding 'deepspeed/inference/v2/model_implementations/__init__.py' adding 'deepspeed/inference/v2/model_implementations/flat_model_helpers.py' adding 'deepspeed/inference/v2/model_implementations/inference_model_base.py' adding 'deepspeed/inference/v2/model_implementations/inference_policy_base.py' adding 'deepspeed/inference/v2/model_implementations/inference_transformer_base.py' adding 'deepspeed/inference/v2/model_implementations/layer_container_base.py' adding 'deepspeed/inference/v2/model_implementations/parameter_base.py' adding 'deepspeed/inference/v2/model_implementations/common_parameters/__init__.py' adding 'deepspeed/inference/v2/model_implementations/common_parameters/attn_output_parameters.py' adding 'deepspeed/inference/v2/model_implementations/common_parameters/embedding_parameters.py' adding 'deepspeed/inference/v2/model_implementations/common_parameters/invfreq_parameters.py' adding 'deepspeed/inference/v2/model_implementations/common_parameters/mlp_parameters.py' adding 'deepspeed/inference/v2/model_implementations/common_parameters/moe_parameters.py' adding 'deepspeed/inference/v2/model_implementations/common_parameters/norm_parameters.py' adding 'deepspeed/inference/v2/model_implementations/common_parameters/qkv_parameters.py' adding 'deepspeed/inference/v2/model_implementations/common_parameters/unembed_parameters.py' adding 'deepspeed/inference/v2/model_implementations/falcon/__init__.py' adding 'deepspeed/inference/v2/model_implementations/falcon/container.py' adding 'deepspeed/inference/v2/model_implementations/falcon/model.py' adding 'deepspeed/inference/v2/model_implementations/falcon/policy.py' adding 'deepspeed/inference/v2/model_implementations/llama_v2/__init__.py' adding 'deepspeed/inference/v2/model_implementations/llama_v2/container.py' adding 'deepspeed/inference/v2/model_implementations/llama_v2/model.py' adding 'deepspeed/inference/v2/model_implementations/llama_v2/policy.py' adding 'deepspeed/inference/v2/model_implementations/mistral/__init__.py' adding 'deepspeed/inference/v2/model_implementations/mistral/container.py' adding 'deepspeed/inference/v2/model_implementations/mistral/model.py' adding 'deepspeed/inference/v2/model_implementations/mistral/policy.py' adding 'deepspeed/inference/v2/model_implementations/mixtral/__init__.py' adding 'deepspeed/inference/v2/model_implementations/mixtral/container.py' adding 'deepspeed/inference/v2/model_implementations/mixtral/model.py' adding 'deepspeed/inference/v2/model_implementations/mixtral/policy.py' adding 'deepspeed/inference/v2/model_implementations/opt/__init__.py' adding 'deepspeed/inference/v2/model_implementations/opt/container.py' adding 'deepspeed/inference/v2/model_implementations/opt/model.py' adding 'deepspeed/inference/v2/model_implementations/opt/policy.py' adding 'deepspeed/inference/v2/model_implementations/phi/__init__.py' adding 'deepspeed/inference/v2/model_implementations/phi/containers.py' adding 'deepspeed/inference/v2/model_implementations/phi/model.py' adding 'deepspeed/inference/v2/model_implementations/phi/policy.py' adding 'deepspeed/inference/v2/model_implementations/qwen/__init__.py' adding 'deepspeed/inference/v2/model_implementations/qwen/container.py' adding 'deepspeed/inference/v2/model_implementations/qwen/model.py' adding 'deepspeed/inference/v2/model_implementations/qwen/policy.py' adding 'deepspeed/inference/v2/model_implementations/qwen_v2/__init__.py' adding 'deepspeed/inference/v2/model_implementations/qwen_v2/container.py' adding 'deepspeed/inference/v2/model_implementations/qwen_v2/model.py' adding 'deepspeed/inference/v2/model_implementations/qwen_v2/policy.py' adding 'deepspeed/inference/v2/model_implementations/sharding/__init__.py' adding 'deepspeed/inference/v2/model_implementations/sharding/attn.py' adding 'deepspeed/inference/v2/model_implementations/sharding/attn_out.py' adding 'deepspeed/inference/v2/model_implementations/sharding/embedding.py' adding 'deepspeed/inference/v2/model_implementations/sharding/mlp.py' adding 'deepspeed/inference/v2/model_implementations/sharding/qkv.py' adding 'deepspeed/inference/v2/model_implementations/sharding/types.py' adding 'deepspeed/inference/v2/model_implementations/sharding/unembed.py' adding 'deepspeed/inference/v2/model_implementations/sharding/utils.py' adding 'deepspeed/inference/v2/modules/__init__.py' adding 'deepspeed/inference/v2/modules/ds_module.py' adding 'deepspeed/inference/v2/modules/heuristics.py' adding 'deepspeed/inference/v2/modules/module_registry.py' adding 'deepspeed/inference/v2/modules/configs/__init__.py' adding 'deepspeed/inference/v2/modules/configs/attention_configs.py' adding 'deepspeed/inference/v2/modules/configs/embedding_config.py' adding 'deepspeed/inference/v2/modules/configs/linear_config.py' adding 'deepspeed/inference/v2/modules/configs/moe_config.py' adding 'deepspeed/inference/v2/modules/configs/norm_config.py' adding 'deepspeed/inference/v2/modules/configs/unembed_config.py' adding 'deepspeed/inference/v2/modules/implementations/__init__.py' adding 'deepspeed/inference/v2/modules/implementations/attention/__init__.py' adding 'deepspeed/inference/v2/modules/implementations/attention/dense_blocked_attention.py' adding 'deepspeed/inference/v2/modules/implementations/embedding/__init__.py' adding 'deepspeed/inference/v2/modules/implementations/embedding/ragged_embedding.py' adding 'deepspeed/inference/v2/modules/implementations/linear/__init__.py' adding 'deepspeed/inference/v2/modules/implementations/linear/blas_fp_linear.py' adding 'deepspeed/inference/v2/modules/implementations/linear/quantized_linear.py' adding 'deepspeed/inference/v2/modules/implementations/moe/__init__.py' adding 'deepspeed/inference/v2/modules/implementations/moe/cutlass_multi_gemm.py' adding 'deepspeed/inference/v2/modules/implementations/post_norm/__init__.py' adding 'deepspeed/inference/v2/modules/implementations/post_norm/cuda_post_ln.py' adding 'deepspeed/inference/v2/modules/implementations/pre_norm/__init__.py' adding 'deepspeed/inference/v2/modules/implementations/pre_norm/cuda_pre_ln.py' adding 'deepspeed/inference/v2/modules/implementations/pre_norm/cuda_pre_rms.py' adding 'deepspeed/inference/v2/modules/implementations/unembed/__init__.py' adding 'deepspeed/inference/v2/modules/implementations/unembed/ragged_unembed.py' adding 'deepspeed/inference/v2/modules/interfaces/__init__.py' adding 'deepspeed/inference/v2/modules/interfaces/attention_base.py' adding 'deepspeed/inference/v2/modules/interfaces/embedding_base.py' adding 'deepspeed/inference/v2/modules/interfaces/linear_base.py' adding 'deepspeed/inference/v2/modules/interfaces/moe_base.py' adding 'deepspeed/inference/v2/modules/interfaces/post_norm_base.py' adding 'deepspeed/inference/v2/modules/interfaces/pre_norm_base.py' adding 'deepspeed/inference/v2/modules/interfaces/unembed_base.py' adding 'deepspeed/inference/v2/ragged/__init__.py' adding 'deepspeed/inference/v2/ragged/blocked_allocator.py' adding 'deepspeed/inference/v2/ragged/kv_cache.py' adding 'deepspeed/inference/v2/ragged/manager_configs.py' adding 'deepspeed/inference/v2/ragged/ragged_manager.py' adding 'deepspeed/inference/v2/ragged/ragged_wrapper.py' adding 'deepspeed/inference/v2/ragged/sequence_descriptor.py' adding 'deepspeed/inference/v2/ragged/csrc/fast_host_buffer.cu' adding 'deepspeed/inference/v2/ragged/csrc/ragged_ops.cpp' adding 'deepspeed/inference/v2/ragged/includes/fast_host_buffer.h' adding 'deepspeed/launcher/__init__.py' adding 'deepspeed/launcher/constants.py' adding 'deepspeed/launcher/launch.py' adding 'deepspeed/launcher/launcher_helper.py' adding 'deepspeed/launcher/multinode_runner.py' adding 'deepspeed/launcher/runner.py' adding 'deepspeed/model_implementations/__init__.py' adding 'deepspeed/model_implementations/diffusers/__init__.py' adding 'deepspeed/model_implementations/diffusers/unet.py' adding 'deepspeed/model_implementations/diffusers/vae.py' adding 'deepspeed/model_implementations/features/__init__.py' adding 'deepspeed/model_implementations/features/cuda_graph.py' adding 'deepspeed/model_implementations/transformers/__init__.py' adding 'deepspeed/model_implementations/transformers/clip_encoder.py' adding 'deepspeed/model_implementations/transformers/ds_base.py' adding 'deepspeed/model_implementations/transformers/ds_bert.py' adding 'deepspeed/model_implementations/transformers/ds_bloom.py' adding 'deepspeed/model_implementations/transformers/ds_gpt.py' adding 'deepspeed/model_implementations/transformers/ds_llama2.py' adding 'deepspeed/model_implementations/transformers/ds_megatron_gpt.py' adding 'deepspeed/model_implementations/transformers/ds_opt.py' adding 'deepspeed/model_implementations/transformers/ds_transformer.py' adding 'deepspeed/module_inject/__init__.py' adding 'deepspeed/module_inject/auto_tp.py' adding 'deepspeed/module_inject/auto_tp_model_utils.py' adding 'deepspeed/module_inject/fusedqkv_utils.py' adding 'deepspeed/module_inject/inject.py' adding 'deepspeed/module_inject/layers.py' adding 'deepspeed/module_inject/load_checkpoint.py' adding 'deepspeed/module_inject/module_quantize.py' adding 'deepspeed/module_inject/policy.py' adding 'deepspeed/module_inject/replace_module.py' adding 'deepspeed/module_inject/replace_policy.py' adding 'deepspeed/module_inject/tp_shard.py' adding 'deepspeed/module_inject/utils.py' adding 'deepspeed/module_inject/containers/__init__.py' adding 'deepspeed/module_inject/containers/base.py' adding 'deepspeed/module_inject/containers/base_moe.py' adding 'deepspeed/module_inject/containers/bert.py' adding 'deepspeed/module_inject/containers/bloom.py' adding 'deepspeed/module_inject/containers/clip.py' adding 'deepspeed/module_inject/containers/distil_bert.py' adding 'deepspeed/module_inject/containers/gpt2.py' adding 'deepspeed/module_inject/containers/gptj.py' adding 'deepspeed/module_inject/containers/gptneo.py' adding 'deepspeed/module_inject/containers/gptneox.py' adding 'deepspeed/module_inject/containers/internlm.py' adding 'deepspeed/module_inject/containers/llama.py' adding 'deepspeed/module_inject/containers/llama2.py' adding 'deepspeed/module_inject/containers/megatron_gpt.py' adding 'deepspeed/module_inject/containers/megatron_gpt_moe.py' adding 'deepspeed/module_inject/containers/opt.py' adding 'deepspeed/module_inject/containers/unet.py' adding 'deepspeed/module_inject/containers/vae.py' adding 'deepspeed/module_inject/containers/features/__init__.py' adding 'deepspeed/module_inject/containers/features/gated_mlp.py' adding 'deepspeed/module_inject/containers/features/hybrid_engine.py' adding 'deepspeed/module_inject/containers/features/hybrid_megatron.py' adding 'deepspeed/module_inject/containers/features/megatron.py' adding 'deepspeed/module_inject/containers/features/meta_tensor.py' adding 'deepspeed/module_inject/containers/features/split_qkv.py' adding 'deepspeed/moe/__init__.py' adding 'deepspeed/moe/experts.py' adding 'deepspeed/moe/layer.py' adding 'deepspeed/moe/mappings.py' adding 'deepspeed/moe/sharded_moe.py' adding 'deepspeed/moe/utils.py' adding 'deepspeed/monitor/__init__.py' adding 'deepspeed/monitor/config.py' adding 'deepspeed/monitor/csv_monitor.py' adding 'deepspeed/monitor/monitor.py' adding 'deepspeed/monitor/tensorboard.py' adding 'deepspeed/monitor/utils.py' adding 'deepspeed/monitor/wandb.py' adding 'deepspeed/nebula/__init__.py' adding 'deepspeed/nebula/config.py' adding 'deepspeed/nebula/constants.py' adding 'deepspeed/ops/__init__.py' adding 'deepspeed/ops/adagrad/__init__.py' adding 'deepspeed/ops/adagrad/cpu_adagrad.py' adding 'deepspeed/ops/adam/__init__.py' adding 'deepspeed/ops/adam/cpu_adam.py' adding 'deepspeed/ops/adam/fused_adam.py' adding 'deepspeed/ops/adam/multi_tensor_apply.py' adding 'deepspeed/ops/aio/__init__.py' adding 'deepspeed/ops/csrc/adagrad/cpu_adagrad.cpp' adding 'deepspeed/ops/csrc/adam/cpu_adam.cpp' adding 'deepspeed/ops/csrc/adam/cpu_adam_impl.cpp' adding 'deepspeed/ops/csrc/adam/fused_adam_frontend.cpp' adding 'deepspeed/ops/csrc/adam/multi_tensor_adam.cu' adding 'deepspeed/ops/csrc/adam/multi_tensor_apply.cuh' adding 'deepspeed/ops/csrc/aio/common/deepspeed_aio_common.cpp' adding 'deepspeed/ops/csrc/aio/common/deepspeed_aio_common.h' adding 'deepspeed/ops/csrc/aio/common/deepspeed_aio_types.cpp' adding 'deepspeed/ops/csrc/aio/common/deepspeed_aio_types.h' adding 'deepspeed/ops/csrc/aio/common/deepspeed_aio_utils.cpp' adding 'deepspeed/ops/csrc/aio/common/deepspeed_aio_utils.h' adding 'deepspeed/ops/csrc/aio/py_lib/deepspeed_aio_thread.cpp' adding 'deepspeed/ops/csrc/aio/py_lib/deepspeed_aio_thread.h' adding 'deepspeed/ops/csrc/aio/py_lib/deepspeed_pin_tensor.cpp' adding 'deepspeed/ops/csrc/aio/py_lib/deepspeed_pin_tensor.h' adding 'deepspeed/ops/csrc/aio/py_lib/deepspeed_py_aio.cpp' adding 'deepspeed/ops/csrc/aio/py_lib/deepspeed_py_aio.h' adding 'deepspeed/ops/csrc/aio/py_lib/deepspeed_py_aio_handle.cpp' adding 'deepspeed/ops/csrc/aio/py_lib/deepspeed_py_aio_handle.h' adding 'deepspeed/ops/csrc/aio/py_lib/deepspeed_py_copy.cpp' adding 'deepspeed/ops/csrc/aio/py_lib/deepspeed_py_copy.h' adding 'deepspeed/ops/csrc/aio/py_lib/py_ds_aio.cpp' adding 'deepspeed/ops/csrc/aio/py_test/single_process_config.json' adding 'deepspeed/ops/csrc/common/custom_cuda_kernel.cu' adding 'deepspeed/ops/csrc/cpu/adam/fused_adam.cpp' adding 'deepspeed/ops/csrc/cpu/comm/ccl.cpp' adding 'deepspeed/ops/csrc/cpu/lion/fused_lion.cpp' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/attention.cpp' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/attention_back.cu' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/attention_cu.cu' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm_kernel_utils.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/kernel_backward.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/kernel_forward.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue/epilogue_grad_bias.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue/epilogue_pipelined.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue/epilogue_rescale_output.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/epilogue/epilogue_thread_apply_logsumexp.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/custom_mma.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/custom_mma_base.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/custom_mma_multistage.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/custom_mma_pipelined.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/find_default_mma.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/mma_accum_lambda_iterator.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/gemm/mma_from_smem.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/epilogue_predicated_tile_iterator.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/make_residual_last.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/predicated_tile_access_iterator_residual_last.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/predicated_tile_iterator_atomic.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/predicated_tile_iterator_residual_last.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/transpose_warp_iterator.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/iterators/warp_iterator_from_smem.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/transform/bias_broadcast.h' adding 'deepspeed/ops/csrc/deepspeed4science/evoformer_attn/transform/tile_smem_loader.h' adding 'deepspeed/ops/csrc/includes/StopWatch.h' adding 'deepspeed/ops/csrc/includes/Timer.h' adding 'deepspeed/ops/csrc/includes/activation_type.h' adding 'deepspeed/ops/csrc/includes/compat.h' adding 'deepspeed/ops/csrc/includes/context.h' adding 'deepspeed/ops/csrc/includes/conversion_utils.h' adding 'deepspeed/ops/csrc/includes/cpu_adagrad.h' adding 'deepspeed/ops/csrc/includes/cpu_adam.h' adding 'deepspeed/ops/csrc/includes/cpu_lion.h' adding 'deepspeed/ops/csrc/includes/cublas_wrappers.h' adding 'deepspeed/ops/csrc/includes/custom_cuda_layers.h' adding 'deepspeed/ops/csrc/includes/dequantization_utils.h' adding 'deepspeed/ops/csrc/includes/dropout.h' adding 'deepspeed/ops/csrc/includes/ds_kernel_utils.h' adding 'deepspeed/ops/csrc/includes/ds_transformer_cuda.h' adding 'deepspeed/ops/csrc/includes/feed_forward.h' adding 'deepspeed/ops/csrc/includes/gelu.h' adding 'deepspeed/ops/csrc/includes/gemm_test.h' adding 'deepspeed/ops/csrc/includes/general_kernels.h' adding 'deepspeed/ops/csrc/includes/memory_access_utils.h' adding 'deepspeed/ops/csrc/includes/normalize_layer.h' adding 'deepspeed/ops/csrc/includes/quantization.h' adding 'deepspeed/ops/csrc/includes/quantization_utils.h' adding 'deepspeed/ops/csrc/includes/quantizer.h' adding 'deepspeed/ops/csrc/includes/reduction_utils.h' adding 'deepspeed/ops/csrc/includes/simd.h' adding 'deepspeed/ops/csrc/includes/softmax.h' adding 'deepspeed/ops/csrc/includes/strided_batch_gemm.h' adding 'deepspeed/ops/csrc/includes/type_shim.h' adding 'deepspeed/ops/csrc/lamb/fused_lamb_cuda.cpp' adding 'deepspeed/ops/csrc/lamb/fused_lamb_cuda_kernel.cu' adding 'deepspeed/ops/csrc/lion/cpu_lion.cpp' adding 'deepspeed/ops/csrc/lion/cpu_lion_impl.cpp' adding 'deepspeed/ops/csrc/lion/fused_lion_frontend.cpp' adding 'deepspeed/ops/csrc/lion/multi_tensor_apply.cuh' adding 'deepspeed/ops/csrc/lion/multi_tensor_lion.cu' adding 'deepspeed/ops/csrc/quantization/dequantize.cu' adding 'deepspeed/ops/csrc/quantization/fake_quantizer.cu' adding 'deepspeed/ops/csrc/quantization/pt_binding.cpp' adding 'deepspeed/ops/csrc/quantization/quant_reduce.cu' adding 'deepspeed/ops/csrc/quantization/quantize.cu' adding 'deepspeed/ops/csrc/quantization/quantize_intX.cu' adding 'deepspeed/ops/csrc/quantization/swizzled_quantize.cu' adding 'deepspeed/ops/csrc/random_ltd/gather_scatter.cu' adding 'deepspeed/ops/csrc/random_ltd/pt_binding.cpp' adding 'deepspeed/ops/csrc/random_ltd/slice_attn_masks.cu' adding 'deepspeed/ops/csrc/random_ltd/token_sort.cu' adding 'deepspeed/ops/csrc/sparse_attention/utils.cpp' adding 'deepspeed/ops/csrc/spatial/csrc/opt_bias_add.cu' adding 'deepspeed/ops/csrc/spatial/csrc/pt_binding.cpp' adding 'deepspeed/ops/csrc/spatial/includes/spatial_cuda_layers.h' adding 'deepspeed/ops/csrc/transformer/cublas_wrappers.cu' adding 'deepspeed/ops/csrc/transformer/dropout_kernels.cu' adding 'deepspeed/ops/csrc/transformer/ds_transformer_cuda.cpp' adding 'deepspeed/ops/csrc/transformer/gelu_kernels.cu' adding 'deepspeed/ops/csrc/transformer/general_kernels.cu' adding 'deepspeed/ops/csrc/transformer/normalize_kernels.cu' adding 'deepspeed/ops/csrc/transformer/softmax_kernels.cu' adding 'deepspeed/ops/csrc/transformer/transform_kernels.cu' adding 'deepspeed/ops/csrc/transformer/inference/csrc/apply_rotary_pos_emb.cu' adding 'deepspeed/ops/csrc/transformer/inference/csrc/dequantize.cu' adding 'deepspeed/ops/csrc/transformer/inference/csrc/gelu.cu' adding 'deepspeed/ops/csrc/transformer/inference/csrc/layer_norm.cu' adding 'deepspeed/ops/csrc/transformer/inference/csrc/pointwise_ops.cu' adding 'deepspeed/ops/csrc/transformer/inference/csrc/pt_binding.cpp' adding 'deepspeed/ops/csrc/transformer/inference/csrc/relu.cu' adding 'deepspeed/ops/csrc/transformer/inference/csrc/rms_norm.cu' adding 'deepspeed/ops/csrc/transformer/inference/csrc/softmax.cu' adding 'deepspeed/ops/csrc/transformer/inference/csrc/transform.cu' adding 'deepspeed/ops/csrc/transformer/inference/includes/inference_context.h' adding 'deepspeed/ops/csrc/transformer/inference/includes/inference_cublas_wrappers.h' adding 'deepspeed/ops/csrc/transformer/inference/includes/inference_cuda_layers.h' adding 'deepspeed/ops/csrc/utils/flatten_unflatten.cpp' adding 'deepspeed/ops/csrc/xpu/adagrad/cpu_adagrad.cpp' adding 'deepspeed/ops/csrc/xpu/adam/cpu_adam.cpp' adding 'deepspeed/ops/csrc/xpu/adam/cpu_adam_impl.cpp' adding 'deepspeed/ops/csrc/xpu/adam/fused_adam_frontend.cpp' adding 'deepspeed/ops/csrc/xpu/adam/multi_tensor_adam.dp.cpp' adding 'deepspeed/ops/csrc/xpu/common/custom_cuda_kernel.dp.cpp' adding 'deepspeed/ops/csrc/xpu/includes/compat.h' adding 'deepspeed/ops/csrc/xpu/includes/cpu_adagrad.h' adding 'deepspeed/ops/csrc/xpu/includes/cpu_adam.h' adding 'deepspeed/ops/csrc/xpu/includes/simd.h' adding 'deepspeed/ops/csrc/xpu/includes/type_shim.h' adding 'deepspeed/ops/deepspeed4science/__init__.py' adding 'deepspeed/ops/deepspeed4science/evoformer_attn.py' adding 'deepspeed/ops/lamb/__init__.py' adding 'deepspeed/ops/lamb/fused_lamb.py' adding 'deepspeed/ops/lion/__init__.py' adding 'deepspeed/ops/lion/cpu_lion.py' adding 'deepspeed/ops/lion/fused_lion.py' adding 'deepspeed/ops/lion/multi_tensor_apply.py' adding 'deepspeed/ops/op_builder/__init__.py' adding 'deepspeed/ops/op_builder/all_ops.py' adding 'deepspeed/ops/op_builder/async_io.py' adding 'deepspeed/ops/op_builder/builder.py' adding 'deepspeed/ops/op_builder/cpu_adagrad.py' adding 'deepspeed/ops/op_builder/cpu_adam.py' adding 'deepspeed/ops/op_builder/cpu_lion.py' adding 'deepspeed/ops/op_builder/evoformer_attn.py' adding 'deepspeed/ops/op_builder/fused_adam.py' adding 'deepspeed/ops/op_builder/fused_lamb.py' adding 'deepspeed/ops/op_builder/fused_lion.py' adding 'deepspeed/ops/op_builder/inference_core_ops.py' adding 'deepspeed/ops/op_builder/inference_cutlass_builder.py' adding 'deepspeed/ops/op_builder/quantizer.py' adding 'deepspeed/ops/op_builder/ragged_ops.py' adding 'deepspeed/ops/op_builder/ragged_utils.py' adding 'deepspeed/ops/op_builder/random_ltd.py' adding 'deepspeed/ops/op_builder/sparse_attn.py' adding 'deepspeed/ops/op_builder/spatial_inference.py' adding 'deepspeed/ops/op_builder/stochastic_transformer.py' adding 'deepspeed/ops/op_builder/transformer.py' adding 'deepspeed/ops/op_builder/transformer_inference.py' adding 'deepspeed/ops/op_builder/cpu/__init__.py' adding 'deepspeed/ops/op_builder/cpu/builder.py' adding 'deepspeed/ops/op_builder/cpu/comm.py' adding 'deepspeed/ops/op_builder/cpu/cpu_adam.py' adding 'deepspeed/ops/op_builder/cpu/fused_adam.py' adding 'deepspeed/ops/op_builder/cpu/no_impl.py' adding 'deepspeed/ops/op_builder/hpu/__init__.py' adding 'deepspeed/ops/op_builder/hpu/builder.py' adding 'deepspeed/ops/op_builder/hpu/cpu_adam.py' adding 'deepspeed/ops/op_builder/hpu/fused_adam.py' adding 'deepspeed/ops/op_builder/hpu/no_impl.py' adding 'deepspeed/ops/op_builder/npu/__init__.py' adding 'deepspeed/ops/op_builder/npu/async_io.py' adding 'deepspeed/ops/op_builder/npu/builder.py' adding 'deepspeed/ops/op_builder/npu/cpu_adagrad.py' adding 'deepspeed/ops/op_builder/npu/cpu_adam.py' adding 'deepspeed/ops/op_builder/npu/cpu_lion.py' adding 'deepspeed/ops/op_builder/npu/fused_adam.py' adding 'deepspeed/ops/op_builder/npu/inference.py' adding 'deepspeed/ops/op_builder/npu/no_impl.py' adding 'deepspeed/ops/op_builder/xpu/__init__.py' adding 'deepspeed/ops/op_builder/xpu/async_io.py' adding 'deepspeed/ops/op_builder/xpu/builder.py' adding 'deepspeed/ops/op_builder/xpu/cpu_adagrad.py' adding 'deepspeed/ops/op_builder/xpu/cpu_adam.py' adding 'deepspeed/ops/op_builder/xpu/fused_adam.py' adding 'deepspeed/ops/quantizer/__init__.py' adding 'deepspeed/ops/quantizer/quantizer.py' adding 'deepspeed/ops/random_ltd/__init__.py' adding 'deepspeed/ops/random_ltd/dropping_utils.py' adding 'deepspeed/ops/sparse_attention/__init__.py' adding 'deepspeed/ops/sparse_attention/bert_sparse_self_attention.py' adding 'deepspeed/ops/sparse_attention/matmul.py' adding 'deepspeed/ops/sparse_attention/softmax.py' adding 'deepspeed/ops/sparse_attention/sparse_attention_utils.py' adding 'deepspeed/ops/sparse_attention/sparse_self_attention.py' adding 'deepspeed/ops/sparse_attention/sparsity_config.py' adding 'deepspeed/ops/sparse_attention/trsrc/__init__.py' adding 'deepspeed/ops/sparse_attention/trsrc/matmul.tr' adding 'deepspeed/ops/sparse_attention/trsrc/softmax_bwd.tr' adding 'deepspeed/ops/sparse_attention/trsrc/softmax_fwd.tr' adding 'deepspeed/ops/transformer/__init__.py' adding 'deepspeed/ops/transformer/transformer.py' adding 'deepspeed/ops/transformer/inference/__init__.py' adding 'deepspeed/ops/transformer/inference/bias_add.py' adding 'deepspeed/ops/transformer/inference/config.py' adding 'deepspeed/ops/transformer/inference/diffusers_2d_transformer.py' adding 'deepspeed/ops/transformer/inference/diffusers_attention.py' adding 'deepspeed/ops/transformer/inference/diffusers_transformer_block.py' adding 'deepspeed/ops/transformer/inference/ds_attention.py' adding 'deepspeed/ops/transformer/inference/ds_mlp.py' adding 'deepspeed/ops/transformer/inference/moe_inference.py' adding 'deepspeed/ops/transformer/inference/triton_ops.py' adding 'deepspeed/ops/transformer/inference/op_binding/__init__.py' adding 'deepspeed/ops/transformer/inference/op_binding/base.py' adding 'deepspeed/ops/transformer/inference/op_binding/gelu_gemm.py' adding 'deepspeed/ops/transformer/inference/op_binding/linear.py' adding 'deepspeed/ops/transformer/inference/op_binding/mlp_gemm.py' adding 'deepspeed/ops/transformer/inference/op_binding/qkv_gemm.py' adding 'deepspeed/ops/transformer/inference/op_binding/residual_add.py' adding 'deepspeed/ops/transformer/inference/op_binding/softmax.py' adding 'deepspeed/ops/transformer/inference/op_binding/softmax_context.py' adding 'deepspeed/ops/transformer/inference/op_binding/vector_matmul.py' adding 'deepspeed/ops/transformer/inference/triton/__init__.py' adding 'deepspeed/ops/transformer/inference/triton/attention.py' adding 'deepspeed/ops/transformer/inference/triton/gelu.py' adding 'deepspeed/ops/transformer/inference/triton/layer_norm.py' adding 'deepspeed/ops/transformer/inference/triton/matmul_ext.py' adding 'deepspeed/ops/transformer/inference/triton/mlp.py' adding 'deepspeed/ops/transformer/inference/triton/ops.py' adding 'deepspeed/ops/transformer/inference/triton/residual_add.py' adding 'deepspeed/ops/transformer/inference/triton/softmax.py' adding 'deepspeed/ops/transformer/inference/triton/triton_matmul_kernel.py' adding 'deepspeed/pipe/__init__.py' adding 'deepspeed/profiling/__init__.py' adding 'deepspeed/profiling/config.py' adding 'deepspeed/profiling/constants.py' adding 'deepspeed/profiling/flops_profiler/__init__.py' adding 'deepspeed/profiling/flops_profiler/profiler.py' adding 'deepspeed/runtime/__init__.py' adding 'deepspeed/runtime/bf16_optimizer.py' adding 'deepspeed/runtime/compiler.py' adding 'deepspeed/runtime/config.py' adding 'deepspeed/runtime/config_utils.py' adding 'deepspeed/runtime/constants.py' adding 'deepspeed/runtime/dataloader.py' adding 'deepspeed/runtime/eigenvalue.py' adding 'deepspeed/runtime/engine.py' adding 'deepspeed/runtime/hybrid_engine.py' adding 'deepspeed/runtime/lr_schedules.py' adding 'deepspeed/runtime/progressive_layer_drop.py' adding 'deepspeed/runtime/quantize.py' adding 'deepspeed/runtime/sparse_tensor.py' adding 'deepspeed/runtime/state_dict_factory.py' adding 'deepspeed/runtime/utils.py' adding 'deepspeed/runtime/weight_quantizer.py' adding 'deepspeed/runtime/activation_checkpointing/__init__.py' adding 'deepspeed/runtime/activation_checkpointing/checkpointing.py' adding 'deepspeed/runtime/activation_checkpointing/config.py' adding 'deepspeed/runtime/checkpoint_engine/__init__.py' adding 'deepspeed/runtime/checkpoint_engine/checkpoint_engine.py' adding 'deepspeed/runtime/checkpoint_engine/nebula_checkpoint_engine.py' adding 'deepspeed/runtime/checkpoint_engine/torch_checkpoint_engine.py' adding 'deepspeed/runtime/comm/__init__.py' adding 'deepspeed/runtime/comm/coalesced_collectives.py' adding 'deepspeed/runtime/comm/hccl.py' adding 'deepspeed/runtime/comm/mpi.py' adding 'deepspeed/runtime/comm/nccl.py' adding 'deepspeed/runtime/compression/__init__.py' adding 'deepspeed/runtime/compression/cupy.py' adding 'deepspeed/runtime/data_pipeline/__init__.py' adding 'deepspeed/runtime/data_pipeline/config.py' adding 'deepspeed/runtime/data_pipeline/constants.py' adding 'deepspeed/runtime/data_pipeline/curriculum_scheduler.py' adding 'deepspeed/runtime/data_pipeline/data_routing/__init__.py' adding 'deepspeed/runtime/data_pipeline/data_routing/basic_layer.py' adding 'deepspeed/runtime/data_pipeline/data_routing/helper.py' adding 'deepspeed/runtime/data_pipeline/data_routing/scheduler.py' adding 'deepspeed/runtime/data_pipeline/data_routing/utils.py' adding 'deepspeed/runtime/data_pipeline/data_sampling/__init__.py' adding 'deepspeed/runtime/data_pipeline/data_sampling/data_analyzer.py' adding 'deepspeed/runtime/data_pipeline/data_sampling/data_sampler.py' adding 'deepspeed/runtime/data_pipeline/data_sampling/indexed_dataset.py' adding 'deepspeed/runtime/data_pipeline/data_sampling/utils.py' adding 'deepspeed/runtime/fp16/__init__.py' adding 'deepspeed/runtime/fp16/fused_optimizer.py' adding 'deepspeed/runtime/fp16/loss_scaler.py' adding 'deepspeed/runtime/fp16/unfused_optimizer.py' adding 'deepspeed/runtime/fp16/onebit/__init__.py' adding 'deepspeed/runtime/fp16/onebit/adam.py' adding 'deepspeed/runtime/fp16/onebit/lamb.py' adding 'deepspeed/runtime/fp16/onebit/zoadam.py' adding 'deepspeed/runtime/pipe/__init__.py' adding 'deepspeed/runtime/pipe/engine.py' adding 'deepspeed/runtime/pipe/module.py' adding 'deepspeed/runtime/pipe/p2p.py' adding 'deepspeed/runtime/pipe/schedule.py' adding 'deepspeed/runtime/pipe/topology.py' adding 'deepspeed/runtime/swap_tensor/__init__.py' adding 'deepspeed/runtime/swap_tensor/aio_config.py' adding 'deepspeed/runtime/swap_tensor/async_swapper.py' adding 'deepspeed/runtime/swap_tensor/constants.py' adding 'deepspeed/runtime/swap_tensor/optimizer_utils.py' adding 'deepspeed/runtime/swap_tensor/partitioned_optimizer_swapper.py' adding 'deepspeed/runtime/swap_tensor/partitioned_param_swapper.py' adding 'deepspeed/runtime/swap_tensor/pipelined_optimizer_swapper.py' adding 'deepspeed/runtime/swap_tensor/utils.py' adding 'deepspeed/runtime/zero/__init__.py' adding 'deepspeed/runtime/zero/config.py' adding 'deepspeed/runtime/zero/contiguous_memory_allocator.py' adding 'deepspeed/runtime/zero/linear.py' adding 'deepspeed/runtime/zero/mics.py' adding 'deepspeed/runtime/zero/mics_utils.py' adding 'deepspeed/runtime/zero/offload_config.py' adding 'deepspeed/runtime/zero/parameter_offload.py' adding 'deepspeed/runtime/zero/partition_parameters.py' adding 'deepspeed/runtime/zero/partitioned_param_coordinator.py' adding 'deepspeed/runtime/zero/partitioned_param_profiler.py' adding 'deepspeed/runtime/zero/stage3.py' adding 'deepspeed/runtime/zero/stage_1_and_2.py' adding 'deepspeed/runtime/zero/test.py' adding 'deepspeed/runtime/zero/tiling.py' adding 'deepspeed/runtime/zero/utils.py' adding 'deepspeed/sequence/__init__.py' adding 'deepspeed/sequence/layer.py' adding 'deepspeed/utils/__init__.py' adding 'deepspeed/utils/comms_logging.py' adding 'deepspeed/utils/debug.py' adding 'deepspeed/utils/exceptions.py' adding 'deepspeed/utils/groups.py' adding 'deepspeed/utils/init_on_device.py' adding 'deepspeed/utils/logging.py' adding 'deepspeed/utils/mixed_precision_linkage.py' adding 'deepspeed/utils/numa.py' adding 'deepspeed/utils/nvtx.py' adding 'deepspeed/utils/tensor_fragment.py' adding 'deepspeed/utils/timer.py' adding 'deepspeed/utils/types.py' adding 'deepspeed/utils/z3_leaf_module.py' adding 'deepspeed/utils/zero_to_fp32.py' adding 'deepspeed-0.14.0+unknown.data/scripts/deepspeed' adding 'deepspeed-0.14.0+unknown.data/scripts/deepspeed.pt' adding 'deepspeed-0.14.0+unknown.data/scripts/ds' adding 'deepspeed-0.14.0+unknown.data/scripts/ds_bench' adding 'deepspeed-0.14.0+unknown.data/scripts/ds_elastic' adding 'deepspeed-0.14.0+unknown.data/scripts/ds_report' adding 'deepspeed-0.14.0+unknown.data/scripts/ds_ssh' adding 'deepspeed-0.14.0+unknown.data/scripts/dsr' adding 'deepspeed-0.14.0+unknown.dist-info/LICENSE' adding 'deepspeed-0.14.0+unknown.dist-info/METADATA' adding 'deepspeed-0.14.0+unknown.dist-info/WHEEL' adding 'deepspeed-0.14.0+unknown.dist-info/entry_points.txt' adding 'deepspeed-0.14.0+unknown.dist-info/top_level.txt' adding 'deepspeed-0.14.0+unknown.dist-info/RECORD' removing build/bdist.linux-x86_64/wheel deepspeed build time = 1.0049188137054443 secs Building wheel for deepspeed (pyproject.toml): finished with status 'done' Created wheel for deepspeed: filename=deepspeed-0.14.0+unknown-py3-none-any.whl size=1404777 sha256=0a5cf8c6fb7415d00fa83d5566bb70103d3f2faf1ee28da433b1c52d7ec1de47 Stored in directory: /builddir/.cache/pip/wheels/ef/5c/e9/8715b6a00ce88343e7ff1d08cd0bdf91931827b4ace58cac81 Successfully built deepspeed + sleep 1 + RPM_EC=0 ++ jobs -p + exit 0 Executing(%install): /bin/sh -e /var/tmp/rpm-tmp.Yw5sFc + umask 022 + cd /builddir/build/BUILD + '[' /builddir/build/BUILDROOT/deepspeed-0.14.0-1.x86_64 '!=' / ']' + rm -rf /builddir/build/BUILDROOT/deepspeed-0.14.0-1.x86_64 ++ dirname /builddir/build/BUILDROOT/deepspeed-0.14.0-1.x86_64 + mkdir -p /builddir/build/BUILDROOT + mkdir /builddir/build/BUILDROOT/deepspeed-0.14.0-1.x86_64 + cd DeepSpeed-0.14.0 ++ ls ./build/deepspeed-0.14.0+unknown-py3-none-any.whl ++ xargs basename --multiple ++ sed -E 's/([^-]+)-([^-]+)-.+\.whl/\1==\2/' + specifier=deepspeed==0.14.0+unknown + CFLAGS='-O2 -g -pipe -Wall -Werror=format-security -Wp,-D_FORTIFY_SOURCE=2 -Wp,-D_GLIBCXX_ASSERTIONS -fstack-protector-strong -grecord-gcc-switches -specs=/usr/lib/rpm/generic-hardened-cc1 -m64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection ' + LDFLAGS='-Wl,-z,relro -Wl,-z,now -specs=/usr/lib/rpm/generic-hardened-ld' + /usr/bin/python3 -mpip install --verbose --progress-bar off --disable-pip-version-check --root /builddir/build/BUILDROOT/deepspeed-0.14.0-1.x86_64 --no-compile --ignore-installed --no-deps --no-index --find-links ./build deepspeed==0.14.0+unknown Using pip 23.1.2 from /usr/lib/python3.11/site-packages/pip (python 3.11) Looking in links: ./build Processing ./build/deepspeed-0.14.0+unknown-py3-none-any.whl Installing collected packages: deepspeed Successfully installed deepspeed-0.14.0+unknown + /usr/bin/find-debuginfo -j4 --strict-build-id -i --build-id-seed 0.14.0-1 --unique-debug-suffix -0.14.0-1.x86_64 --unique-debug-src-base deepspeed-0.14.0-1.x86_64 -S debugsourcefiles.list /builddir/build/BUILD/DeepSpeed-0.14.0 find: 'debug': No such file or directory + /usr/lib/rpm/check-buildroot + /usr/lib/rpm/brp-ldconfig + /usr/lib/rpm/brp-compress + /usr/lib/rpm/brp-strip-static-archive /usr/bin/strip + /usr/lib/rpm/brp-python-bytecompile /usr/bin/python 1 1 Bytecompiling .py files below /builddir/build/BUILDROOT/deepspeed-0.14.0-1.x86_64/usr/lib/python3.11 using /usr/bin/python3.11 + /usr/lib/rpm/brp-python-hardlink Processing files: python3-deepspeed-0.14.0-1.x86_64 Executing(%doc): /bin/sh -e /var/tmp/rpm-tmp.XEAiR9 + umask 022 + cd /builddir/build/BUILD + cd DeepSpeed-0.14.0 + DOCDIR=/builddir/build/BUILDROOT/deepspeed-0.14.0-1.x86_64/usr/share/doc/python3-deepspeed + export LC_ALL=C + LC_ALL=C + export DOCDIR + /usr/bin/mkdir -p /builddir/build/BUILDROOT/deepspeed-0.14.0-1.x86_64/usr/share/doc/python3-deepspeed + cp -pr CODE_OF_CONDUCT.md CONTRIBUTING.md README.md SECURITY.md /builddir/build/BUILDROOT/deepspeed-0.14.0-1.x86_64/usr/share/doc/python3-deepspeed + RPM_EC=0 ++ jobs -p + exit 0 Executing(%license): /bin/sh -e /var/tmp/rpm-tmp.1yFKbY + umask 022 + cd /builddir/build/BUILD + cd DeepSpeed-0.14.0 + LICENSEDIR=/builddir/build/BUILDROOT/deepspeed-0.14.0-1.x86_64/usr/share/licenses/python3-deepspeed + export LC_ALL=C + LC_ALL=C + export LICENSEDIR + /usr/bin/mkdir -p /builddir/build/BUILDROOT/deepspeed-0.14.0-1.x86_64/usr/share/licenses/python3-deepspeed + cp -pr LICENSE /builddir/build/BUILDROOT/deepspeed-0.14.0-1.x86_64/usr/share/licenses/python3-deepspeed + RPM_EC=0 ++ jobs -p + exit 0 /usr/lib/rpm/pythondistdeps.py:105: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html from pkg_resources import Distribution, FileMetadata, PathMetadata, Requirement /usr/lib/rpm/pythondistdeps.py:105: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html from pkg_resources import Distribution, FileMetadata, PathMetadata, Requirement Provides: python3-deepspeed = 0.14.0-1 python3-deepspeed(x86-64) = 0.14.0-1 python3.11dist(deepspeed) = 0.14.0+unknown python3dist(deepspeed) = 0.14.0+unknown Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PartialHardlinkSets) <= 4.0.4-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: python(abi) = 3.11 python3.11dist(hjson) python3.11dist(ninja) python3.11dist(numpy) python3.11dist(packaging) >= 20 python3.11dist(psutil) python3.11dist(py-cpuinfo) python3.11dist(pydantic) python3.11dist(pynvml) python3.11dist(torch) python3.11dist(tqdm) Processing files: deepspeed-debuginfo-0.14.0-1.x86_64 Processing files: deepspeed-debugsource-0.14.0-1.x86_64 warning: Empty %files file /builddir/build/BUILD/DeepSpeed-0.14.0/debugfiles.list warning: Empty %files file /builddir/build/BUILD/DeepSpeed-0.14.0/debugsourcefiles.list Checking for unpackaged file(s): /usr/lib/rpm/check-files /builddir/build/BUILDROOT/deepspeed-0.14.0-1.x86_64 warning: Installed (but unpackaged) file(s) found: /usr/bin/deepspeed /usr/bin/deepspeed.pt /usr/bin/ds /usr/bin/ds_bench /usr/bin/ds_elastic /usr/bin/ds_report /usr/bin/ds_ssh /usr/bin/dsr Wrote: /builddir/build/RPMS/deepspeed-debugsource-0.14.0-1.x86_64.rpm Wrote: /builddir/build/RPMS/deepspeed-debuginfo-0.14.0-1.x86_64.rpm Wrote: /builddir/build/RPMS/python3-deepspeed-0.14.0-1.x86_64.rpm Executing(%clean): /bin/sh -e /var/tmp/rpm-tmp.IFFlQk + umask 022 + cd /builddir/build/BUILD + cd DeepSpeed-0.14.0 + /usr/bin/rm -rf /builddir/build/BUILDROOT/deepspeed-0.14.0-1.x86_64 + RPM_EC=0 ++ jobs -p + exit 0 Executing(rmbuild): /bin/sh -e /var/tmp/rpm-tmp.83DJ2R + umask 022 + cd /builddir/build/BUILD + rm -rf DeepSpeed-0.14.0 DeepSpeed-0.14.0.gemspec + RPM_EC=0 ++ jobs -p + exit 0 RPM build warnings: Empty %files file /builddir/build/BUILD/DeepSpeed-0.14.0/debugfiles.list Empty %files file /builddir/build/BUILD/DeepSpeed-0.14.0/debugsourcefiles.list Installed (but unpackaged) file(s) found: /usr/bin/deepspeed /usr/bin/deepspeed.pt /usr/bin/ds /usr/bin/ds_bench /usr/bin/ds_elastic /usr/bin/ds_report /usr/bin/ds_ssh /usr/bin/dsr Child return code was: 0