You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
mht c0e5aa7d0a Fix ResNet BatchNorm parameter loading and enhance BN1 debugging. Corrected loading of BatchNorm running_mean, running_var, and num_batches_tracked parameters in C++ ResNet BottleneckImpl and ResNetImpl to use direct member assignment instead of named_buffers(). This resolved discrepancies with Python's BatchNorm behavior. Added detailed intermediate output saving for bn1 in both C++ ResNet and Python comparison script to facilitate debugging. Ensured Python comparison script correctly loads and compares these new ResNet intermediate tensors. This series of changes led to numerical equivalence for ResNet conv1, bn1, and subsequently layer1-4 outputs between Python and C++. 2 months ago
..
CompilerIdCUDA Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
CompilerIdCXX Intial commit 2 months ago
CMakeCUDACompiler.cmake Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
CMakeCXXCompiler.cmake Intial commit 2 months ago
CMakeDetermineCompilerABI_CUDA.bin Fix NameError in compare_models; Add ResNet BN1 debug prints; Prepare to address other modules 2 months ago
CMakeDetermineCompilerABI_CXX.bin Intial commit 2 months ago
CMakeSystem.cmake Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago