You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
mht c0e5aa7d0a Fix ResNet BatchNorm parameter loading and enhance BN1 debugging. Corrected loading of BatchNorm running_mean, running_var, and num_batches_tracked parameters in C++ ResNet BottleneckImpl and ResNetImpl to use direct member assignment instead of named_buffers(). This resolved discrepancies with Python's BatchNorm behavior. Added detailed intermediate output saving for bn1 in both C++ ResNet and Python comparison script to facilitate debugging. Ensured Python comparison script correctly loads and compares these new ResNet intermediate tensors. This series of changes led to numerical equivalence for ResNet conv1, bn1, and subsequently layer1-4 outputs between Python and C++. 2 months ago
..
CMakeCUDACompilerId.cpp1.ii Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
CMakeCUDACompilerId.cpp4.ii Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
CMakeCUDACompilerId.cudafe1.c Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
CMakeCUDACompilerId.cudafe1.cpp Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
CMakeCUDACompilerId.cudafe1.gpu Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
CMakeCUDACompilerId.cudafe1.stub.c Intial commit 2 months ago
CMakeCUDACompilerId.fatbin Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
CMakeCUDACompilerId.fatbin.c Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
CMakeCUDACompilerId.module_id Intial commit 2 months ago
CMakeCUDACompilerId.o Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
CMakeCUDACompilerId.ptx Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
CMakeCUDACompilerId.sm_52.cubin Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
a_dlink.fatbin Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
a_dlink.fatbin.c Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
a_dlink.o Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago
a_dlink.reg.c Intial commit 2 months ago
a_dlink.sm_52.cubin Feat: Achieved perfect Conv1, good BN1/ReLU1/MaxPool similarity. README updated. 2 months ago