sbuild (Debian sbuild) 0.86.3~bpo12+1 (03 November 2024) on debusine-worker-arm64-demeter-08.freexian.com
+==============================================================================+
| gemmlowp 0.0~git20211220.e844ffd-1 (arm64) Sat, 16 Nov 2024 07:09:16 +0000 |
+==============================================================================+
Package: gemmlowp
Version: 0.0~git20211220.e844ffd-1
Source Version: 0.0~git20211220.e844ffd-1
Distribution: sid
Machine Architecture: arm64
Host Architecture: arm64
Build Architecture: arm64
Build Type: binary
I: No tarballs found in /var/lib/debusine/worker/.cache/sbuild
Unpacking /var/lib/debusine/worker/system-images/957781/system.tar.xz to /tmp/tmp.sbuild.46l2HxOMhW...
I: NOTICE: Log filtering will replace 'sbuild-unshare-dummy-location' with '<<CHROOT>>'
+------------------------------------------------------------------------------+
| Chroot Setup Commands |
+------------------------------------------------------------------------------+
rm -f /etc/resolv.conf
----------------------
I: Finished running 'rm -f /etc/resolv.conf'.
Finished processing commands.
--------------------------------------------------------------------------------
Copying /tmp/debusine-fetch-exec-upload-lnyc4poe/dpkg-dbgsym_1.22.12~1.gbp82cafd_arm64.deb to /<<CHROOT>>...
Copying /tmp/debusine-fetch-exec-upload-lnyc4poe/dpkg_1.22.12~1.gbp82cafd_arm64.deb to /<<CHROOT>>...
Copying /tmp/debusine-fetch-exec-upload-lnyc4poe/dselect-dbgsym_1.22.12~1.gbp82cafd_arm64.deb to /<<CHROOT>>...
Copying /tmp/debusine-fetch-exec-upload-lnyc4poe/dselect_1.22.12~1.gbp82cafd_arm64.deb to /<<CHROOT>>...
Copying /tmp/debusine-fetch-exec-upload-lnyc4poe/libdpkg-dev_1.22.12~1.gbp82cafd_arm64.deb to /<<CHROOT>>...
Copying /tmp/debusine-fetch-exec-upload-lnyc4poe/dpkg-dev_1.22.12~1.gbp82cafd_all.deb to /<<CHROOT>>...
Copying /tmp/debusine-fetch-exec-upload-lnyc4poe/libdpkg-perl_1.22.12~1.gbp82cafd_all.deb to /<<CHROOT>>...
I: NOTICE: Log filtering will replace 'build/gemmlowp-xpTCUd/resolver-NGv2NQ' with '<<RESOLVERDIR>>'
+------------------------------------------------------------------------------+
| Update chroot |
+------------------------------------------------------------------------------+
Get:1 file:/build/gemmlowp-xpTCUd/resolver-DxNUNH/apt_archive ./ InRelease
Ign:1 file:/build/gemmlowp-xpTCUd/resolver-DxNUNH/apt_archive ./ InRelease
Get:2 file:/build/gemmlowp-xpTCUd/resolver-DxNUNH/apt_archive ./ Release [606 B]
Get:3 http://deb.debian.org/debian sid InRelease [202 kB]
Get:2 file:/build/gemmlowp-xpTCUd/resolver-DxNUNH/apt_archive ./ Release [606 B]
Get:4 file:/build/gemmlowp-xpTCUd/resolver-DxNUNH/apt_archive ./ Release.gpg
Ign:4 file:/build/gemmlowp-xpTCUd/resolver-DxNUNH/apt_archive ./ Release.gpg
Get:5 file:/build/gemmlowp-xpTCUd/resolver-DxNUNH/apt_archive ./ Packages [9246 B]
Get:6 http://deb.debian.org/debian sid/main arm64 Packages [9952 kB]
Get:7 http://deb.debian.org/debian sid/main arm64 Components [4911 kB]
Fetched 15.1 MB in 3s (5999 kB/s)
Reading package lists...
Reading package lists...
Building dependency tree...
Reading state information...
Calculating upgrade...
The following packages will be upgraded:
apt bsdextrautils bsdutils debianutils dpkg dpkg-dev libapt-pkg6.0t64
libaudit-common libaudit1 libblkid1 libbrotli1 libcap-ng0 libdpkg-perl
libglib2.0-0t64 libmarkdown2 libmount1 libpcre2-8-0 libseccomp2 libselinux1
libsemanage2 libsmartcols1 libuuid1 libxml2 login login.defs mount passwd
util-linux
28 upgraded, 0 newly installed, 0 to remove and 0 not upgraded.
Need to get 8719 kB/12.2 MB of archives.
After this operation, 603 kB disk space will be freed.
Get:1 file:/build/gemmlowp-xpTCUd/resolver-DxNUNH/apt_archive ./ dpkg 1.22.12~1.gbp82cafd [1513 kB]
Get:2 http://deb.debian.org/debian sid/main arm64 bsdutils arm64 1:2.40.2-11 [104 kB]
Get:3 file:/build/gemmlowp-xpTCUd/resolver-DxNUNH/apt_archive ./ dpkg-dev 1.22.12~1.gbp82cafd [1337 kB]
Get:4 http://deb.debian.org/debian sid/main arm64 debianutils arm64 5.21 [92.1 kB]
Get:5 http://deb.debian.org/debian sid/main arm64 libapt-pkg6.0t64 arm64 2.9.11 [921 kB]
Get:6 file:/build/gemmlowp-xpTCUd/resolver-DxNUNH/apt_archive ./ libdpkg-perl 1.22.12~1.gbp82cafd [647 kB]
Get:7 http://deb.debian.org/debian sid/main arm64 bsdextrautils arm64 2.40.2-11 [91.2 kB]
Get:8 http://deb.debian.org/debian sid/main arm64 libblkid1 arm64 2.40.2-11 [162 kB]
Get:9 http://deb.debian.org/debian sid/main arm64 libmount1 arm64 2.40.2-11 [190 kB]
Get:10 http://deb.debian.org/debian sid/main arm64 libsmartcols1 arm64 2.40.2-11 [135 kB]
Get:11 http://deb.debian.org/debian sid/main arm64 mount arm64 2.40.2-11 [153 kB]
Get:12 http://deb.debian.org/debian sid/main arm64 libuuid1 arm64 2.40.2-11 [35.7 kB]
Get:13 http://deb.debian.org/debian sid/main arm64 util-linux arm64 2.40.2-11 [1170 kB]
Get:14 http://deb.debian.org/debian sid/main arm64 libpcre2-8-0 arm64 10.44-4 [243 kB]
Get:15 http://deb.debian.org/debian sid/main arm64 libselinux1 arm64 3.7-3+b1 [72.1 kB]
Get:16 http://deb.debian.org/debian sid/main arm64 libseccomp2 arm64 2.5.5-1+b3 [46.8 kB]
Get:17 http://deb.debian.org/debian sid/main arm64 apt arm64 2.9.11 [1287 kB]
Get:18 http://deb.debian.org/debian sid/main arm64 libaudit-common all 1:4.0.2-2 [12.7 kB]
Get:19 http://deb.debian.org/debian sid/main arm64 libcap-ng0 arm64 0.8.5-3+b1 [17.0 kB]
Get:20 http://deb.debian.org/debian sid/main arm64 libaudit1 arm64 1:4.0.2-2 [54.2 kB]
Get:21 http://deb.debian.org/debian sid/main arm64 login arm64 1:4.16.0-2+really2.40.2-11 [80.0 kB]
Get:22 http://deb.debian.org/debian sid/main arm64 login.defs all 1:4.16.0-5 [185 kB]
Get:23 http://deb.debian.org/debian sid/main arm64 libsemanage2 arm64 3.7-2+b1 [84.5 kB]
Get:24 http://deb.debian.org/debian sid/main arm64 passwd arm64 1:4.16.0-5 [1210 kB]
Get:25 http://deb.debian.org/debian sid/main arm64 libbrotli1 arm64 1.1.0-2+b6 [297 kB]
Get:26 http://deb.debian.org/debian sid/main arm64 libglib2.0-0t64 arm64 2.82.2-3 [1411 kB]
Get:27 http://deb.debian.org/debian sid/main arm64 libmarkdown2 arm64 2.2.7-2.1 [33.1 kB]
Get:28 http://deb.debian.org/debian sid/main arm64 libxml2 arm64 2.12.7+dfsg+really2.9.14-0.2+b1 [630 kB]
debconf: delaying package configuration, since apt-utils is not installed
Fetched 8719 kB in 0s (56.1 MB/s)
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17129 files and directories currently installed.)
Preparing to unpack .../bsdutils_1%3a2.40.2-11_arm64.deb ...
Unpacking bsdutils (1:2.40.2-11) over (1:2.40.2-10) ...
Setting up bsdutils (1:2.40.2-11) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17129 files and directories currently installed.)
Preparing to unpack .../debianutils_5.21_arm64.deb ...
Unpacking debianutils (5.21) over (5.20+b1) ...
Setting up debianutils (5.21) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17128 files and directories currently installed.)
Preparing to unpack .../libapt-pkg6.0t64_2.9.11_arm64.deb ...
Unpacking libapt-pkg6.0t64:arm64 (2.9.11) over (2.9.10) ...
Setting up libapt-pkg6.0t64:arm64 (2.9.11) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17128 files and directories currently installed.)
Preparing to unpack .../dpkg_1.22.12~1.gbp82cafd_arm64.deb ...
Unpacking dpkg (1.22.12~1.gbp82cafd) over (1.22.11) ...
Setting up dpkg (1.22.12~1.gbp82cafd) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17127 files and directories currently installed.)
Preparing to unpack .../bsdextrautils_2.40.2-11_arm64.deb ...
Unpacking bsdextrautils (2.40.2-11) over (2.40.2-10) ...
Preparing to unpack .../libblkid1_2.40.2-11_arm64.deb ...
Unpacking libblkid1:arm64 (2.40.2-11) over (2.40.2-10) ...
Setting up libblkid1:arm64 (2.40.2-11) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17123 files and directories currently installed.)
Preparing to unpack .../libmount1_2.40.2-11_arm64.deb ...
Unpacking libmount1:arm64 (2.40.2-11) over (2.40.2-10) ...
Setting up libmount1:arm64 (2.40.2-11) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17123 files and directories currently installed.)
Preparing to unpack .../libsmartcols1_2.40.2-11_arm64.deb ...
Unpacking libsmartcols1:arm64 (2.40.2-11) over (2.40.2-10) ...
Setting up libsmartcols1:arm64 (2.40.2-11) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17123 files and directories currently installed.)
Preparing to unpack .../mount_2.40.2-11_arm64.deb ...
Unpacking mount (2.40.2-11) over (2.40.2-10) ...
Preparing to unpack .../libuuid1_2.40.2-11_arm64.deb ...
Unpacking libuuid1:arm64 (2.40.2-11) over (2.40.2-10) ...
Setting up libuuid1:arm64 (2.40.2-11) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17123 files and directories currently installed.)
Preparing to unpack .../util-linux_2.40.2-11_arm64.deb ...
Unpacking util-linux (2.40.2-11) over (2.40.2-10) ...
Setting up util-linux (2.40.2-11) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17120 files and directories currently installed.)
Preparing to unpack .../libpcre2-8-0_10.44-4_arm64.deb ...
Unpacking libpcre2-8-0:arm64 (10.44-4) over (10.42-4+b2) ...
Setting up libpcre2-8-0:arm64 (10.44-4) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17119 files and directories currently installed.)
Preparing to unpack .../libselinux1_3.7-3+b1_arm64.deb ...
Unpacking libselinux1:arm64 (3.7-3+b1) over (3.7-3) ...
Setting up libselinux1:arm64 (3.7-3+b1) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17120 files and directories currently installed.)
Preparing to unpack .../libseccomp2_2.5.5-1+b3_arm64.deb ...
Unpacking libseccomp2:arm64 (2.5.5-1+b3) over (2.5.5-1+b2) ...
Setting up libseccomp2:arm64 (2.5.5-1+b3) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17120 files and directories currently installed.)
Preparing to unpack .../archives/apt_2.9.11_arm64.deb ...
Unpacking apt (2.9.11) over (2.9.10) ...
Setting up apt (2.9.11) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17117 files and directories currently installed.)
Preparing to unpack .../libaudit-common_1%3a4.0.2-2_all.deb ...
Unpacking libaudit-common (1:4.0.2-2) over (1:4.0.1-3) ...
Setting up libaudit-common (1:4.0.2-2) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17117 files and directories currently installed.)
Preparing to unpack .../libcap-ng0_0.8.5-3+b1_arm64.deb ...
Unpacking libcap-ng0:arm64 (0.8.5-3+b1) over (0.8.5-3) ...
Setting up libcap-ng0:arm64 (0.8.5-3+b1) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17118 files and directories currently installed.)
Preparing to unpack .../libaudit1_1%3a4.0.2-2_arm64.deb ...
Unpacking libaudit1:arm64 (1:4.0.2-2) over (1:4.0.1-3) ...
Setting up libaudit1:arm64 (1:4.0.2-2) ...
(Reading database ...
(Reading database ... 5%
(Reading database ... 10%
(Reading database ... 15%
(Reading database ... 20%
(Reading database ... 25%
(Reading database ... 30%
(Reading database ... 35%
(Reading database ... 40%
(Reading database ... 45%
(Reading database ... 50%
(Reading database ... 55%
(Reading database ... 60%
(Reading database ... 65%
(Reading database ... 70%
(Reading database ... 75%
(Reading database ... 80%
(Reading database ... 85%
(Reading database ... 90%
(Reading database ... 95%
(Reading database ... 100%
(Reading database ... 17118 files and directories currently installed.)
Preparing to unpack .../0-login_1%3a4.16.0-2+really2.40.2-11_arm64.deb ...
Unpacking login (1:4.16.0-2+really2.40.2-11) over (1:4.16.0-2+really2.40.2-10) ...
Preparing to unpack .../1-login.defs_1%3a4.16.0-5_all.deb ...
Unpacking login.defs (1:4.16.0-5) over (1:4.16.0-4) ...
Preparing to unpack .../2-libsemanage2_3.7-2+b1_arm64.deb ...
Unpacking libsemanage2:arm64 (3.7-2+b1) over (3.7-2) ...
Preparing to unpack .../3-passwd_1%3a4.16.0-5_arm64.deb ...
Unpacking passwd (1:4.16.0-5) over (1:4.16.0-4) ...
Preparing to unpack .../4-dpkg-dev_1.22.12~1.gbp82cafd_all.deb ...
Unpacking dpkg-dev (1.22.12~1.gbp82cafd) over (1.22.11) ...
Preparing to unpack .../5-libdpkg-perl_1.22.12~1.gbp82cafd_all.deb ...
Unpacking libdpkg-perl (1.22.12~1.gbp82cafd) over (1.22.11) ...
Preparing to unpack .../6-libbrotli1_1.1.0-2+b6_arm64.deb ...
Unpacking libbrotli1:arm64 (1.1.0-2+b6) over (1.1.0-2+b5) ...
Preparing to unpack .../7-libglib2.0-0t64_2.82.2-3_arm64.deb ...
Unpacking libglib2.0-0t64:arm64 (2.82.2-3) over (2.82.2-2) ...
Preparing to unpack .../8-libmarkdown2_2.2.7-2.1_arm64.deb ...
Unpacking libmarkdown2:arm64 (2.2.7-2.1) over (2.2.7-2+b1) ...
Preparing to unpack .../9-libxml2_2.12.7+dfsg+really2.9.14-0.2+b1_arm64.deb ...
Unpacking libxml2:arm64 (2.12.7+dfsg+really2.9.14-0.2+b1) over (2.12.7+dfsg+really2.9.14-0.1) ...
Setting up bsdextrautils (2.40.2-11) ...
Setting up login.defs (1:4.16.0-5) ...
Installing new version of config file /etc/login.defs ...
Setting up libbrotli1:arm64 (1.1.0-2+b6) ...
Setting up libglib2.0-0t64:arm64 (2.82.2-3) ...
No schema files found: doing nothing.
Setting up libdpkg-perl (1.22.12~1.gbp82cafd) ...
Setting up mount (2.40.2-11) ...
Setting up libsemanage2:arm64 (3.7-2+b1) ...
Setting up libxml2:arm64 (2.12.7+dfsg+really2.9.14-0.2+b1) ...
Setting up libmarkdown2:arm64 (2.2.7-2.1) ...
Setting up login (1:4.16.0-2+really2.40.2-11) ...
Setting up dpkg-dev (1.22.12~1.gbp82cafd) ...
Setting up passwd (1:4.16.0-5) ...
Processing triggers for man-db (2.13.0-1) ...
Processing triggers for libc-bin (2.40-3) ...
+------------------------------------------------------------------------------+
| Fetch source files |
+------------------------------------------------------------------------------+
Local sources
-------------
/tmp/debusine-fetch-exec-upload-lnyc4poe/gemmlowp_0.0~git20211220.e844ffd-1.dsc exists in /tmp/debusine-fetch-exec-upload-lnyc4poe; copying to chroot
I: NOTICE: Log filtering will replace 'build/gemmlowp-xpTCUd/gemmlowp-0.0~git20211220.e844ffd' with '<<PKGBUILDDIR>>'
I: NOTICE: Log filtering will replace 'build/gemmlowp-xpTCUd' with '<<BUILDDIR>>'
+------------------------------------------------------------------------------+
| Install package build dependencies |
+------------------------------------------------------------------------------+
Setup apt archive
-----------------
Merged Build-Depends: debhelper-compat (= 13), cmake, build-essential, fakeroot
Filtered Build-Depends: debhelper-compat (= 13), cmake, build-essential, fakeroot
dpkg-deb: warning: root directory has unusual owner or group 998:999.
Hint: either pass --root-owner-group, see dpkg-build-api(7) or add an explicit 'Rules-Requires-Root: no' in debian/control.
dpkg-deb: warning: ignoring 1 warning about the control file(s)
dpkg-deb: building package 'sbuild-build-depends-main-dummy' in '/<<RESOLVERDIR>>/apt_archive/sbuild-build-depends-main-dummy.deb'.
Ign:1 copy:/<<RESOLVERDIR>>/apt_archive ./ InRelease
Get:2 copy:/<<RESOLVERDIR>>/apt_archive ./ Release [609 B]
Ign:3 copy:/<<RESOLVERDIR>>/apt_archive ./ Release.gpg
Get:4 copy:/<<RESOLVERDIR>>/apt_archive ./ Sources [624 B]
Get:5 copy:/<<RESOLVERDIR>>/apt_archive ./ Packages [656 B]
Fetched 1889 B in 0s (155 kB/s)
Reading package lists...
Get:1 file:/<<BUILDDIR>>/resolver-DxNUNH/apt_archive ./ InRelease
Ign:1 file:/<<BUILDDIR>>/resolver-DxNUNH/apt_archive ./ InRelease
Get:2 file:/<<BUILDDIR>>/resolver-DxNUNH/apt_archive ./ Release [606 B]
Get:2 file:/<<BUILDDIR>>/resolver-DxNUNH/apt_archive ./ Release [606 B]
Get:3 file:/<<BUILDDIR>>/resolver-DxNUNH/apt_archive ./ Release.gpg
Ign:3 file:/<<BUILDDIR>>/resolver-DxNUNH/apt_archive ./ Release.gpg
Reading package lists...
Reading package lists...
Install main build dependencies (apt-based resolver)
----------------------------------------------------
Installing build dependencies
Reading package lists...
Building dependency tree...
Reading state information...
The following additional packages will be installed:
autoconf automake autopoint autotools-dev build-essential cmake cmake-data
cpp cpp-14 cpp-14-aarch64-linux-gnu cpp-aarch64-linux-gnu debhelper
dh-autoreconf dh-strip-nondeterminism dwz fakeroot g++ g++-14
g++-14-aarch64-linux-gnu g++-aarch64-linux-gnu gcc gcc-14
gcc-14-aarch64-linux-gnu gcc-aarch64-linux-gnu libarchive13t64 libasan8
libc-dev-bin libc6-dev libcc1-0 libcrypt-dev libcurl4t64 libdebhelper-perl
libelf1t64 libexpat1 libfakeroot libfile-stripnondeterminism-perl
libgcc-14-dev libhwasan0 libisl23 libitm1 libjsoncpp26 liblsan0 libmpc3
libmpfr6 libncursesw6 libproc2-0 librhash1 libstdc++-14-dev libtool libtsan2
libubsan1 libuv1t64 linux-libc-dev m4 po-debconf procps rpcsvc-proto
Suggested packages:
autoconf-archive gnu-standards autoconf-doc cmake-doc cmake-format
elpa-cmake-mode ninja-build cpp-doc gcc-14-locales cpp-14-doc dh-make
gcc-14-doc gcc-multilib manpages-dev flex bison gdb gcc-doc
gdb-aarch64-linux-gnu lrzip libc-devtools glibc-doc libstdc++-14-doc
libtool-doc gfortran | fortran95-compiler gcj-jdk m4-doc libmail-box-perl
Recommended packages:
manpages manpages-dev libarchive-cpio-perl libgpm2 libltdl-dev
libmail-sendmail-perl psmisc linux-sysctl-defaults
The following NEW packages will be installed:
autoconf automake autopoint autotools-dev build-essential cmake cmake-data
cpp cpp-14 cpp-14-aarch64-linux-gnu cpp-aarch64-linux-gnu debhelper
dh-autoreconf dh-strip-nondeterminism dwz fakeroot g++ g++-14
g++-14-aarch64-linux-gnu g++-aarch64-linux-gnu gcc gcc-14
gcc-14-aarch64-linux-gnu gcc-aarch64-linux-gnu libarchive13t64 libasan8
libc-dev-bin libc6-dev libcc1-0 libcrypt-dev libcurl4t64 libdebhelper-perl
libelf1t64 libexpat1 libfakeroot libfile-stripnondeterminism-perl
libgcc-14-dev libhwasan0 libisl23 libitm1 libjsoncpp26 liblsan0 libmpc3
libmpfr6 libncursesw6 libproc2-0 librhash1 libstdc++-14-dev libtool libtsan2
libubsan1 libuv1t64 linux-libc-dev m4 po-debconf procps rpcsvc-proto
sbuild-build-depends-main-dummy
0 upgraded, 58 newly installed, 0 to remove and 0 not upgraded.
Need to get 75.2 MB of archives.
After this operation, 293 MB of additional disk space will be used.
Get:1 copy:/<<RESOLVERDIR>>/apt_archive ./ sbuild-build-depends-main-dummy 0.invalid.0 [888 B]
Get:2 http://deb.debian.org/debian sid/main arm64 libncursesw6 arm64 6.5-2+b1 [125 kB]
Get:3 http://deb.debian.org/debian sid/main arm64 libproc2-0 arm64 2:4.0.4-6 [62.3 kB]
Get:4 http://deb.debian.org/debian sid/main arm64 procps arm64 2:4.0.4-6 [872 kB]
Get:5 http://deb.debian.org/debian sid/main arm64 m4 arm64 1.4.19-4 [277 kB]
Get:6 http://deb.debian.org/debian sid/main arm64 autoconf all 2.72-3 [493 kB]
Get:7 http://deb.debian.org/debian sid/main arm64 autotools-dev all 20220109.1 [51.6 kB]
Get:8 http://deb.debian.org/debian sid/main arm64 automake all 1:1.16.5-1.3 [823 kB]
Get:9 http://deb.debian.org/debian sid/main arm64 autopoint all 0.22.5-2 [723 kB]
Get:10 http://deb.debian.org/debian sid/main arm64 libc-dev-bin arm64 2.40-3 [50.9 kB]
Get:11 http://deb.debian.org/debian sid/main arm64 linux-libc-dev all 6.11.7-1 [2454 kB]
Get:12 http://deb.debian.org/debian sid/main arm64 libcrypt-dev arm64 1:4.4.36-5 [122 kB]
Get:13 http://deb.debian.org/debian sid/main arm64 rpcsvc-proto arm64 1.4.3-1+b1 [60.5 kB]
Get:14 http://deb.debian.org/debian sid/main arm64 libc6-dev arm64 2.40-3 [1591 kB]
Get:15 http://deb.debian.org/debian sid/main arm64 libisl23 arm64 0.27-1 [601 kB]
Get:16 http://deb.debian.org/debian sid/main arm64 libmpfr6 arm64 4.2.1-1+b2 [680 kB]
Get:17 http://deb.debian.org/debian sid/main arm64 libmpc3 arm64 1.3.1-1+b3 [50.5 kB]
Get:18 http://deb.debian.org/debian sid/main arm64 cpp-14-aarch64-linux-gnu arm64 14.2.0-8 [9166 kB]
Get:19 http://deb.debian.org/debian sid/main arm64 cpp-14 arm64 14.2.0-8 [1284 B]
Get:20 http://deb.debian.org/debian sid/main arm64 cpp-aarch64-linux-gnu arm64 4:14.2.0-1 [4832 B]
Get:21 http://deb.debian.org/debian sid/main arm64 cpp arm64 4:14.2.0-1 [1568 B]
Get:22 http://deb.debian.org/debian sid/main arm64 libcc1-0 arm64 14.2.0-8 [42.2 kB]
Get:23 http://deb.debian.org/debian sid/main arm64 libitm1 arm64 14.2.0-8 [24.2 kB]
Get:24 http://deb.debian.org/debian sid/main arm64 libasan8 arm64 14.2.0-8 [2579 kB]
Get:25 http://deb.debian.org/debian sid/main arm64 liblsan0 arm64 14.2.0-8 [1161 kB]
Get:26 http://deb.debian.org/debian sid/main arm64 libtsan2 arm64 14.2.0-8 [2386 kB]
Get:27 http://deb.debian.org/debian sid/main arm64 libubsan1 arm64 14.2.0-8 [1039 kB]
Get:28 http://deb.debian.org/debian sid/main arm64 libhwasan0 arm64 14.2.0-8 [1442 kB]
Get:29 http://deb.debian.org/debian sid/main arm64 libgcc-14-dev arm64 14.2.0-8 [2365 kB]
Get:30 http://deb.debian.org/debian sid/main arm64 gcc-14-aarch64-linux-gnu arm64 14.2.0-8 [17.7 MB]
Get:31 http://deb.debian.org/debian sid/main arm64 gcc-14 arm64 14.2.0-8 [519 kB]
Get:32 http://deb.debian.org/debian sid/main arm64 gcc-aarch64-linux-gnu arm64 4:14.2.0-1 [1440 B]
Get:33 http://deb.debian.org/debian sid/main arm64 gcc arm64 4:14.2.0-1 [5136 B]
Get:34 http://deb.debian.org/debian sid/main arm64 libstdc++-14-dev arm64 14.2.0-8 [2267 kB]
Get:35 http://deb.debian.org/debian sid/main arm64 g++-14-aarch64-linux-gnu arm64 14.2.0-8 [10.1 MB]
Get:36 http://deb.debian.org/debian sid/main arm64 g++-14 arm64 14.2.0-8 [20.2 kB]
Get:37 http://deb.debian.org/debian sid/main arm64 g++-aarch64-linux-gnu arm64 4:14.2.0-1 [1200 B]
Get:38 http://deb.debian.org/debian sid/main arm64 g++ arm64 4:14.2.0-1 [1332 B]
Get:39 http://deb.debian.org/debian sid/main arm64 build-essential arm64 12.12 [4624 B]
Get:40 http://deb.debian.org/debian sid/main arm64 cmake-data all 3.31.0-1 [2265 kB]
Get:41 http://deb.debian.org/debian sid/main arm64 libarchive13t64 arm64 3.7.4-1.1 [323 kB]
Get:42 http://deb.debian.org/debian sid/main arm64 libcurl4t64 arm64 8.11.0-1 [322 kB]
Get:43 http://deb.debian.org/debian sid/main arm64 libexpat1 arm64 2.6.4-1 [90.7 kB]
Get:44 http://deb.debian.org/debian sid/main arm64 libjsoncpp26 arm64 1.9.6-2 [73.1 kB]
Get:45 http://deb.debian.org/debian sid/main arm64 librhash1 arm64 1.4.5-1 [129 kB]
Get:46 http://deb.debian.org/debian sid/main arm64 libuv1t64 arm64 1.48.0-7 [143 kB]
Get:47 http://deb.debian.org/debian sid/main arm64 cmake arm64 3.31.0-1 [9874 kB]
Get:48 http://deb.debian.org/debian sid/main arm64 libdebhelper-perl all 13.20 [89.7 kB]
Get:49 http://deb.debian.org/debian sid/main arm64 libtool all 2.4.7-8 [517 kB]
Get:50 http://deb.debian.org/debian sid/main arm64 dh-autoreconf all 20 [17.1 kB]
Get:51 http://deb.debian.org/debian sid/main arm64 libfile-stripnondeterminism-perl all 1.14.0-1 [19.5 kB]
Get:52 http://deb.debian.org/debian sid/main arm64 dh-strip-nondeterminism all 1.14.0-1 [8448 B]
Get:53 http://deb.debian.org/debian sid/main arm64 libelf1t64 arm64 0.192-4 [189 kB]
Get:54 http://deb.debian.org/debian sid/main arm64 dwz arm64 0.15-1+b1 [102 kB]
Get:55 http://deb.debian.org/debian sid/main arm64 po-debconf all 1.0.21+nmu1 [248 kB]
Get:56 http://deb.debian.org/debian sid/main arm64 debhelper all 13.20 [915 kB]
Get:57 http://deb.debian.org/debian sid/main arm64 libfakeroot arm64 1.36-1 [29.1 kB]
Get:58 http://deb.debian.org/debian sid/main arm64 fakeroot arm64 1.36-1 [74.4 kB]
debconf: delaying package configuration, since apt-utils is not installed
Fetched 75.2 MB in 1s (90.3 MB/s)
Selecting previously unselected package libncursesw6:arm64.
(Reading database ... 17119 files and directories currently installed.)
Preparing to unpack .../00-libncursesw6_6.5-2+b1_arm64.deb ...
Unpacking libncursesw6:arm64 (6.5-2+b1) ...
Selecting previously unselected package libproc2-0:arm64.
Preparing to unpack .../01-libproc2-0_2%3a4.0.4-6_arm64.deb ...
Unpacking libproc2-0:arm64 (2:4.0.4-6) ...
Selecting previously unselected package procps.
Preparing to unpack .../02-procps_2%3a4.0.4-6_arm64.deb ...
Unpacking procps (2:4.0.4-6) ...
Selecting previously unselected package m4.
Preparing to unpack .../03-m4_1.4.19-4_arm64.deb ...
Unpacking m4 (1.4.19-4) ...
Selecting previously unselected package autoconf.
Preparing to unpack .../04-autoconf_2.72-3_all.deb ...
Unpacking autoconf (2.72-3) ...
Selecting previously unselected package autotools-dev.
Preparing to unpack .../05-autotools-dev_20220109.1_all.deb ...
Unpacking autotools-dev (20220109.1) ...
Selecting previously unselected package automake.
Preparing to unpack .../06-automake_1%3a1.16.5-1.3_all.deb ...
Unpacking automake (1:1.16.5-1.3) ...
Selecting previously unselected package autopoint.
Preparing to unpack .../07-autopoint_0.22.5-2_all.deb ...
Unpacking autopoint (0.22.5-2) ...
Selecting previously unselected package libc-dev-bin.
Preparing to unpack .../08-libc-dev-bin_2.40-3_arm64.deb ...
Unpacking libc-dev-bin (2.40-3) ...
Selecting previously unselected package linux-libc-dev.
Preparing to unpack .../09-linux-libc-dev_6.11.7-1_all.deb ...
Unpacking linux-libc-dev (6.11.7-1) ...
Selecting previously unselected package libcrypt-dev:arm64.
Preparing to unpack .../10-libcrypt-dev_1%3a4.4.36-5_arm64.deb ...
Unpacking libcrypt-dev:arm64 (1:4.4.36-5) ...
Selecting previously unselected package rpcsvc-proto.
Preparing to unpack .../11-rpcsvc-proto_1.4.3-1+b1_arm64.deb ...
Unpacking rpcsvc-proto (1.4.3-1+b1) ...
Selecting previously unselected package libc6-dev:arm64.
Preparing to unpack .../12-libc6-dev_2.40-3_arm64.deb ...
Unpacking libc6-dev:arm64 (2.40-3) ...
Selecting previously unselected package libisl23:arm64.
Preparing to unpack .../13-libisl23_0.27-1_arm64.deb ...
Unpacking libisl23:arm64 (0.27-1) ...
Selecting previously unselected package libmpfr6:arm64.
Preparing to unpack .../14-libmpfr6_4.2.1-1+b2_arm64.deb ...
Unpacking libmpfr6:arm64 (4.2.1-1+b2) ...
Selecting previously unselected package libmpc3:arm64.
Preparing to unpack .../15-libmpc3_1.3.1-1+b3_arm64.deb ...
Unpacking libmpc3:arm64 (1.3.1-1+b3) ...
Selecting previously unselected package cpp-14-aarch64-linux-gnu.
Preparing to unpack .../16-cpp-14-aarch64-linux-gnu_14.2.0-8_arm64.deb ...
Unpacking cpp-14-aarch64-linux-gnu (14.2.0-8) ...
Selecting previously unselected package cpp-14.
Preparing to unpack .../17-cpp-14_14.2.0-8_arm64.deb ...
Unpacking cpp-14 (14.2.0-8) ...
Selecting previously unselected package cpp-aarch64-linux-gnu.
Preparing to unpack .../18-cpp-aarch64-linux-gnu_4%3a14.2.0-1_arm64.deb ...
Unpacking cpp-aarch64-linux-gnu (4:14.2.0-1) ...
Selecting previously unselected package cpp.
Preparing to unpack .../19-cpp_4%3a14.2.0-1_arm64.deb ...
Unpacking cpp (4:14.2.0-1) ...
Selecting previously unselected package libcc1-0:arm64.
Preparing to unpack .../20-libcc1-0_14.2.0-8_arm64.deb ...
Unpacking libcc1-0:arm64 (14.2.0-8) ...
Selecting previously unselected package libitm1:arm64.
Preparing to unpack .../21-libitm1_14.2.0-8_arm64.deb ...
Unpacking libitm1:arm64 (14.2.0-8) ...
Selecting previously unselected package libasan8:arm64.
Preparing to unpack .../22-libasan8_14.2.0-8_arm64.deb ...
Unpacking libasan8:arm64 (14.2.0-8) ...
Selecting previously unselected package liblsan0:arm64.
Preparing to unpack .../23-liblsan0_14.2.0-8_arm64.deb ...
Unpacking liblsan0:arm64 (14.2.0-8) ...
Selecting previously unselected package libtsan2:arm64.
Preparing to unpack .../24-libtsan2_14.2.0-8_arm64.deb ...
Unpacking libtsan2:arm64 (14.2.0-8) ...
Selecting previously unselected package libubsan1:arm64.
Preparing to unpack .../25-libubsan1_14.2.0-8_arm64.deb ...
Unpacking libubsan1:arm64 (14.2.0-8) ...
Selecting previously unselected package libhwasan0:arm64.
Preparing to unpack .../26-libhwasan0_14.2.0-8_arm64.deb ...
Unpacking libhwasan0:arm64 (14.2.0-8) ...
Selecting previously unselected package libgcc-14-dev:arm64.
Preparing to unpack .../27-libgcc-14-dev_14.2.0-8_arm64.deb ...
Unpacking libgcc-14-dev:arm64 (14.2.0-8) ...
Selecting previously unselected package gcc-14-aarch64-linux-gnu.
Preparing to unpack .../28-gcc-14-aarch64-linux-gnu_14.2.0-8_arm64.deb ...
Unpacking gcc-14-aarch64-linux-gnu (14.2.0-8) ...
Selecting previously unselected package gcc-14.
Preparing to unpack .../29-gcc-14_14.2.0-8_arm64.deb ...
Unpacking gcc-14 (14.2.0-8) ...
Selecting previously unselected package gcc-aarch64-linux-gnu.
Preparing to unpack .../30-gcc-aarch64-linux-gnu_4%3a14.2.0-1_arm64.deb ...
Unpacking gcc-aarch64-linux-gnu (4:14.2.0-1) ...
Selecting previously unselected package gcc.
Preparing to unpack .../31-gcc_4%3a14.2.0-1_arm64.deb ...
Unpacking gcc (4:14.2.0-1) ...
Selecting previously unselected package libstdc++-14-dev:arm64.
Preparing to unpack .../32-libstdc++-14-dev_14.2.0-8_arm64.deb ...
Unpacking libstdc++-14-dev:arm64 (14.2.0-8) ...
Selecting previously unselected package g++-14-aarch64-linux-gnu.
Preparing to unpack .../33-g++-14-aarch64-linux-gnu_14.2.0-8_arm64.deb ...
Unpacking g++-14-aarch64-linux-gnu (14.2.0-8) ...
Selecting previously unselected package g++-14.
Preparing to unpack .../34-g++-14_14.2.0-8_arm64.deb ...
Unpacking g++-14 (14.2.0-8) ...
Selecting previously unselected package g++-aarch64-linux-gnu.
Preparing to unpack .../35-g++-aarch64-linux-gnu_4%3a14.2.0-1_arm64.deb ...
Unpacking g++-aarch64-linux-gnu (4:14.2.0-1) ...
Selecting previously unselected package g++.
Preparing to unpack .../36-g++_4%3a14.2.0-1_arm64.deb ...
Unpacking g++ (4:14.2.0-1) ...
Selecting previously unselected package build-essential.
Preparing to unpack .../37-build-essential_12.12_arm64.deb ...
Unpacking build-essential (12.12) ...
Selecting previously unselected package cmake-data.
Preparing to unpack .../38-cmake-data_3.31.0-1_all.deb ...
Unpacking cmake-data (3.31.0-1) ...
Selecting previously unselected package libarchive13t64:arm64.
Preparing to unpack .../39-libarchive13t64_3.7.4-1.1_arm64.deb ...
Unpacking libarchive13t64:arm64 (3.7.4-1.1) ...
Selecting previously unselected package libcurl4t64:arm64.
Preparing to unpack .../40-libcurl4t64_8.11.0-1_arm64.deb ...
Unpacking libcurl4t64:arm64 (8.11.0-1) ...
Selecting previously unselected package libexpat1:arm64.
Preparing to unpack .../41-libexpat1_2.6.4-1_arm64.deb ...
Unpacking libexpat1:arm64 (2.6.4-1) ...
Selecting previously unselected package libjsoncpp26:arm64.
Preparing to unpack .../42-libjsoncpp26_1.9.6-2_arm64.deb ...
Unpacking libjsoncpp26:arm64 (1.9.6-2) ...
Selecting previously unselected package librhash1:arm64.
Preparing to unpack .../43-librhash1_1.4.5-1_arm64.deb ...
Unpacking librhash1:arm64 (1.4.5-1) ...
Selecting previously unselected package libuv1t64:arm64.
Preparing to unpack .../44-libuv1t64_1.48.0-7_arm64.deb ...
Unpacking libuv1t64:arm64 (1.48.0-7) ...
Selecting previously unselected package cmake.
Preparing to unpack .../45-cmake_3.31.0-1_arm64.deb ...
Unpacking cmake (3.31.0-1) ...
Selecting previously unselected package libdebhelper-perl.
Preparing to unpack .../46-libdebhelper-perl_13.20_all.deb ...
Unpacking libdebhelper-perl (13.20) ...
Selecting previously unselected package libtool.
Preparing to unpack .../47-libtool_2.4.7-8_all.deb ...
Unpacking libtool (2.4.7-8) ...
Selecting previously unselected package dh-autoreconf.
Preparing to unpack .../48-dh-autoreconf_20_all.deb ...
Unpacking dh-autoreconf (20) ...
Selecting previously unselected package libfile-stripnondeterminism-perl.
Preparing to unpack .../49-libfile-stripnondeterminism-perl_1.14.0-1_all.deb ...
Unpacking libfile-stripnondeterminism-perl (1.14.0-1) ...
Selecting previously unselected package dh-strip-nondeterminism.
Preparing to unpack .../50-dh-strip-nondeterminism_1.14.0-1_all.deb ...
Unpacking dh-strip-nondeterminism (1.14.0-1) ...
Selecting previously unselected package libelf1t64:arm64.
Preparing to unpack .../51-libelf1t64_0.192-4_arm64.deb ...
Unpacking libelf1t64:arm64 (0.192-4) ...
Selecting previously unselected package dwz.
Preparing to unpack .../52-dwz_0.15-1+b1_arm64.deb ...
Unpacking dwz (0.15-1+b1) ...
Selecting previously unselected package po-debconf.
Preparing to unpack .../53-po-debconf_1.0.21+nmu1_all.deb ...
Unpacking po-debconf (1.0.21+nmu1) ...
Selecting previously unselected package debhelper.
Preparing to unpack .../54-debhelper_13.20_all.deb ...
Unpacking debhelper (13.20) ...
Selecting previously unselected package libfakeroot:arm64.
Preparing to unpack .../55-libfakeroot_1.36-1_arm64.deb ...
Unpacking libfakeroot:arm64 (1.36-1) ...
Selecting previously unselected package fakeroot.
Preparing to unpack .../56-fakeroot_1.36-1_arm64.deb ...
Unpacking fakeroot (1.36-1) ...
Selecting previously unselected package sbuild-build-depends-main-dummy.
Preparing to unpack .../57-sbuild-build-depends-main-dummy_0.invalid.0_arm64.deb ...
Unpacking sbuild-build-depends-main-dummy (0.invalid.0) ...
Setting up libexpat1:arm64 (2.6.4-1) ...
Setting up libfile-stripnondeterminism-perl (1.14.0-1) ...
Setting up libcurl4t64:arm64 (8.11.0-1) ...
Setting up po-debconf (1.0.21+nmu1) ...
Setting up libdebhelper-perl (13.20) ...
Setting up libuv1t64:arm64 (1.48.0-7) ...
Setting up linux-libc-dev (6.11.7-1) ...
Setting up m4 (1.4.19-4) ...
Setting up libfakeroot:arm64 (1.36-1) ...
Setting up libelf1t64:arm64 (0.192-4) ...
Setting up fakeroot (1.36-1) ...
update-alternatives: using /usr/bin/fakeroot-sysv to provide /usr/bin/fakeroot (fakeroot) in auto mode
Setting up autotools-dev (20220109.1) ...
Setting up rpcsvc-proto (1.4.3-1+b1) ...
Setting up libmpfr6:arm64 (4.2.1-1+b2) ...
Setting up libjsoncpp26:arm64 (1.9.6-2) ...
Setting up libproc2-0:arm64 (2:4.0.4-6) ...
Setting up libmpc3:arm64 (1.3.1-1+b3) ...
Setting up autopoint (0.22.5-2) ...
Setting up libncursesw6:arm64 (6.5-2+b1) ...
Setting up autoconf (2.72-3) ...
Setting up libubsan1:arm64 (14.2.0-8) ...
Setting up dh-strip-nondeterminism (1.14.0-1) ...
Setting up dwz (0.15-1+b1) ...
Setting up libhwasan0:arm64 (14.2.0-8) ...
Setting up libcrypt-dev:arm64 (1:4.4.36-5) ...
Setting up libasan8:arm64 (14.2.0-8) ...
Setting up procps (2:4.0.4-6) ...
Setting up cmake-data (3.31.0-1) ...
Setting up librhash1:arm64 (1.4.5-1) ...
Setting up libtsan2:arm64 (14.2.0-8) ...
Setting up libisl23:arm64 (0.27-1) ...
Setting up libc-dev-bin (2.40-3) ...
Setting up libarchive13t64:arm64 (3.7.4-1.1) ...
Setting up libcc1-0:arm64 (14.2.0-8) ...
Setting up liblsan0:arm64 (14.2.0-8) ...
Setting up libitm1:arm64 (14.2.0-8) ...
Setting up automake (1:1.16.5-1.3) ...
update-alternatives: using /usr/bin/automake-1.16 to provide /usr/bin/automake (automake) in auto mode
Setting up cpp-14-aarch64-linux-gnu (14.2.0-8) ...
Setting up cmake (3.31.0-1) ...
Setting up libc6-dev:arm64 (2.40-3) ...
Setting up libgcc-14-dev:arm64 (14.2.0-8) ...
Setting up libstdc++-14-dev:arm64 (14.2.0-8) ...
Setting up cpp-aarch64-linux-gnu (4:14.2.0-1) ...
Setting up cpp-14 (14.2.0-8) ...
Setting up cpp (4:14.2.0-1) ...
Setting up gcc-14-aarch64-linux-gnu (14.2.0-8) ...
Setting up gcc-aarch64-linux-gnu (4:14.2.0-1) ...
Setting up g++-14-aarch64-linux-gnu (14.2.0-8) ...
Setting up gcc-14 (14.2.0-8) ...
Setting up g++-aarch64-linux-gnu (4:14.2.0-1) ...
Setting up g++-14 (14.2.0-8) ...
Setting up libtool (2.4.7-8) ...
Setting up gcc (4:14.2.0-1) ...
Setting up dh-autoreconf (20) ...
Setting up g++ (4:14.2.0-1) ...
update-alternatives: using /usr/bin/g++ to provide /usr/bin/c++ (c++) in auto mode
Setting up build-essential (12.12) ...
Setting up debhelper (13.20) ...
Setting up sbuild-build-depends-main-dummy (0.invalid.0) ...
Processing triggers for man-db (2.13.0-1) ...
Processing triggers for libc-bin (2.40-3) ...
+------------------------------------------------------------------------------+
| Check architectures |
+------------------------------------------------------------------------------+
Arch check ok (arm64 included in any)
+------------------------------------------------------------------------------+
| Build environment |
+------------------------------------------------------------------------------+
Kernel: Linux 6.1.0-27-cloud-arm64 #1 SMP Debian 6.1.115-1 (2024-11-01) arm64 (aarch64)
Toolchain package versions: binutils_2.43.1-5 dpkg-dev_1.22.12~1.gbp82cafd g++-14_14.2.0-8 gcc-14_14.2.0-8 libc6-dev_2.40-3 libstdc++-14-dev_14.2.0-8 libstdc++6_14.2.0-8 linux-libc-dev_6.11.7-1
Package versions: appstream_1.0.3-1+b1 apt_2.9.11 autoconf_2.72-3 automake_1:1.16.5-1.3 autopoint_0.22.5-2 autotools-dev_20220109.1 base-files_13.5 base-passwd_3.6.5 bash_5.2.32-1+b2 binutils_2.43.1-5 binutils-aarch64-linux-gnu_2.43.1-5 binutils-common_2.43.1-5 bsdextrautils_2.40.2-11 bsdutils_1:2.40.2-11 build-essential_12.12 bzip2_1.0.8-6 ca-certificates_20240203 cmake_3.31.0-1 cmake-data_3.31.0-1 coreutils_9.5-1+b1 cpp_4:14.2.0-1 cpp-14_14.2.0-8 cpp-14-aarch64-linux-gnu_14.2.0-8 cpp-aarch64-linux-gnu_4:14.2.0-1 dash_0.5.12-9+b1 debconf_1.5.87 debhelper_13.20 debian-archive-keyring_2023.4 debianutils_5.21 dh-autoreconf_20 dh-strip-nondeterminism_1.14.0-1 diffstat_1.66-1+b1 diffutils_1:3.10-1+b1 dpkg_1.22.12~1.gbp82cafd dpkg-dev_1.22.12~1.gbp82cafd dwz_0.15-1+b1 e2fsprogs_1.47.1-1+b1 fakeroot_1.36-1 file_1:5.45-3+b1 findutils_4.10.0-3 g++_4:14.2.0-1 g++-14_14.2.0-8 g++-14-aarch64-linux-gnu_14.2.0-8 g++-aarch64-linux-gnu_4:14.2.0-1 gcc_4:14.2.0-1 gcc-14_14.2.0-8 gcc-14-aarch64-linux-gnu_14.2.0-8 gcc-14-base_14.2.0-8 gcc-aarch64-linux-gnu_4:14.2.0-1 gettext_0.22.5-2 gettext-base_0.22.5-2 gpg_2.2.45-2 gpgconf_2.2.45-2 gpgv_2.2.45-2 grep_3.11-4+b1 groff-base_1.23.0-5 gzip_1.12-1.1+b1 hostname_3.25 init-system-helpers_1.67 intltool-debian_0.35.0+20060710.6 iso-codes_4.17.0-1 libacl1_2.3.2-2+b1 libaliased-perl_0.34-3 libappstream5_1.0.3-1+b1 libapt-pkg-perl_0.1.40+b6 libapt-pkg6.0t64_2.9.11 libarchive-zip-perl_1.68-1 libarchive13t64_3.7.4-1.1 libasan8_14.2.0-8 libassuan9_3.0.1-2 libatomic1_14.2.0-8 libattr1_1:2.5.2-2 libaudit-common_1:4.0.2-2 libaudit1_1:4.0.2-2 libb-hooks-endofscope-perl_0.28-1 libb-hooks-op-check-perl_0.22-3+b2 libberkeleydb-perl_0.66-1 libbinutils_2.43.1-5 libblkid1_2.40.2-11 libbrotli1_1.1.0-2+b6 libbsd0_0.12.2-2 libbz2-1.0_1.0.8-6 libc-bin_2.40-3 libc-dev-bin_2.40-3 libc6_2.40-3 libc6-dev_2.40-3 libcap-ng0_0.8.5-3+b1 libcap2_1:2.66-5+b1 libcapture-tiny-perl_0.48-2 libcc1-0_14.2.0-8 libcgi-pm-perl_4.66-1 libclass-data-inheritable-perl_0.10-1 libclass-inspector-perl_1.36-3 libclass-method-modifiers-perl_2.15-1 libclass-xsaccessor-perl_1.19-4+b4 libclone-perl_0.47-1+b1 libcom-err2_1.47.1-1+b1 libconfig-tiny-perl_2.30-1 libconst-fast-perl_0.014-2 libcpanel-json-xs-perl_4.38-1+b1 libcrypt-dev_1:4.4.36-5 libcrypt1_1:4.4.36-5 libctf-nobfd0_2.43.1-5 libctf0_2.43.1-5 libcurl3t64-gnutls_8.11.0-1 libcurl4t64_8.11.0-1 libdata-dpath-perl_0.60-1 libdata-messagepack-perl_1.02-1+b4 libdata-optlist-perl_0.114-1 libdata-validate-domain-perl_0.15-1 libdata-validate-ip-perl_0.31-1 libdata-validate-uri-perl_0.07-3 libdb5.3t64_5.3.28+dfsg2-9 libdebconfclient0_0.273 libdebhelper-perl_13.20 libdevel-callchecker-perl_0.009-1+b1 libdevel-size-perl_0.84-1+b1 libdevel-stacktrace-perl_2.0500-1 libdpkg-perl_1.22.12~1.gbp82cafd libdynaloader-functions-perl_0.004-1 libelf1t64_0.192-4 libemail-address-xs-perl_1.05-1+b4 libencode-locale-perl_1.05-3 libexception-class-perl_1.45-1 libexpat1_2.6.4-1 libext2fs2t64_1.47.1-1+b1 libfakeroot_1.36-1 libffi8_3.4.6-1 libfile-basedir-perl_0.09-2 libfile-find-rule-perl_0.34-3 libfile-listing-perl_6.16-1 libfile-sharedir-perl_1.118-3 libfile-stripnondeterminism-perl_1.14.0-1 libfont-ttf-perl_1.06-2 libgcc-14-dev_14.2.0-8 libgcc-s1_14.2.0-8 libgcrypt20_1.11.0-6 libgdbm-compat4t64_1.24-2 libgdbm6t64_1.24-2 libglib2.0-0t64_2.82.2-3 libgmp10_2:6.3.0+dfsg-2+b2 libgnutls30t64_3.8.8-2 libgomp1_14.2.0-8 libgpg-error0_1.50-4 libgprofng0_2.43.1-5 libgssapi-krb5-2_1.21.3-3 libhogweed6t64_3.10-1+b1 libhtml-form-perl_6.12-1 libhtml-html5-entities-perl_0.004-3 libhtml-parser-perl_3.83-1+b1 libhtml-tagset-perl_3.24-1 libhtml-tokeparser-simple-perl_3.16-4 libhtml-tree-perl_5.07-3 libhttp-cookies-perl_6.11-1 libhttp-date-perl_6.06-1 libhttp-message-perl_7.00-2 libhttp-negotiate-perl_6.01-2 libhwasan0_14.2.0-8 libicu72_72.1-5+b1 libidn2-0_2.3.7-2+b1 libimport-into-perl_1.002005-2 libio-html-perl_1.004-3 libio-interactive-perl_1.025-1 libio-socket-ssl-perl_2.089-1 libio-string-perl_1.08-4 libipc-run3-perl_0.049-1 libipc-system-simple-perl_1.30-2 libisl23_0.27-1 libiterator-perl_0.03+ds1-2 libiterator-util-perl_0.02+ds1-2 libitm1_14.2.0-8 libjansson4_2.14-2+b3 libjson-maybexs-perl_1.004008-1 libjsoncpp26_1.9.6-2 libk5crypto3_1.21.3-3 libkeyutils1_1.6.3-4 libkrb5-3_1.21.3-3 libkrb5support0_1.21.3-3 libldap-2.5-0_2.5.18+dfsg-3+b1 liblist-compare-perl_0.55-2 liblist-someutils-perl_0.59-1 liblist-utilsby-perl_0.12-2 liblsan0_14.2.0-8 liblwp-mediatypes-perl_6.04-2 liblwp-protocol-https-perl_6.14-1 liblz1_1.15~pre2-1 liblz4-1_1.9.4-3+b1 liblzma5_5.6.3-1+b1 liblzo2-2_2.10-3+b1 libmagic-mgc_1:5.45-3+b1 libmagic1t64_1:5.45-3+b1 libmarkdown2_2.2.7-2.1 libmd0_1.1.0-2+b1 libmldbm-perl_2.05-4 libmodule-implementation-perl_0.09-2 libmodule-runtime-perl_0.016-2 libmoo-perl_2.005005-1 libmoox-aliases-perl_0.001006-2 libmount1_2.40.2-11 libmouse-perl_2.5.11-1+b1 libmpc3_1.3.1-1+b3 libmpfr6_4.2.1-1+b2 libnamespace-clean-perl_0.27-2 libncursesw6_6.5-2+b1 libnet-domain-tld-perl_1.75-4 libnet-http-perl_6.23-1 libnet-ipv6addr-perl_1.02-1 libnet-netmask-perl_2.0002-2 libnet-ssleay-perl_1.94-2 libnetaddr-ip-perl_4.079+dfsg-2+b4 libnettle8t64_3.10-1+b1 libnghttp2-14_1.64.0-1 libnghttp3-9_1.4.0-1+b1 libngtcp2-16_1.6.0-1 libngtcp2-crypto-gnutls8_1.6.0-1 libnumber-compare-perl_0.03-3 libp11-kit0_0.25.5-2+b1 libpackage-stash-perl_0.40-1 libpam-modules_1.5.3-7+b1 libpam-modules-bin_1.5.3-7+b1 libpam-runtime_1.5.3-7 libpam0g_1.5.3-7+b1 libparams-classify-perl_0.015-2+b4 libparams-util-perl_1.102-3+b1 libpath-tiny-perl_0.146-1 libpcre2-8-0_10.44-4 libperl5.40_5.40.0-7 libperlio-gzip-perl_0.20-1+b4 libperlio-utf8-strict-perl_0.010-1+b3 libpipeline1_1.5.8-1 libproc-processtable-perl_0.636-1+b3 libproc2-0_2:4.0.4-6 libpsl5t64_0.21.2-1.1+b1 libreadline8t64_8.2-5 libregexp-wildcards-perl_1.05-3 librhash1_1.4.5-1 librole-tiny-perl_2.002004-1 librtmp1_2.4+20151223.gitfa8646d.1-2+b5 libsasl2-2_2.1.28+dfsg1-8 libsasl2-modules-db_2.1.28+dfsg1-8 libseccomp2_2.5.5-1+b3 libselinux1_3.7-3+b1 libsemanage-common_3.7-2 libsemanage2_3.7-2+b1 libsepol2_3.7-1 libsereal-decoder-perl_5.004+ds-1+b3 libsereal-encoder-perl_5.004+ds-1+b3 libsframe1_2.43.1-5 libsmartcols1_2.40.2-11 libsort-versions-perl_1.62-3 libsqlite3-0_3.46.1-1 libss2_1.47.1-1+b1 libssh2-1t64_1.11.1-1 libssl3t64_3.3.2-2 libstdc++-14-dev_14.2.0-8 libstdc++6_14.2.0-8 libstemmer0d_2.2.0-4+b2 libstrictures-perl_2.000006-1 libsub-exporter-perl_0.990-1 libsub-exporter-progressive-perl_0.001013-3 libsub-identify-perl_0.14-3+b3 libsub-install-perl_0.929-1 libsub-name-perl_0.27-1+b3 libsub-quote-perl_2.006008-1 libsyntax-keyword-try-perl_0.30-1+b1 libsystemd0_257~rc1-4 libtasn1-6_4.19.0-3+b3 libterm-readkey-perl_2.38-2+b4 libtext-glob-perl_0.11-3 libtext-levenshteinxs-perl_0.03-5+b4 libtext-markdown-discount-perl_0.16-1+b3 libtext-xslate-perl_3.5.9-2+b1 libtime-duration-perl_1.21-2 libtime-moment-perl_0.44-2+b4 libtimedate-perl_2.3300-2 libtinfo6_6.5-2+b1 libtool_2.4.7-8 libtry-tiny-perl_0.32-1 libtsan2_14.2.0-8 libubsan1_14.2.0-8 libuchardet0_0.0.8-1+b2 libudev1_257~rc1-4 libunicode-utf8-perl_0.62-2+b3 libunistring5_1.2-1+b1 liburi-perl_5.30-1 libuuid1_2.40.2-11 libuv1t64_1.48.0-7 libvariable-magic-perl_0.64-1+b1 libwww-mechanize-perl_2.19-1 libwww-perl_6.77-1 libwww-robotrules-perl_6.02-1 libxml-libxml-perl_2.0207+dfsg+really+2.0134-5+b1 libxml-namespacesupport-perl_1.12-2 libxml-sax-base-perl_1.09-3 libxml-sax-perl_1.02+dfsg-3 libxml2_2.12.7+dfsg+really2.9.14-0.2+b1 libxmlb2_0.3.21-1 libxs-parse-keyword-perl_0.46-1+b1 libxxhash0_0.8.2-2+b2 libyaml-0-2_0.2.5-1+b2 libyaml-libyaml-perl_0.902.0+ds-2+b1 libzstd1_1.5.6+dfsg-1+b1 lintian_2.120.0 linux-libc-dev_6.11.7-1 login_1:4.16.0-2+really2.40.2-11 login.defs_1:4.16.0-5 logsave_1.47.1-1+b1 lzop_1.04-2+b1 m4_1.4.19-4 make_4.3-4.1+b1 man-db_2.13.0-1 mawk_1.3.4.20240905-1 mount_2.40.2-11 ncurses-base_6.5-2 ncurses-bin_6.5-2+b1 netbase_6.4 openssl_3.3.2-2 openssl-provider-legacy_3.3.2-2 passwd_1:4.16.0-5 patch_2.7.6-7+b1 patchutils_0.4.2-1+b1 perl_5.40.0-7 perl-base_5.40.0-7 perl-modules-5.40_5.40.0-7 perl-openssl-defaults_7+b2 plzip_1.11-2 po-debconf_1.0.21+nmu1 procps_2:4.0.4-6 readline-common_8.2-5 rpcsvc-proto_1.4.3-1+b1 sbuild-build-depends-main-dummy_0.invalid.0 sed_4.9-2+b1 sensible-utils_0.0.24 shared-mime-info_2.4-5+b1 sysvinit-utils_3.11-1 t1utils_1.41-4+b1 tar_1.35+dfsg-3+b1 tzdata_2024b-3 ucf_3.0043+nmu1 unzip_6.0-28+b1 util-linux_2.40.2-11 xz-utils_5.6.3-1+b1 zlib1g_1:1.3.dfsg+really1.3.1-1+b1
+------------------------------------------------------------------------------+
| Build |
+------------------------------------------------------------------------------+
Unpack source
-------------
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA512
Format: 3.0 (quilt)
Source: gemmlowp
Binary: libgemmlowp-dev
Architecture: any
Version: 0.0~git20211220.e844ffd-1
Maintainer: Debian Science Maintainers <debian-science-maintainers@lists.alioth.debian.org>
Uploaders: Mo Zhou <lumin@debian.org>
Homepage: https://github.com/google/gemmlowp
Standards-Version: 4.5.0
Vcs-Browser: https://salsa.debian.org/science-team/gemmlowp
Vcs-Git: https://salsa.debian.org/science-team/gemmlowp.git
Build-Depends: debhelper-compat (= 13), cmake
Package-List:
libgemmlowp-dev deb libdevel optional arch=any
Checksums-Sha1:
f7b4ad36b046a404ce7f7b0df1b674d747e73430 544168 gemmlowp_0.0~git20211220.e844ffd.orig.tar.xz
c31cbd302d97bf431aed316aa8a6d4ce74a11007 3372 gemmlowp_0.0~git20211220.e844ffd-1.debian.tar.xz
Checksums-Sha256:
b43ad962deaa41ba6727888a48ecf4dad181a70fcdd82505d9dbdcc8388a98e7 544168 gemmlowp_0.0~git20211220.e844ffd.orig.tar.xz
a21ebb88383583856d79b7cc2e04e3d2abf162801a749e9c486404088316bfd6 3372 gemmlowp_0.0~git20211220.e844ffd-1.debian.tar.xz
Files:
abdf0c81faac4c2340657290fb5a9354 544168 gemmlowp_0.0~git20211220.e844ffd.orig.tar.xz
ee68dd119e3f3cd7036072821b60c274 3372 gemmlowp_0.0~git20211220.e844ffd-1.debian.tar.xz
-----BEGIN PGP SIGNATURE-----
iQJFBAEBCgAvFiEEY4vHXsHlxYkGfjXeYmRes19oaooFAmK1UhgRHGx1bWluQGRl
Ymlhbi5vcmcACgkQYmRes19oaopA/Q/+PFnDxewMprNXwTj8nFzpsi/yZ7OjBvXO
y00AxsZD6+7eS039S/VpbjQ43eJM56kT0p2kqZN4bueJOQI2BCcPtGN1k/fHixlv
By+L2BQoXyownTD1WhfylJuySCY1tghN/zLl4G7Xr0daWyrZz9nCBXVx+JzVQg7V
AFjwPDD5ox5ZwoCKlFMOsJr9HA5sEV1AbmCAvJ3lf9qOPuaD03HefGjSRyRhFSR2
zU9yHqYS/DdmyzuqL4TLV5hnmTeS7H+xIaNHW6pN46nlA3ZTGPKc/0Rn5edGhYtS
PrFar/NJ0jri7qFC4ck+kWR/fz6oP9FLmr5E4Da5aGOruqU3g4ByeSmg6z9on5fe
MPJoHrf7kwCPa9VopOmfI7SoI6f7tyJ9JDLe2NprXiLryuFRw3/X01RH9V0Y9BPa
KgOscXbJZwl3XdbtMa2Me3UKkgQQsZB5kRyHBFNhX6a7noxEoFTOMSLhGw3tIwpd
U4CG1ddZqASnWBn7RjQNeLxLMvPo2mK9IkzQm+sHpNONf1GisUAS0IHKARAuxgsk
Na2r8mvIHMv4Y2nmHKUkd3U9OupXusFZOapV2PBgjI2AZ9h9OJwUPVs9SGseogbx
yKF2U3bZqf2f1OJKlw0kjN+x6oO3NtA6+H3fDPczmgQN0UliIOW39dkDtqzE5t8Z
tu4VkVxZLX4=
=AWaR
-----END PGP SIGNATURE-----
gpgv: Signature made Fri Jun 24 05:56:40 2022 UTC
gpgv: using RSA key 638BC75EC1E5C589067E35DE62645EB35F686A8A
gpgv: issuer "lumin@debian.org"
gpgv: Can't check signature: No public key
dpkg-source: warning: cannot verify inline signature for ./gemmlowp_0.0~git20211220.e844ffd-1.dsc: no acceptable signature found
dpkg-source: info: extracting gemmlowp in /<<PKGBUILDDIR>>
dpkg-source: info: unpacking gemmlowp_0.0~git20211220.e844ffd.orig.tar.xz
dpkg-source: info: unpacking gemmlowp_0.0~git20211220.e844ffd-1.debian.tar.xz
dpkg-source: info: using patch list from debian/patches/series
dpkg-source: info: applying 0001-cmake-build-fix.patch
Check disk space
----------------
Sufficient free space for build
User Environment
----------------
APT_CONFIG=/var/lib/sbuild/apt.conf
HOME=/sbuild-nonexistent
LANG=en_US.UTF-8
LC_ALL=C.UTF-8
LOGNAME=debusine-worker
PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games
SHELL=/bin/sh
USER=debusine-worker
dpkg-buildpackage
-----------------
Command: dpkg-buildpackage -us -uc -b -rfakeroot
dpkg-buildpackage: info: source package gemmlowp
dpkg-buildpackage: info: source version 0.0~git20211220.e844ffd-1
dpkg-buildpackage: info: source distribution unstable
dpkg-buildpackage: info: source changed by Mo Zhou <lumin@debian.org>
dpkg-source --before-build .
dpkg-buildpackage: info: host architecture arm64
debian/rules clean
dh clean -Scmake
debian/rules override_dh_auto_clean
make[1]: Entering directory '/<<PKGBUILDDIR>>'
rm -f CMakeLists.txt
dh_auto_clean
make[1]: Leaving directory '/<<PKGBUILDDIR>>'
dh_clean -O-Scmake
debian/rules binary
dh binary -Scmake
dh_update_autotools_config -O-Scmake
dh_autoreconf -O-Scmake
debian/rules override_dh_auto_configure
make[1]: Entering directory '/<<PKGBUILDDIR>>'
ln -s contrib/CMakeLists.txt .
dh_auto_configure -- \
-DCMAKE_C_FLAGS="-g -O2 -Werror=implicit-function-declaration -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2" \
-DCMAKE_CXX_FLAGS="-g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2"
cd obj-aarch64-linux-gnu && DEB_PYTHON_INSTALL_LAYOUT=deb PKG_CONFIG=/usr/bin/pkg-config cmake -DCMAKE_INSTALL_PREFIX=/usr -DCMAKE_BUILD_TYPE=None -DCMAKE_INSTALL_SYSCONFDIR=/etc -DCMAKE_INSTALL_LOCALSTATEDIR=/var -DCMAKE_EXPORT_NO_PACKAGE_REGISTRY=ON -DCMAKE_FIND_USE_PACKAGE_REGISTRY=OFF -DCMAKE_FIND_PACKAGE_NO_PACKAGE_REGISTRY=ON -DFETCHCONTENT_FULLY_DISCONNECTED=ON -DCMAKE_INSTALL_RUNSTATEDIR=/run -DCMAKE_SKIP_INSTALL_ALL_DEPENDENCY=ON "-GUnix Makefiles" -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_INSTALL_LIBDIR=lib/aarch64-linux-gnu "-DCMAKE_C_FLAGS=-g -O2 -Werror=implicit-function-declaration -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2" "-DCMAKE_CXX_FLAGS=-g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2" ..
CMake Deprecation Warning at CMakeLists.txt:5 (cmake_minimum_required):
Compatibility with CMake < 3.10 will be removed from a future version of
CMake.
Update the VERSION argument <min> value or use a ...<max> suffix to tell
CMake that the project does not need compatibility with older versions.
-- The C compiler identification is GNU 14.2.0
-- The CXX compiler identification is GNU 14.2.0
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Check for working C compiler: /usr/bin/cc - skipped
-- Detecting C compile features
-- Detecting C compile features - done
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Check for working CXX compiler: /usr/bin/c++ - skipped
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Configuring done (1.0s)
-- Generating done (0.1s)
CMake Warning:
Manually-specified variables were not used by the project:
CMAKE_EXPORT_NO_PACKAGE_REGISTRY
CMAKE_FIND_PACKAGE_NO_PACKAGE_REGISTRY
CMAKE_FIND_USE_PACKAGE_REGISTRY
FETCHCONTENT_FULLY_DISCONNECTED
-- Build files have been written to: /<<PKGBUILDDIR>>/obj-aarch64-linux-gnu
make[1]: Leaving directory '/<<PKGBUILDDIR>>'
dh_auto_build -O-Scmake
cd obj-aarch64-linux-gnu && make -j8 "INSTALL=install --strip-program=true" VERBOSE=1
make[1]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
/usr/bin/cmake -S"/<<PKGBUILDDIR>>" -B"/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" --check-build-system CMakeFiles/Makefile.cmake 0
/usr/bin/cmake -E cmake_progress_start "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/CMakeFiles" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu//CMakeFiles/progress.marks"
make -f CMakeFiles/Makefile2 all
make[2]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make -f CMakeFiles/eight_bit_int_gemm.dir/build.make CMakeFiles/eight_bit_int_gemm.dir/depend
make -f CMakeFiles/benchmark.dir/build.make CMakeFiles/benchmark.dir/depend
make -f CMakeFiles/benchmark_all_sizes.dir/build.make CMakeFiles/benchmark_all_sizes.dir/depend
make -f CMakeFiles/test_math_helpers.dir/build.make CMakeFiles/test_math_helpers.dir/depend
make -f CMakeFiles/test_blocking_counter.dir/build.make CMakeFiles/test_blocking_counter.dir/depend
make -f CMakeFiles/test_allocator.dir/build.make CMakeFiles/test_allocator.dir/depend
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
cd "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/CMakeFiles/eight_bit_int_gemm.dir/DependInfo.cmake" "--color="
make -f CMakeFiles/test_fixedpoint.dir/build.make CMakeFiles/test_fixedpoint.dir/depend
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
cd "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/CMakeFiles/benchmark.dir/DependInfo.cmake" "--color="
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
cd "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/CMakeFiles/benchmark_all_sizes.dir/DependInfo.cmake" "--color="
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
cd "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/CMakeFiles/test_math_helpers.dir/DependInfo.cmake" "--color="
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
cd "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/CMakeFiles/test_blocking_counter.dir/DependInfo.cmake" "--color="
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
cd "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/CMakeFiles/test_allocator.dir/DependInfo.cmake" "--color="
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
cd "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/CMakeFiles/test_fixedpoint.dir/DependInfo.cmake" "--color="
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make -f CMakeFiles/benchmark.dir/build.make CMakeFiles/benchmark.dir/build
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make -f CMakeFiles/benchmark_all_sizes.dir/build.make CMakeFiles/benchmark_all_sizes.dir/build
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make -f CMakeFiles/eight_bit_int_gemm.dir/build.make CMakeFiles/eight_bit_int_gemm.dir/build
make -f CMakeFiles/test_math_helpers.dir/build.make CMakeFiles/test_math_helpers.dir/build
make -f CMakeFiles/test_blocking_counter.dir/build.make CMakeFiles/test_blocking_counter.dir/build
make -f CMakeFiles/test_allocator.dir/build.make CMakeFiles/test_allocator.dir/build
make -f CMakeFiles/test_fixedpoint.dir/build.make CMakeFiles/test_fixedpoint.dir/build
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
[ 5%] Building CXX object CMakeFiles/benchmark_all_sizes.dir/test/benchmark_all_sizes.cc.o
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -DBENCHMARK_8bit -DBENCHMARK_QUICK -MD -MT CMakeFiles/benchmark_all_sizes.dir/test/benchmark_all_sizes.cc.o -MF CMakeFiles/benchmark_all_sizes.dir/test/benchmark_all_sizes.cc.o.d -o CMakeFiles/benchmark_all_sizes.dir/test/benchmark_all_sizes.cc.o -c "/<<PKGBUILDDIR>>/test/benchmark_all_sizes.cc"
[ 23%] Building CXX object CMakeFiles/eight_bit_int_gemm.dir/eight_bit_int_gemm/eight_bit_int_gemm.cc.o
[ 23%] Building CXX object CMakeFiles/benchmark.dir/test/benchmark.cc.o
[ 23%] Building CXX object CMakeFiles/test_math_helpers.dir/test/test_math_helpers.cc.o
[ 29%] Building CXX object CMakeFiles/test_blocking_counter.dir/test/test_blocking_counter.cc.o
[ 35%] Building CXX object CMakeFiles/test_allocator.dir/test/test_allocator.cc.o
[ 41%] Building CXX object CMakeFiles/test_fixedpoint.dir/test/test_fixedpoint.cc.o
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/eight_bit_int_gemm.dir/eight_bit_int_gemm/eight_bit_int_gemm.cc.o -MF CMakeFiles/eight_bit_int_gemm.dir/eight_bit_int_gemm/eight_bit_int_gemm.cc.o.d -o CMakeFiles/eight_bit_int_gemm.dir/eight_bit_int_gemm/eight_bit_int_gemm.cc.o -c "/<<PKGBUILDDIR>>/eight_bit_int_gemm/eight_bit_int_gemm.cc"
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/test_math_helpers.dir/test/test_math_helpers.cc.o -MF CMakeFiles/test_math_helpers.dir/test/test_math_helpers.cc.o.d -o CMakeFiles/test_math_helpers.dir/test/test_math_helpers.cc.o -c "/<<PKGBUILDDIR>>/test/test_math_helpers.cc"
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/benchmark.dir/test/benchmark.cc.o -MF CMakeFiles/benchmark.dir/test/benchmark.cc.o.d -o CMakeFiles/benchmark.dir/test/benchmark.cc.o -c "/<<PKGBUILDDIR>>/test/benchmark.cc"
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/test_blocking_counter.dir/test/test_blocking_counter.cc.o -MF CMakeFiles/test_blocking_counter.dir/test/test_blocking_counter.cc.o.d -o CMakeFiles/test_blocking_counter.dir/test/test_blocking_counter.cc.o -c "/<<PKGBUILDDIR>>/test/test_blocking_counter.cc"
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/test_allocator.dir/test/test_allocator.cc.o -MF CMakeFiles/test_allocator.dir/test/test_allocator.cc.o.d -o CMakeFiles/test_allocator.dir/test/test_allocator.cc.o -c "/<<PKGBUILDDIR>>/test/test_allocator.cc"
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/test_fixedpoint.dir/test/test_fixedpoint.cc.o -MF CMakeFiles/test_fixedpoint.dir/test/test_fixedpoint.cc.o.d -o CMakeFiles/test_fixedpoint.dir/test/test_fixedpoint.cc.o -c "/<<PKGBUILDDIR>>/test/test_fixedpoint.cc"
[ 47%] Linking CXX executable test_blocking_counter
/usr/bin/cmake -E cmake_link_script CMakeFiles/test_blocking_counter.dir/link.txt --verbose=1
[ 52%] Linking CXX executable test_allocator
/usr/bin/cmake -E cmake_link_script CMakeFiles/test_allocator.dir/link.txt --verbose=1
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,-z,relro -Wl,-z,now -Wl,--as-needed -Wl,--dependency-file,CMakeFiles/test_blocking_counter.dir/link.d CMakeFiles/test_blocking_counter.dir/test/test_blocking_counter.cc.o -o test_blocking_counter -lpthread
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
[ 52%] Built target test_blocking_counter
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,-z,relro -Wl,-z,now -Wl,--as-needed -Wl,--dependency-file,CMakeFiles/test_allocator.dir/link.d CMakeFiles/test_allocator.dir/test/test_allocator.cc.o -o test_allocator
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
[ 52%] Built target test_allocator
[ 58%] Linking CXX executable test_math_helpers
/usr/bin/cmake -E cmake_link_script CMakeFiles/test_math_helpers.dir/link.txt --verbose=1
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,-z,relro -Wl,-z,now -Wl,--as-needed -Wl,--dependency-file,CMakeFiles/test_math_helpers.dir/link.d CMakeFiles/test_math_helpers.dir/test/test_math_helpers.cc.o -o test_math_helpers
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
[ 58%] Built target test_math_helpers
[ 64%] Linking CXX executable benchmark_all_sizes
/usr/bin/cmake -E cmake_link_script CMakeFiles/benchmark_all_sizes.dir/link.txt --verbose=1
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,-z,relro -Wl,-z,now -Wl,--as-needed -Wl,--dependency-file,CMakeFiles/benchmark_all_sizes.dir/link.d CMakeFiles/benchmark_all_sizes.dir/test/benchmark_all_sizes.cc.o -o benchmark_all_sizes -lpthread
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
[ 64%] Built target benchmark_all_sizes
[ 70%] Linking CXX executable benchmark
/usr/bin/cmake -E cmake_link_script CMakeFiles/benchmark.dir/link.txt --verbose=1
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,-z,relro -Wl,-z,now -Wl,--as-needed -Wl,--dependency-file,CMakeFiles/benchmark.dir/link.d CMakeFiles/benchmark.dir/test/benchmark.cc.o -o benchmark -lpthread
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
[ 70%] Built target benchmark
[ 76%] Linking CXX executable test_fixedpoint
/usr/bin/cmake -E cmake_link_script CMakeFiles/test_fixedpoint.dir/link.txt --verbose=1
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,-z,relro -Wl,-z,now -Wl,--as-needed -Wl,--dependency-file,CMakeFiles/test_fixedpoint.dir/link.d CMakeFiles/test_fixedpoint.dir/test/test_fixedpoint.cc.o -o test_fixedpoint
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
[ 76%] Built target test_fixedpoint
[ 82%] Linking CXX static library libeight_bit_int_gemm.a
/usr/bin/cmake -P CMakeFiles/eight_bit_int_gemm.dir/cmake_clean_target.cmake
/usr/bin/cmake -E cmake_link_script CMakeFiles/eight_bit_int_gemm.dir/link.txt --verbose=1
/usr/bin/ar qc libeight_bit_int_gemm.a CMakeFiles/eight_bit_int_gemm.dir/eight_bit_int_gemm/eight_bit_int_gemm.cc.o
/usr/bin/ranlib libeight_bit_int_gemm.a
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
[ 82%] Built target eight_bit_int_gemm
make -f CMakeFiles/test_gemmlowp.dir/build.make CMakeFiles/test_gemmlowp.dir/depend
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
cd "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/CMakeFiles/test_gemmlowp.dir/DependInfo.cmake" "--color="
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make -f CMakeFiles/test_gemmlowp.dir/build.make CMakeFiles/test_gemmlowp.dir/build
make[3]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
[ 88%] Building CXX object CMakeFiles/test_gemmlowp.dir/test/test.cc.o
[ 94%] Building CXX object CMakeFiles/test_gemmlowp.dir/test/test_data.cc.o
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/test_gemmlowp.dir/test/test.cc.o -MF CMakeFiles/test_gemmlowp.dir/test/test.cc.o.d -o CMakeFiles/test_gemmlowp.dir/test/test.cc.o -c "/<<PKGBUILDDIR>>/test/test.cc"
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/test_gemmlowp.dir/test/test_data.cc.o -MF CMakeFiles/test_gemmlowp.dir/test/test_data.cc.o.d -o CMakeFiles/test_gemmlowp.dir/test/test_data.cc.o -c "/<<PKGBUILDDIR>>/test/test_data.cc"
/<<PKGBUILDDIR>>/test/test.cc: In static member function ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<1, 1>, 1>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<1, 1>, 1> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’:
/<<PKGBUILDDIR>>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=]
163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name());
| ^~
In file included from /usr/include/stdio.h:970,
from /usr/include/c++/14/cstdio:42,
from /usr/include/c++/14/ext/string_conversions.h:45,
from /usr/include/c++/14/bits/basic_string.h:4154,
from /usr/include/c++/14/string:54,
from /usr/include/c++/14/bits/locale_classes.h:40,
from /usr/include/c++/14/bits/ios_base.h:41,
from /usr/include/c++/14/ios:44,
from /usr/include/c++/14/ostream:40,
from /usr/include/c++/14/iostream:41,
from /<<PKGBUILDDIR>>/test/test.h:26,
from /<<PKGBUILDDIR>>/test/test.cc:15:
In function ‘int snprintf(char*, size_t, const char*, ...)’,
inlined from ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<1, 1>, 1>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<1, 1>, 1> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’ at /<<PKGBUILDDIR>>/test/test.cc:163:13:
/usr/include/aarch64-linux-gnu/bits/stdio2.h:68:35: note: ‘__builtin___snprintf_chk’ output between 26 and 281 bytes into a destination of size 256
68 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
| ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
69 | __glibc_objsize (__s), __fmt,
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
70 | __va_arg_pack ());
| ~~~~~~~~~~~~~~~~~
/<<PKGBUILDDIR>>/test/test.cc: In static member function ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<4, 2>, 1>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<4, 2>, 2> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’:
/<<PKGBUILDDIR>>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=]
163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name());
| ^~
In function ‘int snprintf(char*, size_t, const char*, ...)’,
inlined from ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<4, 2>, 1>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<4, 2>, 2> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’ at /<<PKGBUILDDIR>>/test/test.cc:163:13:
/usr/include/aarch64-linux-gnu/bits/stdio2.h:68:35: note: ‘__builtin___snprintf_chk’ output between 26 and 281 bytes into a destination of size 256
68 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
| ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
69 | __glibc_objsize (__s), __fmt,
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
70 | __va_arg_pack ());
| ~~~~~~~~~~~~~~~~~
/<<PKGBUILDDIR>>/test/test.cc: In static member function ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<4, 2>, 4>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<4, 2>, 5> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’:
/<<PKGBUILDDIR>>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=]
163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name());
| ^~
In function ‘int snprintf(char*, size_t, const char*, ...)’,
inlined from ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<4, 2>, 4>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<4, 2>, 5> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’ at /<<PKGBUILDDIR>>/test/test.cc:163:13:
/usr/include/aarch64-linux-gnu/bits/stdio2.h:68:35: note: ‘__builtin___snprintf_chk’ output between 26 and 281 bytes into a destination of size 256
68 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
| ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
69 | __glibc_objsize (__s), __fmt,
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
70 | __va_arg_pack ());
| ~~~~~~~~~~~~~~~~~
/<<PKGBUILDDIR>>/test/test.cc: In static member function ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<3, 4, gemmlowp::CellOrder::DepthMajor>, 2>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<5, 4, gemmlowp::CellOrder::DepthMajor>, 3> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’:
/<<PKGBUILDDIR>>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=]
163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name());
| ^~
In function ‘int snprintf(char*, size_t, const char*, ...)’,
inlined from ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<3, 4, gemmlowp::CellOrder::DepthMajor>, 2>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<5, 4, gemmlowp::CellOrder::DepthMajor>, 3> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’ at /<<PKGBUILDDIR>>/test/test.cc:163:13:
/usr/include/aarch64-linux-gnu/bits/stdio2.h:68:35: note: ‘__builtin___snprintf_chk’ output between 26 and 281 bytes into a destination of size 256
68 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
| ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
69 | __glibc_objsize (__s), __fmt,
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
70 | __va_arg_pack ());
| ~~~~~~~~~~~~~~~~~
/<<PKGBUILDDIR>>/test/test.cc: In static member function ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<3, 4, gemmlowp::CellOrder::WidthMajor>, 2>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<5, 4, gemmlowp::CellOrder::WidthMajor>, 3> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’:
/<<PKGBUILDDIR>>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=]
163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name());
| ^~
In function ‘int snprintf(char*, size_t, const char*, ...)’,
inlined from ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<3, 4, gemmlowp::CellOrder::WidthMajor>, 2>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<5, 4, gemmlowp::CellOrder::WidthMajor>, 3> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’ at /<<PKGBUILDDIR>>/test/test.cc:163:13:
/usr/include/aarch64-linux-gnu/bits/stdio2.h:68:35: note: ‘__builtin___snprintf_chk’ output between 26 and 281 bytes into a destination of size 256
68 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
| ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
69 | __glibc_objsize (__s), __fmt,
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
70 | __va_arg_pack ());
| ~~~~~~~~~~~~~~~~~
/<<PKGBUILDDIR>>/test/test.cc: In static member function ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<5, 2, gemmlowp::CellOrder::WidthMajor>, 3>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<4, 2>, 2> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’:
/<<PKGBUILDDIR>>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=]
163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name());
| ^~
In function ‘int snprintf(char*, size_t, const char*, ...)’,
inlined from ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<5, 2, gemmlowp::CellOrder::WidthMajor>, 3>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<4, 2>, 2> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’ at /<<PKGBUILDDIR>>/test/test.cc:163:13:
/usr/include/aarch64-linux-gnu/bits/stdio2.h:68:35: note: ‘__builtin___snprintf_chk’ output between 26 and 281 bytes into a destination of size 256
68 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
| ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
69 | __glibc_objsize (__s), __fmt,
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
70 | __va_arg_pack ());
| ~~~~~~~~~~~~~~~~~
/<<PKGBUILDDIR>>/test/test.cc: In static member function ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<5, 2, gemmlowp::CellOrder::DepthMajor>, 3>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<4, 2, gemmlowp::CellOrder::WidthMajor>, 2> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’:
/<<PKGBUILDDIR>>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=]
163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name());
| ^~
In function ‘int snprintf(char*, size_t, const char*, ...)’,
inlined from ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<5, 2, gemmlowp::CellOrder::DepthMajor>, 3>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<4, 2, gemmlowp::CellOrder::WidthMajor>, 2> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’ at /<<PKGBUILDDIR>>/test/test.cc:163:13:
/usr/include/aarch64-linux-gnu/bits/stdio2.h:68:35: note: ‘__builtin___snprintf_chk’ output between 26 and 281 bytes into a destination of size 256
68 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
| ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
69 | __glibc_objsize (__s), __fmt,
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
70 | __va_arg_pack ());
| ~~~~~~~~~~~~~~~~~
/<<PKGBUILDDIR>>/test/test.cc: In static member function ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<8, 8, gemmlowp::CellOrder::Diagonal>, 2>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<3, 8, gemmlowp::CellOrder::WidthMajor>, 1> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’:
/<<PKGBUILDDIR>>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=]
163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name());
| ^~
In function ‘int snprintf(char*, size_t, const char*, ...)’,
inlined from ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<8, 8, gemmlowp::CellOrder::Diagonal>, 2>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<3, 8, gemmlowp::CellOrder::WidthMajor>, 1> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’ at /<<PKGBUILDDIR>>/test/test.cc:163:13:
/usr/include/aarch64-linux-gnu/bits/stdio2.h:68:35: note: ‘__builtin___snprintf_chk’ output between 26 and 281 bytes into a destination of size 256
68 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
| ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
69 | __glibc_objsize (__s), __fmt,
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
70 | __va_arg_pack ());
| ~~~~~~~~~~~~~~~~~
/<<PKGBUILDDIR>>/test/test.cc: In static member function ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<1, 4, gemmlowp::CellOrder::DepthMajor>, 1>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<4, 4, gemmlowp::CellOrder::Diagonal>, 1> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’:
/<<PKGBUILDDIR>>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=]
163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name());
| ^~
In function ‘int snprintf(char*, size_t, const char*, ...)’,
inlined from ‘static const char* gemmlowp::MultiThreadGemmWrapper<Kernel, Scalar, tBitDepthParams>::Name() [with Kernel = gemmlowp::ReferenceKernel<gemmlowp::KernelFormat<gemmlowp::KernelSideFormat<gemmlowp::CellFormat<1, 4, gemmlowp::CellOrder::DepthMajor>, 1>, gemmlowp::KernelSideFormat<gemmlowp::CellFormat<4, 4, gemmlowp::CellOrder::Diagonal>, 1> > >; Scalar = unsigned char; tBitDepthParams = gemmlowp::BitDepthParams<gemmlowp::OperandRange<0, 255>, gemmlowp::OperandRange<0, 255> >]’ at /<<PKGBUILDDIR>>/test/test.cc:163:13:
/usr/include/aarch64-linux-gnu/bits/stdio2.h:68:35: note: ‘__builtin___snprintf_chk’ output between 26 and 281 bytes into a destination of size 256
68 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
| ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
69 | __glibc_objsize (__s), __fmt,
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
70 | __va_arg_pack ());
| ~~~~~~~~~~~~~~~~~
[100%] Linking CXX executable test_gemmlowp
/usr/bin/cmake -E cmake_link_script CMakeFiles/test_gemmlowp.dir/link.txt --verbose=1
/usr/bin/c++ -g -O2 -ffile-prefix-map=/<<PKGBUILDDIR>>=. -fstack-protector-strong -fstack-clash-protection -Wformat -Werror=format-security -mbranch-protection=standard -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,-z,relro -Wl,-z,now -Wl,--as-needed -Wl,--dependency-file,CMakeFiles/test_gemmlowp.dir/link.d CMakeFiles/test_gemmlowp.dir/test/test.cc.o CMakeFiles/test_gemmlowp.dir/test/test_data.cc.o -o test_gemmlowp libeight_bit_int_gemm.a -lpthread
make[3]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
[100%] Built target test_gemmlowp
make[2]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
/usr/bin/cmake -E cmake_progress_start "/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/CMakeFiles" 0
make[1]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
dh_auto_test -O-Scmake
cd obj-aarch64-linux-gnu && make -j8 test ARGS\+=--verbose ARGS\+=-j8
make[1]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
Running tests...
/usr/bin/ctest --force-new-ctest-process --verbose -j8
UpdateCTestConfiguration from :/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/DartConfiguration.tcl
Parse Config file:/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/DartConfiguration.tcl
UpdateCTestConfiguration from :/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/DartConfiguration.tcl
Parse Config file:/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/DartConfiguration.tcl
Test project /<<PKGBUILDDIR>>/obj-aarch64-linux-gnu
Constructing a list of tests
Done constructing a list of tests
Updating test list for fixtures
Added 0 tests to meet fixture requirements
Checking test dependency graph...
Checking test dependency graph end
test 1
Start 1: test_math_helpers
1: Test command: /<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/test_math_helpers
1: Working Directory: /<<PKGBUILDDIR>>/obj-aarch64-linux-gnu
1: Test timeout computed to be: 1500
test 2
Start 2: test_blocking_counter
2: Test command: /<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/test_blocking_counter
2: Working Directory: /<<PKGBUILDDIR>>/obj-aarch64-linux-gnu
2: Test timeout computed to be: 1500
test 3
Start 3: test_allocator
3: Test command: /<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/test_allocator
3: Working Directory: /<<PKGBUILDDIR>>/obj-aarch64-linux-gnu
3: Test timeout computed to be: 1500
test 4
Start 4: test_fixedpoint
4: Test command: /<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/test_fixedpoint
4: Working Directory: /<<PKGBUILDDIR>>/obj-aarch64-linux-gnu
4: Test timeout computed to be: 1500
test 5
Start 5: test_gemmlowp
5: Test command: /<<PKGBUILDDIR>>/obj-aarch64-linux-gnu/test_gemmlowp
5: Working Directory: /<<PKGBUILDDIR>>/obj-aarch64-linux-gnu
5: Test timeout computed to be: 1500
1/5 Test #3: test_allocator ................... Passed 0.01 sec
2/5 Test #1: test_math_helpers ................ Passed 0.01 sec
5: TestWithSmallData: PASS
5: number of matrix entries: 8
5: median value: 136
5: median unsigned diff: 0 (tolerating 0)
5: max unsigned diff: 0 (tolerating 0)
5: median signed diff: 0 (tolerating 0)
5: mean signed diff: 0 (tolerating 0)
5: No error: 100.00 % of entries
5: Error in 1..1 range: 0.00 % of entries
5: Error in 2..3 range: 0.00 % of entries
5: Error in 4..7 range: 0.00 % of entries
5: Error in 8..15 range: 0.00 % of entries
5: Error in 16..31 range: 0.00 % of entries
5: Error in 32..63 range: 0.00 % of entries
5: Error in 64..127 range: 0.00 % of entries
5: Error in 128..255 range: 0.00 % of entries
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 6
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 6
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 6
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 10
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 10
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 10
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 6
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 6
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 6
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 10
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 10
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 10
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
3/5 Test #2: test_blocking_counter ............ Passed 0.03 sec
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
4: PASS (Scalar int32)
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
4: PASS (Scalar int16)
4: PASS (NEON int32x4_t)
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
4: PASS (NEON int16x8_t)
4/5 Test #4: test_fixedpoint .................. Passed 0.26 sec
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 6
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 6
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 10
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 10
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 10
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 10
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 6
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 6
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 6
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 6
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 10
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 10
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1x1x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1x1000x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x500 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1x1x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1x1000x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x500 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 8
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 8
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 8
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 8
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 4x4, depth 16, accumulating two within signed int16, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 6
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 6
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 10
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 10
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 10
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 10
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 6
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 6
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 6
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 6
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 10
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1x1x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1x1000x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x500 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: NEON, 12x8, depth 2, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 10
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 10
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1x1x1000 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 1x1000x1000 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x500 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 6
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 6
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 6
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 6
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 8
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 10
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 10
5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12
5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14
5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 16
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16
5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1x1x1000 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 1x1000x1000 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16
5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 22
5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 24
5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18
5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18
5: PASS: 300x400x500 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 300x400x500 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets 0/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets 10/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets 0/10/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets 0/0/10, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets 0/0/0, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets 10/10/10, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets 256/1/17, mult 4, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 18
5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 10/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/10/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/0/10, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/0/0, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 10/10/10, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 256/1/17, mult 4, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 18
5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets 0/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets 10/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets 0/10/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets 0/0/10, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets 0/0/0, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets 10/10/10, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets 256/1/17, mult 4, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 18
5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets 0/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets 10/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets 0/10/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets 0/0/10, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets 0/0/0, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets 10/10/10, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets 256/1/17, mult 4, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 18
5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets 0/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets 10/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets 0/10/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets 0/0/10, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets 0/0/0, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets 10/10/10, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets 256/1/17, mult 4, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 18
5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 10/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/10/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/0/10, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/0/0, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 10/10/10, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 256/1/17, mult 4, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 18
5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets 0/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets 10/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets 0/10/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets 0/0/10, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets 0/0/0, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets 10/10/10, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets 256/1/17, mult 4, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18
5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets 0/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets 10/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets 0/10/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets 0/0/10, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets 0/0/0, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets 10/10/10, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets 256/1/17, mult 4, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 18
5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 24
5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets 0/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets 10/0/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets 0/10/0, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets 0/0/10, mult 1, shift 12
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets 0/0/0, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets 10/10/10, mult 10, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets 256/1/17, mult 4, shift 16
5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 18
5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20
5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 24
5: TestWithRealData: PASS with Lhs: 8 bit, Rhs: 8 bit
5: number of matrix entries: 49152
5: median value: 104
5: median unsigned diff: 0 (tolerating 0)
5: max unsigned diff: 0 (tolerating 0)
5: median signed diff: 0 (tolerating 0)
5: mean signed diff: 0 (tolerating 0)
5: No error: 100.00 % of entries
5: Error in 1..1 range: 0.00 % of entries
5: Error in 2..3 range: 0.00 % of entries
5: Error in 4..7 range: 0.00 % of entries
5: Error in 8..15 range: 0.00 % of entries
5: Error in 16..31 range: 0.00 % of entries
5: Error in 32..63 range: 0.00 % of entries
5: Error in 64..127 range: 0.00 % of entries
5: Error in 128..255 range: 0.00 % of entries
5: TestWithRealData: PASS with (legacy, no longer requantizing) Lhs: 7 bit, Rhs: 5 bit
5: number of matrix entries: 49152
5: median value: 104
5: median unsigned diff: 0 (tolerating 2)
5: max unsigned diff: 0 (tolerating 10)
5: median signed diff: 0 (tolerating 0)
5: mean signed diff: 0 (tolerating 0.2)
5: No error: 100.00 % of entries
5: Error in 1..1 range: 0.00 % of entries
5: Error in 2..3 range: 0.00 % of entries
5: Error in 4..7 range: 0.00 % of entries
5: Error in 8..15 range: 0.00 % of entries
5: Error in 16..31 range: 0.00 % of entries
5: Error in 32..63 range: 0.00 % of entries
5: Error in 64..127 range: 0.00 % of entries
5: Error in 128..255 range: 0.00 % of entries
5: TestOutputStages: PASS with ResultOrder=RowMajor
5: TestOutputStages: PASS with ResultOrder=ColMajor
5: TestOutputStages: PASS with ResultOrder=RowMajor
5: TestOutputStages: PASS with ResultOrder=ColMajor
5: TestOutputStages: PASS with ResultOrder=RowMajor
5: TestOutputStages: PASS with ResultOrder=ColMajor
5: TestOutputStages: PASS with ResultOrder=RowMajor
5: TestOutputStages: PASS with ResultOrder=ColMajor
5: TestWithSmallDataPerChannelQuantization: PASS
5: number of matrix entries: 18
5: median value: 127
5: median unsigned diff: 0 (tolerating 0)
5: max unsigned diff: 0 (tolerating 0)
5: median signed diff: 0 (tolerating 0)
5: mean signed diff: 0 (tolerating 0)
5: No error: 100.00 % of entries
5: Error in 1..1 range: 0.00 % of entries
5: Error in 2..3 range: 0.00 % of entries
5: Error in 4..7 range: 0.00 % of entries
5: Error in 8..15 range: 0.00 % of entries
5: Error in 16..31 range: 0.00 % of entries
5: Error in 32..63 range: 0.00 % of entries
5: Error in 64..127 range: 0.00 % of entries
5: Error in 128..255 range: 0.00 % of entries
5: TestWithLargeDataPerChannelQuantization: PASS
5: number of matrix entries: 550
5: median value: 7
5: median unsigned diff: 0 (tolerating 0)
5: max unsigned diff: 0 (tolerating 0)
5: median signed diff: 0 (tolerating 0)
5: mean signed diff: 0 (tolerating 0)
5: No error: 100.00 % of entries
5: Error in 1..1 range: 0.00 % of entries
5: Error in 2..3 range: 0.00 % of entries
5: Error in 4..7 range: 0.00 % of entries
5: Error in 8..15 range: 0.00 % of entries
5: Error in 16..31 range: 0.00 % of entries
5: Error in 32..63 range: 0.00 % of entries
5: Error in 64..127 range: 0.00 % of entries
5: Error in 128..255 range: 0.00 % of entries
5: TestMultithreadedPerChannelQuantization: PASS
5: number of matrix entries: 1280
5: median value: 0
5: median unsigned diff: 0 (tolerating 0)
5: max unsigned diff: 0 (tolerating 0)
5: median signed diff: 0 (tolerating 0)
5: mean signed diff: 0 (tolerating 0)
5: No error: 100.00 % of entries
5: Error in 1..1 range: 0.00 % of entries
5: Error in 2..3 range: 0.00 % of entries
5: Error in 4..7 range: 0.00 % of entries
5: Error in 8..15 range: 0.00 % of entries
5: Error in 16..31 range: 0.00 % of entries
5: Error in 32..63 range: 0.00 % of entries
5: Error in 64..127 range: 0.00 % of entries
5: Error in 128..255 range: 0.00 % of entries
5: All tests passed.
5/5 Test #5: test_gemmlowp .................... Passed 80.76 sec
100% tests passed, 0 tests failed out of 5
Total Test time (real) = 80.78 sec
make[1]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
create-stamp debian/debhelper-build-stamp
dh_prep -O-Scmake
dh_auto_install --destdir=debian/libgemmlowp-dev/ -O-Scmake
cd obj-aarch64-linux-gnu && make -j8 install DESTDIR=/<<BUILDDIR>>/gemmlowp-0.0\~git20211220.e844ffd/debian/libgemmlowp-dev AM_UPDATE_INFO_DIR=no "INSTALL=install --strip-program=true"
make[1]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
/usr/bin/cmake -S"/<<PKGBUILDDIR>>" -B"/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu" --check-build-system CMakeFiles/Makefile.cmake 0
make -f CMakeFiles/Makefile2 preinstall
make[2]: Entering directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
make[2]: Nothing to be done for 'preinstall'.
make[2]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
Install the project...
/usr/bin/cmake -P cmake_install.cmake
-- Install configuration: "None"
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/eight_bit_int_gemm/eight_bit_int_gemm.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/base.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/legacy_multi_thread_common.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/legacy_multi_thread_gemm.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/legacy_multi_thread_gemv.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/legacy_operations_common.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/legacy_single_thread_gemm.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/multi_thread_common.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/multi_thread_gemm.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/multi_thread_transform.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/quantized_mul_kernels.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/quantized_mul_kernels_arm_32.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/quantized_mul_kernels_arm_64.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/single_thread_gemm.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/single_thread_transform.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/streams.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/streams_arm_32.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/streams_arm_64.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/transform_kernels.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/transform_kernels_arm_32.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/transform_kernels_arm_64.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/public/bit_depth.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/public/gemmlowp.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/public/map.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/public/output_stages.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/profiling/instrumentation.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/profiling/profiler.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/profiling/pthread_everywhere.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/allocator.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/block_params.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/common.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/compute.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/detect_platform.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/dispatch_gemm_shape.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/kernel.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/kernel_avx.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/kernel_default.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/kernel_msa.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/kernel_neon.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/kernel_reference.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/kernel_sse.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/multi_thread_gemm.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/output.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/output_avx.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/output_msa.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/output_neon.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/output_sse.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/pack.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/pack_avx.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/pack_msa.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/pack_neon.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/pack_sse.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/platform.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/simd_wrappers.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/simd_wrappers_common_neon_sse.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/simd_wrappers_msa.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/simd_wrappers_neon.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/simd_wrappers_sse.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/single_thread_gemm.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/unpack.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/fixedpoint/fixedpoint.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/fixedpoint/fixedpoint_avx.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/fixedpoint/fixedpoint_msa.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/fixedpoint/fixedpoint_neon.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/fixedpoint/fixedpoint_sse.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/include/gemmlowp/fixedpoint/fixedpoint_wasmsimd.h
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/lib/aarch64-linux-gnu/libeight_bit_int_gemm.a
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/lib/aarch64-linux-gnu/cmake/gemmlowp/gemmlowp-config.cmake
-- Installing: /<<PKGBUILDDIR>>/debian/libgemmlowp-dev/usr/lib/aarch64-linux-gnu/cmake/gemmlowp/gemmlowp-config-none.cmake
make[1]: Leaving directory '/<<PKGBUILDDIR>>/obj-aarch64-linux-gnu'
dh_install -O-Scmake
debian/rules override_dh_installdocs
make[1]: Entering directory '/<<PKGBUILDDIR>>'
mkdir -p debian/libgemmlowp-dev/usr/share/doc/libgemmlowp-dev/meta/
install meta/README debian/libgemmlowp-dev/usr/share/doc/libgemmlowp-dev/meta/
dh_installdocs
make[1]: Leaving directory '/<<PKGBUILDDIR>>'
dh_installchangelogs -O-Scmake
dh_installexamples -O-Scmake
dh_installinit -O-Scmake
dh_perl -O-Scmake
dh_link -O-Scmake
dh_strip_nondeterminism -O-Scmake
dh_compress -O-Scmake
dh_fixperms -O-Scmake
dh_missing -O-Scmake
dh_dwz -a -O-Scmake
dh_strip -a -O-Scmake
dh_makeshlibs -a -O-Scmake
dh_shlibdeps -a -O-Scmake
dh_installdeb -O-Scmake
dh_gencontrol -O-Scmake
dh_md5sums -O-Scmake
dh_builddeb -O-Scmake
dpkg-deb: building package 'libgemmlowp-dev' in '../libgemmlowp-dev_0.0~git20211220.e844ffd-1_arm64.deb'.
dpkg-genbuildinfo --build=binary -O../gemmlowp_0.0~git20211220.e844ffd-1_arm64.buildinfo
dpkg-genchanges --build=binary -O../gemmlowp_0.0~git20211220.e844ffd-1_arm64.changes
dpkg-genchanges: info: binary-only upload (no source code included)
dpkg-source --after-build .
dpkg-buildpackage: info: binary-only upload (no source included)
--------------------------------------------------------------------------------
Build finished at 2024-11-16T07:13:26Z
Finished
--------
I: Built successfully
+------------------------------------------------------------------------------+
| Changes |
+------------------------------------------------------------------------------+
gemmlowp_0.0~git20211220.e844ffd-1_arm64.changes:
-------------------------------------------------
Format: 1.8
Date: Thu, 23 Jun 2022 22:56:13 -0700
Source: gemmlowp
Binary: libgemmlowp-dev
Architecture: arm64
Version: 0.0~git20211220.e844ffd-1
Distribution: sid
Urgency: medium
Maintainer: Debian Science Maintainers <debian-science-maintainers@lists.alioth.debian.org>
Changed-By: Mo Zhou <lumin@debian.org>
Description:
libgemmlowp-dev - small self-contained low-precision GEMM library
Changes:
gemmlowp (0.0~git20211220.e844ffd-1) unstable; urgency=medium
.
* New upstream version 0.0~git20211220.e844ffd
Checksums-Sha1:
7de4c6a7d30f3d5072e8d4afdeb03965e83a712d 5678 gemmlowp_0.0~git20211220.e844ffd-1_arm64.buildinfo
46792dde8a77ae79239920dc970060a582b6314c 569380 libgemmlowp-dev_0.0~git20211220.e844ffd-1_arm64.deb
Checksums-Sha256:
887312de14844cb2a10e3a4549d33bc8787595344e279a6f8b4447829baa6637 5678 gemmlowp_0.0~git20211220.e844ffd-1_arm64.buildinfo
8a6715da5b9b71aad3af8a2f67b96eea13858e155bedd282be4cd9067d3ed3f7 569380 libgemmlowp-dev_0.0~git20211220.e844ffd-1_arm64.deb
Files:
fde5a6c8d6cb8eb3f54e7ea566c404ea 5678 science optional gemmlowp_0.0~git20211220.e844ffd-1_arm64.buildinfo
0011b646ba3cb426a901d717c27db280 569380 libdevel optional libgemmlowp-dev_0.0~git20211220.e844ffd-1_arm64.deb
+------------------------------------------------------------------------------+
| Buildinfo |
+------------------------------------------------------------------------------+
Format: 1.0
Source: gemmlowp
Binary: libgemmlowp-dev
Architecture: arm64
Version: 0.0~git20211220.e844ffd-1
Checksums-Md5:
0011b646ba3cb426a901d717c27db280 569380 libgemmlowp-dev_0.0~git20211220.e844ffd-1_arm64.deb
Checksums-Sha1:
46792dde8a77ae79239920dc970060a582b6314c 569380 libgemmlowp-dev_0.0~git20211220.e844ffd-1_arm64.deb
Checksums-Sha256:
8a6715da5b9b71aad3af8a2f67b96eea13858e155bedd282be4cd9067d3ed3f7 569380 libgemmlowp-dev_0.0~git20211220.e844ffd-1_arm64.deb
Build-Origin: Debian
Build-Architecture: arm64
Build-Date: Sat, 16 Nov 2024 07:13:26 +0000
Build-Path: /<<PKGBUILDDIR>>
Build-Tainted-By:
merged-usr-via-aliased-dirs
Installed-Build-Depends:
autoconf (= 2.72-3),
automake (= 1:1.16.5-1.3),
autopoint (= 0.22.5-2),
autotools-dev (= 20220109.1),
base-files (= 13.5),
base-passwd (= 3.6.5),
bash (= 5.2.32-1+b2),
binutils (= 2.43.1-5),
binutils-aarch64-linux-gnu (= 2.43.1-5),
binutils-common (= 2.43.1-5),
bsdextrautils (= 2.40.2-11),
bsdutils (= 1:2.40.2-11),
build-essential (= 12.12),
bzip2 (= 1.0.8-6),
cmake (= 3.31.0-1),
cmake-data (= 3.31.0-1),
coreutils (= 9.5-1+b1),
cpp (= 4:14.2.0-1),
cpp-14 (= 14.2.0-8),
cpp-14-aarch64-linux-gnu (= 14.2.0-8),
cpp-aarch64-linux-gnu (= 4:14.2.0-1),
dash (= 0.5.12-9+b1),
debconf (= 1.5.87),
debhelper (= 13.20),
debianutils (= 5.21),
dh-autoreconf (= 20),
dh-strip-nondeterminism (= 1.14.0-1),
diffutils (= 1:3.10-1+b1),
dpkg (= 1.22.12~1.gbp82cafd),
dpkg-dev (= 1.22.12~1.gbp82cafd),
dwz (= 0.15-1+b1),
file (= 1:5.45-3+b1),
findutils (= 4.10.0-3),
g++ (= 4:14.2.0-1),
g++-14 (= 14.2.0-8),
g++-14-aarch64-linux-gnu (= 14.2.0-8),
g++-aarch64-linux-gnu (= 4:14.2.0-1),
gcc (= 4:14.2.0-1),
gcc-14 (= 14.2.0-8),
gcc-14-aarch64-linux-gnu (= 14.2.0-8),
gcc-14-base (= 14.2.0-8),
gcc-aarch64-linux-gnu (= 4:14.2.0-1),
gettext (= 0.22.5-2),
gettext-base (= 0.22.5-2),
grep (= 3.11-4+b1),
groff-base (= 1.23.0-5),
gzip (= 1.12-1.1+b1),
hostname (= 3.25),
init-system-helpers (= 1.67),
intltool-debian (= 0.35.0+20060710.6),
libacl1 (= 2.3.2-2+b1),
libarchive-zip-perl (= 1.68-1),
libarchive13t64 (= 3.7.4-1.1),
libasan8 (= 14.2.0-8),
libatomic1 (= 14.2.0-8),
libattr1 (= 1:2.5.2-2),
libaudit-common (= 1:4.0.2-2),
libaudit1 (= 1:4.0.2-2),
libbinutils (= 2.43.1-5),
libblkid1 (= 2.40.2-11),
libbrotli1 (= 1.1.0-2+b6),
libbz2-1.0 (= 1.0.8-6),
libc-bin (= 2.40-3),
libc-dev-bin (= 2.40-3),
libc6 (= 2.40-3),
libc6-dev (= 2.40-3),
libcap-ng0 (= 0.8.5-3+b1),
libcap2 (= 1:2.66-5+b1),
libcc1-0 (= 14.2.0-8),
libcom-err2 (= 1.47.1-1+b1),
libcrypt-dev (= 1:4.4.36-5),
libcrypt1 (= 1:4.4.36-5),
libctf-nobfd0 (= 2.43.1-5),
libctf0 (= 2.43.1-5),
libcurl4t64 (= 8.11.0-1),
libdb5.3t64 (= 5.3.28+dfsg2-9),
libdebconfclient0 (= 0.273),
libdebhelper-perl (= 13.20),
libdpkg-perl (= 1.22.12~1.gbp82cafd),
libelf1t64 (= 0.192-4),
libexpat1 (= 2.6.4-1),
libffi8 (= 3.4.6-1),
libfile-stripnondeterminism-perl (= 1.14.0-1),
libgcc-14-dev (= 14.2.0-8),
libgcc-s1 (= 14.2.0-8),
libgdbm-compat4t64 (= 1.24-2),
libgdbm6t64 (= 1.24-2),
libgmp10 (= 2:6.3.0+dfsg-2+b2),
libgnutls30t64 (= 3.8.8-2),
libgomp1 (= 14.2.0-8),
libgprofng0 (= 2.43.1-5),
libgssapi-krb5-2 (= 1.21.3-3),
libhogweed6t64 (= 3.10-1+b1),
libhwasan0 (= 14.2.0-8),
libicu72 (= 72.1-5+b1),
libidn2-0 (= 2.3.7-2+b1),
libisl23 (= 0.27-1),
libitm1 (= 14.2.0-8),
libjansson4 (= 2.14-2+b3),
libjsoncpp26 (= 1.9.6-2),
libk5crypto3 (= 1.21.3-3),
libkeyutils1 (= 1.6.3-4),
libkrb5-3 (= 1.21.3-3),
libkrb5support0 (= 1.21.3-3),
libldap-2.5-0 (= 2.5.18+dfsg-3+b1),
liblsan0 (= 14.2.0-8),
liblz4-1 (= 1.9.4-3+b1),
liblzma5 (= 5.6.3-1+b1),
libmagic-mgc (= 1:5.45-3+b1),
libmagic1t64 (= 1:5.45-3+b1),
libmd0 (= 1.1.0-2+b1),
libmount1 (= 2.40.2-11),
libmpc3 (= 1.3.1-1+b3),
libmpfr6 (= 4.2.1-1+b2),
libncursesw6 (= 6.5-2+b1),
libnettle8t64 (= 3.10-1+b1),
libnghttp2-14 (= 1.64.0-1),
libp11-kit0 (= 0.25.5-2+b1),
libpam-modules (= 1.5.3-7+b1),
libpam-modules-bin (= 1.5.3-7+b1),
libpam-runtime (= 1.5.3-7),
libpam0g (= 1.5.3-7+b1),
libpcre2-8-0 (= 10.44-4),
libperl5.40 (= 5.40.0-7),
libpipeline1 (= 1.5.8-1),
libproc2-0 (= 2:4.0.4-6),
libpsl5t64 (= 0.21.2-1.1+b1),
librhash1 (= 1.4.5-1),
librtmp1 (= 2.4+20151223.gitfa8646d.1-2+b5),
libsasl2-2 (= 2.1.28+dfsg1-8),
libsasl2-modules-db (= 2.1.28+dfsg1-8),
libseccomp2 (= 2.5.5-1+b3),
libselinux1 (= 3.7-3+b1),
libsframe1 (= 2.43.1-5),
libsmartcols1 (= 2.40.2-11),
libssh2-1t64 (= 1.11.1-1),
libssl3t64 (= 3.3.2-2),
libstdc++-14-dev (= 14.2.0-8),
libstdc++6 (= 14.2.0-8),
libsystemd0 (= 257~rc1-4),
libtasn1-6 (= 4.19.0-3+b3),
libtinfo6 (= 6.5-2+b1),
libtool (= 2.4.7-8),
libtsan2 (= 14.2.0-8),
libubsan1 (= 14.2.0-8),
libuchardet0 (= 0.0.8-1+b2),
libudev1 (= 257~rc1-4),
libunistring5 (= 1.2-1+b1),
libuuid1 (= 2.40.2-11),
libuv1t64 (= 1.48.0-7),
libxml2 (= 2.12.7+dfsg+really2.9.14-0.2+b1),
libxxhash0 (= 0.8.2-2+b2),
libzstd1 (= 1.5.6+dfsg-1+b1),
linux-libc-dev (= 6.11.7-1),
m4 (= 1.4.19-4),
make (= 4.3-4.1+b1),
man-db (= 2.13.0-1),
mawk (= 1.3.4.20240905-1),
ncurses-base (= 6.5-2),
ncurses-bin (= 6.5-2+b1),
openssl-provider-legacy (= 3.3.2-2),
patch (= 2.7.6-7+b1),
perl (= 5.40.0-7),
perl-base (= 5.40.0-7),
perl-modules-5.40 (= 5.40.0-7),
po-debconf (= 1.0.21+nmu1),
procps (= 2:4.0.4-6),
rpcsvc-proto (= 1.4.3-1+b1),
sed (= 4.9-2+b1),
sensible-utils (= 0.0.24),
sysvinit-utils (= 3.11-1),
tar (= 1.35+dfsg-3+b1),
util-linux (= 2.40.2-11),
xz-utils (= 5.6.3-1+b1),
zlib1g (= 1:1.3.dfsg+really1.3.1-1+b1)
Environment:
DEB_BUILD_OPTIONS="parallel=8"
LANG="en_US.UTF-8"
LC_ALL="C.UTF-8"
SOURCE_DATE_EPOCH="1656050173"
+------------------------------------------------------------------------------+
| Package contents |
+------------------------------------------------------------------------------+
libgemmlowp-dev_0.0~git20211220.e844ffd-1_arm64.deb
---------------------------------------------------
new Debian package, version 2.0.
size 569380 bytes: control archive=4184 bytes.
1057 bytes, 24 lines control
10120 bytes, 116 lines md5sums
Package: libgemmlowp-dev
Source: gemmlowp
Version: 0.0~git20211220.e844ffd-1
Architecture: arm64
Maintainer: Debian Science Maintainers <debian-science-maintainers@lists.alioth.debian.org>
Installed-Size: 5940
Section: libdevel
Priority: optional
Multi-Arch: foreign
Homepage: https://github.com/google/gemmlowp
Description: small self-contained low-precision GEMM library
This is not a full linear algebra library, only a GEMM library: it only does
general matrix multiplication ("GEMM").
.
Its performance goals differ from typical GEMM performance goals in the
following ways:
1. It cares not only about speed, but also about minimizing power usage.
It specifically cares about charge usage in mobile/embedded devices.
2. Most GEMMs are optimized primarily for large dense matrix sizes (>= 1000).
It does care about large sizes, but it also cares specifically about the
typically smaller matrix sizes encountered in various mobile applications.
.
Keep in mind (previous section) that gemmlowp itself is a pure-headers-only
library.
drwxr-xr-x root/root 0 2022-06-24 05:56 ./
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/include/
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/include/gemmlowp/
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/include/gemmlowp/eight_bit_int_gemm/
-rw-r--r-- root/root 3418 2021-12-20 17:33 ./usr/include/gemmlowp/eight_bit_int_gemm/eight_bit_int_gemm.h
drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/include/gemmlowp/fixedpoint/
-rw-r--r-- root/root 35769 2021-12-20 17:33 ./usr/include/gemmlowp/fixedpoint/fixedpoint.h
-rw-r--r-- root/root 11184 2021-12-20 17:33 ./usr/include/gemmlowp/fixedpoint/fixedpoint_avx.h
-rw-r--r-- root/root 12541 2021-12-20 17:33 ./usr/include/gemmlowp/fixedpoint/fixedpoint_msa.h
-rw-r--r-- root/root 9073 2021-12-20 17:33 ./usr/include/gemmlowp/fixedpoint/fixedpoint_neon.h
-rw-r--r-- root/root 11142 2021-12-20 17:33 ./usr/include/gemmlowp/fixedpoint/fixedpoint_sse.h
-rw-r--r-- root/root 11257 2021-12-20 17:33 ./usr/include/gemmlowp/fixedpoint/fixedpoint_wasmsimd.h
drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/include/gemmlowp/internal/
-rw-r--r-- root/root 6328 2021-12-20 17:33 ./usr/include/gemmlowp/internal/allocator.h
-rw-r--r-- root/root 6768 2021-12-20 17:33 ./usr/include/gemmlowp/internal/block_params.h
-rw-r--r-- root/root 6676 2021-12-20 17:33 ./usr/include/gemmlowp/internal/common.h
-rw-r--r-- root/root 4299 2021-12-20 17:33 ./usr/include/gemmlowp/internal/compute.h
-rw-r--r-- root/root 4996 2021-12-20 17:33 ./usr/include/gemmlowp/internal/detect_platform.h
-rw-r--r-- root/root 8036 2021-12-20 17:33 ./usr/include/gemmlowp/internal/dispatch_gemm_shape.h
-rw-r--r-- root/root 9218 2021-12-20 17:33 ./usr/include/gemmlowp/internal/kernel.h
-rw-r--r-- root/root 19165 2021-12-20 17:33 ./usr/include/gemmlowp/internal/kernel_avx.h
-rw-r--r-- root/root 4847 2021-12-20 17:33 ./usr/include/gemmlowp/internal/kernel_default.h
-rw-r--r-- root/root 23748 2021-12-20 17:33 ./usr/include/gemmlowp/internal/kernel_msa.h
-rw-r--r-- root/root 75739 2021-12-20 17:33 ./usr/include/gemmlowp/internal/kernel_neon.h
-rw-r--r-- root/root 4837 2021-12-20 17:33 ./usr/include/gemmlowp/internal/kernel_reference.h
-rw-r--r-- root/root 18968 2021-12-20 17:33 ./usr/include/gemmlowp/internal/kernel_sse.h
-rw-r--r-- root/root 28223 2021-12-20 17:33 ./usr/include/gemmlowp/internal/multi_thread_gemm.h
-rw-r--r-- root/root 22604 2021-12-20 17:33 ./usr/include/gemmlowp/internal/output.h
-rw-r--r-- root/root 763 2021-12-20 17:33 ./usr/include/gemmlowp/internal/output_avx.h
-rw-r--r-- root/root 44686 2021-12-20 17:33 ./usr/include/gemmlowp/internal/output_msa.h
-rw-r--r-- root/root 35994 2021-12-20 17:33 ./usr/include/gemmlowp/internal/output_neon.h
-rw-r--r-- root/root 20105 2021-12-20 17:33 ./usr/include/gemmlowp/internal/output_sse.h
-rw-r--r-- root/root 17975 2021-12-20 17:33 ./usr/include/gemmlowp/internal/pack.h
-rw-r--r-- root/root 11519 2021-12-20 17:33 ./usr/include/gemmlowp/internal/pack_avx.h
-rw-r--r-- root/root 18939 2021-12-20 17:33 ./usr/include/gemmlowp/internal/pack_msa.h
-rw-r--r-- root/root 15055 2021-12-20 17:33 ./usr/include/gemmlowp/internal/pack_neon.h
-rw-r--r-- root/root 4972 2021-12-20 17:33 ./usr/include/gemmlowp/internal/pack_sse.h
-rw-r--r-- root/root 2977 2021-12-20 17:33 ./usr/include/gemmlowp/internal/platform.h
-rw-r--r-- root/root 25588 2021-12-20 17:33 ./usr/include/gemmlowp/internal/simd_wrappers.h
-rw-r--r-- root/root 31387 2021-12-20 17:33 ./usr/include/gemmlowp/internal/simd_wrappers_common_neon_sse.h
-rw-r--r-- root/root 5642 2021-12-20 17:33 ./usr/include/gemmlowp/internal/simd_wrappers_msa.h
-rw-r--r-- root/root 19188 2021-12-20 17:33 ./usr/include/gemmlowp/internal/simd_wrappers_neon.h
-rw-r--r-- root/root 4257 2021-12-20 17:33 ./usr/include/gemmlowp/internal/simd_wrappers_sse.h
-rw-r--r-- root/root 5586 2021-12-20 17:33 ./usr/include/gemmlowp/internal/single_thread_gemm.h
-rw-r--r-- root/root 12594 2021-12-20 17:33 ./usr/include/gemmlowp/internal/unpack.h
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/include/gemmlowp/meta/
-rw-r--r-- root/root 3960 2021-12-20 17:33 ./usr/include/gemmlowp/meta/base.h
-rw-r--r-- root/root 5384 2021-12-20 17:33 ./usr/include/gemmlowp/meta/legacy_multi_thread_common.h
-rw-r--r-- root/root 11396 2021-12-20 17:33 ./usr/include/gemmlowp/meta/legacy_multi_thread_gemm.h
-rw-r--r-- root/root 6992 2021-12-20 17:33 ./usr/include/gemmlowp/meta/legacy_multi_thread_gemv.h
-rw-r--r-- root/root 1850 2021-12-20 17:33 ./usr/include/gemmlowp/meta/legacy_operations_common.h
-rw-r--r-- root/root 9600 2021-12-20 17:33 ./usr/include/gemmlowp/meta/legacy_single_thread_gemm.h
-rw-r--r-- root/root 1593 2021-12-20 17:33 ./usr/include/gemmlowp/meta/multi_thread_common.h
-rw-r--r-- root/root 5253 2021-12-20 17:33 ./usr/include/gemmlowp/meta/multi_thread_gemm.h
-rw-r--r-- root/root 3519 2021-12-20 17:33 ./usr/include/gemmlowp/meta/multi_thread_transform.h
-rw-r--r-- root/root 5759 2021-12-20 17:33 ./usr/include/gemmlowp/meta/quantized_mul_kernels.h
-rw-r--r-- root/root 131368 2021-12-20 17:33 ./usr/include/gemmlowp/meta/quantized_mul_kernels_arm_32.h
-rw-r--r-- root/root 130137 2021-12-20 17:33 ./usr/include/gemmlowp/meta/quantized_mul_kernels_arm_64.h
-rw-r--r-- root/root 25668 2021-12-20 17:33 ./usr/include/gemmlowp/meta/single_thread_gemm.h
-rw-r--r-- root/root 2957 2021-12-20 17:33 ./usr/include/gemmlowp/meta/single_thread_transform.h
-rw-r--r-- root/root 11049 2021-12-20 17:33 ./usr/include/gemmlowp/meta/streams.h
-rw-r--r-- root/root 390785 2021-12-20 17:33 ./usr/include/gemmlowp/meta/streams_arm_32.h
-rw-r--r-- root/root 410715 2021-12-20 17:33 ./usr/include/gemmlowp/meta/streams_arm_64.h
-rw-r--r-- root/root 7317 2021-12-20 17:33 ./usr/include/gemmlowp/meta/transform_kernels.h
-rw-r--r-- root/root 247365 2021-12-20 17:33 ./usr/include/gemmlowp/meta/transform_kernels_arm_32.h
-rw-r--r-- root/root 260838 2021-12-20 17:33 ./usr/include/gemmlowp/meta/transform_kernels_arm_64.h
drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/include/gemmlowp/profiling/
-rw-r--r-- root/root 6543 2021-12-20 17:33 ./usr/include/gemmlowp/profiling/instrumentation.h
-rw-r--r-- root/root 11853 2021-12-20 17:33 ./usr/include/gemmlowp/profiling/profiler.h
-rw-r--r-- root/root 3313 2021-12-20 17:33 ./usr/include/gemmlowp/profiling/pthread_everywhere.h
drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/include/gemmlowp/public/
-rw-r--r-- root/root 2616 2021-12-20 17:33 ./usr/include/gemmlowp/public/bit_depth.h
-rw-r--r-- root/root 4314 2021-12-20 17:33 ./usr/include/gemmlowp/public/gemmlowp.h
-rw-r--r-- root/root 4422 2021-12-20 17:33 ./usr/include/gemmlowp/public/map.h
-rw-r--r-- root/root 11283 2021-12-20 17:33 ./usr/include/gemmlowp/public/output_stages.h
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/lib/
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/lib/aarch64-linux-gnu/
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/lib/aarch64-linux-gnu/cmake/
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/lib/aarch64-linux-gnu/cmake/gemmlowp/
-rw-r--r-- root/root 958 2022-06-24 05:56 ./usr/lib/aarch64-linux-gnu/cmake/gemmlowp/gemmlowp-config-none.cmake
-rw-r--r-- root/root 4910 2022-06-24 05:56 ./usr/lib/aarch64-linux-gnu/cmake/gemmlowp/gemmlowp-config.cmake
-rw-r--r-- root/root 1076426 2022-06-24 05:56 ./usr/lib/aarch64-linux-gnu/libeight_bit_int_gemm.a
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/share/
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/share/doc/
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/share/doc/libgemmlowp-dev/
-rw-r--r-- root/root 390 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/AUTHORS
-rw-r--r-- root/root 1977 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/CONTRIBUTING
-rw-r--r-- root/root 1208 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/CONTRIBUTORS
-rw-r--r-- root/root 3774 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/README.md.gz
-rw-r--r-- root/root 463 2022-06-24 05:56 ./usr/share/doc/libgemmlowp-dev/changelog.Debian.gz
-rw-r--r-- root/root 1864 2022-06-24 05:51 ./usr/share/doc/libgemmlowp-dev/copyright
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/share/doc/libgemmlowp-dev/doc/
-rw-r--r-- root/root 2479 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/design.md.gz
-rw-r--r-- root/root 2399 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/kernel.md.gz
-rw-r--r-- root/root 5820 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/less-than-8-bit.md.gz
-rw-r--r-- root/root 3106 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/low-precision.md.gz
-rw-r--r-- root/root 2022 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/output.md
-rw-r--r-- root/root 3499 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/packing.md.gz
-rw-r--r-- root/root 2650 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/public.md.gz
-rw-r--r-- root/root 4673 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/quantization.md.gz
-rw-r--r-- root/root 4572 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/quantization_example.cc.gz
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/share/doc/libgemmlowp-dev/examples/
-rw-r--r-- root/root 15511 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/eight_bit_int_gemm.cc
drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/
-rw-r--r-- root/root 12806 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/benchmark.cc
-rw-r--r-- root/root 11382 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/benchmark_all_sizes.cc
-rw-r--r-- root/root 10839 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/benchmark_meta_gemm.cc
-rw-r--r-- root/root 12064 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/correctness_meta_gemm.cc
drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/
drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test.xcodeproj/
-rw-r--r-- root/root 29285 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test.xcodeproj/project.pbxproj
drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/
-rw-r--r-- root/root 279 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/AppDelegate.h
-rw-r--r-- root/root 2149 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/AppDelegate.mm
drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/Base.lproj/
-rw-r--r-- root/root 3708 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/Base.lproj/LaunchScreen.xib
-rw-r--r-- root/root 1575 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/Base.lproj/Main.storyboard
drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/Images.xcassets/
drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/Images.xcassets/AppIcon.appiconset/
-rw-r--r-- root/root 1077 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/Images.xcassets/AppIcon.appiconset/Contents.json
-rw-r--r-- root/root 1511 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/Info.plist
-rw-r--r-- root/root 219 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/ViewController.h
-rw-r--r-- root/root 492 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/ViewController.m
-rw-r--r-- root/root 334 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/main.m
-rw-r--r-- root/root 75539 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test.cc
-rw-r--r-- root/root 4448 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test.h
-rw-r--r-- root/root 2115 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test_allocator.cc
-rw-r--r-- root/root 4408 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test_blocking_counter.cc
-rw-r--r-- root/root 2296359 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test_data.cc
-rw-r--r-- root/root 1288 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test_data.h
-rw-r--r-- root/root 22285 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test_fixedpoint.cc
-rw-r--r-- root/root 4144 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test_math_helpers.cc
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/share/doc/libgemmlowp-dev/meta/
-rw-r--r-- root/root 3657 2022-06-24 05:56 ./usr/share/doc/libgemmlowp-dev/meta/README
drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/share/doc/libgemmlowp-dev/todo/
-rw-r--r-- root/root 1605 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/armv8-64bit-kernel-for-less-than-8-bit.txt
-rw-r--r-- root/root 3277 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/error-diffusion-experiments.txt
-rw-r--r-- root/root 6232 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/fast-gemv.txt.gz
-rw-r--r-- root/root 962 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/less-than-8-bit-without-requantization.txt
-rw-r--r-- root/root 2338 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/multi-threading-experiments.txt.gz
-rw-r--r-- root/root 1114 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/neon-depth-major-sources-packing.txt
-rw-r--r-- root/root 802 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/remove-default-template-param-values.txt
-rw-r--r-- root/root 1666 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/x86-kernels.txt
lintian
-------
Setup apt archive
-----------------
Merged Build-Depends: lintian
Filtered Build-Depends: lintian
dpkg-deb: warning: root directory has unusual owner or group 998:999.
Hint: either pass --root-owner-group, see dpkg-build-api(7) or add an explicit 'Rules-Requires-Root: no' in debian/control.
dpkg-deb: warning: ignoring 1 warning about the control file(s)
dpkg-deb: building package 'sbuild-build-depends-lintian-dummy' in '/<<RESOLVERDIR>>/apt_archive/sbuild-build-depends-lintian-dummy.deb'.
Ign:1 copy:/<<RESOLVERDIR>>/apt_archive ./ InRelease
Get:2 copy:/<<RESOLVERDIR>>/apt_archive ./ Release [615 B]
Ign:3 copy:/<<RESOLVERDIR>>/apt_archive ./ Release.gpg
Get:4 copy:/<<RESOLVERDIR>>/apt_archive ./ Sources [1206 B]
Get:5 copy:/<<RESOLVERDIR>>/apt_archive ./ Packages [1268 B]
Fetched 3089 B in 0s (247 kB/s)
Reading package lists...
Get:1 file:/<<BUILDDIR>>/resolver-DxNUNH/apt_archive ./ InRelease
Ign:1 file:/<<BUILDDIR>>/resolver-DxNUNH/apt_archive ./ InRelease
Get:2 file:/<<BUILDDIR>>/resolver-DxNUNH/apt_archive ./ Release [606 B]
Get:2 file:/<<BUILDDIR>>/resolver-DxNUNH/apt_archive ./ Release [606 B]
Get:3 file:/<<BUILDDIR>>/resolver-DxNUNH/apt_archive ./ Release.gpg
Ign:3 file:/<<BUILDDIR>>/resolver-DxNUNH/apt_archive ./ Release.gpg
Reading package lists...
Reading package lists...
Install lintian build dependencies (apt-based resolver)
-------------------------------------------------------
Installing build dependencies
Reading package lists...
Building dependency tree...
Reading state information...
The following NEW packages will be installed:
sbuild-build-depends-lintian-dummy
0 upgraded, 1 newly installed, 0 to remove and 0 not upgraded.
Need to get 852 B of archives.
After this operation, 0 B of additional disk space will be used.
Get:1 copy:/<<RESOLVERDIR>>/apt_archive ./ sbuild-build-depends-lintian-dummy 0.invalid.0 [852 B]
debconf: delaying package configuration, since apt-utils is not installed
Fetched 852 B in 0s (0 B/s)
Selecting previously unselected package sbuild-build-depends-lintian-dummy.
(Reading database ... 26582 files and directories currently installed.)
Preparing to unpack .../sbuild-build-depends-lintian-dummy_0.invalid.0_arm64.deb ...
Unpacking sbuild-build-depends-lintian-dummy (0.invalid.0) ...
Setting up sbuild-build-depends-lintian-dummy (0.invalid.0) ...
Running lintian...
E: libgemmlowp-dev: multiarch-foreign-cmake-file [usr/lib/aarch64-linux-gnu/cmake/gemmlowp/gemmlowp-config-none.cmake]
E: libgemmlowp-dev: multiarch-foreign-cmake-file [usr/lib/aarch64-linux-gnu/cmake/gemmlowp/gemmlowp-config.cmake]
E: libgemmlowp-dev: multiarch-foreign-static-library [usr/lib/aarch64-linux-gnu/libeight_bit_int_gemm.a]
W: gemmlowp changes: distribution-and-changes-mismatch sid unstable
E: Lintian run failed (runtime error)
+------------------------------------------------------------------------------+
| Post Build |
+------------------------------------------------------------------------------+
+------------------------------------------------------------------------------+
| Cleanup |
+------------------------------------------------------------------------------+
Purging /<<BUILDDIR>>
Not cleaning session: cloned chroot in use
+------------------------------------------------------------------------------+
| Summary |
+------------------------------------------------------------------------------+
Build Architecture: arm64
Build Type: binary
Build-Space: 114888
Build-Time: 190
Distribution: sid
Host Architecture: arm64
Install-Time: 21
Job: /tmp/debusine-fetch-exec-upload-lnyc4poe/gemmlowp_0.0~git20211220.e844ffd-1.dsc
Lintian: error
Machine Architecture: arm64
Package: gemmlowp
Package-Time: 250
Source-Version: 0.0~git20211220.e844ffd-1
Space: 114888
Status: successful
Version: 0.0~git20211220.e844ffd-1
--------------------------------------------------------------------------------
Finished at 2024-11-16T07:13:26Z
Build needed 00:04:10, 114888k disk space