Mentions légales du service
Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
Chameleon
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package registry
Container registry
Operate
Environments
Terraform modules
Monitor
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Admin message
GitLab upgrade completed. Current version is 17.11.3.
Show more breadcrumbs
solverstack
Chameleon
Merge requests
!395
Introduce half-precision conversion and gemm kernels for GPUs
Code
Review changes
Check out branch
Download
Patches
Plain diff
Merged
Introduce half-precision conversion and gemm kernels for GPUs
faverge/chameleon:gpus/half_kernels
into
master
Overview
7
Commits
8
Pipelines
0
Changes
22
Merged
Mathieu Faverge
requested to merge
faverge/chameleon:gpus/half_kernels
into
master
2 years ago
Overview
3
Commits
8
Pipelines
0
Changes
22
Expand
0
0
Merge request reports
Compare
master
version 10
9b63a0bd
1 year ago
version 9
7acc44e7
1 year ago
version 8
1a19cf91
2 years ago
version 7
770ecde3
2 years ago
version 6
ab98af76
2 years ago
version 5
df2c8309
2 years ago
version 4
5efd80ed
2 years ago
version 3
f8542c1f
2 years ago
version 2
9f547b58
2 years ago
version 1
d8cdab92
2 years ago
master (base)
and
latest version
latest version
10020dc3
8 commits,
1 year ago
version 10
9b63a0bd
6 commits,
1 year ago
version 9
7acc44e7
6 commits,
1 year ago
version 8
1a19cf91
7 commits,
2 years ago
version 7
770ecde3
7 commits,
2 years ago
version 6
ab98af76
5 commits,
2 years ago
version 5
df2c8309
6 commits,
2 years ago
version 4
5efd80ed
6 commits,
2 years ago
version 3
f8542c1f
6 commits,
2 years ago
version 2
9f547b58
6 commits,
2 years ago
version 1
d8cdab92
5 commits,
2 years ago
22 files
+
967
−
49
Inline
Compare changes
Side-by-side
Inline
Show whitespace changes
Show one file at a time
Files
22
Search (e.g. *.vue) (Ctrl+P)
cmake_modules/local_subs.py
+
9
−
2
Options
@@ -6,12 +6,12 @@
@copyright 2019-2023 Bordeaux INP, CNRS (LaBRI UMR 5800), Inria,
Univ. Bordeaux. All rights reserved.
@version 1.
2
.0
@version 1.
3
.0
@author Mathieu Faverge
@author Florent Pruvost
@author Nathalie Furmento
@author Alycia Lisito
@date 202
2
-0
2-22
@date 202
3
-0
7-04
"""
_extra_blas
=
[
@@ -39,6 +39,10 @@ _extra_blas = [
(
''
,
'
slatm1
'
,
'
dlatm1
'
,
'
slatm1
'
,
'
dlatm1
'
),
(
''
,
'
sgenm2
'
,
'
dgenm2
'
,
'
cgenm2
'
,
'
zgenm2
'
),
(
''
,
'
slag2c_fake
'
,
'
dlag2z_fake
'
,
'
slag2c
'
,
'
dlag2z
'
),
(
''
,
'
slag2h
'
,
'
dlag2h
'
,
'
slag2h
'
,
'
dlag2h
'
),
(
''
,
'
hlag2s
'
,
'
hlag2d
'
,
'
hlag2s
'
,
'
hlag2d
'
),
(
''
,
'
slag2h
'
,
'
dlag2h
'
,
'
clag2x
'
,
'
zlag2x
'
),
(
''
,
'
hlag2s
'
,
'
hlag2d
'
,
'
xlag2c
'
,
'
xlag2z
'
),
(
''
,
'
sgepdf
'
,
'
dgepdf
'
,
'
cgepdf
'
,
'
zgepdf
'
),
(
''
,
'
scesca
'
,
'
dcesca
'
,
'
ccesca
'
,
'
zcesca
'
),
(
''
,
'
sgesum
'
,
'
dgesum
'
,
'
cgesum
'
,
'
zgesum
'
),
@@ -73,6 +77,9 @@ subs = {
(
'
int
'
,
'
float
'
,
'
double
'
,
'
CHAMELEON_Complex32_t
'
,
r
'
\bCHAMELEON_Complex64_t
'
),
(
'
ChamPattern
'
,
'
ChamRealFloat
'
,
'
ChamRealDouble
'
,
'
ChamComplexFloat
'
,
r
'
\bChamComplexDouble
'
),
(
'
ChamPattern
'
,
'
ChamRealFloat
'
,
'
ChamRealDouble
'
,
'
ChamRealFloat
'
,
r
'
\bChamRealDouble
'
),
(
'
ChamPattern
'
,
'
ChamRealFloat
'
,
'
ChamRealFloat
'
,
'
ChamComplexFloat
'
,
r
'
\bChamComplexFloat
'
),
(
'
ChamPattern
'
,
'
ChamRealFloat
'
,
'
ChamRealFloat
'
,
'
ChamRealFloat
'
,
r
'
\bChamRealFloat
'
),
(
'
ChamPattern
'
,
'
ChamRealHalf
'
,
'
ChamRealHalf
'
,
'
ChamComplexHalf
'
,
r
'
\bChamComplexHalf
'
),
(
'
int
'
,
'
float
'
,
'
double
'
,
'
complex32
'
,
'
complex64
'
),
(
'
Int
'
,
'
Float
'
,
'
Double
'
,
'
Complex32
'
,
'
Complex64
'
),
(
'
Int
'
,
'
HMAT_SIMPLE_PRECISION
'
,
'
HMAT_DOUBLE_PRECISION
'
,
'
HMAT_SIMPLE_COMPLEX
'
,
'
HMAT_DOUBLE_COMPLEX
'
),
Loading