speed optimizations by jgrosseo · Pull Request #2266 · AliceO2Group/O2Physics

jgrosseo · 2023-03-21T16:19:34Z

Factor 4 increase of merging speed on pass 3
However, this needs to be carefully checked first.

jgrosseo · 2023-03-21T16:19:47Z

jgrosseo · 2023-03-21T16:20:13Z

@mpuccio Could you review the code changes? :-)

jgrosseo · 2023-03-22T07:44:40Z

To give some more background. On my local machine, I can achieve on the data provided by @davidrohr a throughput of 18 MB/s. While this may sound little, we do float (or smaller) memcpy operations. This means assuming all float, we do 2 x 4.5 million 4 byte memcpy per second. This is because the in-memory representation is different from the disk-one of course. So we have widely spread memory regions which are all large (baskets) from which we take 4 bytes each and copy it so some other structure. I'd assume this will produce only cache misses...

As the aim of the merger is to provide larger baskets to speed up subsequent reading not much can be done about this strategy.

The optimizations here, skip this part if we already have large enough trees (> 10 MB) and no manipulation of the data needs to be done...

I verified that the data in the output trees of one file is identical.

davidrohr · 2023-03-24T14:45:45Z

Hm, if you have such eratic patterns, perhaps you could use the _mm_prefetch intrinsic, to preload the data in the cache some loop cycles ahead of time? That could help reducing memory latencies.

jgrosseo · 2023-03-30T16:01:29Z

Thanks for the suggestion. I will try this as well. For now, I made this ready for review.
@mpuccio did you have any remarks?

mpuccio · 2023-03-31T13:43:48Z

Hi @jgrosseo, no, the PR looks good to me. I still have to solve the issue with the hypertriton AO2Ds merging, but since this PR does not change things in this respect, it can go on.

speed optimizations

09ef00b

jgrosseo marked this pull request as ready for review March 30, 2023 16:01

jgrosseo requested review from a team, alibuild, ddobrigk, iarsene and ktf as code owners March 30, 2023 16:01

mpuccio approved these changes Mar 31, 2023

View reviewed changes

ddobrigk approved these changes Mar 31, 2023

View reviewed changes

ddobrigk merged commit 1f2ebd1 into AliceO2Group:master Mar 31, 2023

hahassan7 pushed a commit to hahassan7/O2Physics that referenced this pull request Apr 3, 2023

speed optimizations (AliceO2Group#2266)

41399cf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speed optimizations#2266

speed optimizations#2266
ddobrigk merged 1 commit into
AliceO2Group:masterfrom
jgrosseo:master

jgrosseo commented Mar 21, 2023

Uh oh!

jgrosseo commented Mar 21, 2023

Uh oh!

jgrosseo commented Mar 21, 2023

Uh oh!

jgrosseo commented Mar 22, 2023

Uh oh!

davidrohr commented Mar 24, 2023

Uh oh!

jgrosseo commented Mar 30, 2023

Uh oh!

mpuccio commented Mar 31, 2023

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

Conversation

jgrosseo commented Mar 21, 2023

Uh oh!

jgrosseo commented Mar 21, 2023

Uh oh!

jgrosseo commented Mar 21, 2023

Uh oh!

jgrosseo commented Mar 22, 2023

Uh oh!

davidrohr commented Mar 24, 2023

Uh oh!

jgrosseo commented Mar 30, 2023

Uh oh!

mpuccio commented Mar 31, 2023

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants