Hi Serg,
Thanks for looking
>> +public:
>> uint32 buffer[(width+31)/32];
> Why is it public?
No reason, an oversight. Leftover from previous testing,
I fixed it.
> + Bitmap():buffer()
zero initialization of member variable ,here of the type array (s
https://stackoverflow.com/a/23987926/547065)
I removed it by replacing with clear_all(), so it is better understandable.
>I wonder whether init() method is still needed.
>You've removed it from the constructor. And if there's any other
>code that uses it, I'd rather it'd be using clear_all().
Done.
>> + void set_all()
>> + {
>> + set_prefix(width);
>
>may be better to optimize it?
> memset(buffer, 0xff, sizeof(buffer) - 1);
> ((uchar*)buffer)[sizeof(buffer)-1] = last_byte_mask(width);
I’m not really convinced this would be an optimization
😊
My claim is that, since “width” is constant at compile time (128), and it is divisible by 32, any decent C++ compiler must be able to figure out, at compile time, that set_all() == set_prefix(width)==set_prefix(128)
trivially reduces to a shorter
memset(buffer, 0xff, 32)
>> + {
>> + compile_time_assert(sizeof(ulonglong) == 8);
>> + uint32 tmp[2];
>> + int8store(tmp, map2buff);
>> + if (array_elements(buffer) > 2 && pad_with_ones)
>> + set_all();
>really? I'd expect that if pad_with_ones==true,
>you simply don't need to do anything to the
>rest of the buffer.
The original implementation does padding with 0xff or 0x00 , depending on the highest order bit of map2buff, with bitmap_set_above(). retained that property
>> + bool is_set_all() const
>> + {
>> + if (width == sizeof(buffer) * 8)
>> + {
>> + for (size_t i = 0; i < array_elements(buffer); i++)
>> + if (buffer[i] != 0xFFFFFFFFU)
>> + return false;
>> + return true;
>> + }
>> + else
>> + return is_prefix(width);
>
>this can also be optimized to not use complex is_prefix(),
>but I don't think is_set_all() is used as often as set_all(), so it
>shouldn't matter as much.
In fact, it is optimized 😊
If this condition if (width == sizeof(buffer) * 8) us true, s optimized version is used. This condition can be evaluated at compile time, and is true for width which is divisible by 32. the code for is_prefix should
not be even generated by the compile
Thanks again for looking,
Wlad
From:
Sergei Golubchik
Sent: Wednesday, 8 May 2019 22:49
Subject: Re: a4be420c920: MDEV-19235 MariaDB Server compiled for 128 Indexes crashes at startup
Hi, Vladislav!
Thanks. That's pretty much the solution I had in mind too :)
A couple of comments on details below.
I've only commented on sql_bitmap.h, everything else is fine.
On May 08, Vladislav Vaintroub wrote:
> revision-id: a4be420c920 (mariadb-10.4.4-25-ga4be420c920)
> parent(s): ee4a2fef181
> author: Vladislav Vaintroub <wlad@mariadb.com>
> committer: Vladislav Vaintroub <wlad@mariadb.com>
> timestamp: 2019-04-17 17:17:06 +0100
> message:
>
> MDEV-19235 MariaDB Server compiled for 128 Indexes crashes at startup
>
> With MAX_INDEXIES=64(default), key_map=Bitmap<64> is just a wrapper around
> ulonglong and thus "trivial" (can be bzero-ed, or memcpy-ed, and stays
> valid still)
>
> With MAX_INDEXES=128, key_map = Bitmap<128> is not a "trivial" type
> anymore. The implementation uses MY_BITMAP, and MY_BITMAP contains pointers
> which make Bitmap invalid, when it is memcpy-ed/bzero-ed.
>
> The problem in 10.4 is that there are many new key_map members, inside TABLE
> or KEY, and those are often memcopied and bzeroed
>
> The fix makes Bitmap "trivial", by inlining most of MY_BITMAP functionality.
> pointers/heap allocations are not used anymore.
> diff --git a/sql/sql_bitmap.h b/sql/sql_bitmap.h
> index 705a8d169e1..74866336a1a 100644
> --- a/sql/sql_bitmap.h
> +++ b/sql/sql_bitmap.h
> @@ -25,73 +25,184 @@
>
> #include <my_sys.h>
> #include <my_bitmap.h>
> +#include <my_bit.h>
>
> template <uint width> class Bitmap
> {
> - MY_BITMAP map;
> +public:
> uint32 buffer[(width+31)/32];
Why is it public?
> public:
> - Bitmap() { init(); }
> + Bitmap():buffer()
What does that do?
> + {
> + }
> - Bitmap(const Bitmap& from) { *this=from; }
> - explicit Bitmap(uint prefix_to_set) { init(prefix_to_set); }
> + explicit Bitmap(uint prefix_to_set):buffer()
> + {
> + set_prefix(prefix_to_set);
> + }
> - void init() { my_bitmap_init(&map, buffer, width, 0); }
> - void init(uint prefix_to_set) { init(); set_prefix(prefix_to_set); }
> uint length() const { return width; }
> - Bitmap& operator=(const Bitmap& map2)
> - {
> + void init()
> + {
> + clear_all();
> + }
> + void init(uint prefix_to_set)
> + {
> init();
> - memcpy(buffer, map2.buffer, sizeof(buffer));
> - return *this;
> + set_prefix(prefix_to_set);
I wonder whether init() method is still needed.
You've removed it from the constructor. And if there's any other
code that uses it, I'd rather it'd be using clear_all().
> }
> - void set_bit(uint n) { bitmap_set_bit(&map, n); }
> + void set_bit(uint n)
> + {
> + DBUG_ASSERT(n < width);
> + ((uchar*)buffer)[n / 8] |= (1 << (n & 7));
> + }
> - void clear_bit(uint n) { bitmap_clear_bit(&map, n); }
> + void clear_bit(uint n)
> + {
> + DBUG_ASSERT(n < width);
> + ((uchar*)buffer)[n / 8] &= ~(1 << (n & 7));
> + }
> - void set_prefix(uint n) { bitmap_set_prefix(&map, n); }
> + void set_prefix(uint prefix_size)
> + {
> + set_if_smaller(prefix_size, width);
> + uint prefix_bytes, prefix_bits, d;
> + uchar* m = (uchar*)buffer;
> +
> + if ((prefix_bytes = prefix_size / 8))
> + memset(m, 0xff, prefix_bytes);
> + m += prefix_bytes;
> + if ((prefix_bits = prefix_size & 7))
> + {
> + *(m++) = (1 << prefix_bits) - 1;
> + // As the prefix bits are set, lets count this byte too as a prefix byte.
> + prefix_bytes++;
> + }
> + if ((d = (width + 7) / 8 - prefix_bytes))
> + memset(m, 0, d);
> + }
> - void set_all() { bitmap_set_all(&map); }
> + void set_all()
> + {
> + set_prefix(width);
may be better to optimize it?
memset(buffer, 0xff, sizeof(buffer) - 1);
((uchar*)buffer)[sizeof(buffer)-1] = last_byte_mask(width);
> + }
> - void clear_all() { bitmap_clear_all(&map); }
> + void clear_all()
> + {
> + memset(buffer, 0x00, sizeof(buffer));
> + }
> - void intersect(Bitmap& map2) { bitmap_intersect(&map, &map2.map); }
> + void intersect(Bitmap & map2)
> + {
> + for (uint i = 0; i < array_elements(buffer); i++)
> + buffer[i] &= map2.buffer[i];
> + }
> - void intersect(ulonglong map2buff)
> - {
> - // Use a spearate temporary buffer, as bitmap_init() clears all the bits.
> - ulonglong buf2;
> - MY_BITMAP map2;
> -
> - my_bitmap_init(&map2, (uint32 *) &buf2, sizeof(ulonglong) * 8, 0);
> -
> - // Store the original bits.
> - if (sizeof(ulonglong) >= 8)
> - {
> - int8store(const_cast<uchar *>(static_cast<uchar *>
> - (static_cast<void *>(&buf2))),
> - map2buff);
> - }
> - else
> - {
> - DBUG_ASSERT(sizeof(buffer) >= 4);
> - int4store(const_cast<uchar *>(static_cast<uchar *>
> - (static_cast<void *>(&buf2))),
> - static_cast<uint32>(map2buff));
> - }
> - bitmap_intersect(&map, &map2);
> - }
> + void intersect_and_pad(ulonglong map2buff, bool pad_with_ones)
make it private?
> + {
> + compile_time_assert(sizeof(ulonglong) == 8);
> + uint32 tmp[2];
> + int8store(tmp, map2buff);
> + if (array_elements(buffer) > 2 && pad_with_ones)
> + set_all();
really? I'd expect that if pad_with_ones==true,
you simply don't need to do anything to the
rest of the buffer.
> +
> + buffer[0] &= tmp[0];
> + if (array_elements(buffer) > 1)
> + buffer[1] &= tmp[1];
> +
> + if (array_elements(buffer) > 2 && !pad_with_ones)
> + memset((char*)buffer + 8, 0 , sizeof(buffer) - 8);
> + }
> + void intersect(ulonglong map2buff)
> + {
> + intersect_and_pad(map2buff, 0);
> + }
> /* Use highest bit for all bits above sizeof(ulonglong)*8. */
> void intersect_extended(ulonglong map2buff)
> {
> - intersect(map2buff);
> - if (map.n_bits > sizeof(ulonglong) * 8)
> - bitmap_set_above(&map, sizeof(ulonglong),
> - MY_TEST(map2buff & (1LL << (sizeof(ulonglong) * 8 - 1))));
> + intersect_and_pad(map2buff, (map2buff & (1ULL << 63)));
> }
> - void subtract(Bitmap& map2) { bitmap_subtract(&map, &map2.map); }
> + void subtract(Bitmap & map2)
> + {
> + for (size_t i = 0; i < array_elements(buffer); i++)
> + buffer[i] &= ~(map2.buffer[i]);
> + }
> - void merge(Bitmap& map2) { bitmap_union(&map, &map2.map); }
> + void merge(Bitmap & map2)
> + {
> + for (size_t i = 0; i < array_elements(buffer); i++)
> + buffer[i] |= map2.buffer[i];
> + }
> - bool is_set(uint n) const { return bitmap_is_set(&map, n); }
> + bool is_set(uint n) const
> + {
> + DBUG_ASSERT(n < width);
> + return ((uchar*)buffer)[n / 8] & (1 << (n & 7));
> + }
> - bool is_prefix(uint n) const { return bitmap_is_prefix(&map, n); }
> + bool is_prefix(uint prefix_size) const
> + {
> + uint prefix_mask = last_byte_mask(prefix_size);
> + uchar* m = (uchar*)buffer;
> + uchar* end_prefix = m + (prefix_size - 1) / 8;
> + uchar* end;
> + DBUG_ASSERT(prefix_size <= width);
> +
> + /* Empty prefix is always true */
> + if (!prefix_size)
> + return true;
> +
> + while (m < end_prefix)
> + if (*m++ != 0xff)
> + return false;
> +
> + end = ((uchar*)buffer) + (width + 7) / 8 - 1;
> + if (m == end)
> + return ((*m & last_byte_mask(width)) == prefix_mask);
> +
> + if (*m != prefix_mask)
> + return false;
> +
> + while (++m < end)
> + if (*m != 0)
> + return false;
> + return ((*m & last_byte_mask(width)) == 0);
> + }
> - bool is_clear_all() const { return bitmap_is_clear_all(&map); }
> + bool is_clear_all() const
> + {
> + for (size_t i= 0; i < array_elements(buffer); i++)
> + if (buffer[i])
> + return false;
> + return true;
> + }
> - bool is_set_all() const { return bitmap_is_set_all(&map); }
> + bool is_set_all() const
> + {
> + if (width == sizeof(buffer) * 8)
> + {
> + for (size_t i = 0; i < array_elements(buffer); i++)
> + if (buffer[i] != 0xFFFFFFFFU)
> + return false;
> + return true;
> + }
> + else
> + return is_prefix(width);
this can also be optimized to not use complex is_prefix(),
but I don't think is_set_all() is used as often as set_all(), so it
shouldn't matter as much.
> + }
> - bool is_subset(const Bitmap& map2) const { return bitmap_is_subset(&map, &map2.map); }
> + bool is_subset(const Bitmap & map2) const
> + {
> + for (size_t i= 0; i < array_elements(buffer); i++)
> + if (buffer[i] & ~(map2.buffer[i]))
> + return false;
> + return true;
> + }
> - bool is_overlapping(const Bitmap& map2) const { return bitmap_is_overlapping(&map, &map2.map); }
> + bool is_overlapping(const Bitmap & map2) const
> + {
> + for (size_t i = 0; i < array_elements(buffer); i++)
> + if (buffer[i] & map2.buffer[i])
> + return true;
> + return false;
> + }
> - bool operator==(const Bitmap& map2) const { return bitmap_cmp(&map, &map2.map); }
> + bool operator==(const Bitmap & map2) const
> + {
> + return memcmp(buffer, map2.buffer, sizeof(buffer)) == 0;
> + }
> - bool operator!=(const Bitmap& map2) const { return !(*this == map2); }
> + bool operator!=(const Bitmap & map2) const
> + {
> + return !(*this == map2);
> + }
> char *print(char *buf) const
> {
> char *s=buf;
Regards,
Sergei
Chief Architect MariaDB
and security@mariadb.org