Hi Serg,

 

Thanks for looking

 

>> +public:
>>   uint32 buffer[(width+31)/32];
> Why is it public?

 

No reason, an oversight. Leftover from previous testing,

I fixed it.

 

> +  Bitmap():buffer()
zero initialization of member variable ,here of the type array (s https://stackoverflow.com/a/23987926/547065)

I removed it by replacing with clear_all(), so it is better understandable.

 

>I wonder whether init() method is still needed.
>You've removed it from the constructor. And if there's any other
>code that uses it, I'd rather it'd be using clear_all().

 

Done.

 

>> +  void set_all()
>> +  {
>> +    set_prefix(width);

>
>may be better to optimize it?
> memset(buffer, 0xff, sizeof(buffer) - 1);
> ((uchar*)buffer)[sizeof(buffer)-1] = last_byte_mask(width);

 

I’m not really convinced this would be an optimization 😊

My claim is that, since “width” is constant at compile time (128), and it is divisible by 32, any decent  C++ compiler must be able to figure out, at compile time, that set_all() == set_prefix(width)==set_prefix(128) trivially reduces to a shorter

 

  memset(buffer, 0xff, 32)

 

>> +  {
>> +    compile_time_assert(sizeof(ulonglong) == 8);
>> +    uint32 tmp[2];
>> +    int8store(tmp, map2buff);
>> +    if (array_elements(buffer) > 2 && pad_with_ones)
>> +      set_all();

>really? I'd expect that if pad_with_ones==true,
>you simply don't need to do anything to the
>rest of the buffer.

 

The original implementation does padding  with 0xff  or 0x00  , depending on the highest order bit of map2buff, with bitmap_set_above(). retained that property

 

>> +  bool is_set_all() const
>> +  {
>> +    if (width == sizeof(buffer) * 8)
>> +    {
>> +      for (size_t i = 0; i < array_elements(buffer); i++)
>> +        if (buffer[i] != 0xFFFFFFFFU)
>> +          return false;
>> +      return true;
>> +    }
>> +    else
>> +      return is_prefix(width);
>
>this can also be optimized to not use complex is_prefix(),
>but I don't think is_set_all() is used as often as set_all(), so it
>shouldn't matter as much.

In fact, it is optimized 😊

If this condition    if (width == sizeof(buffer) * 8) us true, s optimized version is used. This condition can be evaluated at compile time, and is true for width which is divisible by 32. the code for is_prefix should not be even generated by the compile

 

Thanks again for looking,

Wlad

 

From: Sergei Golubchik
Sent: Wednesday, 8 May 2019 22:49
Subject: Re: a4be420c920: MDEV-19235 MariaDB Server compiled for 128 Indexes crashes at startup

 

Hi, Vladislav!

Thanks. That's pretty much the solution I had in mind too :)

A couple of comments on details below.

I've only commented on sql_bitmap.h, everything else is fine.

On May 08, Vladislav Vaintroub wrote:
> revision-id: a4be420c920 (mariadb-10.4.4-25-ga4be420c920)
> parent(s): ee4a2fef181
> author: Vladislav Vaintroub <wlad@mariadb.com>
> committer: Vladislav Vaintroub <wlad@mariadb.com>
> timestamp: 2019-04-17 17:17:06 +0100
> message:
>
> MDEV-19235 MariaDB Server compiled for 128 Indexes crashes at startup
>
> With MAX_INDEXIES=64(default), key_map=Bitmap<64> is just a wrapper around
> ulonglong and thus "trivial" (can be bzero-ed, or memcpy-ed, and stays
> valid still)
>
> With MAX_INDEXES=128, key_map = Bitmap<128> is not a "trivial" type
> anymore. The implementation uses MY_BITMAP, and MY_BITMAP contains pointers
> which make Bitmap invalid, when it is memcpy-ed/bzero-ed.
>
> The problem in 10.4 is that there are many new key_map members, inside TABLE
> or KEY, and those are often memcopied and bzeroed
>
> The fix makes Bitmap "trivial", by inlining most of MY_BITMAP functionality.
> pointers/heap allocations are not used anymore.


> diff --git a/sql/sql_bitmap.h b/sql/sql_bitmap.h
> index 705a8d169e1..74866336a1a 100644
> --- a/sql/sql_bitmap.h
> +++ b/sql/sql_bitmap.h
> @@ -25,73 +25,184 @@

>  #include <my_sys.h>
>  #include <my_bitmap.h>
> +#include <my_bit.h>

>  template <uint width> class Bitmap
>  {
> -  MY_BITMAP map;
> +public:
>    uint32 buffer[(width+31)/32];

Why is it public?

>  public:
> -  Bitmap() { init(); }
> +  Bitmap():buffer()

What does that do?

> +  {
> +  }
> -  Bitmap(const Bitmap& from) { *this=from; }
> -  explicit Bitmap(uint prefix_to_set) { init(prefix_to_set); }
> +  explicit Bitmap(uint prefix_to_set):buffer()
> +  {
> +    set_prefix(prefix_to_set);
> +  }
> -  void init() { my_bitmap_init(&map, buffer, width, 0); }
> -  void init(uint prefix_to_set) { init(); set_prefix(prefix_to_set); }
>    uint length() const { return width; }
> -  Bitmap& operator=(const Bitmap& map2)
> -  {
> +  void init()
> +  {
> +    clear_all();
> +  }
> +  void init(uint prefix_to_set)
> +  {
>      init();
> -    memcpy(buffer, map2.buffer, sizeof(buffer));
> -    return *this;
> +    set_prefix(prefix_to_set);

I wonder whether init() method is still needed.
You've removed it from the constructor. And if there's any other
code that uses it, I'd rather it'd be using clear_all().

>    }
> -  void set_bit(uint n) { bitmap_set_bit(&map, n); }
> +  void set_bit(uint n)
> +  {
> +    DBUG_ASSERT(n < width);
> +    ((uchar*)buffer)[n / 8] |= (1 << (n & 7));
> +  }
> -  void clear_bit(uint n) { bitmap_clear_bit(&map, n); }
> +  void clear_bit(uint n)
> +  {
> +    DBUG_ASSERT(n < width);
> +    ((uchar*)buffer)[n / 8] &= ~(1 << (n & 7));
> +  }
> -  void set_prefix(uint n) { bitmap_set_prefix(&map, n); }
> +  void set_prefix(uint prefix_size)
> +  {
> +    set_if_smaller(prefix_size, width);
> +    uint prefix_bytes, prefix_bits, d;
> +    uchar* m = (uchar*)buffer;
> +
> +    if ((prefix_bytes = prefix_size / 8))
> +      memset(m, 0xff, prefix_bytes);
> +    m += prefix_bytes;
> +    if ((prefix_bits = prefix_size & 7))
> +    {
> +      *(m++) = (1 << prefix_bits) - 1;
> +      // As the prefix bits are set, lets count this byte too as a prefix byte.
> +      prefix_bytes++;
> +    }
> +    if ((d = (width + 7) / 8 - prefix_bytes))
> +      memset(m, 0, d);
> +  }
> -  void set_all() { bitmap_set_all(&map); }
> +  void set_all()
> +  {
> +    set_prefix(width);

may be better to optimize it?

  memset(buffer, 0xff, sizeof(buffer) - 1);
  ((uchar*)buffer)[sizeof(buffer)-1] = last_byte_mask(width);

> +  }
> -  void clear_all() { bitmap_clear_all(&map); }
> +  void clear_all()
> +  {
> +    memset(buffer, 0x00, sizeof(buffer));
> +  }
> -  void intersect(Bitmap& map2) { bitmap_intersect(&map, &map2.map); }
> +  void intersect(Bitmap & map2)
> +  {
> +    for (uint i = 0; i < array_elements(buffer); i++)
> +      buffer[i] &= map2.buffer[i];
> +  }
> -  void intersect(ulonglong map2buff)
> -  {
> -    // Use a spearate temporary buffer, as bitmap_init() clears all the bits.
> -    ulonglong buf2;
> -    MY_BITMAP map2;
> -
> -    my_bitmap_init(&map2, (uint32 *) &buf2, sizeof(ulonglong) * 8, 0);
> -
> -    // Store the original bits.
> -    if (sizeof(ulonglong) >= 8)
> -    {
> -      int8store(const_cast<uchar *>(static_cast<uchar *>
> -                                    (static_cast<void *>(&buf2))),
> -                map2buff);
> -    }
> -    else
> -    {
> -      DBUG_ASSERT(sizeof(buffer) >= 4);
> -      int4store(const_cast<uchar *>(static_cast<uchar *>
> -                                    (static_cast<void *>(&buf2))),
> -                static_cast<uint32>(map2buff));
> -    }
> -    bitmap_intersect(&map, &map2);
> -  }
> +  void intersect_and_pad(ulonglong map2buff, bool pad_with_ones)

make it private?

> +  {
> +    compile_time_assert(sizeof(ulonglong) == 8);
> +    uint32 tmp[2];
> +    int8store(tmp, map2buff);
> +    if (array_elements(buffer) > 2 && pad_with_ones)
> +      set_all();

really? I'd expect that if pad_with_ones==true,
you simply don't need to do anything to the
rest of the buffer.

> +
> +    buffer[0] &= tmp[0];
> +    if (array_elements(buffer) > 1)
> +      buffer[1] &= tmp[1];
> +
> +    if (array_elements(buffer) > 2 && !pad_with_ones)
> +        memset((char*)buffer + 8, 0 , sizeof(buffer) - 8);
> +  }
> +  void intersect(ulonglong map2buff)
> +  {
> +    intersect_and_pad(map2buff, 0);
> +  }
>    /* Use highest bit for all bits above sizeof(ulonglong)*8. */
>    void intersect_extended(ulonglong map2buff)
>    {
> -    intersect(map2buff);
> -    if (map.n_bits > sizeof(ulonglong) * 8)
> -      bitmap_set_above(&map, sizeof(ulonglong),
> -                       MY_TEST(map2buff & (1LL << (sizeof(ulonglong) * 8 - 1))));
> +    intersect_and_pad(map2buff, (map2buff & (1ULL << 63)));
>    }
> -  void subtract(Bitmap& map2) { bitmap_subtract(&map, &map2.map); }
> +  void subtract(Bitmap & map2)
> +  {
> +    for (size_t i = 0; i < array_elements(buffer); i++)
> +      buffer[i] &= ~(map2.buffer[i]);
> +  }
> -  void merge(Bitmap& map2) { bitmap_union(&map, &map2.map); }
> +  void merge(Bitmap & map2)
> +  {
> +    for (size_t i = 0; i < array_elements(buffer); i++)
> +      buffer[i] |= map2.buffer[i];
> +  }
> -  bool is_set(uint n) const { return bitmap_is_set(&map, n); }
> +  bool is_set(uint n) const
> +  {
> +    DBUG_ASSERT(n < width);
> +    return ((uchar*)buffer)[n / 8] & (1 << (n & 7));
> +  }
> -  bool is_prefix(uint n) const { return bitmap_is_prefix(&map, n); }
> +  bool is_prefix(uint prefix_size) const
> +  {
> +    uint prefix_mask = last_byte_mask(prefix_size);
> +    uchar* m = (uchar*)buffer;
> +    uchar* end_prefix = m + (prefix_size - 1) / 8;
> +    uchar* end;
> +    DBUG_ASSERT(prefix_size <= width);
> +
> +    /* Empty prefix is always true */
> +    if (!prefix_size)
> +      return true;
> +
> +    while (m < end_prefix)
> +      if (*m++ != 0xff)
> +        return false;
> +
> +    end = ((uchar*)buffer) + (width + 7) / 8 - 1;
> +    if (m == end)
> +      return ((*m & last_byte_mask(width)) == prefix_mask);
> +
> +    if (*m != prefix_mask)
> +      return false;
> +
> +    while (++m < end)
> +      if (*m != 0)
> +        return false;
> +    return ((*m & last_byte_mask(width)) == 0);
> +  }
> -  bool is_clear_all() const { return bitmap_is_clear_all(&map); }
> +  bool is_clear_all() const
> +  {
> +    for (size_t i= 0; i < array_elements(buffer); i++)
> +      if (buffer[i])
> +        return false;
> +    return true;
> +  }
> -  bool is_set_all() const { return bitmap_is_set_all(&map); }
> +  bool is_set_all() const
> +  {
> +    if (width == sizeof(buffer) * 8)
> +    {
> +      for (size_t i = 0; i < array_elements(buffer); i++)
> +        if (buffer[i] != 0xFFFFFFFFU)
> +          return false;
> +      return true;
> +    }
> +    else
> +      return is_prefix(width);

this can also be optimized to not use complex is_prefix(),
but I don't think is_set_all() is used as often as set_all(), so it
shouldn't matter as much.

> +  }
> -  bool is_subset(const Bitmap& map2) const { return bitmap_is_subset(&map, &map2.map); }
> +  bool is_subset(const Bitmap & map2) const
> +  {
> +    for (size_t i= 0; i < array_elements(buffer); i++)
> +      if (buffer[i] & ~(map2.buffer[i]))
> +        return false;
> +    return true;
> +  }
> -  bool is_overlapping(const Bitmap& map2) const { return bitmap_is_overlapping(&map, &map2.map); }
> +  bool is_overlapping(const Bitmap & map2) const
> +  {
> +    for (size_t i = 0; i < array_elements(buffer); i++)
> +      if (buffer[i] & map2.buffer[i])
> +        return true;
> +    return false;
> +  }
> -  bool operator==(const Bitmap& map2) const { return bitmap_cmp(&map, &map2.map); }
> +  bool operator==(const Bitmap & map2) const
> +  {
> +    return memcmp(buffer, map2.buffer, sizeof(buffer)) == 0;
> +  }
> -  bool operator!=(const Bitmap& map2) const { return !(*this == map2); }
> +  bool operator!=(const Bitmap & map2) const
> +  {
> +    return !(*this == map2);
> +  }
>    char *print(char *buf) const
>    {
>      char *s=buf;
 
Regards,
Sergei
Chief Architect MariaDB
and security@mariadb.org