What is the better practice when designing MQTT topics for thousands of devices of multiple customers?

lmsyx · June 16, 2022, 6:44am

I am developing a MQTT service and I am confused when I design the topics. I think there are two ways to do this:

Topic data/customer-a
Payload {"sensor-id":"23aafff2d23659cc97c557909de12f16","type":"demo","value":"hello world"}
Topic data/customer-a/23aafff2d23659cc97c557909de12f16
Payload {"type":"demo","value":"hello world"}

I wonder which is better. I think the first one may result in complexity when I need to subscribe message from a single sensor. Meanwhile, the topic data/customer-a is always under heavy load because of thousands of messages per second. And the second one has drawbacks too, obviously it makes the mqtt broker maintain a huge topic tree, I don’t know whether it would cause significant performance degradation?

Which is better for thousands of devices of multiple customers?

codeadict · June 16, 2022, 4:54pm

I would design it like /devices/{UUID}/data and check the scope/access to the topic on authentication/authorization hooks without exposing the customer names or ids in the topics. Unless you have a use case to use a wildcard to subscribe to all the devices of a customer at once.

mmzeeman · June 16, 2022, 8:01pm

We do it in a similar way in zotonic. Name from less to more specific. E.g. model/rsc/event/{ ResourceId }/{ insert|update|delete} for inserts, updates and deletion of resources. Similar for edges between resources. model/edge/event/{ SubjectId }/o/{ PredicateName }.

And about the load, this entirely depends on the implementation of the router of course. We use an ets based trie data structure to map message topics to subscribers. So it doesn’t really make a higher level names more loaded or something, it is just an handy way of routing the messages.

lmsyx · June 17, 2022, 3:45am

thanks a lot for your advice and respect for your work. But I am still confused whether too many topics would impact on vernemq service performance? chould you please give me some guidance？

mmzeeman · June 17, 2022, 2:43pm

This probably depends more on the number of subscribers you have than the number of topics you use in your messages.

codeadict · June 24, 2022, 3:37pm

VerneMQ uses a trie. I have personally noticed more performance degradation from having many subscribers than for having many topics. But every use case is different, so load testing with your specific topic structure, number of devices and specific message sizes is what will give you a good idea of the performance. No load test is good for every single case, Load testing VerneMQ, the basics might be of help to you.

lmsyx · June 27, 2022, 12:56am

@codeadict @mmzeeman yes, you are right, the number of subscribers is the real key point. Thank you all guys for giving professional advice and contribution to the open source community!