nng - A mirror of https://github.com/nanomsg/nng

	Commit message (Collapse)	Author	Age
*	fixes #1064 Potential deadlock in statistics code	Garrett D'Amore	2019-12-29
\| \| \| \| \| \| \| \|	fixes #1063 Include sanitizer runs in CI fixes #1068 Wssfile test sometimes fails with wrong error code While here, addressed a number of clang-tidy items, and some light cleanup of code we were already in.
*	fixes #1065 resolver leaks work structures	Garrett D'Amore	2019-12-29
\| \| \| \| \| \|	This includes changes to support setting the sanitizer correctly (the old code CMake stuff didn't quite get it right), and addresses a number of failures in the test code found by the address sanitizer.
*	Brittleness in pair1 mono faithful test.	Garrett D'Amore	2019-12-27
\|
*	fixes #1057 reqpoll test fails (bad test logic) sometimes	Garrett D'Amore	2019-12-27
\| \| \| \|	The reqpoll test is now moved into the common req/rep logic.
*	fixes #1040 Convert rest of the protocols to new CMake infra	Garrett D'Amore	2019-12-25
\|
*	fixes #1032 Figure out Darwin bustedness	Garrett D'Amore	2019-12-24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	fixes #1035 Convey is awkward -- consider acutest.h This represents a rather large effort towards cleaning up our testing and optional configuration infrastructure. A separate test library is built by default, which is static, and includes some useful utilities design to make it easier to write shorter and more robust (not timing dependent) tests. This also means that we can cover pretty nearly all the tests (protocols etc.) in every case, even if the shipped image will be minimized. Subsystems which are optional can now use a few new macros to configure what they need see nng_sources_if, nng_headers_if, and nng_defines_if. This goes a long way to making the distributed CMakefiles a lot simpler. Additionally, tests for different parts of the tree can now be located outside of the tests/ tree, so that they can be placed next to the code that they are testing. Beyond the enabling work, the work has only begun, but these changes have resolved the most often failing tests for Darwin in the cloud.
*	Add option for preferring new messages on SUB0	Nathan Kent	2019-11-03
\|
*	fixes #923 #935 RECVBUF/SENDBUF has variable type	Nathan Kent	2019-05-19
\|
*	fixes #915 Memory Leak in pub	Garrett D'Amore	2019-04-11
\|
*	fixes #919 Polling on subscriber socket recvfd seems broken	Behrooze Sirang	2019-04-11
\| \| \| \|	sub0_recv_cb was not calling nni_pollable_raise on sock->recvable.
*	fixes #461 Context support for SUB	Garrett D'Amore	2019-02-26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	fixes #762 Pub/Sub very slow compared with nanomsg This introduces contexts for SUB, and converts both the cooked SUB and PUB protocols to use a new lightweight message queue that has significant performance benefits over the heavy-weight message queue. We've also added a test program, pubdrop, in the perf directory, which can be used for measuring pub/sub message rates and drop rates. Note that its quite easy to overwhelm a subscriber still. The SUB socket performance is still not completely where it needs to be. There are two remainging things to improve. Firsst we need to replace the naive linked list of topics with a proper PATRICIA trie. Second, we need to work on the low level POSIX poller code. (The Windows code is already quite good, and we outperform nanomsg on Windows.)
*	fixes #857 NNG_OPT_REQ_RESENDTIME does not honor NNG_DURATION_INFINITE	Garrett D'Amore	2019-02-17
\|
*	fixes #871 panic when sharing rep between threads	Garrett D'Amore	2019-02-17
\|
*	fixes #831 Unify option structures, o_type is unused	Garrett D'Amore	2018-12-29
\|
*	move all public headers to include/nng/ folder	Gregor Burger	2018-11-22
\| \| \| \| \| \| \| \| \| \|	This change makes embedding nng + nggpp (or other projects depending on nng) in cmake easier. The header files are moved to a separate include directory. This also makes installation of the headers easier, and allows clearer identification of private vs public heade files. Some additional cleanups were performed by @gedamore, but the main credit for this change belongs with @gregorburger.
*	fixes #577 target library dependencies should be public	Garrett D'Amore	2018-11-05
\| \| \| \| \| \| \| \| \| \| \|	This is a significant refactor of the library configuration. We use the modern package configuration helper, with a template script that also does the find_package dance for any of our dependencies. We also have restructured the code so that most protocols and transports have their configuration isolated to their own CMakeLists file, reducing the size of the global CMakeLists file.
*	fixes #4 Statistics support	Garrett D'Amore	2018-09-03
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This introduces new public APIs for obtaining statistics, and adds some generic stats for dialers, listeners, pipes, and sockets. Also added are stats for inproc and pairv1 protocol. The other protocols and transports will have stats added incrementally as time goes on. A simple test program, and man pages are provided for this. Start by looking at nng_stat(5). Statistics does have some impact, and they can be disabled by using the advanced NNG_ENABLE_STATS (setting it to OFF, it's ON by default) if you need to build a minimized configuration.
*	fixes #664 aio cancellation could be better	Garrett D'Amore	2018-08-20
\| \| \| \| \| \| \| \| \|	This changes the signature of the aio cancellation routines to take the argument for cancellation directly, so we do not need to lookup the argument using the nni_aio_get_prov_data. We should probably consider eliminating nni_aio_get_prov_data, and co, and changing the prov_extra to reflect prov_data. Later.
*	fixes #648 REQ protocol can hang on close	Garrett D'Amore	2018-08-14
\| \| \| \| \| \| \| \| \| \| \|	Actually the problem was in socket core, in particular in the shutdown code. The socket shutdown is supposed to ensure that no pipes were present on the socket, so that protocols need not concern themselves with this. The code unfortunately was busted, due to an ordering problem compounded by a race condition. This fixes that, and changes the REQ protocol to avoid the blocking condition altogether, and sprinkles a few assertions to validate these rules are being adhered to.
*	fixes #628 Hang in closing REQ	Garrett D'Amore	2018-08-07
\| \| \| \| \| \|	This adds a proper boolean condition for the pipe being closed (removing the unused sending flag), and adds checks for both the pipe closed and the socket closed flags at key points.
*	fixes #611 Memory Leaks under Windows	Garrett D'Amore	2018-08-06
\| \| \| \| \| \| \| \| \| \| \| \| \|	fixes #622 incorrect assumptions about malloc(0) Windows actually allocates an object of size zero when calling malloc on size zero. This is unusual behavior, and we just add logic to work more like malloc on POSIX systems. Other systems can return non-NULL objects to fixed pages here. We think the best option here is to uniformly return NULL from our APIs in these circumstances, and to include testing to validate that.
*	fixes #568 Want a single reader/write lock on socket child objects	Garrett D'Amore	2018-07-06
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	fixes #170 Make more use of reaper This is a complete restructure/rethink of how child objects interact with the socket. (This also backs out #576 as it turns out not to be needed.) While 568 says reader/writer lock, for now we have settled for a single writer lock. Its likely that this is sufficient. Essentially we use the single socket lock to guard lists of the socket children. We also use deferred deletion in the idhash to facilitate teardown, which means endpoint closes are no longer synchronous. We use the reaper to clean up objects when the reference count drops to zero. We make a special exception for pipes, since they really are not reference counted by their parents, and they are leaf objects anyway. We believe this addresses the main outstanding race conditions in a much more correct and holistic way. Note that endpoint shutdown is a little tricky, as it makes use of atomic flags to guard against double entry, and against recursive lock entry. This is something that would be nice to make a bit more obvious, but what we have is safe, and the complexity is at least confined to one place.
*	fixes #572 Several locking errors found	Garrett D'Amore	2018-07-03
\| \| \| \| \| \| \| \| \| \|	fixes #573 atomic flags could help This introduces a new atomic flag, and reduces some of the global locking. The lock refactoring work is not yet complete, but this is a positive step forward, and should help with certain things. While here we also fixed a compile warning due to incorrect types.
*	fixes #540 nni_ep_opttype serves no purpose	Garrett D'Amore	2018-06-13
\| \| \| \| \| \| \| \| \| \| \| \|	fixes #538 setopt should have an explicit chkopt routine fixes #537 Internal TCP API needs better name separation fixes #524 Option types should be "typed" This is a rework of the option management code, to make it both clearer and to prepare for further work to break up endpoints. This reduces a certain amount of dead or redundant code, and actually saves cycles when setting options, as some loops were not terminated that should have been.
*	fixes #484 crashes in websocket transport	Garrett D'Amore	2018-05-29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	fixes #490 posix_epdesc use-after-free bug fixes #489 Sanitizer based testing would help fixes #492 Numerous memory leaks found with sanitizer This introduces support for compiler-based sanitizers when using clang or gcc (and not on Windows). See NNG_SANITIZER for possible settings such as "thread" or "address". Furthermore, we have fixed the issues we found with both the thread and address sanitizers. We believe that the thread issues pointed to a low frequency use-after-free responsible for rare crashes in some of the tests. The tests generally have their timeouts doubled when running under a sanitizer, to account for the extra long times that the sanitizer can cause these to take. While here, we also changed the compat_ws test to avoid a particularly painful and time consuming DNS lookup, and we made the nngcat_unlimited test a bit more robust by waiting before sending traffic.
*	fixes #459 SUB should be more aggressive about discarding messages	Garrett D'Amore	2018-05-21
\| \| \| \| \| \| \| \| \|	As part of this code fix, we needed to add filtering support to the msgq_tryput code path -- it turns out that code path was bypassing the filterfn altogether. Eventually we'll remove all this filtering stuff from the msgq code and replace it with inline filtering directly in sub.
*	fixes #449 Want more flexible pipe events	Garrett D'Amore	2018-05-17
\| \| \| \| \| \| \| \| \|	This changes the signature of nng_pipe_notify(), and the associated events. The documentation is updated to reflect this. We have also broken the lock up so that we don't hold the master socket lock for some of these things, which may have beneficial impact on performance.
*	fixes #441 Unintentional semantic in bus protocol	Garrett D'Amore	2018-05-17
\|
*	fixes #440 leak in bus protocol	Garrett D'Amore	2018-05-16
\|
*	fixes #419 want to nni_aio_stop without blocking (#428)	Garrett D'Amore	2018-05-15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* fixes #419 want to nni_aio_stop without blocking This actually introduces an nni_aio_close() API that causes nni_aio_begin to return NNG_ECLOSED, while scheduling a callback on the AIO to do an NNG_ECLOSED as well. This should be called in non-blocking close() contexts instead of nni_aio_stop(), and the cases where we call nni_aio_fini() multiple times are updated updated to add nni_aio_stop() calls on all "interlinked" aios before finalizing them. Furthermore, we call nni_aio_close() as soon as practical in the close path. This closes an annoying race condition where the callback from a lower subsystem could wind up rescheduling an operation that we wanted to abort.
*	fixes #352 aio lock is burning hot	Garrett D'Amore	2018-05-14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	fixes #326 consider nni_taskq_exec_synch() fixes #410 kqueue implementation could be smarter fixes #411 epoll_implementation could be smarter fixes #426 synchronous completion can lead to panic fixes #421 pipe close race condition/duplicate destroy This is a major refactoring of two significant parts of the code base, which are closely interrelated. First the aio and taskq framework have undergone a number of simplifications, and improvements. We have ditched a few parts of the internal API (for example tasks no longer support cancellation) that weren't terribly useful but added a lot of complexity, and we've made aio_schedule something that now checks for cancellation or other "premature" completions. The aio framework now uses the tasks more tightly, so that aio wait can devolve into just nni_task_wait(). We did have to add a "task_prep()" step to prevent race conditions. Second, the entire POSIX poller framework has been simplified, and made more robust, and more scalable. There were some fairly inherent race conditions around the shutdown/close code, where we thought we were synchronizing against the other thread, but weren't doing so adequately. With a cleaner design, we've been able to tighten up the implementation to remove these race conditions, while substantially reducing the chance for lock contention, thereby improving scalability. The illumos poller also got a performance boost by polling for multiple events. In highly "busy" systems, we expect to see vast reductions in lock contention, and therefore greater scalability, in addition to overall improved reliability. One area where we currently can do better is that there is still only a single poller thread run. Scaling this out is a task that has to be done differently for each poller, and carefuly to ensure that close conditions are safe on all pollers, and that no chance for deadlock/livelock waiting for pfd finalizers can occur.
*	fixes #424 reqstress low frequency crash	Garrett D'Amore	2018-05-09
\|
*	fixes #342 Want Surveyor/Respondent context support	Garrett D'Amore	2018-04-24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	fixes #360 core should nng_aio_begin before nng_aio_finish_error fixes #361 nng_send_aio should check for NULL message fixes #362 nni_msgq does not signal pollable on certain events This adds support for contexts for both sides of the surveyor pattern. Prior to this commit, the raw mode was completely broken, and there were numerous other bugs found and fixed. This integration includes much deeper validation of this pattern. Some changes to the core and other patterns have been made, where it was obvioius that we could make such improvements. (The obviousness stemming from the fact that RESPONDENT in particular is very closely derived from REP.)
*	fixes #368 context options could be empty	Garrett D'Amore	2018-04-24
\|
*	fixes #346 nng_recv() sometimes acts on null `msg` pointer	Garrett D'Amore	2018-04-20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This closes a fundamental flaw in the way aio structures were handled. In paticular, aio expiration could race ahead, and fire before the aio was properly registered by the provider. This ultimately led to the possibility of duplicate completions on the same aio. The solution involved breaking up nni_aio_start into two functions. nni_aio_begin (which can be run outside of external locks) simply validates that nni_aio_fini() has not been called, and clears certain fields in the aio to make it ready for use by the provider. nni_aio_schedule does the work to register the aio with the expiration thread, and should only be called when the aio is actually scheduled for asynchronous completion. nni_aio_schedule_verify does the same thing, but returns NNG_ETIMEDOUT if the aio has a zero length timeout. This change has a small negative performance impact. We have plans to rectify that by converting nni_aio_begin to use a locklesss flag for the aio->a_fini bit. While we were here, we fixed some error paths in the POSIX subsystem, which would have returned incorrect error codes, and we made some optmizations in the message queues to reduce conditionals while holding locks in the hot code path.
*	fixes #355 Possible use after free in REP	Garrett D'Amore	2018-04-17
\| \| \| \| \| \| \| \| \|	While here I've added some code that should help us backtrack on a crash, by linking back to the pipe from the context when we are queued on that pipes sendq. I'm not sure if we've ever seen these or not, but it could explain certain infrequent crashes we think we've seen.
*	fixes #334 Separate context for state machines from sockets	Garrett D'Amore	2018-04-10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This provides context support for REQ and REP sockets. More discussion around this is in the issue itself. Optionally we would like to extend this to the surveyor pattern. Note that we specifically do not support pollable descriptors for non-default contexts, and the results of using file descriptors for polling (NNG_OPT_SENDFD and NNG_OPT_RECVFD) is undefined. In the future, it might be nice to figure out how to factor in optional use of a message queue for users who want more buffering, but we think there is little need for this with cooked mode.
*	fixes #331 replace NNG_OPT_RAW option with constructor	Garrett D'Amore	2018-04-04
\| \| \| \| \| \| \| \| \| \| \| \| \|	This makes the raw mode something that is immutable, determined at socket construction. This is an enabling change for the separate context support coming soon. As a result, this is an API breaking change for users of the raw mode option (NNG_OPT_RAW). There aren't many of them out there. Cooked mode is entirely unaffected. There are changes to tests and documentation included.
*	fixes #329 type checking not done for setopt	Garrett D'Amore	2018-04-04
\|
*	fixes #301 String option handling for getopt	Garrett D'Amore	2018-03-20
\|
*	fixes #296 Typed options should validate option type	Garrett D'Amore	2018-03-20
\| \| \| \| \| \| \| \| \| \| \| \| \|	fixes #302 nng_dialer/listener/pipe_getopt_sockaddr desired This adds plumbing to pass and check the type of options all the way through. NNG_ZT_OPT_ORBIT is type UINT64, but you can use the untyped form to pass two of them if needed. No typed access for retrieving strings yet. I think this should allocate a pointer and copy that out, but that's for later.
*	fixes #295 boolean options should use C99 bool type	Garrett D'Amore	2018-03-18
\| \| \| \| \| \| \| \| \| \| \|	fixes #275 nng_pipe_getopt_ptr() missing? fixes #285 nng_setopt_ptr MIS fixes #297 nng_listener/dialer_close does not validate mode This change adds some missing APIs, and changes others. In particular, certain options are now of type bool, with size of just one. This is a breaking change for code that uses those options -- NNG_OPT_RAW, NNG_OPT_PAIR1_POLY, NNG_OPT_TLS_VERIFIED.
*	fixes #240 nngcat is MIA	Garrett D'Amore	2018-02-28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is intended to provide compatibility with, and has been tested against, legacy nanocat. There are a few differences though. At this time support for the alias names (where argv[0] is set to something like nngreq or somesuch) is missing. By default this library operations without NNG_FLAG_NONBLOCK on dial and listen, so that failures here are immediately diagnosable. (This behavior can be changed with the --async flag.) By default --pair means PAIRv1, but you can specify --pair0 or --pair1 explicitly. (There is also a --compat mode, and in that mode --pair means PAIRv0. The --compat mode also turns on NNG_FLAG_NONBLOCK by default.) The "quoted" mode also quotes tabs. (Legacy nanocat did not.) It is possible to connect to multiple peers by using the --dial or --listen (or similar) options multiple times. Shorthands can be used for long options that are not ambiguous. For example, --surv can be used to mean surveyor, but --re is invalid because it can mean req, rep, or respondent. We assume you have a reasonable standard C environment. This won't work in embedded environments without support for FILE *. TLS options are missing but to be added soon. A man page is still to be written.
*	CMake & CPack improvements.	Garrett D'Amore	2018-02-21
\| \| \| \| \| \| \| \| \|	These are incremental updates... we avoid using install() in the subdirectories, so that we can adapt properly to them in the single parent directory. We have started some of the work to improve support for CPack. This is still not yet done, but work in progress.
*	fixes #234 Investigate enabling more verbose compiler warnings	Garrett D'Amore	2018-02-14
\| \| \| \| \| \| \|	We enabled verbose compiler warnings, and found a lot of issues. Some of these were even real bugs. As a bonus, we actually save some initialization steps in the compat layer, and avoid passing some variables we don't need.
*	fixes #173 Define public HTTP server API	Garrett D'Amore	2018-02-01
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This introduces enough of the HTTP API to support fully server applications, including creation of websocket style protocols, pluggable handlers, and so forth. We have also introduced scatter/gather I/O (rudimentary) for aios, and made other enhancements to the AIO framework. The internals of the AIOs themselves are now fully private, and we have eliminated the aio->a_addr member, with plans to remove the pipe and possibly message members as well. A few other minor issues were found and fixed as well. The HTTP API includes request, response, and connection objects, which can be used with both servers and clients. It also defines the HTTP server and handler objects, which support server applications. Support for client applications will require a client object to be exposed, and that should be happening shortly. None of this is "documented" yet, bug again, we will follow up shortly.
*	fixes #196 surveyor pattern hangs after second survey	Garrett D'Amore	2018-01-09
\|
*	fixes #147 surveyor protocol needs NNG_OPT_MAXTTL	Garrett D'Amore	2017-11-03
\|
*	fixes #143 Protocols and transports should be "configurable"	Garrett D'Amore	2017-11-02
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This makes all the protocols and transports optional. All of them except ZeroTier are enabled by default, but you can now disable them (remove from the build) with cmake options. The test suite is modified so that tests still run as much as they can, but skip over things caused by missing functionality from the library (due to configuration). Further, the constant definitions and prototypes for functions that are specific to transports or protocols are moved into appropriate headers, which should be included directly by applications wishing to use these. We have also added and improved documentation -- all of the transports are documented, and several more man pages for protocols have been added. (Req/Rep and Surveyor are still missing.)
*	fixes #137 Remove public access to numeric protocols	Garrett D'Amore	2017-10-31
\|