Async CosmosDB client raises: 'Got more than 8190 bytes (11994) when reading Header value is too long.' #27625

ghost · 2022-11-21T13:23:25Z

Describe the bug
We recently switched some of our applications to the new async CosmosDB client that was released in azure-cosmos 4.3.0. Since then we started seeing this error when iterating over large result sets:

Got more than 8190 bytes (10904) when reading Header value is too long.

The error is thrown by aiohttp while fetching/parsing the next result on the AsyncItemPaged returned by a query

The unpredictable occurrence of this error makes the new async client more or less unusable

Exception or Stack Trace

Traceback (most recent call last):
File " ./aiohttp/client_reqrep.py", line 899, in start message, payload = await protocol.read() # type: ignore[union-attr]
File " ./aiohttp/streams.py", line 616, in read await self._waiter
File " ./aiohttp/client_proto.py", line 213, in data_received messages, upgraded, tail = self._parser.feed_data(data)
File "aiohttp/_http_parser.pyx", line 551, in aiohttp._http_parser.HttpParser.feed_data
File "aiohttp/_http_parser.pyx", line 721, in aiohttp._http_parser.cb_on_header_value aiohttp.http_exceptions.LineTooLong: 400, message='Got more than 8190 bytes (11994) when reading Header value is too long.'
The above exception was the direct cause of the following exception: Traceback (most recent call last):
File " ./azure/core/pipeline/transport/_aiohttp.py", line 229, in send result = await self.session.request( # type: ignore
File " ./aiohttp/client.py", line 560, in _request await resp.start(conn)
File " ./aiohttp/client_reqrep.py", line 901, in start raise ClientResponseError( aiohttp.client_exceptions.ClientResponseError: 400, message='Got more than 8190 bytes (11994) when reading Header value is too long.', url=URL('...')
(The above exception was the direct cause of the following exception: Traceback (most recent call last):
File "/home/site/wwwroot/common/az_functions/events_to_aggregate_container.py", line 80, in transform_and_send async for event in events_for_agg:
File " ./hermes/common/aio/wrapped_iterator.py", line 13, in anext return self.mapper(await self.delegate.anext())
File " ./azure/core/async_paging.py", line 154, in anext return await self.anext()
File " ./azure/core/async_paging.py", line 157, in anext self._page = await self._page_iterator.anext()
File " ./azure/core/async_paging.py", line 99, in anext self._response = await self._get_next(self.continuation_token)
File " ./azure/cosmos/aio/_query_iterable_async.py", line 102, in _fetch_next block = await self._ex_context.fetch_next_block()
File " ./azure/cosmos/_execution_context/aio/execution_dispatcher.py", line 89, in fetch_next_block return await self._execution_context.fetch_next_block()
File " ./azure/cosmos/_execution_context/aio/base_execution_context.py", line 82, in fetch_next_block return await self._fetch_next_block()
File " ./azure/cosmos/_execution_context/aio/base_execution_context.py", line 170, in _fetch_next_block return await self._fetch_items_helper_with_retries(self._fetch_function)
File " ./azure/cosmos/_execution_context/aio/base_execution_context.py", line 144, in _fetch_items_helper_with_retries return await _retry_utility_async.ExecuteAsync(self._client, self._client._global_endpoint_manager, callback)
File " ./azure/cosmos/aio/_retry_utility_async.py", line 81, in ExecuteAsync result = await ExecuteFunctionAsync(function, *args, **kwargs)
File " ./azure/cosmos/aio/_retry_utility_async.py", line 138, in ExecuteFunctionAsync return await function(*args, **kwargs)
File " ./azure/cosmos/_execution_context/aio/base_execution_context.py", line 142, in callback return await self._fetch_items_helper_no_retries(fetch_function)
File " ./azure/cosmos/_execution_context/aio/base_execution_context.py", line 125, in _fetch_items_helper_no_retries (fetched_items, response_headers) = await fetch_function(new_options)
File " ./azure/cosmos/aio/_cosmos_client_connection_async.py", line 1722, in fetch_fn await self.__QueryFeed(
File " ./azure/cosmos/aio/_cosmos_client_connection_async.py", line 2291, in __QueryFeed result, self.last_response_headers = await self.__Post(path, request_params, query, req_headers, **kwargs)
File " ./azure/cosmos/aio/_cosmos_client_connection_async.py", line 751, in __Post return await asynchronous_request.AsynchronousRequest(
File " ./azure/cosmos/aio/_asynchronous_request.py", line 175, in AsynchronousRequest return await _retry_utility_async.ExecuteAsync(
File " ./azure/cosmos/aio/_retry_utility_async.py", line 79, in ExecuteAsync result = await ExecuteFunctionAsync(function, global_endpoint_manager, *args, **kwargs)
File " ./azure/cosmos/aio/_retry_utility_async.py", line 138, in ExecuteFunctionAsync return await function(*args, **kwargs)
File " ./azure/cosmos/aio/_asynchronous_request.py", line 100, in _Request response = await _PipelineRunFunction(
File " ./azure/cosmos/aio/_asynchronous_request.py", line 141, in _PipelineRunFunction return await pipeline_client._pipeline.run(request, **kwargs)
File " ./azure/core/pipeline/_base_async.py", line 215, in run return await first_node.send(pipeline_request)
File " ./azure/core/pipeline/_base_async.py", line 83, in send response = await self.next.send(request) # type: ignore
File " ./azure/core/pipeline/_base_async.py", line 83, in send response = await self.next.send(request) # type: ignore
File " ./azure/core/pipeline/_base_async.py", line 83, in send response = await self.next.send(request) # type: ignore [Previous line repeated 1 more time]
File " ./azure/cosmos/aio/_retry_utility_async.py", line 194, in send raise err
File " ./azure/cosmos/aio/_retry_utility_async.py", line 171, in send response = await self.next.send(request)
File " ./azure/core/pipeline/_base_async.py", line 83, in send response = await self.next.send(request) # type: ignore
File " ./azure/core/pipeline/_base_async.py", line 83, in send response = await self.next.send(request) # type: ignore
File " ./azure/core/pipeline/_base_async.py", line 83, in send response = await self.next.send(request) # type: ignore [Previous line repeated 1 more time]
File " ./azure/core/pipeline/_base_async.py", line 116, in send await self._sender.send(request.http_request, **request.context.options),
File " ./azure/core/pipeline/transport/_aiohttp.py", line 255, in send raise ServiceResponseError(err, error=err) from err azure.core.exceptions.ServiceResponseError: 400, message='Got more than 8190 bytes (11994) when reading Header value is too long.', url=URL('...')

To Reproduce
Reproducing the error is not straightforward. We have seen cases where we can easily and without errors iterate over a result set returning 100k+ items while adding or changing a single constraint in the where clause can break the iteration after the first 1k+ results:

SELECT * FROM c
runs

SELECT * FROM c WHERE c.some_field = 'SomeValue'
fails

SELECT * FROM c WHERE lower(c.some_field) = 'somevalue'
runs

SELECT * FROM c WHERE StringEquals(c.some_field, 'SomeValue', true)
fails

Code Snippet
result = [item async for item in container.query_items(query="SELECT * FROM c", max_item_count=100000, enable_cross_partition_query=True)]

Additional info
Two years ago an apparently similar issue was raised for the Java SDK:

Azure/azure-sdk-for-java#6069

Setup:

Python 3.9
azure-cosmos 4.3.0/4.3.1b

The text was updated successfully, but these errors were encountered:

zachschillaci27 · 2022-11-28T08:35:49Z

Seems related to aio-libs/aiohttp#2304

xiangyan99 · 2022-11-28T17:33:49Z

Thanks for the feedback, we’ll investigate asap.

simorenoh · 2022-12-16T21:50:11Z

Hi @roekoe-loterij, thank you for using our SDK and opening this issue. I do believe this might have to do with the issue that was linked above, definitely doesn't seem like something we touch directly in the SDK.

I'm wondering, this is not an issue you have encountered with the sync client at all right? Trying to see if this is something that extends beyond aiohttp.

ghost · 2023-03-11T08:03:08Z

Hi, we're sending this friendly reminder because we haven't heard back from you in a while. We need more information about this issue to help address it. Please be sure to give us your input within the next 7 days. If we don't hear back from you within 14 days of this comment the issue will be automatically closed. Thank you!

antoinegaston · 2023-03-27T14:29:02Z

Hello guys I encounter the same issue as the author, with a growing header size with a growing request length.

ghost · 2023-04-25T14:17:06Z

Bit late with my feedback, but the error only occurs when using the aiohttp library as stated. It is directly related to header config options aiohttp sets by default. I'm pretty sure this error still occurs. We worked around it by patching the aiohttp.client_proto.ResponseHandler.set_response_params method that by default hard codes the configuration of the HttpResponseParser.

kushagraThapar · 2023-05-11T16:24:13Z

Re-opening this issue, as one way to fix this on Cosmos DB side is with the feature which allows users to set continuation token size limit.

@xiangyan99 did we make any progress on this from aiohttp client params?

kushagraThapar · 2023-05-11T16:24:42Z

@bambriz can you please take a look at this, thanks!

xiangyan99 · 2023-05-12T17:47:32Z

Re-opening this issue, as one way to fix this on Cosmos DB side is with the feature which allows users to set continuation token size limit.

@xiangyan99 did we make any progress on this from aiohttp client params?

We allow users/SDK developers to provide custom transport.

You can customize aiohttp and use it when creating clients.

github-actions · 2023-05-19T21:33:57Z

Hi @roekoe-loterij, we're sending this friendly reminder because we haven't heard back from you in 7 days. We need more information about this issue to help address it. Please be sure to give us your input. If we don't hear back from you within 14 days of this comment the issue will be automatically closed. Thank you!

bambriz · 2023-06-01T22:33:01Z

There is an open PR at the moment that will fix this issue by implementing continuation token limits to the python sdk. By setting the limit to be 8KB or under it will prevent this issue from occurring. You can check the progress of PR #30731

simorenoh · 2023-07-24T16:13:27Z

Leaving this open until the changes are available with this month's release.

simorenoh · 2023-07-26T16:29:16Z

Changes are now available in version 4.4.1b1: https://pypi.org/project/azure-cosmos/4.4.1b1/

ivandigiusto · 2023-10-10T00:29:00Z

Just wanted to leave a note here that the name of the parameter for the continuation toke limit has changed. The value to use at this time seems to be:
continuation_token_limit

Setting it to 1 or 2 seems to work.

ghost added the Mgmt This issue is related to a management-plane library. label Nov 21, 2022

github-actions bot added the needs-triage This is a new issue that needs to be triaged to the appropriate team. label Nov 21, 2022

azure-sdk added Azure.Core Client This issue points to a problem in the data-plane of the library. needs-team-triage This issue needs the team to triage. labels Nov 21, 2022

xiangyan99 added Cosmos CXP Attention and removed Mgmt This issue is related to a management-plane library. Azure.Core needs-triage This is a new issue that needs to be triaged to the appropriate team. needs-team-triage This issue needs the team to triage. labels Nov 28, 2022

xiangyan99 assigned simorenoh Nov 28, 2022

simorenoh added the needs-author-feedback More information is needed from author to address the issue. label Mar 3, 2023

ghost added the no-recent-activity There has been no recent activity on this issue. label Mar 11, 2023

ghost closed this as completed Mar 26, 2023

ghost removed the no-recent-activity There has been no recent activity on this issue. label Mar 27, 2023

kushagraThapar reopened this May 11, 2023

kushagraThapar assigned bambriz and unassigned simorenoh May 11, 2023

github-actions bot added the no-recent-activity There has been no recent activity on this issue. label May 19, 2023

bambriz mentioned this issue Jun 1, 2023

Implement continuation token size limit #30600

Merged

6 tasks

github-actions bot removed the no-recent-activity There has been no recent activity on this issue. label Jun 1, 2023

bambriz closed this as completed in #30600 Jun 8, 2023

bambriz mentioned this issue Jun 13, 2023

Implement Response Continuation Token Size Limit when querying items #30731

Merged

6 tasks

simorenoh reopened this Jul 24, 2023

simorenoh closed this as completed Jul 26, 2023

github-actions bot locked and limited conversation to collaborators Jan 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Async CosmosDB client raises: 'Got more than 8190 bytes (11994) when reading Header value is too long.' #27625

Async CosmosDB client raises: 'Got more than 8190 bytes (11994) when reading Header value is too long.' #27625

ghost commented Nov 21, 2022

zachschillaci27 commented Nov 28, 2022

xiangyan99 commented Nov 28, 2022

simorenoh commented Dec 16, 2022

ghost commented Mar 11, 2023

antoinegaston commented Mar 27, 2023

ghost commented Apr 25, 2023

kushagraThapar commented May 11, 2023

kushagraThapar commented May 11, 2023

xiangyan99 commented May 12, 2023

github-actions bot commented May 19, 2023

bambriz commented Jun 1, 2023 •

edited

simorenoh commented Jul 24, 2023

simorenoh commented Jul 26, 2023

ivandigiusto commented Oct 10, 2023

Async CosmosDB client raises: 'Got more than 8190 bytes (11994) when reading Header value is too long.' #27625

Async CosmosDB client raises: 'Got more than 8190 bytes (11994) when reading Header value is too long.' #27625

Comments

ghost commented Nov 21, 2022

zachschillaci27 commented Nov 28, 2022

xiangyan99 commented Nov 28, 2022

simorenoh commented Dec 16, 2022

ghost commented Mar 11, 2023

antoinegaston commented Mar 27, 2023

ghost commented Apr 25, 2023

kushagraThapar commented May 11, 2023

kushagraThapar commented May 11, 2023

xiangyan99 commented May 12, 2023

github-actions bot commented May 19, 2023

bambriz commented Jun 1, 2023 • edited

simorenoh commented Jul 24, 2023

simorenoh commented Jul 26, 2023

ivandigiusto commented Oct 10, 2023

bambriz commented Jun 1, 2023 •

edited