-
Notifications
You must be signed in to change notification settings - Fork 362
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
1.12 Upgrade #5062
Comments
@sba30 Thanks for reporting the issue. The error message I guess it is related with some issue when decoding PakcetIn2 message with a fragmented DNS packet. Could you help provided the OVS logs? |
Can you also share the full antrea-agent log (starting from begining)? Even if antrea-agent receives fragmented DNS packet, it should be running. It seems to be killed by kubelet because of failed liveness probe, likely due to the "ovs" check: antrea/pkg/agent/apiserver/apiserver.go Lines 186 to 191 in 141224a
The early logs may help us understand why the ovs check is not ready. |
Having offline sync with @sba30 , these OVS error logs are printed on the issued setup,
The errors showed that some issue exists when encoding the OpenFlow Except for the resume message encoding error, I also observed that the flow entry for packet_in TCP DNS response is not installed as expected, the failed flow would send all TCP packets marked with flags "+ack+psh" to antrea-agent, which may indirectly lead to this failure. I use another issue #5077 to track it. |
Describe the bug
To Reproduce
Upgrade from 1.8 to 1.12
Expected
Upgrade to be successful and pods running.
Actual behavior
Agent Logs flooded with this errot
E0530 15:53:46.969895 1 ofSwitch.go:267] Received OpenFlow1.5 error: unknown error with type 14, code 1, vendor 0 on message OFPT_EXPERIMENTER
I0530 15:53:46.980139 1 fqdn.go:756] "Received a fragmented DNS response, partially unpacking it" lengthField=26529 actualLength=1396
I0530 15:53:46.980159 1 fqdn.go:758] "Unable to unpack the DNS response partially, skipping it" err="dns: bad rdata"
E0530 15:53:46.980372 1 ofSwitch.go:267] Received OpenFlow1.5 error: unknown error with type 14, code 1, vendor 0 on message OFPT_EXPERIMENTER
I0530 15:53:46.989486 1 fqdn.go:756] "Received a fragmented DNS response, partially unpacking it" lengthField=43364 actualLength=1396
I0530 15:53:46.989513 1 fqdn.go:758] "Unable to unpack the DNS response partially, skipping it" err="dns: bad rdata"
E0530 15:53:46.989734 1 ofSwitch.go:267] Received OpenFlow1.5 error: unknown error with type 14, code 1, vendor 0 on message OFPT_EXPERIMENTER
I0530 15:53:47.000031 1 fqdn.go:756] "Received a fragmented DNS response, partially unpacking it" lengthField=14966 actualLength=1396
I0530 15:53:47.000060 1 fqdn.go:758] "Unable to unpack the DNS response partially, skipping it" err="dns: bad rdata"
E0530 15:53:47.000267 1 ofSwitch.go:267] Received OpenFlow1.5 error: unknown error with type 14, code 1, vendor 0 on message OFPT_EXPERIMENTER
I0530 15:53:47.010512 1 fqdn.go:756] "Received a fragmented DNS response, partially unpacking it" lengthField=5891 actualLength=22
I0530 15:53:47.010551 1 fqdn.go:758] "Unable to unpack the DNS response partially, skipping it" err="dns: buffer size too small"
E0530 15:53:47.010772 1 ofSwitch.go:267] Received OpenFlow1.5 error: unknown error with type 14, code 1, vendor 0 on message OFPT_EXPERIMENTER
I0530 15:53:47.019994 1 fqdn.go:756] "Received a fragmented DNS response, partially unpacking it" lengthField=28525 actualLength=1396
I0530 15:53:47.020027 1 fqdn.go:758] "Unable to unpack the DNS response partially, skipping it" err="dns: bad rdata"
I0530 15:53:47.030227 1 fqdn.go:756] "Received a fragmented DNS response, partially unpacking it" lengthField=25398 actualLength=1396
I0530 15:53:47.030247 1 fqdn.go:758] "Unable to unpack the DNS response partially, skipping it" err="dns: bad rdata"
I0530 15:53:47.039496 1 fqdn.go:756] "Received a fragmented DNS response, partially unpacking it" lengthField=11572 actualLength=1396
I0530 15:53:47.039522 1 fqdn.go:758] "Unable to unpack the DNS response partially, skipping it" err="dns: bad rdata"
I0530 15:53:47.049930 1 fqdn.go:756] "Received a fragmented DNS response, partially unpacking it" lengthField=12465 actualLength=1396
I0530 15:53:47.049956 1 fqdn.go:758] "Unable to unpack the DNS response partially, skipping it" err="dns: bad rdata"
I0530 15:53:47.059833 1 fqdn.go:756] "Received a fragmented DNS response, partially unpacking it" lengthField=26990 actualLength=1158
I0530 15:53:47.059850 1 fqdn.go:758] "Unable to unpack the DNS response partially, skipping it" err="dns: bad rdata"
E0530 15:53:47.060144 1 ofSwitch.go:267] Received OpenFlow1.5 error: unknown error with type 14, code 1, vendor 0 on message OFPT_EXPERIMENTER
I0530 15:53:47.070512 1 fqdn.go:756] "Received a fragmented DNS response, partially unpacking it" lengthField=43713 actualLength=1190
I0530 15:53:47.070533 1 fqdn.go:758] "Unable to unpack the DNS response partially, skipping it" err="dns: buffer size too small"
E0530 15:53:47.070773 1 ofSwitch.go:267] Received OpenFlow1.5 error: unknown error with type 14, code 1, vendor 0 on message OFPT_EXPERIMENTER
I0530 15:53:47.080502 1 fqdn.go:756] "Received a fragmented DNS response, partially unpacking it" lengthField=5891 actualLength=445
I0530 15:53:47.080532 1 fqdn.go:758] "Unable to unpack the DNS response partially, skipping it" err="dns: buffer size too small"
I0530 15:53:47.082991 1 traceflow_controller.go:188] Shutting down AntreaAgentTraceflowController
I0530 15:53:47.083009 1 agent.go:867] Stopping Antrea agent
I0530 15:53:47.083021 1 controller.go:188] Shutting down ExternalIPPoolController
I0530 15:53:47.083031 1 node_route_controller.go:365] Shutting down AntreaAgentNodeRouteController
I0530 15:53:47.083036 1 requestheader_controller.go:183] Shutting down RequestHeaderAuthRequestController
I0530 15:53:47.083053 1 ip_scheduler.go:241] "Shutting down Egress IP scheduler"
I0530 15:53:47.083023 1 configmap_cafile_content.go:223] "Shutting down controller" name="client-ca::kube-system::extension-apiserver-authentication::client-ca-file"
I0530 15:53:47.083055 1 cluster.go:352] "Shutting down" controllerName="MemberListCluster"
I0530 15:53:47.083163 1 secure_serving.go:255] Stopped listening on [::]:10350
I0530 15:53:47.083058 1 server.go:701] Shutting down CNI server
I0530 15:53:47.083064 1 discoverer.go:105] Stopping ServiceCIDRDiscoverer
I0530 15:53:47.083001 1 status_controller.go:217] Shutting down NetworkPolicy StatusController
I0530 15:53:47.083091 1 object_count_tracker.go:151] "StorageObjectCountTracker pruner is exiting"
I0530 15:53:47.083093 1 channel.go:87] "Stopping SubscribableChannel" name="PodUpdate"
I0530 15:53:47.083010 1 egress_controller.go:330] Shutting down AntreaAgentEgressController
I0530 15:53:47.083105 1 configmap_cafile_content.go:223] "Shutting down controller" name="client-ca::kube-system::extension-apiserver-authentication::requestheader-client-ca-file"
I0530 15:53:47.083104 1 configmap_cafile_content.go:223] "Shutting down controller" name="antrea-ca::kube-system::antrea-ca::ca.crt"
I0530 15:53:47.083119 1 tlsconfig.go:255] "Shutting down DynamicServingCertificateController"
Versions:
Additional context
The text was updated successfully, but these errors were encountered: