2018-03-29 16:27:26 +08:00
|
|
|
|
## 部署集群 DNS
|
2017-12-08 16:17:16 +08:00
|
|
|
|
|
2019-04-11 22:17:25 +08:00
|
|
|
|
DNS 是 k8s 集群首先需要部署的,集群中的其他 pods 使用它提供域名解析服务;主要可以解析 `集群服务名 SVC` 和 `Pod hostname`;目前 k8s v1.9+ 版本可以有两个选择:`kube-dns` 和 `coredns`(推荐),可以选择其中一个部署安装。
|
2017-12-08 16:17:16 +08:00
|
|
|
|
|
2018-03-29 16:27:26 +08:00
|
|
|
|
### 部署 dns
|
2017-12-08 16:17:16 +08:00
|
|
|
|
|
2018-03-29 16:27:26 +08:00
|
|
|
|
配置文件参考 `https://github.com/kubernetes/kubernetes` 项目目录 `kubernetes/cluster/addons/dns`
|
2018-01-27 13:54:12 +08:00
|
|
|
|
|
2019-04-11 22:17:25 +08:00
|
|
|
|
+ 安装
|
2018-01-27 13:54:12 +08:00
|
|
|
|
|
2019-06-10 22:47:20 +08:00
|
|
|
|
目前 kubeasz 已经自动集成安装 dns 组件,配置模板位于`roles/cluster-addon/templates/`目录
|
2017-12-08 17:43:51 +08:00
|
|
|
|
|
|
|
|
|
+ 集群 pod默认继承 node的dns 解析,修改 kubelet服务启动参数 --resolv-conf="",可以更改这个特性,详见 kubelet 启动参数
|
|
|
|
|
|
2018-03-29 16:27:26 +08:00
|
|
|
|
### 验证 dns服务
|
2017-12-08 17:43:51 +08:00
|
|
|
|
|
|
|
|
|
新建一个测试nginx服务
|
|
|
|
|
|
|
|
|
|
`kubectl run nginx --image=nginx --expose --port=80`
|
|
|
|
|
|
|
|
|
|
确认nginx服务
|
|
|
|
|
|
|
|
|
|
``` bash
|
|
|
|
|
kubectl get pod|grep nginx
|
|
|
|
|
nginx-7cbc4b4d9c-fl46v 1/1 Running 0 1m
|
|
|
|
|
kubectl get svc|grep nginx
|
|
|
|
|
nginx ClusterIP 10.68.33.167 <none> 80/TCP 1m
|
|
|
|
|
```
|
|
|
|
|
|
2019-02-15 17:31:31 +08:00
|
|
|
|
测试pod alpine
|
2017-12-08 17:43:51 +08:00
|
|
|
|
|
|
|
|
|
``` bash
|
2019-02-15 17:31:31 +08:00
|
|
|
|
kubectl run test --rm -it --image=alpine /bin/sh
|
2017-12-08 17:43:51 +08:00
|
|
|
|
If you don't see a command prompt, try pressing enter.
|
|
|
|
|
/ # cat /etc/resolv.conf
|
|
|
|
|
nameserver 10.68.0.2
|
|
|
|
|
search default.svc.cluster.local. svc.cluster.local. cluster.local.
|
|
|
|
|
options ndots:5
|
|
|
|
|
# 测试集群内部服务解析
|
2018-07-19 15:07:05 +08:00
|
|
|
|
/ # nslookup nginx.default.svc.cluster.local
|
2017-12-08 17:43:51 +08:00
|
|
|
|
Server: 10.68.0.2
|
|
|
|
|
Address 1: 10.68.0.2 kube-dns.kube-system.svc.cluster.local
|
|
|
|
|
|
|
|
|
|
Name: nginx
|
|
|
|
|
Address 1: 10.68.33.167 nginx.default.svc.cluster.local
|
2018-07-19 15:07:05 +08:00
|
|
|
|
/ # nslookup kubernetes.default.svc.cluster.local
|
2017-12-08 17:43:51 +08:00
|
|
|
|
Server: 10.68.0.2
|
|
|
|
|
Address 1: 10.68.0.2 kube-dns.kube-system.svc.cluster.local
|
|
|
|
|
|
|
|
|
|
Name: kubernetes
|
|
|
|
|
Address 1: 10.68.0.1 kubernetes.default.svc.cluster.local
|
|
|
|
|
# 测试外部域名的解析,默认集成node的dns解析
|
|
|
|
|
/ # nslookup www.baidu.com
|
|
|
|
|
Server: 10.68.0.2
|
|
|
|
|
Address 1: 10.68.0.2 kube-dns.kube-system.svc.cluster.local
|
|
|
|
|
|
|
|
|
|
Name: www.baidu.com
|
|
|
|
|
Address 1: 180.97.33.108
|
|
|
|
|
Address 2: 180.97.33.107
|
|
|
|
|
/ #
|
|
|
|
|
```
|
2017-12-08 16:17:16 +08:00
|
|
|
|
|
2019-02-15 17:31:31 +08:00
|
|
|
|
- Note1: 如果你使用`calico`网络组件,通过命令`ansible-playbook 90.setup.yml`安装完集群后,直接安装dns组件,可能会出现如下BUG,分析是因为calico分配pod地址时候会从网段的第一个地址(网络地址)开始,详见提交的 [ISSUE #1710](https://github.com/projectcalico/calico/issues/1710),临时解决办法为手动删除POD,重新创建后获取后面的IP地址
|
|
|
|
|
|
|
|
|
|
```
|
|
|
|
|
# BUG出现现象
|
|
|
|
|
$ kubectl get pod --all-namespaces -o wide
|
|
|
|
|
NAMESPACE NAME READY STATUS RESTARTS AGE IP NODE
|
|
|
|
|
default busy-5cc98488d4-s894w 1/1 Running 0 28m 172.20.24.193 192.168.97.24
|
|
|
|
|
kube-system calico-kube-controllers-6597d9c664-nq9hn 1/1 Running 0 1h 192.168.97.24 192.168.97.24
|
|
|
|
|
kube-system calico-node-f8gnf 2/2 Running 0 1h 192.168.97.24 192.168.97.24
|
|
|
|
|
kube-system kube-dns-69bf9d5cc9-c68mw 0/3 CrashLoopBackOff 27 31m 172.20.24.192 192.168.97.24
|
|
|
|
|
|
|
|
|
|
# 解决办法,删除pod,自动重建
|
|
|
|
|
$ kubectl delete pod -n kube-system kube-dns-69bf9d5cc9-c68mw
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
- Note2: 使用``` kubectl run test -it --rm --image=busybox /bin/sh``` 进行解析测试可能会失败, busybox内的nslookup程序有bug, 详见 https://github.com/kubernetes/dns/issues/109
|