CKA认证

CKA 是 k8s 的一个管理员认证, 我也弄了一个证书 CKA证书.

将名为 my-nginx 的 deployment 的数量,扩展至 10 个 pods.

环境准备:

1
kubectl create deployment my-nginx --image=nginx

答题:

1
kubectl scale deployment my-nginx --replicas=10

创建一个多容器的 Pod 对象

  • nginx 容器用 nginx 镜像
  • redis 容器用 redis 镜像
  • tomcat 容器用 tomcat 镜像
  • mysql 容器用 mysql

答题:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
apiVersion: v1
kind: Pod
metadata:
  name: multi
spec:
  containers:
    - name: nginx
      image: nginx
    - name: redis
      image: redis
    - name: tomcat
      image: tomcat
    - name: mysql
      image: mysql:5.7
      env:
        - name: MYSQL_ROOT_PASSWORD
          value: "mima"
1
2
3
4
5
6
7
8
9
# images下加上ports相关内容
containers:
 - image: nginx
   imagePullPolicy: Always
   name: nginx
   ports:
   - name: http
     protocol: TCP
     containerPort: 80

暴露端口 80

1
kubectl expose deployment front-end --name=front-end-svc --port=80 --target-port=80 --type=NodePort 

题目:

  1. sidecar 使用 busybox,添加到 pod legacy-app
  2. 新 sidecar 运行 /bin/sh -c tail -n+1 -f /var/log/legacy-app.log
  3. 使用安装在/var/log 的卷,使日志 legacy-app.log 可用于 sidecar 的容器

答题:

  1. 将 pod 导出成 yml 保存到文件中 kubectl get pod legacy-app -o yaml > 1.yml

  2. 编辑文件 vim 1.yml

     1
     2
     3
     4
     5
     6
     7
     8
     9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    27
    28
    29
    30
    
    apiVersion: v1
    kind: Pod
    metadata:
      name: legacy-app
    spec:
      containers:
      - name: count
        image: busybox
        args:
        - /bin/sh
        - -c
        - >
          i=0;
          while true;
          do
          echo "$i: $(date)" >> /var/log/legacy-app.log;
          sleep 1;
          done      
        volumeMounts:
        - name: varlog
          mountPath: /var/log
      - name: sidecar
        image: busybox
        args: [/bin/sh, -c, 'tail -n+1 -f /var/log/legacy-app.log']
        volumeMounts:
        - name: varlog
          mountPath: /var/log
      volumes: # volumes 块在导出的 yaml 下面已经有了,在已有的添加下面两行即可
        - name: varlog
          emptyDir: {}
    
  3. kubectl apply -f 1.yml 应用改动. 如果失败则先 kubectl delete pod legacy-app 删除.

创建名为 app-data 的 pv,容量 2 Gi。ReadwriteOnce,hostPath,位于/srv/app-data

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
apiVersion: v1
kind: PersistentVolume
metadata:
  name: app-data
spec:
  capacity:
    storage: 2Gi
  accessModes:
    - ReadWriteOnce
  hostPath: "/srv/app-data"

创建一个名为 app-pvc 的 PVC 资源对象

  • 容量大小为 50 Mi
  • 访问模式为 ReadWriteOnce
  • 基于 storageclass 的 SC 资源对象创建

创建一个名为 app-pod 的 Pod 资源对象

  • 依赖镜像为 Nginx
  • 挂载路径为 /usr/share/nginx/html

调整 PVC 的资源对象为 100Mi

答题:

先查看有哪些 sc?

1
kubectl get sc

创建一个 pvcpod

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
apiVersion: v1
kind: PersistentVolumeClaim
metadata:
  name: app-pvc
spec:
  storageClassName: standard # 把sc的名字填在这里
  accessModes:
    - ReadWriteOnce
  resources:
    requests:
      storage: 50Mi
---
apiVersion: v1
kind: Pod
metadata:
  name: app-pod
spec:
  containers:
    - name: app-pod
      image: nginx
      volumeMounts:
        - mountPath: /usr/share/nginx/html
          name: app-data
  volumes:
    - name: app-data
      persistentVolumeClaim:
        claimName: app-pvc

修改 pvc 的大小. kubectl edit pvc/app-pvc --save-config 编辑后退出即可. 如果报错, 提示你尝试 kubectl replace -f /tmp/kubectl-edit-1602427990.yaml 说明没有成功修改. 你尝试此命令后, 如果依然报错.

删掉已创建的容器, 编辑 yml 文件内容, 重新创建 pvc 和 pod.

参考文档:配置 Pod 以使用 PersistentVolume 作为存储 | Kubernetes

创一个 clusterrole 将 deployment-clusterrole,允许创建 deployment、statefulset、daemonset。绑定到 app-team1 中 cicd-token 的 serviceaccount

1
2
3
4
5
6
7
8
# 创建一个clusterrole
kubectl create clusterrole "default-clusterrole" --verb=create --resource=deployment,statefulset,daemonset
# 创建命名空间
kubectl create namespace app-team1
# 创建serviceaccount 且指定命名空间
kubectl create serviceaccount cicd-token -n app-team1
# 权限绑定到一起就授权了
kubectl create clusterrolebinding rb --clusterrole=default-clusterrole --serviceaccount=app-team1:cicd-token -n app-team1

创建 allow-port-from-namespace 的策略,允许 my-app 连接 big-corp 的 8080

  1. 不允许访问没有监听 8080 端口的服务
  2. 仅允许 my-app 中的 pod 访问
 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
apiVersion: networking.k8s.io/v1
kind: NetworkPolicy
metadata:
  name: allow-port-from-namespace
  namespace: big-corp
spec:
  podSelector:
    matchLabels:
      namespace: big-corp
  policyTypes:
    - Ingress
  ingress:
    - from:
        - namespaceSelector:
            matchLabels:
              project: my-app
      ports:
        - protocol: TCP
          port: 8080

创建一个名为 my-ingress 的 ingress:

  • 该 ingress 位于 app-team 的命名空间中
  • 名称为 django 的 svc,提供 8000 端口服务
  • ingress 提供一个 /django 的 url 入口,用于访问 django 的 svc

环境准备:

1
2
3
4
# 创建命名空间
kubectl create ns app-team
# 创建一个应用,方便svc对接
kubectl create deployment django-deployment --image=nginx -n app-team

答题:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
apiVersion: v1
kind: Service
metadata:
  name: django
  namespace: app-team
spec:
  selector:
    app: django-deployment
  ports:
    - protocol: TCP
      port: 80
      targetPort: 8000
---
apiVersion: networking.k8s.io/v1
kind: Ingress
metadata:
  name: django-ingress
  namespace: app-team
  annotations:
    nginx.ingress.kubernetes.io/rewrite-target: /
spec:
  ingressClassName: nginx
  rules:
  - host: "django-ingress.kentxxq.com"
    http:
      paths:
      - path: /django
        pathType: Prefix
        backend:
          service:
            name: django
            port:
              number: 8000

将当前 kubernetes 集群的 etcd 数据进行备份

  • etcd 的 endpoint 位置为 https://127.0.0.1:2379
  • 保存到 /data/backup/ 目录下,文件名为 snapshot-etcd.db
  • 将之前存储的 /data/backup/snapshot-etcd-previous.db 数据进行还原

切换到题目对应的 context, 然后 ssh 连接到 master 节点. 考虑 sudo -i 切换到 root 用户.

  1. systemctl cat etcd 查看是否部署了 etcd. 然后通过启动命令拿到证书, 数据位置
  2. 如果是容器部署的, 那么需要我们来查找
 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
# 找到etcd
kubectl get pods -n kube-system
...
etcd 1/1     Running   0          8m51s
...

# 拿到路径
kubectl describe pod/etcd -n kube-system
...
command:
  etcd
  --data-dir=路径
  --key-file=路径
  --cert-file=路径
  --trusted-ca-file=路径
...
1
2
3
etcdctl member list --cacert=/etc/ssl/etcd/ssl/ca.pem --cert=/etc/ssl/etcd/ssl/admin-master1.pem --key=/etc/ssl/etcd/ssl/admin-master1-key.pem
# 或者
etcdctl endpoint status --cacert=/etc/ssl/etcd/ssl/ca.pem --cert=/etc/ssl/etcd/ssl/admin-master1.pem --key=/etc/ssl/etcd/ssl/admin-master1-key.pem

答题:

1
2
3
4
5
6
# 备份只需要1个节点
etcdctl snapshot save /data/backup/etcd-snapshot.db
--endpoints=https://127.0.0.1:2379
--cacert=/xxx/ca.crt
--cert=/xxx/etcd-client.crt
--key=/xxx/etcd-client.key

etcd 以服务的方式运行 (独立于 k8s ):

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
# 关闭服务,减少影响.
systemctl stop api-server etcd
# 把data-dir备份起来
mv /var/lib/etcd /var/lib/etcd.bak

# 恢复数据
ETCDCTL_API=3 etcdctl snapshot restore /data/backup/snapshot-etcd-previous.db --data-dir=/var/lib/etcd ...endpoint...ca...cert...key...
# 如果是生产环境,可能会有多个etcd节点,那么应该加上下面这些参数
# 当前节点的名称
# --name etcd-0 \
# 所有的节点
# --initial-cluster etcd-0=http://host1:2380,etcd-1=http://host2:2380,etcd-2=http://host3:2380  \ 
# 初始化互相通信用到的token
# --initial-cluster-token etcd-cluster \ 
# 宣告自己是谁
# --initial-advertise-peer-urls https://host1:2380

# 调整权限
chown -R etcd:etcd /var/lib/etcd
# 启动etcd
systemctl start etcd
# 确认健康状态
ETCDCTL_API=3 etcdctl --cacert=/opt/kubernetes/ssl/ca.pem --cert=/opt/kubernetes/ssl/server.pem --key=/opt/kubernetes/ssl/server-key.pem --endpoints=https://host1:2379,https://host2:2379,https://host3:2379 endpoint health
# 启动apiserver,确认etcd状态正常再启动api-server
systemctl start api-server

etcd 以 pod 的方式运行 (在 k8s 的内部):

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
# 通常kubeadm安装的k8s,manifest在/etc/kubernetes/manifests,而下面就有etcd.yaml文件
# 参考https://kubernetes.io/zh-cn/docs/reference/setup-tools/kubeadm/implementation-details/
mv /etc/kubernetes/manifests /etc/kubernetes/manifests-bak
# etcd容器将会消失
crictl ps
# 备份etcd
mv /var/lib/etcd /var/lib/etcd-bak
# 恢复数据,每个master节点都要改动参数,然后执行
ETCDCTL_API=3 etcdctl snapshot restore /data/backup/snapshot-etcd-previous.db \
--name=etcd-0 \
--data-dir=/var/lib/etcd \
--initial-cluster="kubernetes-master1=https://10.0.0.12:2380,kubernetes-master2=https://10.0.0.13:2380,kubernetes-master3=https://10.0.0.14:2380" \
--initial-cluster-token=etcd-cluster \
--initial-advertise-peer-urls=https://10.0.0.12:2380 \
...endpoint...ca...cert...key...
# 恢复容器
mv /etc/kubernetes/manifests-bak /etc/kubernetes/manifests
1
kubectl edit pod nginx-kusc00401

添加/修改 nodeSelector 字段

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
apiVersion: v1
kind: Pod
metadata:
 name: nginx-kusc00401
spec:
 containers:
 - name: nginx
   image: nginx
 nodeSelector:
   disk: ssd

找到 pod bar 的 file-not-found 日志,然后记录到文件

1
kubectl logs pod bar |grep "file-not-found" > 1.txt
1
2
3
4
5
kubectl drain node1 --ignore-daemonsets
# 可强制 --delete-local-data --force

# 恢复
kubectl uncordon node1
 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
# 排空节点
kubectl drain node1 --ignore-daemonsets

# 安装指定的kubeadm
sudo -i
apt install kubeadm=1.20.1-00 –y

kubeadm upgrade plan
kubeadm upgrade apply v1.20.1 --etcd-upgrade=false

# 升级kubelet和kubectl
apt install kubelet=1.20.1-00 kubectl=1.20.1-00 -y
# 重启kubelet
systemctl restart kubelet
# 恢复节点
kubectl uncordon node1
# 验证升级情况
kubectl get nodes
1
kubectl top pod -l name=cpu-utilizer --sort-by="cpu" –A
1
kubectl get nodes |grep worker|grep Ready|wc -l

全部指定命名空间 kubectl create namespace kentxxq

创建 NFS 的 pv 和 pvc. 其中 pv 是集群级别的,而 pvc 是有命名空间的。

04_kubernetes-storage_pv_pvc.yml

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
apiVersion: v1
kind: PersistentVolume
metadata:
  name: kentxxq-pv
spec:
  capacity:
    storage: 3Gi
  accessModes:
    - ReadWriteMany
  nfs:
    # nfs的exports里配置的路径
    path: /data/nfs
    server: 10.0.1.157
---
apiVersion: v1	
kind: PersistentVolumeClaim
metadata:
  name: kentxxq-pvc
  # 默认default
  namespace: default
spec:
  accessModes:
    - ReadWriteMany
  # storageClassName如果不写,可能会使用默认sc。从而导致无法匹配
  # 查看默认sc命令 kubectl get sc
  storageClassName: ""
  resources:
    requests:
      storage: 1Gi

相关权限 02_kubernetes_sc_rbac.yaml

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
apiVersion: v1
kind: Namespace
metadata:
  name: kentxxq
---
apiVersion: v1
kind: ServiceAccount
metadata:
  name: nfs-provisioner
  namespace: kentxxq
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRole
metadata:
   name: nfs-provisioner
   namespace: kentxxq
rules:
#  - apiGroups: ["*"] # "" 标明 core API 组
#    resources: ["*"]
#    verbs: ["*"]
  - apiGroups: [""]
    resources: ["persistentvolumes"]
    verbs: ["get", "list", "watch", "create", "delete"]
  - apiGroups: [""]
    resources: ["persistentvolumeclaims"]
    verbs: ["get", "list", "watch", "update"]
  - apiGroups: ["storage.k8s.io"]
    resources: ["storageclasses"]
    verbs: ["get", "list", "watch"]
  - apiGroups: [""]
    resources: ["events"]
    verbs: ["watch", "create", "update", "patch"]
  - apiGroups: [""]
    resources: ["services", "endpoints"]
    verbs: ["get","create","patch","list", "watch","update"]
  - apiGroups: ["extensions"]
    resources: ["podsecuritypolicies"]
    resourceNames: ["nfs-provisioner"]
    verbs: ["use"]
---
kind: ClusterRoleBinding
apiVersion: rbac.authorization.k8s.io/v1
metadata:
  name: nfs-provisioner
  namespace: kentxxq
subjects:
  - kind: ServiceAccount
    name: nfs-provisioner
    namespace: kentxxq
roleRef:
  kind: ClusterRole
  name: nfs-provisioner
  apiGroup: rbac.authorization.k8s.io

01_kubernetes_sc_provisioner.yaml

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
apiVersion: apps/v1
kind: Deployment
metadata:
  name: nfs-client-provisioner
  # 命名空间要与定制的rbac的一致
  namespace: kentxxq
spec:
  replicas: 1
  selector:
    matchLabels:
      app: nfs-client-provisioner
  strategy:
    type: Recreate
  template:
    metadata:
      labels:
        app: nfs-client-provisioner
    spec:
      serviceAccount: nfs-provisioner
      containers:
      - name: nfs-client-provisioner
        # image: kubernetes-register.superopsmsb.com/google_containers/nfs-client-provisioner:latest
        image: registry.cn-beijing.aliyuncs.com/pylixm/nfs-subdir-external-provisioner:v4.0.0
        volumeMounts:
        - name: nfs-client-root
          mountPath: /persistentvolumes
        env:
        - name: PROVISIONER_NAME
             # 该变量的值,必须与nfs的storageclass的provisioner的值一致
          value: "nfsprovisioner"
        - name: NFS_SERVER
              # 设置NFS服务器的ip地址
          value: "10.0.1.157"
        - name: NFS_PATH
              # 设置NFS服务器分享的目录
          value: "/data/nfs"
      volumes:
      - name: nfs-client-root
        nfs:
          # 直接使用nfs来挂载该目录,方便storageclass基于该pod对pv和pvc进行自动处理
          server: "10.0.1.157"
          path: "/data/nfs"

03_kubernetes_sc_pv.yaml

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
apiVersion: storage.k8s.io/v1
kind: StorageClass
metadata:
  name: storageclass
  namespace: kentxxq
# 每个 StorageClass 都包含 provisioner、parameters 和 reclaimPolicy 字段
# provisioner用来决定使用哪个卷插件分配PV,必须与nfs-client的容器内部的 PROVISIONER_NAME 变量一致
provisioner: "nfsprovisioner"
# reclaimPolicy指定创建的Persistent Volume的回收策略
reclaimPolicy: Retain
parameters:
  # archiveOnDelete: "false"表示在删除时不会对数据进行打包,当设置为true时表示删除时会对数据进行打包
  archiveOnDelete: "false"

08_kubernetes_storage_sc_pvc.yaml

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
apiVersion: v1
kind: PersistentVolumeClaim
metadata:
  name: sc-pvc
  namespace: kentxxq
  labels:
    role: pvc
spec:
  storageClassName: storageclass
  accessModes:
    - ReadWriteOnce
  # 定义资源要求PV满足这个PVC的要求才会匹配到
  resources:
    requests:
      storage: 1Gi

pvc 会寻找合适的 pv,而 pv 可能会被其他 pvc 使用。

所以最好同时指定 pvc的volumeName , pv的claimRef。关于自动绑定可以看 这里

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
 apiVersion: v1
 kind: PersistentVolume
 metadata:
   name: pv0003
 spec:
   storageClassName: ""
   capacity:
     storage: 5Gi
   accessModes:
     - ReadWriteOnce
   persistentVolumeReclaimPolicy: Retain
   claimRef:
     namespace: default
     name: myclaim
   nfs:
     path: /tmp
     server: 172.17.0.2
---
 kind: PersistentVolumeClaim
 apiVersion: v1
 metadata:
   name: myclaim
 spec:
   # 避免被default storageClass创建
   storageClassName: ""
   volumeName: "pv0003"
   accessModes:
     - ReadWriteOnce
   resources:
     requests:
       storage: 5Gi