百度飞桨PaddlePaddle深度学习心得分享

Jade ·

更新时间:2024-09-21

· 955 次阅读

能参加到这次的百度飞桨七日“Python小白逆袭大神”打卡营是一次偶然的了解得知的这个机会，之前因为课程和基础不够扎实参加到AI实战营没有顺利跟完全程，这一次很开心可以顺利跟上并且结营。下面的内容就简要的总结一下这几天的学习内容和踩过的小坑吧。

本次课程的课程目标：

1.掌握Python的基础语言、进阶知识和常用的深度学习库，能够利用Python爬取数据并进行可视化分析
2.掌握人工智能基础知识、应用，体验人工智能的前沿技术
3.了解飞桨平台及百度AI技术、应用，掌握AI Studio的使用方法

本次课程的安排：Day1-人工智能概述与python入门基础python基础、人工智能概述Day2-Python进阶python进阶学习和深度学习实践Day3-人工智能常用Python库深度学习常用Python库介绍Day4-PaddleHub体验与应用PaddleHub体验与应用——《青春有你2》五人识别Day5-EasyDL体验与作业发布EasyDL体验与综合大作业Day6-PaddleHub创意赛发布Day7-课程结营 Day1-人工智能概述与python入门基础 python基础、人工智能概述

第一天的学习是对于python语言和人工智能的一个简单介绍，这部分对于没有什么基础的小白来说还是很友好，实验上应该也比较简单，没有遇到什么大问题，就直接上代码：

输出 9*9 乘法口诀表：


def table():
    for i in range(1,10):
        for j in range(1,i+1):
            print("{}*{}={:2} ".format(j,i,i*j), end='')
        print('')
if __name__ == '__main__':
    table()

在这里插入图片描述

查找特定文件名称：

#导入OS模块
import os
#待搜索的目录路径
path = "Day1-homework"
#待搜索的名称
filename = "2020"
#定义保存结果的数组
result = []
def findfiles(path):
    #在这里写下您的查找文件代码吧！
    global index
    li=os.listdir(path)
    for p in li:
        tempPathName=p
        pathname=os.path.join(path,p)
        if os.path.isdir(pathname):
            findfiles(pathname)
        elif os.path.isfile(pathname):
         if(tempPathName.find(filename)!=-1):
            index=0
            index=index+1
            result=[]
            result.append(index)
            result.append(pathname)
            if(len(result)>0):
                print(result)
if __name__ == '__main__':
    findfiles(path)

Day2-Python进阶 python进阶学习和深度学习实践

这次实践使用Python来爬取百度百科中《青春有你2》所有参赛选手的信息，对于小白来说可以上手稍微有一点点难度，但跟着课程老师的思路和讲解还是可以克服困难的

爬虫的过程：
1.发送请求（requests模块）
2.获取响应数据（服务器返回）
3.解析并提取数据（BeautifulSoup查找或者re正则）
4.保存数据


一、爬取百度百科中《青春有你2》中所有参赛选手信息，返回页面数据
import json
import re
import requests
import datetime
from bs4 import BeautifulSoup
import os
#获取当天的日期,并进行格式化,用于后面文件命名，格式:20200420
today = datetime.date.today().strftime('%Y%m%d')    
def crawl_wiki_data():
    """
    爬取百度百科中《青春有你2》中参赛选手信息，返回html
    """
    headers = { 
        'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/67.0.3396.99 Safari/537.36'
    }
    url='https://baike.baidu.com/item/青春有你第二季'                         
    try:
        response = requests.get(url,headers=headers)
        print(response.status_code)
        #将一段文档传入BeautifulSoup的构造方法,就能得到一个文档的对象, 可以传入一段字符串
        soup = BeautifulSoup(response.text,'lxml')
        #返回的是class为table-view log-set-param的所有标签
        tables = soup.find_all('table',{'class':'table-view log-set-param'})
        crawl_table_title = "参赛学员"
        for table in  tables:           
            #对当前节点前面的标签和字符串进行查找
            table_titles = table.find_previous('div').find_all('h3')
            for title in table_titles:
                if(crawl_table_title in title):
                    return table       
    except Exception as e:
        print(e)
二、对爬取的页面数据进行解析，并保存为JSON文件
def parse_wiki_data(table_html):
    '''
    从百度百科返回的html中解析得到选手信息，以当前日期作为文件名，存JSON文件,保存到work目录下
    '''
    bs = BeautifulSoup(str(table_html),'lxml')
    all_trs = bs.find_all('tr')
    error_list = ['\'','\"']
    stars = []
    for tr in all_trs[1:]:
         all_tds = tr.find_all('td')
         star = {}
         #姓名
         star["name"]=all_tds[0].text
         #个人百度百科链接
         star["link"]= 'https://baike.baidu.com' + all_tds[0].find('a').get('href')
         #籍贯
         star["zone"]=all_tds[1].text
         #星座
         star["constellation"]=all_tds[2].text
         #身高
         star["height"]=all_tds[3].text
         #体重
         star["weight"]= all_tds[4].text
         #花语,去除掉花语中的单引号或双引号
         flower_word = all_tds[5].text
         for c in flower_word:
             if  c in error_list:
                 flower_word=flower_word.replace(c,'')
         star["flower_word"]=flower_word 
         #公司
         if not all_tds[6].find('a') is  None:
             star["company"]= all_tds[6].find('a').text
         else:
             star["company"]= all_tds[6].text  
         stars.append(star)
    json_data = json.loads(str(stars).replace("\'","\""))   
    with open('work/' + today + '.json', 'w', encoding='UTF-8') as f:
        json.dump(json_data, f, ensure_ascii=False)
三、爬取每个选手的百度百科图片，并进行保存
def crawl_pic_urls():
    '''
    爬取每个选手的百度百科图片，并保存
    ''' 
    with open('work/'+ today + '.json', 'r', encoding='UTF-8') as file:
         json_array = json.loads(file.read())
    headers = { 
        'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/67.0.3396.99 Safari/537.36' 
     }
    count=0
    pic_urls=[]
    for star in json_array:
        name = star['name']
        link = star['link']
        #！！！请在以下完成对每个选手图片的爬取，将所有图片url存储在一个列表pic_urls中！！！
        response=requests.get(link,headers=headers)
        soup=BeautifulSoup(response.text,'lxml')
        tables=soup.find_all('div',{'class':'summary-pic'})
        for table in tables:
            aHref=table.a['href']
            album_url="https://baike.baidu.com"+aHref
            response=requests.get(album_url,headers=headers)
            soup=BeautifulSoup(response.text,'lxml')
            pic_list_div=soup.find('div',{'class':'pic-list'})
            albumTmages=pic_list_div.find_all('img')
            for album_image in albumTmages:
                album_image_url=album_image["src"]
                pic_urls.append(album_image_url)
                count=count+1
                print(count)
        #！！！根据图片链接列表pic_urls, 下载所有图片，保存在以name命名的文件夹中！！！
    down_pic(name,pic_urls)
def down_pic(name,pic_urls):
    '''
    根据图片链接列表pic_urls, 下载所有图片，保存在以name命名的文件夹中,
    '''
    path = 'work/'+'pics/'+name+'/'
    if not os.path.exists(path):
      os.makedirs(path)
    for i, pic_url in enumerate(pic_urls):
        try:
            pic = requests.get(pic_url, timeout=15)
            string = str(i + 1) + '.jpg'
            with open(path+string, 'wb') as f:
                f.write(pic.content)
                print('成功下载第%s张图片: %s' % (str(i + 1), str(pic_url)))
        except Exception as e:
            print('下载第%s张图片时失败: %s' % (str(i + 1), str(pic_url)))
            print(e)
            continue
四、打印爬取的所有图片的路径
def show_pic_path(path):
    '''
    遍历所爬取的每张图片，并打印所有图片的绝对路径
    '''
    pic_num = 0
    for (dirpath,dirnames,filenames) in os.walk(path):
        for filename in filenames:
           pic_num += 1
           print("第%d张照片：%s" % (pic_num,os.path.join(dirpath,filename)))           
    print("共爬取《青春有你2》选手的%d照片" % pic_num)
In[61]
if __name__ == '__main__':
     #爬取百度百科中《青春有你2》中参赛选手信息，返回html
     html = crawl_wiki_data()
     #解析html,得到选手信息，保存为json文件
     parse_wiki_data(html)
     #从每个选手的百度百科页面上爬取图片,并保存
     crawl_pic_urls()
     #打印所爬取的选手图片路径
     show_pic_path('/home/aistudio/work/pics/')
     print("所有信息爬取完成！")
Day3-人工智能常用Python库深度学习常用Python库介绍这个模块利用了python对day2的数据进行绘制饼图和柱状图，我觉得特别实用也是能感受到python方便之处的一个重要方面
import matplotlib.pyplot as plt
import numpy as np 
import json
import matplotlib.font_manager as font_manager
%matplotlib inline
with open('data/data31557/20200422.json', 'r', encoding='UTF-8') as file:
         json_array = json.loads(file.read())
weight_list= []
for star in json_array:
    weight = star['weight'].strip('kg')
    weight_list.append(weight)
# print(len(weight_list))
# print(weight_list)
weight_label = ['55kg']
count_list = [0,0,0,0]
for weight in weight_list:
    weight=float(weight)
    if weight 45 and weight 50 and weight 55:
        count_list[3]=count_list[3]+1
        continue
print(weight_label)
print(count_list)
plt.figure(figsize=(20,15))
explode = (0, 0.1, 0, 0)
plt.pie(x=count_list ,explode=explode, labels=weight_label,autopct='%1.1f%%', shadow=False, startangle=90)
plt.axis('equal')
plt.legend()
plt.title("The percentage of weight",fontsize = 24)
plt.savefig('/home/aistudio/work/result/pie_result.jpg')
plt.show()
                                            
                    原创文章 1获赞 1访问量 47
                                            关注
                                                                私信
    
                展开阅读全文


作者：熊和敲不完的代码
                    

 


                            paddlepaddle
                            学习心得
                            学习
                    

                    
                         1024 个赞
                        
                            
                                
                                    
                                
                                
                                    编辑
                                    举报
                                
                            
                        
                    
                

    
    

                
                    
                
            

                
    
        
            需要 登录 后方可回复, 如果你还没有账号请 注册新账号
        
    
                
            

                    
                        相关文章

    
        
            Shell中去除字符串前后空格的方法
        
        
            Iris
            2021-08-03
        
    
    
        567
    


    
        
            ADO.NET基础知识汇总
        
        
            Rose
            2021-04-30
        
    
    
        873
    


    
        
            jsp实现购物程序
        
        
            Echo
            2020-02-05
        
    
    
        768
    


    
        
            关于MongoDB数据库学习路线指南
        
        
            Kande
            2023-05-13
        
    
    
        295
    


    
        
            SQL注入之sqlmap入门学习
        
        
            Ula
            2023-05-13
        
    
    
        1477
    


    
        
            Docker AIGC等大模型深度学习环境搭建步骤最新详细版
        
        
            Jacinda
            2023-05-13
        
    
    
        1807
    


    
        
            一文带你学习一下C++中的构造函数
        
        
            Winona
            2023-05-13
        
    
    
        520
    


    
        
            C++中set的用法学习
        
        
            Fawn
            2023-05-13
        
    
    
        226
    


    
        
            Three.js概述和基础知识学习
        
        
            Echo
            2023-05-13
        
    
    
        1103
    


    
        
            Vue自定义指令学习及应用详解
        
        
            Maha
            2023-05-13
        
    
    
        1234
    


    
        
            写给小白学习的地理信息的表示法GeoJSON
        
        
            Kande
            2023-05-15
        
    
    
        702
    


    
        
            从错误中学习改正Go语言五个坏习惯提高编程技巧
        
        
            Viridis
            2023-05-17
        
    
    
        1281
    


    
        
    
    
        
            Java NIO下ByteBuffer的常用方法学习
        
        
            Maleah
            2023-05-17
        
    
    
        473
    


    
        
    
    
        
            MySQL基础学习之字符集的应用
        
        
            Nafisa
            2023-05-17
        
    
    
        1150
    


    
        
            C++函数模板学习示例教程指南
        
        
            Pandora
            2023-07-07
        
    
    
        25
    


    
        
            go语言定义零值可用的类型学习教程
        
        
            Tallulah
            2023-07-17
        
    
    
        216
    


    
        
            Redis数据结构之intset整数集合使用学习
        
        
            Janna
            2023-07-20
        
    
    
        1721
    


    
        
            Redis数据结构之跳跃表使用学习
        
        
            Ophelia
            2023-07-20
        
    
    
        644
    


    
        
            Redis数据结构之listpack和quicklist使用学习
        
        
            Natalia
            2023-07-20
        
    
    
        1968
    


    
        
            大语言模型的开发利器langchainan安装使用快速入门学习
        
        
            Irma
            2023-07-20
        
    
    
        829
    

                    
                

    
        
            我要提问
        
    
    
        
        
    
        致谢
        
            帮助他人，成就自己。
            人生最大成功就是伸出热情而温暖的双手，尽自己所能去帮助身边的每一个人，只要无私的奉献，就会收获到美好的生活。
            1024问感谢每一位朋友的帮助和支持。
            软件开发网提供编程的基础软件技术培训教程,软件开发编程实例讲解Go,Node,HTML,CSS,Javascript,Python,Java,Ruby,C,PHP,MySQL等软件开发编程语言以及数据开发的基础知识，也提供大量的软件开发在线实例、从入门到精通就在1024问。
        
    
    
        
            
    育儿网
    微养生
    全球行
    美食街
    育儿
    菜谱大全
    海南旅游
    女性
    养狗百科
    星座


            
        
    

    
        
            
                
                    © 2021 1024问 MIT 京ICP备11031391号-29