当前位置: 首页 > news >正文

网站右侧信息跟随左侧菜单栏变化电影手机网站建设

网站右侧信息跟随左侧菜单栏变化,电影手机网站建设,哪家网站推广做的好,自己做网站需要学些什么GPT vs BERT 终极选择指南#xff1a;从架构差异到企业级落地策略 引言#xff1a;两大巨头的分道扬镳 2018年#xff0c;BERT和GPT系列同时引爆NLP领域#xff0c;却在架构选择上走向截然不同的道路#xff1a; BERT采用双向Transformer Encoder#xff0c;在11项NLP…GPT vs BERT 终极选择指南从架构差异到企业级落地策略 引言两大巨头的分道扬镳 2018年BERT和GPT系列同时引爆NLP领域却在架构选择上走向截然不同的道路 BERT采用双向Transformer Encoder在11项NLP任务中刷新记录GPT坚持单向Transformer Decoder开创生成式AI新纪元 截至2024年两者衍生出**300企业级应用方案正确选型可降低60%**研发成本。 一、核心架构差异可视化解析 1.1 模型架构对比Mermaid实现 #mermaid-svg-3RWuWB49Qp31NN1e {font-family:"trebuchet ms",verdana,arial,sans-serif;font-size:16px;fill:#333;}#mermaid-svg-3RWuWB49Qp31NN1e .error-icon{fill:#552222;}#mermaid-svg-3RWuWB49Qp31NN1e .error-text{fill:#552222;stroke:#552222;}#mermaid-svg-3RWuWB49Qp31NN1e .edge-thickness-normal{stroke-width:2px;}#mermaid-svg-3RWuWB49Qp31NN1e .edge-thickness-thick{stroke-width:3.5px;}#mermaid-svg-3RWuWB49Qp31NN1e .edge-pattern-solid{stroke-dasharray:0;}#mermaid-svg-3RWuWB49Qp31NN1e .edge-pattern-dashed{stroke-dasharray:3;}#mermaid-svg-3RWuWB49Qp31NN1e .edge-pattern-dotted{stroke-dasharray:2;}#mermaid-svg-3RWuWB49Qp31NN1e .marker{fill:#333333;stroke:#333333;}#mermaid-svg-3RWuWB49Qp31NN1e .marker.cross{stroke:#333333;}#mermaid-svg-3RWuWB49Qp31NN1e svg{font-family:"trebuchet ms",verdana,arial,sans-serif;font-size:16px;}#mermaid-svg-3RWuWB49Qp31NN1e g.classGroup text{fill:#9370DB;fill:#131300;stroke:none;font-family:"trebuchet ms",verdana,arial,sans-serif;font-size:10px;}#mermaid-svg-3RWuWB49Qp31NN1e g.classGroup text .title{font-weight:bolder;}#mermaid-svg-3RWuWB49Qp31NN1e .nodeLabel,#mermaid-svg-3RWuWB49Qp31NN1e .edgeLabel{color:#131300;}#mermaid-svg-3RWuWB49Qp31NN1e .edgeLabel .label rect{fill:#ECECFF;}#mermaid-svg-3RWuWB49Qp31NN1e .label text{fill:#131300;}#mermaid-svg-3RWuWB49Qp31NN1e .edgeLabel .label span{background:#ECECFF;}#mermaid-svg-3RWuWB49Qp31NN1e .classTitle{font-weight:bolder;}#mermaid-svg-3RWuWB49Qp31NN1e .node rect,#mermaid-svg-3RWuWB49Qp31NN1e .node circle,#mermaid-svg-3RWuWB49Qp31NN1e .node ellipse,#mermaid-svg-3RWuWB49Qp31NN1e .node polygon,#mermaid-svg-3RWuWB49Qp31NN1e .node path{fill:#ECECFF;stroke:#9370DB;stroke-width:1px;}#mermaid-svg-3RWuWB49Qp31NN1e .divider{stroke:#9370DB;stroke:1;}#mermaid-svg-3RWuWB49Qp31NN1e g.clickable{cursor:pointer;}#mermaid-svg-3RWuWB49Qp31NN1e g.classGroup rect{fill:#ECECFF;stroke:#9370DB;}#mermaid-svg-3RWuWB49Qp31NN1e g.classGroup line{stroke:#9370DB;stroke-width:1;}#mermaid-svg-3RWuWB49Qp31NN1e .classLabel .box{stroke:none;stroke-width:0;fill:#ECECFF;opacity:0.5;}#mermaid-svg-3RWuWB49Qp31NN1e .classLabel .label{fill:#9370DB;font-size:10px;}#mermaid-svg-3RWuWB49Qp31NN1e .relation{stroke:#333333;stroke-width:1;fill:none;}#mermaid-svg-3RWuWB49Qp31NN1e .dashed-line{stroke-dasharray:3;}#mermaid-svg-3RWuWB49Qp31NN1e #compositionStart,#mermaid-svg-3RWuWB49Qp31NN1e .composition{fill:#333333!important;stroke:#333333!important;stroke-width:1;}#mermaid-svg-3RWuWB49Qp31NN1e #compositionEnd,#mermaid-svg-3RWuWB49Qp31NN1e .composition{fill:#333333!important;stroke:#333333!important;stroke-width:1;}#mermaid-svg-3RWuWB49Qp31NN1e #dependencyStart,#mermaid-svg-3RWuWB49Qp31NN1e .dependency{fill:#333333!important;stroke:#333333!important;stroke-width:1;}#mermaid-svg-3RWuWB49Qp31NN1e #dependencyStart,#mermaid-svg-3RWuWB49Qp31NN1e .dependency{fill:#333333!important;stroke:#333333!important;stroke-width:1;}#mermaid-svg-3RWuWB49Qp31NN1e #extensionStart,#mermaid-svg-3RWuWB49Qp31NN1e .extension{fill:#333333!important;stroke:#333333!important;stroke-width:1;}#mermaid-svg-3RWuWB49Qp31NN1e #extensionEnd,#mermaid-svg-3RWuWB49Qp31NN1e .extension{fill:#333333!important;stroke:#333333!important;stroke-width:1;}#mermaid-svg-3RWuWB49Qp31NN1e #aggregationStart,#mermaid-svg-3RWuWB49Qp31NN1e .aggregation{fill:#ECECFF!important;stroke:#333333!important;stroke-width:1;}#mermaid-svg-3RWuWB49Qp31NN1e #aggregationEnd,#mermaid-svg-3RWuWB49Qp31NN1e .aggregation{fill:#ECECFF!important;stroke:#333333!important;stroke-width:1;}#mermaid-svg-3RWuWB49Qp31NN1e .edgeTerminals{font-size:11px;}#mermaid-svg-3RWuWB49Qp31NN1e :root{--mermaid-font-family:"trebuchet ms",verdana,arial,sans-serif;} Transformer ListLayer encoder ListLayer decoder BERT ListEncoderLayer encoders masked_language_modeling() GPT ListDecoderLayer decoders next_token_prediction() 关键区别 BERT12层Encoder堆叠base版GPT-396层Decoder堆叠参数量差异BERT-base110M vs GPT-3175B 1.2 数据处理流程对比 #mermaid-svg-4Q5ESuxJlzjEtl1o {font-family:"trebuchet ms",verdana,arial,sans-serif;font-size:16px;fill:#ccc;}#mermaid-svg-4Q5ESuxJlzjEtl1o .error-icon{fill:#a44141;}#mermaid-svg-4Q5ESuxJlzjEtl1o .error-text{fill:#ddd;stroke:#ddd;}#mermaid-svg-4Q5ESuxJlzjEtl1o .edge-thickness-normal{stroke-width:2px;}#mermaid-svg-4Q5ESuxJlzjEtl1o .edge-thickness-thick{stroke-width:3.5px;}#mermaid-svg-4Q5ESuxJlzjEtl1o .edge-pattern-solid{stroke-dasharray:0;}#mermaid-svg-4Q5ESuxJlzjEtl1o .edge-pattern-dashed{stroke-dasharray:3;}#mermaid-svg-4Q5ESuxJlzjEtl1o .edge-pattern-dotted{stroke-dasharray:2;}#mermaid-svg-4Q5ESuxJlzjEtl1o .marker{fill:lightgrey;stroke:lightgrey;}#mermaid-svg-4Q5ESuxJlzjEtl1o .marker.cross{stroke:lightgrey;}#mermaid-svg-4Q5ESuxJlzjEtl1o svg{font-family:"trebuchet ms",verdana,arial,sans-serif;font-size:16px;}#mermaid-svg-4Q5ESuxJlzjEtl1o .label{font-family:"trebuchet ms",verdana,arial,sans-serif;color:#ccc;}#mermaid-svg-4Q5ESuxJlzjEtl1o .cluster-label text{fill:#F9FFFE;}#mermaid-svg-4Q5ESuxJlzjEtl1o .cluster-label span{color:#F9FFFE;}#mermaid-svg-4Q5ESuxJlzjEtl1o .label text,#mermaid-svg-4Q5ESuxJlzjEtl1o span{fill:#ccc;color:#ccc;}#mermaid-svg-4Q5ESuxJlzjEtl1o .node rect,#mermaid-svg-4Q5ESuxJlzjEtl1o .node circle,#mermaid-svg-4Q5ESuxJlzjEtl1o .node ellipse,#mermaid-svg-4Q5ESuxJlzjEtl1o .node polygon,#mermaid-svg-4Q5ESuxJlzjEtl1o .node path{fill:#1f2020;stroke:#81B1DB;stroke-width:1px;}#mermaid-svg-4Q5ESuxJlzjEtl1o .node .label{text-align:center;}#mermaid-svg-4Q5ESuxJlzjEtl1o .node.clickable{cursor:pointer;}#mermaid-svg-4Q5ESuxJlzjEtl1o .arrowheadPath{fill:lightgrey;}#mermaid-svg-4Q5ESuxJlzjEtl1o .edgePath .path{stroke:lightgrey;stroke-width:2.0px;}#mermaid-svg-4Q5ESuxJlzjEtl1o .flowchart-link{stroke:lightgrey;fill:none;}#mermaid-svg-4Q5ESuxJlzjEtl1o .edgeLabel{background-color:hsl(0, 0%, 34.4117647059%);text-align:center;}#mermaid-svg-4Q5ESuxJlzjEtl1o .edgeLabel rect{opacity:0.5;background-color:hsl(0, 0%, 34.4117647059%);fill:hsl(0, 0%, 34.4117647059%);}#mermaid-svg-4Q5ESuxJlzjEtl1o .cluster rect{fill:hsl(180, 1.5873015873%, 28.3529411765%);stroke:rgba(255, 255, 255, 0.25);stroke-width:1px;}#mermaid-svg-4Q5ESuxJlzjEtl1o .cluster text{fill:#F9FFFE;}#mermaid-svg-4Q5ESuxJlzjEtl1o .cluster span{color:#F9FFFE;}#mermaid-svg-4Q5ESuxJlzjEtl1o div.mermaidTooltip{position:absolute;text-align:center;max-width:200px;padding:2px;font-family:"trebuchet ms",verdana,arial,sans-serif;font-size:12px;background:hsl(20, 1.5873015873%, 12.3529411765%);border:1px solid rgba(255, 255, 255, 0.25);border-radius:2px;pointer-events:none;z-index:100;}#mermaid-svg-4Q5ESuxJlzjEtl1o :root{--mermaid-font-family:"trebuchet ms",verdana,arial,sans-serif;} GPT处理 仅左向可见 输入文本 预测下一个token 递归生成 BERT处理 双向可见 输入文本 Mask部分token 预测被mask内容 企业级影响 BERT适合文本分类、实体识别、语义理解GPT适合文本生成、对话系统、代码补全 二、训练目标与数学本质差异 2.1 BERT的Masked Language Modeling (MLM) L M L M − ∑ i ∈ M log ⁡ P ( x i ∣ x \ M ) \mathcal{L}_{MLM} -\sum_{i \in M} \log P(x_i | x_{\backslash M}) LMLM​−i∈M∑​logP(xi​∣x\M​) 其中 M M M是被mask的token集合模型需根据上下文 x \ M x_{\backslash M} x\M​预测被遮盖内容 2.2 GPT的自回归语言建模 L A R − ∑ t 1 T log ⁡ P ( x t ∣ x t ) \mathcal{L}_{AR} -\sum_{t1}^T \log P(x_t | x_{t}) LAR​−t1∑T​logP(xt​∣xt​) 模型只能根据历史信息 x t x_{t} xt​预测当前token x t x_t xt​ 实验数据 任务类型BERT准确率GPT准确率文本分类92.3%85.7%文本生成68.5%94.2%问答系统89.1%76.8% 三、企业级选型决策树 #mermaid-svg-sIfvGfHZ1PeTjCmZ {font-family:"trebuchet ms",verdana,arial,sans-serif;font-size:16px;fill:#000000;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .error-icon{fill:#552222;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .error-text{fill:#552222;stroke:#552222;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .edge-thickness-normal{stroke-width:2px;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .edge-thickness-thick{stroke-width:3.5px;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .edge-pattern-solid{stroke-dasharray:0;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .edge-pattern-dashed{stroke-dasharray:3;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .edge-pattern-dotted{stroke-dasharray:2;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .marker{fill:#666;stroke:#666;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .marker.cross{stroke:#666;}#mermaid-svg-sIfvGfHZ1PeTjCmZ svg{font-family:"trebuchet ms",verdana,arial,sans-serif;font-size:16px;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .label{font-family:"trebuchet ms",verdana,arial,sans-serif;color:#000000;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .cluster-label text{fill:#333;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .cluster-label span{color:#333;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .label text,#mermaid-svg-sIfvGfHZ1PeTjCmZ span{fill:#000000;color:#000000;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .node rect,#mermaid-svg-sIfvGfHZ1PeTjCmZ .node circle,#mermaid-svg-sIfvGfHZ1PeTjCmZ .node ellipse,#mermaid-svg-sIfvGfHZ1PeTjCmZ .node polygon,#mermaid-svg-sIfvGfHZ1PeTjCmZ .node path{fill:#eee;stroke:#999;stroke-width:1px;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .node .label{text-align:center;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .node.clickable{cursor:pointer;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .arrowheadPath{fill:#333333;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .edgePath .path{stroke:#666;stroke-width:2.0px;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .flowchart-link{stroke:#666;fill:none;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .edgeLabel{background-color:white;text-align:center;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .edgeLabel rect{opacity:0.5;background-color:white;fill:white;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .cluster rect{fill:hsl(0, 0%, 98.9215686275%);stroke:#707070;stroke-width:1px;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .cluster text{fill:#333;}#mermaid-svg-sIfvGfHZ1PeTjCmZ .cluster span{color:#333;}#mermaid-svg-sIfvGfHZ1PeTjCmZ div.mermaidTooltip{position:absolute;text-align:center;max-width:200px;padding:2px;font-family:"trebuchet ms",verdana,arial,sans-serif;font-size:12px;background:hsl(-160, 0%, 93.3333333333%);border:1px solid #707070;border-radius:2px;pointer-events:none;z-index:100;}#mermaid-svg-sIfvGfHZ1PeTjCmZ :root{--mermaid-font-family:"trebuchet ms",verdana,arial,sans-serif;} 理解任务 生成任务 10万条 1-10万 1万 高延迟容忍 低延迟需求 需求类型 BERT系列 GPT系列 数据量 微调BERT-base PromptBERT-large Zero-shot BERT 实时性要求 GPT-4 API 蒸馏版GPT-3 决策因子 任务类型理解/生成可用训练数据量级推理延迟要求GPT需考虑生成长度硬件预算BERT推理成本比GPT低40% 四、典型企业场景实战案例 4.1 GitHub Sentinel中的BERT应用 # 使用BERT进行Issue分类 from transformers import BertTokenizer, BertForSequenceClassificationtokenizer BertTokenizer.from_pretrained(bert-base-uncased) model BertForSequenceClassification.from_pretrained(bert-base-uncased)issues [Fix memory leak in module X, Add new feature Y] inputs tokenizer(issues, paddingTrue, return_tensorspt) outputs model(**inputs) # 输出分类标签bug/feature等4.2 LanguageMentor中的GPT应用 # 使用GPT生成对话练习 from transformers import GPT2LMHeadModel, GPT2Tokenizertokenizer GPT2Tokenizer.from_pretrained(gpt2-medium) model GPT2LMHeadModel.from_pretrained(gpt2-medium)input_text Travel scenario: Ordering coffee at Starbucks output model.generate(tokenizer.encode(input_text), max_length100, temperature0.7 ) print(tokenizer.decode(output))五、混合架构创新方案 5.1 BERTGPT联合架构 #mermaid-svg-7RneHaGiq4GxOpIu {font-family:"trebuchet ms",verdana,arial,sans-serif;font-size:16px;fill:#000000;}#mermaid-svg-7RneHaGiq4GxOpIu .error-icon{fill:#552222;}#mermaid-svg-7RneHaGiq4GxOpIu .error-text{fill:#552222;stroke:#552222;}#mermaid-svg-7RneHaGiq4GxOpIu .edge-thickness-normal{stroke-width:2px;}#mermaid-svg-7RneHaGiq4GxOpIu .edge-thickness-thick{stroke-width:3.5px;}#mermaid-svg-7RneHaGiq4GxOpIu .edge-pattern-solid{stroke-dasharray:0;}#mermaid-svg-7RneHaGiq4GxOpIu .edge-pattern-dashed{stroke-dasharray:3;}#mermaid-svg-7RneHaGiq4GxOpIu .edge-pattern-dotted{stroke-dasharray:2;}#mermaid-svg-7RneHaGiq4GxOpIu .marker{fill:#000000;stroke:#000000;}#mermaid-svg-7RneHaGiq4GxOpIu .marker.cross{stroke:#000000;}#mermaid-svg-7RneHaGiq4GxOpIu svg{font-family:"trebuchet ms",verdana,arial,sans-serif;font-size:16px;}#mermaid-svg-7RneHaGiq4GxOpIu .label{font-family:"trebuchet ms",verdana,arial,sans-serif;color:#000000;}#mermaid-svg-7RneHaGiq4GxOpIu .cluster-label text{fill:#333;}#mermaid-svg-7RneHaGiq4GxOpIu .cluster-label span{color:#333;}#mermaid-svg-7RneHaGiq4GxOpIu .label text,#mermaid-svg-7RneHaGiq4GxOpIu span{fill:#000000;color:#000000;}#mermaid-svg-7RneHaGiq4GxOpIu .node rect,#mermaid-svg-7RneHaGiq4GxOpIu .node circle,#mermaid-svg-7RneHaGiq4GxOpIu .node ellipse,#mermaid-svg-7RneHaGiq4GxOpIu .node polygon,#mermaid-svg-7RneHaGiq4GxOpIu .node path{fill:#cde498;stroke:#13540c;stroke-width:1px;}#mermaid-svg-7RneHaGiq4GxOpIu .node .label{text-align:center;}#mermaid-svg-7RneHaGiq4GxOpIu .node.clickable{cursor:pointer;}#mermaid-svg-7RneHaGiq4GxOpIu .arrowheadPath{fill:green;}#mermaid-svg-7RneHaGiq4GxOpIu .edgePath .path{stroke:#000000;stroke-width:2.0px;}#mermaid-svg-7RneHaGiq4GxOpIu .flowchart-link{stroke:#000000;fill:none;}#mermaid-svg-7RneHaGiq4GxOpIu .edgeLabel{background-color:#e8e8e8;text-align:center;}#mermaid-svg-7RneHaGiq4GxOpIu .edgeLabel rect{opacity:0.5;background-color:#e8e8e8;fill:#e8e8e8;}#mermaid-svg-7RneHaGiq4GxOpIu .cluster rect{fill:#cdffb2;stroke:#6eaa49;stroke-width:1px;}#mermaid-svg-7RneHaGiq4GxOpIu .cluster text{fill:#333;}#mermaid-svg-7RneHaGiq4GxOpIu .cluster span{color:#333;}#mermaid-svg-7RneHaGiq4GxOpIu div.mermaidTooltip{position:absolute;text-align:center;max-width:200px;padding:2px;font-family:"trebuchet ms",verdana,arial,sans-serif;font-size:12px;background:hsl(78.1578947368, 58.4615384615%, 84.5098039216%);border:1px solid #6eaa49;border-radius:2px;pointer-events:none;z-index:100;}#mermaid-svg-7RneHaGiq4GxOpIu :root{--mermaid-font-family:"trebuchet ms",verdana,arial,sans-serif;} 查询类 生成类 用户输入 BERT语义理解 意图识别 意图类型 BERT生成响应 GPT生成响应 输出结果 某电商客服系统效果 准确率提升32%响应速度提升25% 5.2 参数高效微调方案对比 微调方法训练参数量准确率显存占用全参数微调100%92.1%16GBLoRA0.5%91.3%8GBPrefix Tuning0.1%89.7%6GBPrompt Tuning0.01%85.2%5GB 结语没有最好只有最合适 在《企业级Agents开发实战营》中我们将看到 GitHub Sentinel如何用BERT实现代码变更语义分析LanguageMentor如何用GPT打造拟真对话系统ChatPPT如何融合两者实现多模态理解与生成
http://www.hkea.cn/news/14502083/

相关文章:

  • 石家庄招聘哪个网站做的好平台设计标准
  • 帝国做的网站删除域名后缀海南免费做网站
  • 微信营销软件网站建设优秀网站建设哪家专业
  • 做网站备案成功后怎么办知东莞app下载
  • 一建 建设网站wordpress主题the7中文汉化版
  • 网络优化网站 site网页设计版权信息代码
  • 什么是工具型网站商业网站建设的目的
  • 东莞微信网站开发洛阳洛龙区网络营销公司
  • 手机网站设计规范杭州公司网站建设
  • 河南省建设厅网站154号文件福州 网站制作 外贸
  • 学做网站什么语言合适seo sem sns的区别
  • 宁波制作企业网站江苏电信网站备案
  • 做校服的网站腾讯企点官网下载
  • 盐城网站建设jsxmt上饶网站建设哪家好
  • 雄安建设工程信息网站帝国cms下载类网站怎么做
  • wordpress4.9下载优化方案英语答案
  • 类似12306网站开发tamed wordpress插件
  • 绿色模板网站东营网红桥
  • 手机端购物网站模板下载智能网站推广优化
  • 宜宾网站建设网站wordpress 地图创建
  • 清远医院网站建设方案大连建设网煤气查询
  • 做网站一般图片多大专业个人网站建设
  • 秦皇岛网站制作服务网店代运营收费标准
  • 宝安沙井房价seo网站建设时文章频率
  • 苏州公司网站建设报价微信做的团购网站
  • 苏州网络自学网站建设wordpress瀑布流主题
  • 做网站设计的广告公司泰国男女做那个视频网站
  • 公司建设一个网站首页wordpress mip主题
  • 佛山个人制作网站公司网站建设单位哪家好
  • 免费个人网站模板黄页网站大全在线看免费