国产探花免费观看_亚洲丰满少妇自慰呻吟_97日韩有码在线_资源在线日韩欧美_一区二区精品毛片,辰东完美世界有声小说,欢乐颂第一季,yy玄幻小说排行榜完本

<source id="uqhm8"><tr id="uqhm8"></tr></source>

<rp id="uqhm8"></rp>

<sub id="uqhm8"><tr id="uqhm8"><div id="uqhm8"></div></tr></sub>

<td id="uqhm8"></td>

首頁 > 開發(fā) > PHP > 正文

中文分詞的php代碼

2024-05-04 21:48:48

字體：大中小

來源：轉載

供稿：網友

以前有用過dedecms分詞功能,經過測試還是不理想,后來經過一些處理得到的結果還是可以接受的,今天我再看到這款分詞法,拿出來給大家看看,實例代碼如下:

<?php

class NLP{

private static $cmd_path;

// 不以'/'結尾

static function set_cmd_path($path){

self::$cmd_path = $path;

}

private function cmd($str){

$descriptorspec = array(

0 => array("pipe", "r"),

1 => array("pipe", "w"),

);

$cmd = self::$cmd_path . "/ictclas";

$process = proc_open($cmd, $descriptorspec, $pipes);

if (is_resource($process)) {

$str = iconv('utf-8', 'gbk', $str);

fwrite($pipes[0], $str);

$output = stream_get_contents($pipes[1]);

fclose($pipes[0]);

fclose($pipes[1]);

$return_value = proc_close($process);

}

/*

$cmd = "printf '$input' | " . self::$cmd_path . "/ictclas";

exec($cmd, $output, $ret);

$output = join("n", $output);

*/

$output = trim($output);

$output = iconv('gbk', 'utf-8', $output);

return $output;

}

/**

* 進行分詞, 返回詞語列表.

*/

function tokenize($str){

$tokens = array();

$output = self::cmd($input);

if($output){

$ps教程 = preg_split('/s+/', $output);

foreach($ps as $p){

list($seg, $tag) = explode('/', $p);

$item = array(

'seg' => $seg,

'tag' => $tag,

); //開源代碼Vevb.com

$tokens[] = $item;

}

}

return $tokens;

}

}

NLP::set_cmd_path(dirname(__FILE__));

?>

用起來很簡單,確保 ICTCLAS 編譯后的可執(zhí)行文件和詞典在當前目錄,代碼如下:

<?php

require_once('NLP.php');

var_dump(NLP::tokenize('Hello, World!'));

?>

進行中文分詞的 PHP 類就在下面了,用 proc_open() 函數來執(zhí)行分詞程序,并通過管道和其交互, 輸入要進行分詞的文本, 讀取分詞結果.

上一篇：簡單的php操作word文件實現代碼

下一篇：php hash算法實現數組的方法

學習交流

硬盤分區(qū)如何設置準確的分區(qū)空間

硬盤分區(qū)如何設置準確的分區(qū)空間...

熱門圖片

猜你喜歡的新聞

猜你喜歡的關注

新聞熱點

英偉達市值一夜大漲9246億創(chuàng)2023年5月以來最大單周漲幅

2024-04-27 13:35:46

雷軍：小米正在申請3萬件專利，以提升自身的競爭力

2024-04-27 13:33:47

人參泡酒15年后竟“復活”？網友：這酒還能喝不？

2024-04-24 22:53:44

芯片股普漲！英偉達市值一夜大漲6000億元

2024-04-23 19:32:50

面館老板1天賣100碗剛夠交房租要賣到150碗才能實現盈利

2024-04-23 19:25:50

跌近30%！茶百道上市一小時跌沒3年凈利潤

2024-04-23 19:13:19

疑難解答

圖片精選

網友關注

主站蜘蛛池模板：盐源县| 天长市| 武安市| 岳阳市| 兴隆县| 新民市| 兴业县| 若羌县| 南涧| 磐安县| 建瓯市| 郑州市| 宝丰县| 南川市| 永新县| 安溪县| 旌德县| 若尔盖县| 英超| 苏尼特左旗| 鸡西市| 遂平县| 滁州市| 嵩明县| 沙雅县| 阿勒泰市| 五家渠市| 宣武区| 什邡市| 巴里| 博白县| 宣化县| 特克斯县| 北票市| 江都市| 通州市| 石渠县| 个旧市| 宣威市| 若尔盖县| 鄂州市|

<td id="cbvso"><tr id="cbvso"><div id="cbvso"></div></tr></td>

<source id="cbvso"></source>