-
大小: 5.87KB文件類型: .py金幣: 1下載: 0 次發(fā)布日期: 2021-03-03
- 語(yǔ)言: Python
- 標(biāo)簽: tensorflow??Word??實(shí)戰(zhàn)??
資源簡(jiǎn)介
word2vec的tensorflow實(shí)現(xiàn),來(lái)自黃文堅(jiān)的“tensorflow實(shí)戰(zhàn)”
代碼片段和文件信息
#!/usr/bin/env?python
#*-?coding:?utf-8?-*-
from?__future__?import?print_function
import?os
import?math
import?urllib.request
import?zipfile
import?random
import?collections
import?numpy?as?np
import?tensorflow?as?tf
url?=?‘http://mattmahoney.net/dc/‘
‘‘‘Step1:??download?dataset‘‘‘
def?may_download(filename?expected_bytes):
if?not?os.path.exists(filename):
filename?_?=?urllib.request.urlretrieve(url?+?filename?filename)
statinfo?=?os.stat(filename)
if?statinfo.st_size?==?expected_bytes:
print(‘Found?and?verified‘?filename)
else:
print(statinfo.st_size)
raise?Exception(‘Failed?to?verify?‘?+?filename)
return?filename
filename?=?may_download(‘text8.zip‘?31344016)
‘‘‘Step2:?data?transformation‘‘‘
def?read_data(filename):
with?zipfile.ZipFile(filename)?as?f:
data?=?tf.compat.as_str_any(f.read(f.namelist()[0])).split()
return?data
“““Test“““
words?=?read_data(filename)
print(‘Datas?size‘?len(words))
‘‘‘Step3:?make?dataset‘‘‘
vocabulary_size?=?50000
def?build_data
- 上一篇:python多渠道打包android App(資料+源碼)
- 下一篇:ATM 取款
評(píng)論
共有 條評(píng)論