TensorFlow / Python: Generación de Texto con Redes Neuronales Long Short-Term Memory (LSTM)

Marzo 20, 2022Octubre 15, 2023 by Beastieux, posted in Redes Neuronales, TensorFlow

La Generación de Texto con Redes Neuronales LSTM, hacen referencia a “Long Short-Term Memory” (Memoria a Largo Plazo y Corto Plazo). Es un tipo de red neuronal recurrente (RNN) que se utiliza comúnmente para tareas relacionadas con el procesamiento del lenguaje natural, como la generación de texto, la traducción automática y el análisis de sentimientos. A diferencia de las RNN tradicionales, las LSTM pueden aprender y recordar dependencias a largo plazo en secuencias de datos, lo que las hace especialmente útiles para tareas que involucran datos secuenciales.

import tensorflow as tf
from tensorflow.keras.preprocessing.text import Tokenizer
from tensorflow.keras.preprocessing.sequence import pad_sequences
from tensorflow.keras.layers import Embedding, LSTM, Dense
import numpy as np

# Leer el archivo de texto
with open('tu_texto.txt', 'r', encoding='utf-8') as file:
    text = file.read()

# Preprocesar el texto y crear secuencias
tokenizer = Tokenizer()
tokenizer.fit_on_texts([text])
total_words = len(tokenizer.word_index) + 1

# Crear secuencias de entrada y objetivo
input_sequences = []
for i in range(1, len(text)):
    n_gram_sequence = text[:i+1]
    input_sequences.append(n_gram_sequence)

# Tokenizar las secuencias y rellenarlas
input_sequences = tokenizer.texts_to_sequences(input_sequences)
input_sequences = pad_sequences(input_sequences)

# Crear datos de entrada y objetivo
X = input_sequences[:, :-1]
y = input_sequences[:, -1]

# Modelo LSTM
model = tf.keras.Sequential([
    Embedding(total_words, 64, input_length=X.shape[1]),
    LSTM(100),
    Dense(total_words, activation='softmax')
])

model.compile(loss='sparse_categorical_crossentropy', optimizer='adam')

# Entrenar el modelo
model.fit(X, y, epochs=5, verbose=1)

Probar Modelo:

# Generar texto de prueba
seed_text = "El sol brilla en"
next_words = 10

for _ in range(next_words):
    token_list = tokenizer.texts_to_sequences([seed_text])[0]
    token_list = pad_sequences([token_list], maxlen=X.shape[1], padding='pre')
    predicted = model.predict(token_list, verbose=0)
    predicted_word = tokenizer.index_word[np.argmax(predicted)]
    seed_text += " " + predicted_word

print(seed_text)

El resultado que arrojará podría ser parecido al que resultó con el modelo:
El sol brilla en la nieve

El código anterior carga un archivo de texto, lo pre-procesa, construye un modelo LSTM simple, lo entrena y genera texto a partir de una semilla inicial. A medida que se ejecuta el bucle final, el modelo genera texto caracter por caracter.

El archivo tu_texto.txt mencionado en el código debe contener el texto de entrenamiento en el que deseas que el modelo LSTM se base para generar texto. Puede ser cualquier archivo de texto que desees, como una novela, un conjunto de artículos, poesía o cualquier otro texto. El modelo aprenderá de ese texto y generará secuencias de caracteres similares.

El código carga este archivo, lo preprocesa dividiéndolo en fragmentos más cortos, mapea los caracteres a números para que la red neuronal pueda entenderlos y, finalmente, utiliza esos fragmentos para entrenar el modelo LSTM. Luego, el modelo genera texto basado en la semilla inicial proporcionada en generated_text.

Es importante mencionar que, para obtener resultados más largos, significativos y coherentes, se necesita una cantidad suficientemente grande de texto de entrenamiento y una arquitectura de modelo LSTM más compleja. El ejemplo proporcionado es simplificado y se puede utilizar como punto de partida para comprender cómo funcionan las LSTM en la generación de texto.

Relacionado

17 thoughts on “TensorFlow / Python: Generación de Texto con Redes Neuronales Long Short-Term Memory (LSTM)”

wiresuk dice:

Julio 22, 2024 a las 10:57 am

I do not even know how I ended up here but I thought this post was great I do not know who you are but certainly youre going to a famous blogger if you are not already Cheers

Responder
bestiptv-smarters dice:

Julio 20, 2024 a las 5:17 am

Excellent blog here Also your website loads up very fast What web host are you using Can I get your affiliate link to your host I wish my web site loaded up as quickly as yours lol

Responder
mycroxyproxy dice:

Julio 19, 2024 a las 5:29 am

I just could not depart your web site prior to suggesting that I really loved the usual info an individual supply in your visitors Is gonna be back regularly to check up on new posts

Responder
top888casino dice:

Julio 15, 2024 a las 12:33 pm

I was recommended this website by my cousin I am not sure whether this post is written by him as nobody else know such detailed about my trouble You are amazing Thanks

Responder
bestiptv-smarters dice:

Julio 15, 2024 a las 3:18 am

I just could not depart your web site prior to suggesting that I really loved the usual info an individual supply in your visitors Is gonna be back regularly to check up on new posts

Responder
Eliminate Skin Tags Remover dice:

Junio 8, 2024 a las 2:38 am

What i do not realize is in fact how you are no longer actually much more wellfavored than you might be right now Youre very intelligent You recognize thus considerably in relation to this topic made me in my view believe it from numerous numerous angles Its like men and women are not fascinated until it is one thing to do with Lady gaga Your own stuffs excellent All the time handle it up

Responder
Eliminate Skin Tags Remover dice:

Junio 2, 2024 a las 8:05 pm

Thank you for the auspicious writeup It in fact was a amusement account it Look advanced to more added agreeable from you By the way how could we communicate

Responder
tvbrackets dice:

Mayo 4, 2024 a las 3:16 am

My brother suggested I might like this blog He was totally right This post actually made my day You can not imagine simply how much time I had spent for this info Thanks

Responder
tempmail dice:

Abril 14, 2024 a las 10:11 am

I thought you did a great job on this. Although your language is excellent and the picture is enticing, you come across as nervous about what you might be giving next. If you save this walk, I hope you will come back here often.

Responder
Clear Factor Keto dice:

Enero 19, 2024 a las 11:11 am

I do believe all the ideas youve presented for your post They are really convincing and will certainly work Nonetheless the posts are too short for novices May just you please lengthen them a little from subsequent time Thanks for the post

Responder
Puravive dice:

Enero 14, 2024 a las 5:12 am

helloI really like your writing so a lot share we keep up a correspondence extra approximately your post on AOL I need an expert in this house to unravel my problem May be that is you Taking a look ahead to see you

Responder
WonderFix Glycogen Ingredient dice:

Enero 2, 2024 a las 12:26 pm

Hi i think that i saw you visited my web site thus i came to Return the favore I am attempting to find things to improve my web siteI suppose its ok to use some of your ideas

Responder
Disposable Temporary Email dice:

Enero 1, 2024 a las 4:49 pm

What i do not realize is in fact how you are no longer actually much more wellfavored than you might be right now Youre very intelligent You recognize thus considerably in relation to this topic made me in my view believe it from numerous numerous angles Its like men and women are not fascinated until it is one thing to do with Lady gaga Your own stuffs excellent All the time handle it up

Responder
Renew Calm CBD Gummies dice:

Diciembre 24, 2023 a las 9:05 pm

Awesome! Its genuinely remarkable post, I have got much clear idea regarding from this post

Responder
Fresh Luxe Serum dice:

Diciembre 19, 2023 a las 5:19 am

I like the efforts you have put in this, regards for all the great content.

Responder
weight loss gummies work dice:

Diciembre 11, 2023 a las 12:36 am

I like the efforts you have put in this, regards for all the great content.

Responder
Fake Temp Mail dice:

Diciembre 5, 2023 a las 5:39 pm

This was beautiful Admin. Thank you for your reflections.

Responder

Relacionado

17 thoughts on “TensorFlow / Python: Generación de Texto con Redes Neuronales Long Short-Term Memory (LSTM)”

Deja un comentarioCancelar respuesta