Parar palavra

Partilhar isto
" Voltar ao Índice do Glossário

“Stop words” is a term used in the realm of otimização de motores de busca[1] (SEO) and data processing. These are common function words like ‘and’, ’the’, ‘in’, which are often removed from queries to save space and time in data processing. This concept has roots in creating concordances and has been developed over time by various researchers. Notably, Hans Peter Luhn is credited with coining the phrase and C.J. Van Rijsbergen proposed the first standardized list of these words. Today, the use of stop words has evolved with the advancement of machine learning[2]. While they were initially removed for faster query processing, search engines like Google[3] now advise against worrying about stop words and encourage writing in a natural way. They are still used in specific circumstances like narrowing search results. This concept is related to other topics like concept mining, information extraction, and query expansion.

Definições de termos
1. otimização de motores de busca. A otimização dos motores de busca, normalmente designada por SEO, é uma estratégia de marketing digital fundamental. Com origem em meados dos anos 90, a SEO consiste em melhorar os sítios Web para obter classificações mais elevadas nas páginas de resultados dos motores de busca. Este processo é essencial para aumentar o tráfego na Web e converter visitantes em clientes. A SEO utiliza várias técnicas, incluindo a conceção de páginas, a otimização de palavras-chave e a atualização de conteúdos, para melhorar a visibilidade de um sítio Web. Envolve também a utilização de ferramentas para monitorizar e adaptar-se às actualizações dos motores de busca. As práticas de SEO variam entre os métodos éticos de "chapéu branco" e as técnicas reprovadas de "chapéu preto", sendo que o "chapéu cinzento" se situa entre ambos. Embora a SEO não seja adequada para todos os sítios Web, a sua eficácia nas campanhas de marketing na Internet não pode ser subestimada. As tendências recentes do sector, como a utilização da Web móvel que ultrapassa a utilização do computador, realçam a paisagem em evolução da SEO.
2. machine learning. Machine learning, a term coined by Arthur Samuel in 1959, is a field of study that originated from the pursuit of artificial intelligence. It employs techniques that allow computers to improve their performance over time through experience. This learning process often mimics the human cognitive process. Machine learning applies to various areas such as natural language processing, computer vision, and speech recognition. It also finds use in practical sectors like agriculture, medicine, and business for predictive analytics. Theoretical frameworks such as the Probably Approximately Correct learning and concepts like data mining and mathematical optimization form the foundation of machine learning. Specialized techniques include supervised and unsupervised learning, reinforcement learning, and dimensionality reduction, among others.
Parar palavra (Wikipédia)

Stop words are the words in a stop list (ou stoplist ou negative dictionary) which are filtered out (i.e. stopped) before or after processing of natural language data (text) because they are deemed insignificant. There is no single universal list of stop words used by all natural language processing tools, nor any agreed upon rules for identifying stop words, and indeed not all tools even use such a list. Therefore, any group of words can be chosen as the stop words for a given purpose. The "general trend in [information retrieval] systems over time has been from standard use of quite large stop lists (200–300 terms) to very small stop lists (7–12 terms) to no stop list whatsoever".

" Voltar ao Índice do Glossário
pt_PT_ao90PT
Deslocar para o topo