Distilling Contextual Embeddings Into A Static Word Embedding For Improving Hacker Forum Analytics

Altmetric Attention Score

This badge shows attention from news, blogs, social media, policy documents, and more. View details

๐Ÿ“ˆ Dimensions Citation Metrics

Dimensions tracks citations across scholarly literature, patents, clinical trials, and policy documents. View full metrics โ†’

In Plain Terms

This paper introduces Hack2Vec, a specialized way of converting hacker-forum text into numerical representations that machine-learning models can use. It transfers knowledge from the powerful BERT language model into a lightweight, forum-specific word embedding tailored to hacker language. On a standard forum classification task, Hack2Vec outperforms other widely used embedding methods across accuracy and related metrics.

Key Contributions

Key contributions will be added soon.

Artifacts

Citation

Benjamin M. Ampel & Hsinchun Chen (2021). Distilling Contextual Embeddings Into A Static Word Embedding For Improving Hacker Forum Analytics. IEEE ISI https://doi.org/10.1109/ISI53945.2021.9624848
Benjamin M. Ampel
Benjamin M. Ampel
Assistant Professor in Computer Information Systems and Director, Center for CyberAI Research (CCAIR)

My research focuses on AI-enabled Cybersecurity, including Cyber Threat Intelligence, Large Language Models, and Phishing Detection.

Loading stats...