Distilling Contextual Embeddings Into A Static Word Embedding For Improving Hacker Forum Analytics

Benjamin M. Ampel; Hsinchun Chen

doi:10.1109/ISI53945.2021.9624848

Distilling Contextual Embeddings Into A Static Word Embedding For Improving Hacker Forum Analytics

Benjamin M. Ampel, Hsinchun Chen

Last updated on Jun 9, 2026

Altmetric Attention Score

This badge shows attention from news, blogs, social media, policy documents, and more. View details

📈 Dimensions Citation Metrics

Dimensions tracks citations across scholarly literature, patents, clinical trials, and policy documents. View full metrics →

In Plain Terms

This paper introduces Hack2Vec, a specialized way of converting hacker-forum text into numerical representations that machine-learning models can use. It transfers knowledge from the powerful BERT language model into a lightweight, forum-specific word embedding tailored to hacker language. On a standard forum classification task, Hack2Vec outperforms other widely used embedding methods across accuracy and related metrics.

Key Contributions

Key contributions will be added soon.

Artifacts

PDF

Related Papers

Citation

Benjamin M. Ampel & Hsinchun Chen (2021). Distilling Contextual Embeddings Into A Static Word Embedding For Improving Hacker Forum Analytics. IEEE ISI https://doi.org/10.1109/ISI53945.2021.9624848

Hacker Forums Cyber Threat Intelligence Deep Learning Exploit Labeling

Distilling Contextual Embeddings Into A Static Word Embedding For Improving Hacker Forum Analytics

Altmetric Attention Score

📈 Dimensions Citation Metrics

Key Contributions

Artifacts

Related Papers

Citation

Benjamin M. Ampel

Assistant Professor in Computer Information Systems and Director, CyberAI Research and Education Center (CARE)