DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
How to Enhance Model Performance with Effective Feature Engineering

How to Enhance Model Performance with Effective Feature Engineering

Comments
4 min read
Machine Psychology: Investigating Emergent Capabilities and Behavior in Large Language Models Using Psychological Methods

Machine Psychology: Investigating Emergent Capabilities and Behavior in Large Language Models Using Psychological Methods

Comments
4 min read
Simulacra as Conscious Exotica

Simulacra as Conscious Exotica

Comments
4 min read
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

Comments
3 min read
Which algorithm to select in sports timetabling?

Which algorithm to select in sports timetabling?

Comments
4 min read
Delving into ChatGPT usage in academic writing through excess vocabulary

Delving into ChatGPT usage in academic writing through excess vocabulary

Comments
3 min read
Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows

Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows

Comments
4 min read
AI Agents That Matter

AI Agents That Matter

Comments
3 min read
Mixture of A Million Experts

Mixture of A Million Experts

Comments
3 min read
Distilling System 2 into System 1

Distilling System 2 into System 1

Comments
4 min read
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

Comments
4 min read
SmartChoices: Augmenting Software with Learned Implementations

SmartChoices: Augmenting Software with Learned Implementations

Comments
4 min read
Personalized Language Modeling from Personalized Human Feedback

Personalized Language Modeling from Personalized Human Feedback

Comments
4 min read
LLMs can learn self-restraint through iterative self-reflection

LLMs can learn self-restraint through iterative self-reflection

Comments
5 min read
Shadows of quantum machine learning

Shadows of quantum machine learning

Comments
4 min read
Vulnerability Detection with Code Language Models: How Far Are We?

Vulnerability Detection with Code Language Models: How Far Are We?

Comments
5 min read
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Comments
4 min read
When LLMs Play the Telephone Game: Cumulative Changes and Attractors in Iterated Cultural Transmissions

When LLMs Play the Telephone Game: Cumulative Changes and Attractors in Iterated Cultural Transmissions

Comments
4 min read
PaliGemma: A versatile 3B VLM for transfer

PaliGemma: A versatile 3B VLM for transfer

Comments
4 min read
Tour the WayveScenes101 Autonomous Driving Dataset 03:18

Tour the WayveScenes101 Autonomous Driving Dataset

Comments
1 min read
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

Comments
3 min read
Achieving Energetic Superiority Through System-Level Quantum Circuit Simulation

Achieving Energetic Superiority Through System-Level Quantum Circuit Simulation

Comments
4 min read
Volumetric Rendering with Baked Quadrature Fields

Volumetric Rendering with Baked Quadrature Fields

Comments
3 min read
Memory, Consciousness and Large Language Model

Memory, Consciousness and Large Language Model

Comments
4 min read
How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

Comments
4 min read
A Multivariate Unimodality Test Harnessing the Dip Statistic of Mahalanobis Distances Over Random Projections

A Multivariate Unimodality Test Harnessing the Dip Statistic of Mahalanobis Distances Over Random Projections

Comments
3 min read
X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

Comments
4 min read
The Reason behind Good or Bad: Towards a Better Mathematical Verifier with Natural Language Feedback

The Reason behind Good or Bad: Towards a Better Mathematical Verifier with Natural Language Feedback

Comments
4 min read
When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

Comments
4 min read
LoRA+: Efficient Low Rank Adaptation of Large Models

LoRA+: Efficient Low Rank Adaptation of Large Models

Comments
3 min read
A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task

A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task

Comments
4 min read
Reasoning in Large Language Models: A Geometric Perspective

Reasoning in Large Language Models: A Geometric Perspective

Comments
4 min read
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

Comments
4 min read
ColPali: Efficient Document Retrieval with Vision Language Models

ColPali: Efficient Document Retrieval with Vision Language Models

Comments
4 min read
FACTS About Building Retrieval Augmented Generation-based Chatbots

FACTS About Building Retrieval Augmented Generation-based Chatbots

Comments
4 min read
SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

Comments
4 min read
Toto: Time Series Optimized Transformer for Observability

Toto: Time Series Optimized Transformer for Observability

Comments
5 min read
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Comments
4 min read
Exploring the Latest LLMs for Leaderboard Extraction

Exploring the Latest LLMs for Leaderboard Extraction

Comments
4 min read
There Has To Be a Lot That We're Missing: Moderating AI-Generated Content on Reddit

There Has To Be a Lot That We're Missing: Moderating AI-Generated Content on Reddit

Comments
4 min read
What's the Magic Word? A Control Theory of LLM Prompting

What's the Magic Word? A Control Theory of LLM Prompting

Comments
4 min read
Testing AI on language comprehension tasks reveals insensitivity to underlying meaning

Testing AI on language comprehension tasks reveals insensitivity to underlying meaning

Comments
4 min read
Databases Deconstructed: The Value of Data Lakehouses and Table Formats

Databases Deconstructed: The Value of Data Lakehouses and Table Formats

3
Comments
8 min read
Data Science? Never Heard Of It.

Data Science? Never Heard Of It.

Comments
2 min read
Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

1
Comments
3 min read
Voxel51 Filtered Views Newsletter - July 12, 2024

Voxel51 Filtered Views Newsletter - July 12, 2024

1
Comments
11 min read
Optimizing ETL Processes for Efficient Data Loading in EDWs

Optimizing ETL Processes for Efficient Data Loading in EDWs

Comments
4 min read
Patient-Centered Care and Data Integration in Population Health Management

Patient-Centered Care and Data Integration in Population Health Management

Comments
4 min read
Want to get started as a Data Engineer

Want to get started as a Data Engineer

Comments
1 min read
What is GitHub Copilot: detailed overview

What is GitHub Copilot: detailed overview

12
Comments
4 min read
Useful datasets for AI/ML

Useful datasets for AI/ML

Comments
1 min read
Mooncake: Kimi's KVCache-centric Architecture for LLM Serving

Mooncake: Kimi's KVCache-centric Architecture for LLM Serving

1
Comments
4 min read
Now I know why NVIDIA stocks are high

Now I know why NVIDIA stocks are high

Comments
2 min read
The Data Understanding Phase: The Key to a Successful Machine Learning Project

The Data Understanding Phase: The Key to a Successful Machine Learning Project

Comments
5 min read
ACID: O Pilar dos Bancos de Dados Relacionais

ACID: O Pilar dos Bancos de Dados Relacionais

2
Comments
2 min read
How to Handle Secrets in Jupyter Notebooks

How to Handle Secrets in Jupyter Notebooks

2
Comments
8 min read
I made simple binary translator support binary, text, hex, octal, decimal

I made simple binary translator support binary, text, hex, octal, decimal

2
Comments
1 min read
How to Build a Data Entry System (Quick & Easy Guide)

How to Build a Data Entry System (Quick & Easy Guide)

2
Comments 1
15 min read
Modelos Generativos y su Aplicación en Datos Sintéticos

Modelos Generativos y su Aplicación en Datos Sintéticos

Comments
3 min read
Machine Learning for Predictive Maintenance

Machine Learning for Predictive Maintenance

Comments
3 min read
loading...