An AI To Identify The Environment A Grain Of Sand Came From

Ripples in the sand on a dry and rocky landscape. — The SandAI neural network was trained using modern quartz sand and can help unravel the histories encoded in ancient rocks. Shown here are ancient ripples formed by water currents being reworked by modern wind-blown sediment in Oman. Credit: Mathieu Lapôtre, Stanford University

If you were given a bucket of sand and asked to determine where it came from, you’d probably have a hard time guessing if it was from a beach, a riverbank, the playground down the street, or a Saharan sand dune.

There are experts who can make a guess at that sort of ID, using a categorization process that takes skill, a scanning electron microscope, and hours of time. Now, however, researchers report in the Proceedings of the National Academy of Sciences that they’ve developed an AI model that can quickly judge whether a sample of sand came from a beach, a river, a glacial deposit, or a wind-blown dune.

A very enlarged photo of a grain of sand. — Scanning electron microscopy reveals the shape and texture of a quartz sand grain from the Mississippi River. The pictured sand grain is about 200 micrometers in length. Credit: Michael Hasson, Stanford University

That type of identification isn’t just of interest to geologists. Sand is one of the world’s most in-demand resources, second only to water in use. And different applications need different types of sand—for instance, making concrete and mortar requires angular sand for good adhesion and stability. These kinds of needs have given rise to illicit sand mining, sand theft, and sand smuggling. A way of rapidly identifying the origins of a sample of sand could be useful to investigators, or to companies seeking to ensure sustainability goals.

Michael Hasson, a PhD candidate in Stanford’s Department of Earth and Planetary Sciences, joins SciFri’s Charles Bergquist to discuss the new SandAI, and the challenges of tracking grains of sand.

Sign Up For The Week In Science Newsletter

Keep up with the week’s essential science news headlines, plus stories that offer extra joy and awe.

Segment Guests

Michael Hasson

Michael Hasson is a Ph.D. candidate in the Department of Earth and Planetary Sciences at Stanford University in Stanford, California.

Segment Transcript

SPEAKER: For the rest of the hour, Sci Fri’s Charles Bergquist is here with a story about sand. Hi, Charles.

CHARLES BERGQUIST: Hey, Ira. If I gave you a bucket of sand and asked you to tell me where it came from, I’m guessing you’d have a hard time telling me if it was from a Saharan sand dune, an ocean beach or a riverbank. There are experts who could make a guess at that sort of ID, based on a process that takes skill, a scanning electron microscope and hours of time. But of course, now, there’s an AI for that.

Joining me now is Michael Hasson. He’s a PhD candidate in Stanford’s Department of Earth and Planetary Sciences and lead author on a paper describing this new AI for sand that was published this week in the proceedings of the National Academy of Sciences. Welcome to Science Friday.

MICHAEL HASSON: Hey, thank you so much for having me. It’s great to be here.

CHARLES BERGQUIST: Great to have you. Walk me through this challenge I mentioned. I give you a bucket of sand, how would you start trying to ID it without your tool?

MICHAEL HASSON: Yeah, so in the past, it’s basically amounted to filling out a spreadsheet with about 30-35 different features. And a human would go through each photo and make a check mark next to each feature that was present. Things like whether the grain is angular or subangular or rounded, the presence of arcuate steps, parallel striations or elongated depressions. And so at each step, the human is making this judgment call about very specific terms that have really big implications for where the sand actually comes from. So it’s a hugely subjective process and it takes a really long time.

CHARLES BERGQUIST: Is this something that a regular person could pull out a magnifying glass and see some of this, or do you really need your scanning electron microscope to make this determination?

MICHAEL HASSON: Yeah, we really need a microscope to do this, and particularly, the scanning electron microscope that is able to resolve these features that are less than a micron.

CHARLES BERGQUIST: Is there any kind of standard for categorizing sand? Do sand people look at a sample and say, “oh, that’s clearly a Josephson type 2A”?

MICHAEL HASSON: No, it really all exists along a spectrum. And so using some statistical analysis, we can tell you if it’s more like one environment or another, but we can’t really give you hard, specific answers about where it exactly comes from.

CHARLES BERGQUIST: So forgive me for asking this, but is there really a huge need for rapid, accurate sand identification?

MICHAEL HASSON: Yeah, that’s a totally fair question. Outside of fundamental sciences like geology, geoarchaeology, something that the world is going to start having to grapple with more and more is where we get our sand from. Sand sourcing is this really critical problem because after water, it’s the second most important raw material in the world. And anything that we build that’s made out of concrete is reliant on sand, and not just any sand. It has to be angular sand so that the grains lock together and form a structurally sound material.

And so what that’s led to is a lot of illegal mining of sand, especially in places like southeast Asia that are developing very rapidly. They’re often companies that set up on a riverbank and they just start removing sediment. And so if you’re a construction company that’s trying to make sure you’re sourcing your sand responsibly, using our tool is a way that you can confirm whether your supplier is being accurate about what they’re telling you and where that sand is coming from.

CHARLES BERGQUIST: Yeah. So break it down for me. How structurally is sand from, say, a beach dune different from sand on a Saharan desert dune to an expert?

MICHAEL HASSON: Different environments can produce very similar textures. So it’s really helpful to look at these in ensemble. And so when a human has been doing this in the past, they’ve taken the sample of, say, 100 grains and they say, OK, on average, this sample is a little bit more round than another. And if it’s a little bit more round, maybe that means it was moved by wind instead of water.

Because when grains collide in the air, there’s less of a viscous fluid to soften the impact between them. And so when they collide in the air, they just break off and the grains become more round. But it’s a non-unique process. And so that’s why we were really excited to apply machine learning to this, to try to capture these patterns that are really difficult to get at from just a human analysis.

CHARLES BERGQUIST: So now you have this AI tool that you’ve trained on thousands of known samples of sand. Can it do this task better?

MICHAEL HASSON: Yeah. So this tool that we’ve developed doesn’t just take one sample and condense it down to a single result. So historically, we’ve taken this sample of, say, 100 sand grains, and that sample is more or less beach. That would be the answer that you could get from it. With this new tool, we can get an answer for every single sand grain, which lets us do a much better job of fingerprinting individual sites and telling more complex stories of whole samples. And so one really neat example of this was from a sample that we used to test the model from the bottom of the Mississippi river.

So the Mississippi river is about 2,000 miles long, and we sampled from very end of it. And rather than showing an entirely river dominated signature, the sample was about 50% river and 50% windblown sand. And so at first, that might seem a little bit puzzling, but it’s telling this really interesting story.

And the story is that way upstream, the banks of the Mississippi river are made out of windblown sediment that was deposited around the last ice age. And so just using our tool, we can start to piece together the story of a modern river moving across this bed of ancient windblown sand and get this really interesting picture of how that part of the world has evolved just by looking at a few grains of sand.

CHARLES BERGQUIST: So if you come up with this answer that’s, like, this is 50% windblown, 50% river, does that give you a fingerprint of the Mississippi? If you come up with those numbers, do you say, this is probably Mississippi river sand, as opposed to some other place?

MICHAEL HASSON: Right now, we’re working with probably. We haven’t compiled that whole data set of what different major rivers, let’s say, around the world would produce in terms of the distribution of different environments that our model would predict. But that’s likely a really good signature for individual sites that we could use going forward.

CHARLES BERGQUIST: I mean, if you were able to give it enough samples of very specific rivers, could you fine tune this model to be even more specific, like picking out one river environment versus another?

MICHAEL HASSON: Yeah, absolutely. So that brings up this really important point within the broader machine learning world, which is the idea of transfer learning. And transfer learning is this idea that a model that’s trained to do one task well can likely be fine tuned to do another similar task even better than it would be able to do if it were just starting from scratch. And so what we’ve achieved now is we have a model that sort of knows how to work with sand.

And if we were to take this model and compile a different set of data where we know exactly what site the sample comes from rather than what environment it is made up of, we could train the model to distinguish between different rivers, let’s say, and it would be expected to perform much better than if we were just training the model from scratch.

CHARLES BERGQUIST: I mean, you mentioned sand forensics in the context of construction companies. But is there any possible use here for regular police forensics or, I don’t know, archeological exploration or something like that?

MICHAEL HASSON: It’s something that people have looked into for regular police forensics in terms of using it to test a sample that was on a murder suspect shoe and seeing if it confirms or refutes their alibi. Geoarchaeology is a context that we’re really excited for this to be used in too because this is a new tool for contextualizing environments when we have really limited data available.

One really neat example that we found had to do with taking a core through an archeological site with the goal of contextualizing where artifacts were coming from. So where were people living? Was this a river? Where they were living in a desert? And that’s really hard to do, especially when there’s no real rock outcrop that geologists could look at. And so when you take a core to minimize the destruction of the site, you also lose all the context to where that sediment is coming from. And so having a tool like this that can provide some environmental information without any context can be really critical in some situations.

CHARLES BERGQUIST: So where do you go from here with this? What would be the next step in this research?

MICHAEL HASSON: So we’ve publicized the model so that it’s online for anybody to use freely. And we’re hoping that as more and more people use it, we will understand better the instances where it works and the instances where it doesn’t. We’ve tested it to know that it works quite well in as much data as we have available. But given the natural variability of the world, there will almost certainly be places where it fails. And knowing about those will help us tweak the model and come up with a new version of it that will work in an even wider variety of settings.

CHARLES BERGQUIST: Very cool. Michael Hasson is a PhD candidate in Stanford’s department of Earth and Planetary Sciences. Thanks so much for taking time to talk with me today.

Copyright © 2024 Science Friday Initiative. All rights reserved. Science Friday transcripts are produced on a tight deadline by 3Play Media. Fidelity to the original aired/published audio or video file might vary, and text might be updated or amended in the future. For the authoritative record of Science Friday’s programming, please visit the original aired/published recording. For terms of use and more information, visit our policies pages at http://www.sciencefriday.com/about/policies/

Meet the Producers and Host

About Charles Bergquist

As Science Friday’s director and senior producer, Charles Bergquist channels the chaos of a live production studio into something sounding like a radio program. Favorite topics include planetary sciences, chemistry, materials, and shiny things with blinking lights.

About Ira Flatow

Ira Flatow is the founder and host of Science Friday. His green thumb has revived many an office plant at death’s door.

Cookie	Duration	Description
_abck	1 year	This cookie is used to detect and defend when a client attempt to replay a cookie.This cookie manages the interaction with online bots and takes the appropriate actions.
ASP.NET_SessionId	session	Issued by Microsoft's ASP.NET Application, this cookie stores session data during a user's website visit.
AWSALBCORS	7 days	This cookie is managed by Amazon Web Services and is used for load balancing.
bm_sz	4 hours	This cookie is set by the provider Akamai Bot Manager. This cookie is used to manage the interaction with the online bots. It also helps in fraud preventions
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
csrftoken	past	This cookie is associated with Django web development platform for python. Used to help protect the website against Cross-Site Request Forgery attacks
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
nlbi_972453	session	A load balancing cookie set to ensure requests by a client are sent to the same origin server.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
TiPMix	1 hour	The TiPMix cookie is set by Azure to determine which web server the users must be directed to.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
visid_incap_972453	1 year	SiteLock sets this cookie to provide cloud-based website security services.
X-Mapping-fjhppofk	session	This cookie is used for load balancing purposes. The cookie does not store any personally identifiable data.
x-ms-routing-name	1 hour	Azure sets this cookie for routing production traffic by specifying the production slot.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	2 years	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
S	1 hour	Used by Yahoo to provide ads, content or analytics.
sp_landing	1 day	The sp_landing is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.
sp_t	1 year	The sp_t cookie is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
__jid	30 minutes	Cookie used to remember the user's Disqus login credentials across websites that use Disqus.
_gat	1 minute	This cookie is installed by Google Universal Analytics to restrain request rate and thus limit the collection of data on high traffic sites.
_gat_UA-28243511-22	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
AWSALB	7 days	AWSALB is an application load balancer cookie set by Amazon Web Services to map the session to the target.
countryCode	session	This cookie is used for storing country code selected from country selector.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
NID	6 months	NID cookie, set by Google, is used for advertising purposes; to limit the number of times the user sees an ad, to mute unwanted ads, and to measure the effectiveness of ads.
personalization_id	2 years	Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
vglnk.Agent.p	1 year	VigLink sets this cookie to track the user behaviour and also limit the ads displayed, in order to ensure relevant advertising.
vglnk.PartnerRfsh.p	1 year	VigLink sets this cookie to show users relevant advertisements and also limit the number of adverts that are shown to them.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
_dc_gtm_UA-28243511-20	1 minute	No description
abtest-identifier	1 year	No description
AnalyticsSyncHistory	1 month	No description
ARRAffinityCU	session	No description available.
ccc	1 month	No description
COMPASS	1 hour	No description
cookies.js_dtest	session	No description
debug	never	No description available.
donation-identifier	1 year	No description
f	never	No description available.
GFE_RTT	5 minutes	No description available.
incap_ses_1185_2233503	session	No description
incap_ses_1185_823975	session	No description
incap_ses_1185_972453	session	No description
incap_ses_1319_2233503	session	No description
incap_ses_1319_823975	session	No description
incap_ses_1319_972453	session	No description
incap_ses_1364_2233503	session	No description
incap_ses_1364_823975	session	No description
incap_ses_1364_972453	session	No description
incap_ses_1580_2233503	session	No description
incap_ses_1580_823975	session	No description
incap_ses_1580_972453	session	No description
incap_ses_198_2233503	session	No description
incap_ses_198_823975	session	No description
incap_ses_198_972453	session	No description
incap_ses_340_2233503	session	No description
incap_ses_340_823975	session	No description
incap_ses_340_972453	session	No description
incap_ses_374_2233503	session	No description
incap_ses_374_823975	session	No description
incap_ses_374_972453	session	No description
incap_ses_375_2233503	session	No description
incap_ses_375_823975	session	No description
incap_ses_375_972453	session	No description
incap_ses_455_2233503	session	No description
incap_ses_455_823975	session	No description
incap_ses_455_972453	session	No description
incap_ses_8076_2233503	session	No description
incap_ses_8076_823975	session	No description
incap_ses_8076_972453	session	No description
incap_ses_867_2233503	session	No description
incap_ses_867_823975	session	No description
incap_ses_867_972453	session	No description
incap_ses_9117_2233503	session	No description
incap_ses_9117_823975	session	No description
incap_ses_9117_972453	session	No description
li_gc	2 years	No description
loglevel	never	No description available.
msToken	10 days	No description

An AI To Identify The Environment A Grain Of Sand Came From

Further Reading

Sign Up For The Week In Science Newsletter

Segment Guests

Segment Transcript

Meet the Producers and Host

About Charles Bergquist

About Ira Flatow

Explore More

Further Reading

Sign Up For The Week In Science Newsletter

Segment Guests

Segment Transcript

Meet the Producers and Host

About Charles Bergquist

About Ira Flatow

Explore More

From Skyscrapers to Sand Thieves—Digging Into The World Of Sand

Does More Sand Always Mean A Better Beach?