{"id":19047,"date":"2014-08-28T14:52:45","date_gmt":"2014-08-28T14:52:45","guid":{"rendered":"http:\/\/d45h139.public.uconn.edu\/sites\/news\/?p=19047"},"modified":"2025-01-28T21:57:56","modified_gmt":"2025-01-29T02:57:56","slug":"making-sense-of-big-data-project-to-draw-insights-from-genome","status":"publish","type":"post","link":"https:\/\/today.uconn.edu\/2014\/08\/making-sense-of-big-data-project-to-draw-insights-from-genome\/","title":{"rendered":"Making Sense of Big Data: Project To Draw Insights From Genome"},"content":{"rendered":"<figure id=\"attachment_19092\" aria-describedby=\"caption-attachment-19092\" style=\"width: 200px\" class=\"wp-caption alignleft\"><a href=\"http:\/\/d45h139.public.uconn.edu\/sites\/news\/wp-content\/uploads\/Raj.jpg\"><img decoding=\"async\" class=\"wp-image-19092 img-responsive lazyload\" data-src=\"http:\/\/d45h139.public.uconn.edu\/sites\/news\/wp-content\/uploads\/Raj-300x300.jpg\" alt=\"Raj\" width=\"200\" height=\"200\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 200px; --smush-placeholder-aspect-ratio: 200\/200;\" \/><\/a><figcaption id=\"caption-attachment-19092\" class=\"wp-caption-text\">Sanguthevar Rajasekaran<\/figcaption><\/figure>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">Advances in technology have allowed researchers to produce a huge amount of data in a short amount of time. Making any meaningful sense of this information in a timely manner is the next step in \u201cbig data.\u201d<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">With a $1.2 million NSF grant, Sanguthevar Rajasekaran, Director of the Booth Engineering Center for Advanced Technologies, and a team of researchers will devise new algorithms that can efficiently make use of the almost inconceivably large set of information. It is the first NSF-funded grant\u00a0for big data research project in the state.<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">\u201cPeople end up generating so much data, and it\u2019s a big challenge to process the humongous data sets,\u201d said Rajasekaran, who will work on the project with researchers from the University of Florida and the Jackson Laboratory for Genomic Medicine on the UConn Health campus in Farmington, CT. The team includes Reda Ammar (CSE), Jinbo Bi (CSE), Joerg Graf (MCB), Sartaj Sahni (University of Florida), George Weinstock (JAX), and Yufeng Wu (CSE).<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">In biology, he said, researchers can generate terabytes of data on a daily basis, but the algorithms they currently have to process this data can\u2019t keep up. \u201cThe algorithms take up too much space, as well as too much time.\u201d<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">\u201cIf we want to advance science, we have to have that information quickly,\u201d he said.<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">If a dataset is too large to fit onto the core memory of a computer, it must be placed in a secondary storage, such as a disc or a solid state drive. That greatly increases the time it takes to gain access to the data.<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">Rajasekaran said his project is to develop more efficient algorithms to process these data sets by developing out-of-core algorithms as well as parallel algorithms. Out-of-core algorithms process data too large for a computer\u2019s main memory and are designed to efficiently retrieve information stored in hard drives or tape drives. Work has been done in this area, but very little has been done in regard to biological big data.<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">\u201cNot many people are doing the parallel algorithms and even fewer of the out-of-core ones,\u201d he said.<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">The project will also include the work of George Weinstock, associate director for microbial genomics at Jackson Labs, who will supply Rajasekaran with datasets from their research to test the algorithms.<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">\u201cIt\u2019s now possible to produce amazing amounts of data,\u201d Weinstock said. \u201cBut the data doesn\u2019t do you any good unless you can manage it very efficiently and extract actual results from it. This is one of the very large and\u00a0unmet needs right now in research.\u201d<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">Weinstock\u2019s work on the genome of the African green monkey will figure into the project.<\/span><\/p>\n<figure id=\"attachment_19053\" aria-describedby=\"caption-attachment-19053\" style=\"width: 150px\" class=\"wp-caption alignright\"><a href=\"http:\/\/d45h139.public.uconn.edu\/sites\/news\/wp-content\/uploads\/weinstock-enews-wide1.png\"><img decoding=\"async\" class=\"wp-image-19053 size-thumbnail img-responsive lazyload\" data-src=\"http:\/\/d45h139.public.uconn.edu\/sites\/news\/wp-content\/uploads\/weinstock-enews-wide1-150x150.png\" alt=\"weinstock-enews-wide[1]\" width=\"150\" height=\"150\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 150px; --smush-placeholder-aspect-ratio: 150\/150;\" \/><\/a><figcaption id=\"caption-attachment-19053\" class=\"wp-caption-text\">George Weinstock<\/figcaption><\/figure>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">\u201cWe\u2019re very interested in what quantitative traits we can find \u2013 those\u00a0are things like height, or blood pressure, or more complex things like the concentration of neurotransmitters in blood,\u201d he said. \u201cTo figure out what the genes are that might have different mutations in them in high blood pressure, or height, or bad behavior, you have to do a genetic analysis of the entire genome in many subjects \u2013 sometimes thousands of them. For example, some place in the genome there are particular variants in the genomic sequence that only tall people have.\u201d<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">Rooting out that particular variant, though, means finding an extremely small deviation in a mountain of data \u2013 a single DNA letter difference among 6 billion.<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">It\u2019s possible to do that today, but it can take weeks or months. Getting that time down to a day or two would make a huge difference. It wouldn\u2019t just make researchers\u2019 lives easier \u2013 it could revolutionize science and medicine.<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">\u201cThe Holy Grail in all this is to apply it clinically in medicine,\u201d he said. Go to the doctor now and you can get the results of a blood test back in a day or so. Having the technology to analyze a genomic sequence in the same time could greatly advance treatment for cancer and personalized medicine.\u00a0<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">\u201cIf you could do those genetic computations in just days, now it becomes a clinical tool that helps in medical care, and that would be huge.\u201d<\/span><\/p>\n<p><span style=\"font-family: arial,helvetica,sans-serif;font-size: 10pt\">\u00a0As Rajasekaran envisions it, the results will have a far-reaching impact. The algorithms will be disseminated widely as a software library, and incorporated in undergraduate and graduate courses.<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>UConn&#8217;s Sanguthevar Rajasekaran is part of a research team developing ways to make sense of biological big data.<\/p>\n","protected":false},"author":122,"featured_media":224171,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_crdt_document":"","wds_primary_category":0,"wds_primary_series":0,"wds_primary_attribution":0,"footnotes":""},"categories":[1866],"tags":[],"magazine-issues":[],"coauthors":[43],"class_list":["post-19047","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-engr"],"pp_statuses_selecting_workflow":false,"pp_workflow_action":"current","pp_status_selection":"publish","acf":[],"publishpress_future_action":{"enabled":false,"date":"2026-04-29 05:26:49","action":"change-status","newStatus":"draft","terms":[],"taxonomy":"category","extraData":[]},"publishpress_future_workflow_manual_trigger":{"enabledWorkflows":[]},"_links":{"self":[{"href":"https:\/\/today.uconn.edu\/wp-rest\/wp\/v2\/posts\/19047","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/today.uconn.edu\/wp-rest\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/today.uconn.edu\/wp-rest\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/today.uconn.edu\/wp-rest\/wp\/v2\/users\/122"}],"replies":[{"embeddable":true,"href":"https:\/\/today.uconn.edu\/wp-rest\/wp\/v2\/comments?post=19047"}],"version-history":[{"count":1,"href":"https:\/\/today.uconn.edu\/wp-rest\/wp\/v2\/posts\/19047\/revisions"}],"predecessor-version":[{"id":224178,"href":"https:\/\/today.uconn.edu\/wp-rest\/wp\/v2\/posts\/19047\/revisions\/224178"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/today.uconn.edu\/wp-rest\/wp\/v2\/media\/224171"}],"wp:attachment":[{"href":"https:\/\/today.uconn.edu\/wp-rest\/wp\/v2\/media?parent=19047"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/today.uconn.edu\/wp-rest\/wp\/v2\/categories?post=19047"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/today.uconn.edu\/wp-rest\/wp\/v2\/tags?post=19047"},{"taxonomy":"magazine-issue","embeddable":true,"href":"https:\/\/today.uconn.edu\/wp-rest\/wp\/v2\/magazine-issues?post=19047"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/today.uconn.edu\/wp-rest\/wp\/v2\/coauthors?post=19047"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}