logo
  • About Us
    • Who We Are
    • Mission & Core Values
    • Equity, Diversity & Inclusion
    • Meet the Team
    • Board of Directors
    • Scientific Advisory Committee
    • EDI Advisory Group
  • Services
    • Is SOSCIP for you?
    • SOSCIP’s Industry Overview
    • Advanced Computing Platforms
    • SOSCIP Project Guide
    • Fee for Services Program
  • Initiatives
    • Community Fellowship
    • COVID-19 Response: A Curated List
  • Projects
    • Collaboration Opportunities
    • Research Projects Archive
  • Impact
    • Spotlight Homepage
    • Impact Stories
    • SOSCIP By the Numbers
  • News
    • Platforms Newsletter
    • COVID-19 Update: Operations
    • SOSCIP COVID-19 FAQ
  • Search

Research Projects

Focus Area
  • Digital Media
    • 5G/NextGen Networks
    • Advanced ManufacturingAdvanced Manufacturing
    • Aerospace & DefenceAerospace & Defence
    • AgricultureAgriculture
    • AIAI
    • Blockchain
    • Business AnalyticsBusiness Analytics
    • CitiesCities
    • Clean TechClean Tech
    • COVID-19COVID-19
    • CybersecurityCybersecurity
    • Digital MediaDigital Media
    • EnergyEnergy
    • Environment & ClimateEnvironment & Climate
    • FinTech
    • HealthHealth
    • ICTICT
    • MiningMining
    • Quantum
    • Supply ChainSupply Chain
    • TransportationTransportation
    • WaterWater
    • All
Platform(s)
  • All
    • Cloud
    • GPU
    • Parallel CPU
    • All
Academic Institution
  • All
    • Carleton University
    • McMaster University
    • Ontario Tech University
    • Queen's University
    • Seneca College
    • Toronto Metropolitan University
    • University of Guelph
    • University of Ottawa
    • University of Toronto
    • University of Waterloo
    • University of Windsor
    • Western University
    • Wilfrid Laurier University
    • York University
    • All
(FoRCE): Powering clinical trials research through a secure and integrated data management platform
Collaborators: Queen's University & Indoc Research
Cybersecurity Digital Media Health

(FoRCE): Powering clinical trials research through a secure and integrated data management platform

Critical care units are one of the most data-rich environments in clinical settings, with data being generated by advanced patient monitoring, frequent laboratory and radiologic tests, and around-the-clock evaluation. There are substantial opportunities in linking data that are collected as a part of such clinical practice with data collected in a research setting, such as genome wide studies or comprehensive imaging protocols. However, security and privacy issues have historically been a significant barrier to the storage, analysis, and linkage of such biomedical data. Further, disparate technologies hinder collaboration across teams, most of which lack the secure systems required to enable federation and sharing of these data. This is particularly true when clinical practice or research designs require close to real time analysis and timely feedback, such as when dealing with streamed medical data or output from clinical laboratories. Current commercial and research solutions often fail to integrate different data types, are incapable of handling streaming data, and rely solely on the security measures put in place by the organizations that deploy them.

This proposal seeks to build FoRCE (Focus on Research and Clinical Evaluation), a scalable and adaptable add-on module to the existing Indoc Informatics platform that will address the critical gaps in cybersecurity and privacy infrastructure within shared clinical and research settings, while fulfilling important unmet needs for both the clinical and research communities. FoRCE will provide the secure architecture and processes to support the collection, federation and sharing of data from distributed clinical settings, including critical care units, clinical laboratories, and imaging facilities. The proposed platform will address several key issues including security considerations, infrastructure and software requirements for linkage, and solutions for handling streaming real time medical data, and ensuring regulatory and ethics compliance when linking diverse medical data modalities in a clinical setting.

FoRCE will be designed and developed with broad applicability in mind, and will therefore allow the different data types from numerous technologies and across multiple disease states to utilize the platform. The long term impact of FoRCE on improving the health of Ontarians is of course dependent on its utilization within research and clinical settings. An initial project which will utilize the platform as part of the testing and validation of FoRCE includes Dr. Maslove’s integrated approach to merging genomic and physiologic data streams from the ICU in the context of clinical research. FoRCE will enable Dr. Maslove’s team of critical care researchers to move beyond predictors of survival to focus on predictors of response to therapy, so that clinical trials in the ICU can be optimized to produce actionable evidence and personalized results. This will lead to better allocation of ICU resources, which in Canada cost nearly $3,000 per patient per day – $3.72 billion per year.

Industry Partner(s): Indoc Research

Academic Institution: Queen's University

Academic Researcher: David Maslove

Platform: Cloud, Parallel CPU

Focus Areas: Cybersecurity, Digital Media, Health

A cloud‐based, multi‐modal, cognitive ophthalmic imaging platform for enhanced clinical trial design and personalized medicine in blinding eye disease
Collaborators: Western University & Tracery Ophthalmics
Digital Media Health

A cloud‐based, multi‐modal, cognitive ophthalmic imaging platform for enhanced clinical trial design and personalized medicine in blinding eye disease

Age Related Macular Degeneration is the leading cause of irreversible blindness in Canada and the industrialized world, yet there are no treatments for the vast majority of patients. Led by Tracery Ophthalmics inc, and working with Translatum Medicus inc (TMi) and academic partners at the Robarts Research Institute, Western University, and the “High Risk Dry AMD Clinic” of St Michael’s Hospital, we will engage SOSCIP’s Cloud Analytics platform, including servers, software and human resources, to accelerate the search for new treatments.

Specifically, Tracery has developed a novel functional imaging method, “AMD Imaging” (AMDI) that has already generated unprecedented pictures of the retina (the film of the eye) that include both known and unknown “flavours” of disease (the phenotype). These complex images will be compared against an individual’s genetic makeup (their genotype) and their concurrent illnesses, medications, and lifestyle history (their epigenetics). Further, Tracery’s imaging will help identify particular patients that will benefit from TMi’s drug development program, and ultimately help doctors choose which treatment will work best. Over the course of two years, we will involve increasing numbers of medical experts and their patients to generate and amass AMDI images, evaluating them over time and against other modalities.

Ultimately, through the “I3” program, we will work with IBM to train Watson and the Medical Sieve to recognize and co‐analyse complex disease patterns in the context of the ever‐expanding scientific literature. In short, we will leverage cloud‐based computing, to integrate image‐based and structured data, genomics and large data analytic to unite global users. We anticipate that this approach will significantly accelerate drug development, providing personalized treatment for the right patient at the right time.

Industry Partner(s): Tracery Ophthalmics

Academic Institution: Western University

Academic Researcher: Ali Khan

Co-PI Names: Filiberto Altomare, Louis Giavedoni & Steven Scherer

Platform: Cloud

Focus Areas: Digital Media, Health

A dynamic and scalable data cleaning system for Watson analytics
Collaborators: McMaster University & IBM Canada Ltd.
Cybersecurity Digital Media

A dynamic and scalable data cleaning system for Watson analytics

Poor data quality is a serious and costly problem affecting organizations across all industries. Real data is often dirty, containing missing, erroneous, incomplete, and duplicate values. It is estimated that poor data quality cost organizations between 15% and 25% of their operating budget. Existing data cleaning solutions focus on identifying inconsistencies that do not conform to prescribed data formats assuming the data remains relatively static. As modern applications move towards more dynamic search analytics and visualization, new data quality solutions that support dynamic data cleaning are needed. An increasing number of data analysis tools, such as Watson Analytics, provide flexible data browsing and querying abilities. In order to ensure reliable, trusted and relevant data analysis, dynamic data cleaning solutions are required. In particular, current data quality tools fail to adapt to: (1) fast changing data and data quality rules (for example as new datasets are integrated); (2) new data governance rules that may be imposed for a particular industry; and (3) utilize industry specific terminology and concepts that can refine data quality recommendations for greater accuracy and relevance. In this project, we will develop a system for dynamic data cleaning that adapts to changing data and rules, and considers industry specific models for improved data quality.

Industry Partner(s): IBM Canada Ltd.

Academic Institution: McMaster University

Academic Researcher: Fei Chiang

Platform: Cloud

Focus Areas: Cybersecurity, Digital Media

Active learning for automatic generation of narratives from numeric financial and supply chain data
Collaborators: Ryerson University & Unilever Canada Inc.
Advanced Manufacturing Digital Media

Active learning for automatic generation of narratives from numeric financial and supply chain data

Large enterprises compile and analyze large amounts of data on a daily basis. Typically the collected raw data is processed by financial analysts to produce reports. Executive personnel use these reports to oversee the operations and make decisions based on the data. Some of the processing performed by financial analysts can be easily automated by currently available computational tools. These tasks mostly make use of standard transformations on the raw data including visualizations and aggregate summaries. On the other hand automating some of the manual processing requires more involved artificial intelligence techniques.

In our project we aim to solve one of these harder to automate tasks. In fact analyzing textual data using Natural Language Processing (NLP) techniques is one of the standardized methods of data processing in modern software tools. However the vast majority of NLP methods primarily aim to analyze textual data, rather than generate meaningful narratives.

Since the generation of text is a domain-dependent and non-trivial task, the automated generation of narratives requires novel research to be useful in an enterprise environment. In this project we focus on using numerical financial and supply chain data to generate useful textual reports that can be used in the executive level of companies. Upon successful completion of the project, financial analysts will spend less time on repetitive tasks and have more time to focus on reporting tasks requiring higher-level data fusion skills.

Industry Partner(s): Unilever Canada Inc.

Academic Institution: Ryerson University

Academic Researcher: Ayse Bener

Co-PI Names: John Maidens

Platform: Cloud, GPU

Focus Areas: Advanced Manufacturing, Digital Media

Advancing the CANWET watershed model and decision support system by utilizing high performance parallel computing functionality
Collaborators: University of Guelph & Greenland International Consulting
Cities Clean Tech Digital Media Water

Advancing the CANWET watershed model and decision support system by utilizing high performance parallel computing functionality

Watershed modeling is widely used to better understand processes and help inform planning and watershed management decisions. Examples include identifying impacts associated with land use change; investigating outcomes of infrastructure development, predicting effects of climate change. The proposed project will see the evolution of a desktop based watershed modeling and decision support system to a web based tool that will allow greater access by decision makers and stakeholders. By this means we will advance the idea of evaluating cumulative effects in the watershed decision making process rather than the current practice of assessing proposed changes in isolation.

The proposed software evolution will take advantage of high performance computing by porting existing code to a higher performing language and restructuring to operate using parallel or multi-core processing. The result is expected to be a dramatic reduction in simulation run times. Reduced run times will facilitate the use of automatic calibration routines used to conduct model setup, reducing costs. It will also enable rapid response if the simulation were to be re-run by a request through the web-based user interface. The designed web-based tool will be used by decision and policy makers in the watersheds that contribute to Lake Erie to understand the sources of pollution especially phosphorus which is a major contributor to Lake Erie eutrophication problems and develop policies in supporting a wide variety of watershed planning and ultimately help achieve the Federal and Ontario government commitments to reduce 40% phosphorus entering Lake Erie by 2025.

Industry Partner(s): Greenland International Consulting

Academic Institution: University of Guelph

Academic Researcher: Prasad Daggupati

Platform: Cloud

Focus Areas: Cities, Clean Tech, Digital Media, Water

Advancing video categorization
Collaborators: Seneca College & Vubble Inc.
Digital Media

Advancing video categorization

Vubble is a media tech company that builds solutions for trustworthy digital video distribution and curation. Using a combination of algorithms and human curators, Vubble searches the internet to locate video content of interest to its users. Vubble is collaborating with Dr. Vida Movahedi from Seneca’s School of Information and Communication Technology to develop a machine-learning algorithm that will automatically output highly probable categories for videos. With this algorithm implemented into the Vubble workflow to assist in automated video identification, Vubble will be able to better address their existing, and emerging, customer demands, while increasing their productivity and competitiveness. This video identification research project will be Vubble’s first step in understanding how to automate the identification of accurate video. The need for automation of videos curation is prevalent, as video is quickly becoming the world’s dominant form of media consumption, particularly for digital native younger audiences. Furthermore, the results of the applied research will aid Vubble in moving forward in addressing what they believe is a looming problem facing all media consumers, and society, the rising of fake news video created from archival footage.

Industry Partner(s): Vubble Inc.

Academic Institution: Seneca College

Academic Researcher: Vida Movahedi

Platform: Cloud

Focus Areas: Digital Media

Agile real time radio signal processing
Collaborators: University of Toronto & Thoth Technology Inc.
Digital Media

Agile real time radio signal processing

Canadian VLBI capability has been missing for a decade. Jointly with Thoth Technology Inc we propose to restore domestic and international VLBI infrastructure that will be commercialized by Thoth Technology Inc. This project will implement and optimize multi-telescope correlation and analysis software on the SOSCIP BGQ, Agile and LMS platforms. The resulting pipeline package will allow commercial turnkey VLBI delivery by Thoth Technology Inc to domestic and international customers into a market of about $10 million/year

Industry Partner(s): Thoth Technology Inc.

Academic Institution: University of Toronto

Academic Researcher: Ue-Li Pen

Platform: Cloud, Parallel CPU

Focus Areas: Digital Media

An economics-aware autonomic management system for big data applications
Collaborators: York University & IBM Canada Inc.
Cities Digital Media

An economics-aware autonomic management system for big data applications

Recent advancements in software technology, including virtualization, microservices, and cloud computing, have created novel challenges and opportunities on developing and delivering software. Additionally, it has given rise to DevOps, a hybrid team responsible for both developing and managing the software system, and has led to the development of tools that take advantage of the enhanced flexibility and enable the automation of the software management cycle. In this new world characterized by volatility and speed, the Business Operations (BizOps) team is lagging behind and still remains disconnected from the DevOps team. BizOps views software as a product and is responsible for defining the business and economic strategy around it.

The goal of the proposed project is to imbue DevOps tools and processes with BizOps knowledge and metrics through formal models and methods. Currently, BizOps receives the software system or service as a finished product, a black box, on which a price has to be put and be offered to clients. The price and the marketing strategy are usually defined at the beginning of a sales cycle (e.g. a year) and remain the same for the entirety of the cycle. However, this is in contrast to the great volatility of the service itself. In most cases, the strategies are based on the instinct of managers with high acumen and experience and broad marketing surveys or one-to-one negotiations with clients, information that can easily change and may remain disconnected from the software development. The end product of this project is a set of economic and performance models to connect the DevOps and BizOps processes during the software’s life cycle and eventually incorporate them in automated tools to adapt and scale the system in production and enable continuous development, integration and delivery.

Industry Partner(s): IBM Canada Inc.

Academic Institution: York University

Academic Researcher: Marin Litoiu

Platform: Cloud

Focus Areas: Cities, Digital Media

An intelligent immersive content creation platform for the non-programmer for training, maintenance and assembly using AR and VR.
Collaborators: Bombardier Aerospace; OVA Inc & Ryerson University
5G/NextGen Networks Aerospace & Defence Business Analytics Digital Media Transportation

An intelligent immersive content creation platform for the non-programmer for training, maintenance and assembly using AR and VR.

OVA’s StellarX is an uncompromisingly good virtual and augmented reality software. Purpose-built for teams that want to collaborate and create in those new paradigms.

Industry Partner(s): Bombardier Aerospace , OVA Inc

Academic Institution: Ryerson University

Academic Researcher: Chung, Joon

Platform: GPU, Parallel CPU

Focus Areas: 5G/NextGen Networks, Aerospace & Defence, Business Analytics, Digital Media, Transportation

Analyzing geospatial patterns in the cloud: application to the mineral exploration and mining in Canada
Collaborators: Western University & Osisko Mining Corporation
Digital Media Mining

Analyzing geospatial patterns in the cloud: application to the mineral exploration and mining in Canada

Industry Partner(s): Osisko Mining Corporation

Academic Institution: Western University

Academic Researcher: Neil Banerjee

Co-PI Names: Leonardo Feltrin

Platform: Cloud

Focus Areas: Digital Media, Mining

Big data analysis and optimization of rural and community broadband wireless networks
Collaborators: University of Ottawa & EION Inc.
Cities Digital Media Energy

Big data analysis and optimization of rural and community broadband wireless networks

Rural broadband initiative is happening in a big wave across the world. Canada, being a diverse country has a specific Internet reachability problem due to population being sparse. It is economically not viable to bring fiber to each and every house in Canada. It is not economically viable to connect every household through satellites either. Broadband Internet over wireless networks is a good option where Internet is brought over fiber to a point of presence and moved to houses over wireless.

EION is actively working in Ontario and Newfoundland to make rural broadband a possibility. Wireless networking in rural areas in Canada is a challenge in itself due to weather, terrain and accessibility. Real-time constraints such as weather, water and foliage do alter the maximum capacity of the wireless pipe. In addition the usage pattern of the houses, especially real-time video that require fast response time, require adequate planning.

This is becoming very critical as almost 80% of the traffic seems to be video related due to popularity of applications such as Netflix, Youtube and Shomi.  Intelligence in wireless rural broadband networks are a necessity to bring good quality voice, video and data reliably. Optimization in system and network level using heuristics and artificial intelligence techniques based on big data analysis of video packets is paramount to enable smooth performing broadband rural networks.

In this project, we will be analyzing the big data of video packets in rural broadband networks in Ontario and Newfoundland and design optimized network design and architecture to bring reliable video services over constrained rural broadband wireless networks.

Industry Partner(s): EION Inc.

Academic Institution: University of Ottawa

Academic Researcher: Amiya Nayak

Co-PI Names: Octavia Dobre

Platform: Cloud

Focus Areas: Cities, Digital Media, Energy

Computational support for big data analytics, information extraction and visualization
Collaborators: York University & IBM Spectrum Computing
Cities Digital Media Energy Water

Computational support for big data analytics, information extraction and visualization

The Centre for Innovation in Visualization and Data Driven Design (CIVDDD), an Ontario ORF-RE project performs research for which SOSCIP resources are needed and they were awarded NSERC CRD funding with IBM Platform [Applications of IBM Platform Computing solutions for solving Data Analytics and 3D Scalable Video Cloud Transcoder Problems] beginning in July 2015. This project involves Big Data, Visualization and Transcoding and will train many HQP. We require access to equipment capable of running a multi-core cluster using IBM Symphony and Big Insights software with IBM Platform on data analytics, visualization and transcoding. Our objectives include:

IBM Platform:

  • Test the applicability of Platform Symphony to Data Analytics problems to produce demonstrations of Symphony on application domains (we started by exploring streaming traffic analysis datasets) and identify improvements to Symphony to gain IBM advantage in the marketplace.
  • Design and implement methods to greatly speed-up the search for high utility frequent itemsets in big data using Symphony in a parallel distributed environment.
  • Design algorithms to determine which are suitable in such an environment.
  • Identify commercialization venues in application domains.
  • Exploration of a Scalable Video Cloud Transcoder for Wireless Multicasts

Industry Partner(s): IBM Spectrum Computing

Academic Institution: York University

Academic Researcher: Aijun An

Co-PI Names: Amir Asif

Platform: Cloud

Focus Areas: Cities, Digital Media, Energy, Water

Detailed computational fluid dynamics modeling of UV-AOPs photoreactors for micropollutants oxidation in water and wastewater
Collaborators: Western University & Trojan Technologies
Advanced Manufacturing Clean Tech Digital Media Water

Detailed computational fluid dynamics modeling of UV-AOPs photoreactors for micropollutants oxidation in water and wastewater

Micropollutants such as bisphenol-A and N-nitrosodimethylamine pose a significant threat to aquatic life, animals, and humans beings due to their persistent and potentially carcinogenic nature. While most conventional water treatment methods cannot remove these contaminants, ultraviolet-driven (UV) advanced oxidation processes (AOPs) are effective in degrading micropollutants. As UV-AOPs require electrical energy to enable the treatment, energy costs present a barrier to the widespread adoption of this technology. In this project, we focus on the optimization of UV-AOPs-based reactors to enhance their degradation performance while reducing their energy consumption. In this respect, we will develop a detailed numerical model that integrates hydraulics, optics and chemistry to investigate UV-AOP photoreactors in a comprehensive manner.

The resulting information will then be utilized to design the next-generation of UV-AOP photoreactors commercialized by Trojan Technologies. The design space will be explored by high-performance computer simulations of full-scale photoreactors rather than simplified or scaled-down models. This will be accomplished by leveraging opensource software, artificial-intelligence optimization techniques and the second-to-none parallel-computing capabilities offered by Blue Gene/Q. Once the optimization of UVAOPs-based reactors is complete, the advanced modeling results generated using Blue Gene/Q will be utilized in the development of a simplified model for sizing purposes. This will be accomplished through combined use of metamodeling techniques and cloud computing. In brief, the concept is to simplify the detailed model developed earlier so that it can be simulated using hand-held mobile devices, which will allow the company’s sales personnel to market the optimized reactors. Consequently, it will allow the company to increase its competitiveness on global scale as well as to increase the rate of adoption of advanced water treatment technologies by water utilities and end-users.

Industry Partner(s): Trojan Technologies

Academic Institution: Western University

Academic Researcher: Anthony G. Straatman

Platform: Cloud, Parallel CPU

Focus Areas: Advanced Manufacturing, Clean Tech, Digital Media, Water

Detecting and Responding to Hostile Information Activities: unsupervised methods for measuring the quality of graph embeddings
Collaborators: Patagona Techologies & Ryerson University
Business Analytics Cybersecurity Digital Media

Detecting and Responding to Hostile Information Activities: unsupervised methods for measuring the quality of graph embeddings

The rise in online organized disinformation campaigns presents a significant challenge to Canadian national security. State and non-state hostile actors manipulate users on social media platforms to advance their interests. Patagona Technologies is a Toronto-based software development company started by two Ryerson alumni. The project with Ryerson University is a larger initiative with the Canadian Department of National Defense to address the challenges posed by online hostile actors by analyzing the structure and content of social networks.

Industry Partner(s): Patagona Techologies

Academic Institution: Ryerson University

Academic Researcher: Pralat, Pawel

Platform: Cloud

Focus Areas: Business Analytics, Cybersecurity, Digital Media

Developing Efficient Machine Learning Models for Price Bidding
Collaborators: Curate Mobile Ltd & Ryerson University
Business Analytics Digital Media

Developing Efficient Machine Learning Models for Price Bidding

Curate Mobile operates a demand site platform (DSP), which is an advertising platform responsible forbidding in real time ad placements from various publishers. This process is a blind auction, happeningover 50,000 times a second, and during this bidding process we have less then 100ms to determinewhich of our clients should bid for this ad placement, how much it might be worth to them, and whatprice we believe we can win this auction for. During this project, we will add machine learning modelsto our DSP to provide fast decisions in real time to maximize the return on ad spend of our clients’campaigns. The main goal for this project is to add a proof-of-concept machine learning model to CurateMobile’sDSP, with a pipeline that will continually update the models with new data as it is ingested. Wewill also design a validation module to monitor and validate the performance of the developed models.

Industry Partner(s): Curate Mobile Ltd

Academic Institution: Ryerson University

Academic Researcher: Kashef, Rasha

Platform: Cloud, GPU

Focus Areas: Business Analytics, Digital Media

Developing real-time hyper-resolution simulation capability for the HydroGeoSphere (HGS) integrated groundwater – surface water modelling platform
Collaborators: University of Waterloo & Aquanty Inc.
Digital Media Water

Developing real-time hyper-resolution simulation capability for the HydroGeoSphere (HGS) integrated groundwater – surface water modelling platform

Industry Partner(s): Aquanty Inc.

Academic Institution: University of Waterloo

Academic Researcher: Ed Sudicky

Co-PI Names: David Lapen

Platform: Cloud

Focus Areas: Digital Media, Water

Development of cardiac specific machine learning infrastructure
Collaborators: Analytics 4 Life
Digital Media Health

Development of cardiac specific machine learning infrastructure

Analytics for Life, Inc. (A4L) is an early stage medical device company that specializes in the development of technologies to analyze patient physiological signals in order to evaluate cardiac performance, status and risk. A4L’s core competencies include identifying and developing mathematical features from physiological signals and assembling these features into clinically informative formulae using machine learning techniques. A4L has used third party machine learning tools (open source and licensed products) for the formula generation aspect of the product development cycle.

Specifically, A4L has used these tools to demonstrate the feasibility of computing left ventricular ejection fraction, cardiac ischemic burden and other cardiac performance/status parameters for simple to collect, non-invasive physiological signals (surface voltage gradients, SPO2, Impedance etc.). As a result of this experience, A4L have learned the benefits and insufficiencies of these tools for A4L’s specific purposes. A4L plans to file with the U.S. Food and Drug Administration (FDA) an application for approval of a physiological signal collection device and will soon afterwards be seeking market clearance for products assessing cardiac health emanating from the machine learning process. A4L believes it can build a machine learning tool specifically tailored for cardiac evaluation based on experience with the tools used to date.

This A4L-specific machine learning paradigm will search only relevant mathematical spaces, cutting down on time and CPU power needed to iterate to solutions and will allow for an assessment of a much wider array of potential solutions. Furthermore, this A4L-specific machine learning paradigm will provide a controlled and validated system that can be audited and evaluated by regulatory bodies, something that is not possible with the current machine learning tool(s). A4L proposes a hybridization of paradigms within a set mathematical space. This will create efficiency in the search, and therefore more searches can be performed in the same period of time. This will lead to more solutions being available for evaluation, resulting in more accurate and efficiently produced end solutions. If successful, this new paradigm will allow for simple, non-invasive, rapid and relatively inexpensive cardiac diagnostic capabilities, bringing tertiary care diagnostics to primary care settings and disrupting the current infrastructure and capital cost-centric model of diagnostic delivery.

Industry Partner(s): Analytics 4 Life

Platform: Cloud, Parallel CPU

Focus Areas: Digital Media, Health

Distributed and scalable search in enterprise databases
Collaborators: University of Waterloo & IBM Canada Ltd.
Digital Media

Distributed and scalable search in enterprise databases

Google search, and other search engines such as Bing and Yahoo!, provide a convenient way to find Webpages that contain various keywords or are related to particular topics. For the purposes of searching, Webpages are essentially loosely structured paragraphs of text. However, much of the world’s high-quality enterprise data are structured into well defined tables containing sets of well-defined columns.

One consequence of structured database design is that information about a single entity may be scattered across many columns in many tables, and must be stitched together in a meaningful way when answering user queries. This turns out to be significantly more difficult than finding Webpages or text documents containing various keywords.

As Dr. Surajit Chadhuri (a Distinguished Scientist at Microsoft Research) recently argued in a keynote talk at the IEEE Data Engineering conference, search over structured databases has fallen behind search over unstructured data. In the proposed research, we will develop a powerful and intuitive search system, akin to Web keyword search, for structured enterprise data. Our system will empower nontechnical users to explore enterprise databases and turn big data into actionable insight, just as Google search has empowered society to explore the Web.

Industry Partner(s): IBM Canada Ltd.

Academic Institution: University of Waterloo

Academic Researcher: Lukasz Golab

Co-PI Names: Mehdi Kargar, Jaroslaw Szlichta

Platform: Cloud

Focus Areas: Digital Media

Distributed Deep Learning and Graph Analytics Using IBM Spectrum Computing Solutions
Collaborators: York University & IBM Canada Ltd.
Digital Media

Distributed Deep Learning and Graph Analytics Using IBM Spectrum Computing Solutions

Deep learning is a popular machine learning technique and has been applied to many real-world problems, ranging from computer vision to natural language processing. In most cases deep learning outperformed previous work. However, training a deep neural network is very time-consuming, especially on big data. A popular solution is to distribute and parallel the training process across multiple machines. Indeed, the race is on to parallelize deep learning! Industry and academic research teams around the world are trying to make deep neural networks train as fast as possible on farms of GPU capable servers. We are working with our IBM partners to help develop advanced scheduling and messaging techniques for distributed deep learning. In addition, we will develop two real-world applications of distributed deep learning to demonstrate the efficiency and effectiveness of distributed deep learning. In one application, we address the video surveillance problem of tracking a moving target over a network of video cameras with partial or no overlaps in their coverage. We will use a deep learning approach to identify multiple pedestrians in each video frame, and a particle filter to track moving pedestrians. In the second application, we address the problem of fraud/intrusion detection. We will use graph-based detection that considers relationships between objects or individuals. Graph-based approaches are powerful because they do not operate on objects or individuals in isolation, but also consider their network information. We will emphasize on graph-based fraud detection methods that have a number of applications and potentially large impacts.

Industry Partner(s): IBM Canada Ltd.

Academic Institution: York University

Academic Researcher: Aijun An

Co-PI Names: Amir Asif

Platform: Cloud, GPU

Focus Areas: Digital Media

Efficient deep learning for real-time traffic event detection
Collaborators: University of Waterloo & Miovision
Cities Digital Media

Efficient deep learning for real-time traffic event detection

Miovision is interested in designing the first affordable, low-power, energy efficient real time traffic event detection system that can be installed without the need to be powered by the grid nor the need to be connected directly to city installed infrastructure. Deep learning for traffic event detection can provide overwhelmingly superior accuracy and addresses most of the real-world scenarios that make competing detectors unsuitable for customer adoption. The challenge with deep learning is its complexity, which is currently infeasible for a self-powered real-world embedded detection system. Working with Dr. Alexander Wong and the Vision and Image Processing Lab at the University of Waterloo, the goal of this project is to develop technologies that can significantly reduce the complexity of deep learning for traffic event detection, while maintaining its accuracy and market fit, so that it can be deployed on a low-cost and low-powered hardware platform.

Industry Partner(s): Miovision

Academic Institution: University of Waterloo

Academic Researcher: Alex Wong

Platform: Cloud, GPU

Focus Areas: Cities, Digital Media

  • 1
  • 2
  • 3

Need more information?

SOSCIP Consortium
1 King's College Circle,

Toronto, ON, M5S 1A8

info@soscip.org

Follow Us

Subscribe to Platforms

By subscribing, you are consenting to receiving news, events and updates related to advanced computing in Ontario from SOSCIP.