Friday, March 20, 2020

Benchmark Datasets for Research

The Research Work associated with Data Science and Bio-Informatics need the datasets so that effective predictions and analytics can be done. These datasets are required so that the appropriate algorithms of machine learning, deep learning, data mining can be implemented.


The Following URLs can be used to work out on the Bio-Informatics Datasets by the research scholars, academicians and practitioners :


CoronaVirus (COVID-19)
  • https://www.ncbi.nlm.nih.gov/genbank/2019-ncov-seqs/
  • https://www.worldometers.info/coronavirus/
  • https://data.humdata.org/dataset/novel-coronavirus-2019-ncov-cases
  • https://www.tableau.com/covid-19-coronavirus-data-resources
  • https://covidout.in/

Bacteria Viral Data
  • https://www.ncbi.nlm.nih.gov/genome/?term=Klebsiella%20pneumoniae[Organism]&cmd=DetailsSearch

HIV
  • https://www.hiv.lanl.gov/content/sequence/HIV/USER_ALIGNMENTS/keele.html
  • https://www.rcsb.org/pages/help/advancedsearch/sequence

Viruses
  • https://www.ncbi.nlm.nih.gov/genome/viruses/variation/

Dengue

  • https://www.ncbi.nlm.nih.gov/nuccore/LC379197.1?report=fasta

Tuesday, January 7, 2020

AquaSim: An Effectual Simulation Platform for Underwater Acoustic Networks

The domain of wireless networks and cloud based integration is quite huge. A number of tools and frameworks are available for implementation of wireless scenarios including the following:
  • ns2
  • ns3
  • OPNET
  • OMNET
  • MATLAB
  • WILANTA
  • QualNet

In wireless networks, there is a specialized domain of Underwater Networks in which the underwater or acoustic environment is analyzed with the advanced and performance aware algorithms.

AquaSim is the Free and Open Source Platform that is used by the researchers and practitioners to work on the underwater environment so that the corporate, social as well as the related dynamic applications can be programmed and real time outcomes can be achieved

Wednesday, January 1, 2020

Grafana: Open Source Platform for Data Visualization and Ad-Hoc Query Processing

The data scientists and researchers working on big data based projects generally need Visualization and Plotting Libraries. A number of tools and technologies are available which are used. These libraries integrate Data Mining Tools, Data Analyst Frameworks and many others.

Grafana is the tool that is having features for plotting and visualization of data in multiple database engines. It is used as observability as well as the monitoring platform for the datasets in different formats.

Grafana is available on https://grafana.com and can be downloaded for different operating systems.


In addition to the desktop version, it is available on Cloud with free account


The login to cloud version of Grafana can be done using existing Google Account and directly the features of data analytics can be done


The additional plugins and modules are available with Grafana which can be attached and installed for multiple applications and domains of research



Saturday, December 21, 2019

MQTT Protocol Implementation using Cloud Environment

Message Queuing Telemetry Transport (MQTT) refers to the protocol and high performance technology for Internet of Things (IoT) as well as cloud based environment.

Assorted cloud platforms are available to implement the MQTT on enormous aspects and parameters
  • https://www.cloudmqtt.com/
  • https://myqtthub.com/en
  • http://www.dioty.co/
  • http://www.maqiatto.com/
  • https://www.hivemq.com/
  • http://mosquitto.org

Thursday, November 21, 2019

Using IFTTT for IoT Implementation

IF This Then That (IFTTT) https://ifttt.com is one of the very prominent platforms for the implementation of IoT based environment. It can be integrated for smart devices and wireless integrated applications so that the real time communication can take place.


Tuesday, November 5, 2019

Random Forest Approach


Random Forest Approach
  • Formation of many decision trees by trying different combinations
  • Best output is accepted on Voting
  • Noise Automatically Removed

DOCTORS / MEDICAL PRACTITIONERS
Every doctor tells the patient after giving medicine that come after 2-3 days. By this way, the doctors hit-and-trial different combinations. This is Random Forest Approach

Medicine-1:        X       Y       Z       Come after 2 days
Medicine-2:        X       W      Z       Come after 2 days
Patient OK
Another Patient: X      W      Z
Patient OK

Finally the doctor counts the highest occurrences of X and Z in each



Monday, November 4, 2019

Using Tabula for Extraction of Tables from PDF Files

Data Tables or Comparison Tables from PDF files can be copied or can be extracted using Tabula Software. It is free and open source software available on http://tabula.technology/

Any type of data tables can be fetched using Tabula

https://youtu.be/NvcTIZ2Je50

https://youtu.be/tJNUx1ecDqk