Saturday, April 18, 2020

Conversion of Own Collected Dataset to Benchmark Dataset

The researchers in data science, machine learning, deep learning and related approaches work on different types of datasets. Traditionally, there is need to work with the benchmark dataset so that the validation can be done and outputs will be accepted.

Many times, the researchers collect their own datasets and then use it for implementation of algorithm.

To convert the own collected data to benchmark data, following should be implemented and the dataset should be having specific properties
  • The dataset should be focused towards a specific type of machine learning task
  • The dataset should be open without any restrictions on download by other researchers
  • The dataset should be having sufficient features so that training, testing and validation can be done
  • The dataset should be accessible by other researchers and practitioners so that they can validate the outcomes
  • The dataset should be having labels for identification of attributes
  • The dataset should be clean from mismatch and without missing values
  • The dataset should not be very huge is size
  • There should be proper documentation of the dataset with its details of attributes

Thursday, April 9, 2020

Work From Home (WFH) in Software Testing and Software Quality Assurance

With the impact of CoronaVirus (COVID-19), there is lockdown at many locations. Software Testing is one of the key domains where the Work From Home (WFH) can be done. In this segment, the software applications including mobile apps are tested and the remote professionals are paid very good by the companies.

Following are the portals from where the work can be taken and payments are received in Dollars

1. UserTesting

2. Respondent

3. Userbrain

4. TestingTime

5. TryMyUI

6. Userfeel

7. Userlytics

8. IntelliZoom

9. Validately

10. uTest

11. Loop11

12. Enroll

13. Test IO

14. Intuit User Research

15. Checkealos

16. PingPong

17. Ferpection

18. UserCrowd

Friday, April 3, 2020

Creative Commons License: Open Source License for Authors and Content Creators

While open sourcing a project, a license should be taken so that the distribution, linking, modification, private use etc. can be assigned

Open source licenses help others to contribute to your work or project without seeking special individual permission to do so
  • Not Fully Open Source
  • MIT Open Courseware Material is released under CC

Four Parts of CC
  1. Accreditation: Author must be attributed as the creator of the work. Then, work can be modified / distributed / copied / used
  2. Sharing: The work can be modified, distributed but only under CC License.
  3. Non-Commercial: Work can be modified, distributed but not for commercial purposes.
  4. No Derivative Works: Can copy and distribute the licensed work, but you can’t modify it in any way or create work based on the origin [as MIT Open Courseware Material is]

Attribution CC BY
This license lets others distribute, remix, adapt, and build upon your work, even commercially, as long as they credit you for the original creation. This is the most accommodating of licenses offered. Recommended for maximum dissemination and use of licensed materials.

This license lets others remix, adapt, and build upon your work even for commercial purposes, as long as they credit you and license their new creations under the identical terms. This license is often compared to “copyleft” free and open source software licenses. All new works based on yours will carry the same license, so any derivatives will also allow commercial use. This is the license used by Wikipedia, and is recommended for materials that would benefit from incorporating content from Wikipedia and similarly licensed projects.

This license lets others reuse the work for any purpose, including commercially; however, it cannot be shared with others in adapted form, and credit must be provided to you.

This license lets others remix, adapt, and build upon your work non-commercially, and although their new works must also acknowledge you and be non-commercial, they don’t have to license their derivative works on the same terms.

This license lets others remix, adapt, and build upon your work non-commercially, as long as they credit you and license their new creations under the identical terms.

This license is the most restrictive of our six main licenses, only allowing others to download your works and share them with others as long as they credit you, but they can’t change them in any way or use them commercially


Friday, March 20, 2020

Benchmark Datasets for Research

The Research Work associated with Data Science and Bio-Informatics need the datasets so that effective predictions and analytics can be done. These datasets are required so that the appropriate algorithms of machine learning, deep learning, data mining can be implemented.

The Following URLs can be used to work out on the Bio-Informatics Datasets by the research scholars, academicians and practitioners :

CoronaVirus (COVID-19)

Bacteria Viral Data





Tuesday, January 7, 2020

AquaSim: An Effectual Simulation Platform for Underwater Acoustic Networks

The domain of wireless networks and cloud based integration is quite huge. A number of tools and frameworks are available for implementation of wireless scenarios including the following:
  • ns2
  • ns3
  • QualNet

In wireless networks, there is a specialized domain of Underwater Networks in which the underwater or acoustic environment is analyzed with the advanced and performance aware algorithms.

AquaSim is the Free and Open Source Platform that is used by the researchers and practitioners to work on the underwater environment so that the corporate, social as well as the related dynamic applications can be programmed and real time outcomes can be achieved

Wednesday, January 1, 2020

Grafana: Open Source Platform for Data Visualization and Ad-Hoc Query Processing

The data scientists and researchers working on big data based projects generally need Visualization and Plotting Libraries. A number of tools and technologies are available which are used. These libraries integrate Data Mining Tools, Data Analyst Frameworks and many others.

Grafana is the tool that is having features for plotting and visualization of data in multiple database engines. It is used as observability as well as the monitoring platform for the datasets in different formats.

Grafana is available on and can be downloaded for different operating systems.

In addition to the desktop version, it is available on Cloud with free account

The login to cloud version of Grafana can be done using existing Google Account and directly the features of data analytics can be done

The additional plugins and modules are available with Grafana which can be attached and installed for multiple applications and domains of research

Saturday, December 21, 2019

MQTT Protocol Implementation using Cloud Environment

Message Queuing Telemetry Transport (MQTT) refers to the protocol and high performance technology for Internet of Things (IoT) as well as cloud based environment.

Assorted cloud platforms are available to implement the MQTT on enormous aspects and parameters