Blog

Entrada de prueba 3

Nov 2, 2019 | 7 minutos de lectura

Playing with Sonar {mlbench}
Physical base
- Sonar, Mines vs. Rocks
- Sonar data and signal representation
Exploratory amusements

Playing with Sonar `{mlbench}`

A bit of gymnastics in RStudio, R Markdown and knitr.

This example also lives in an open repository in GitHub: https://github.com/Eclectikus/playingsonar

knitr quick reference: knitr in a knutshell - Author’s site

mlbench: Machine Learning Benchmark Problems -CRAN- A collection of artificial and real-world machine learning benchmark problems, including, e.g., several data sets from the UCI repository.

Usage (remember install library first: install.packages("mlbench"))

library('mlbench')
data("Sonar")

Format:

A data frame with 208 observations on 61 variables, all numerical and one (the Class) nominal.

Below, a couple of samples of this data frame:

b <- sample(c(1:208), size = 9, replace = FALSE, prob = NULL)
kable(head(Sonar[b, 1:10]), format = "markdown")

	V1	V2	V3	V4	V5	V6	V7	V8	V9	V10
78	0.0336	0.0294	0.0476	0.0539	0.0794	0.0804	0.1136	0.1228	0.1235	0.0842
175	0.0191	0.0173	0.0291	0.0301	0.0463	0.0690	0.0576	0.1103	0.2423	0.3134
155	0.0117	0.0069	0.0279	0.0583	0.0915	0.1267	0.1577	0.1927	0.2361	0.2169
162	0.0305	0.0363	0.0214	0.0227	0.0456	0.0665	0.0939	0.0972	0.2535	0.3127
12	0.0123	0.0309	0.0169	0.0313	0.0358	0.0102	0.0182	0.0579	0.1122	0.0835
98	0.0491	0.0279	0.0592	0.1270	0.1772	0.1908	0.2217	0.0768	0.1246	0.2028

kable(head(Sonar[b, 52:61]), format = "markdown")

	V52	V53	V54	V55	V56	V57	V58	V59	V60	Class
78	0.0150	0.0111	0.0032	0.0035	0.0169	0.0137	0.0015	0.0069	0.0051	R
175	0.0040	0.0136	0.0137	0.0172	0.0132	0.0110	0.0122	0.0114	0.0068	M
155	0.0053	0.0029	0.0020	0.0013	0.0029	0.0020	0.0062	0.0026	0.0052	M
162	0.0200	0.0070	0.0070	0.0086	0.0089	0.0074	0.0042	0.0055	0.0021	M
12	0.0133	0.0265	0.0224	0.0074	0.0118	0.0026	0.0092	0.0009	0.0044	R
98	0.0081	0.0129	0.0161	0.0063	0.0119	0.0194	0.0140	0.0332	0.0439	M

Physical base

Sonar, Mines vs. Rocks

Via UBC

This is the data set used by Gorman and Sejnowski in their study of the classification of sonar signals using a neural network [1]. The task is to train a network to discriminate between sonar signals bounced off a metal cylinder and those bounced off a roughly cylindrical rock.

Each pattern is a set of 60 numbers in the range 0.0 to 1.0. Each number represents the energy within a particular frequency band, integrated over a certain period of time. The integration aperture for higher frequencies occur later in time, since these frequencies are transmitted later during the chirp.

The label associated with each record contains the letter “R” if the object is a rock and “M” if it is a mine (metal cylinder). The numbers in the labels are in increasing order of aspect angle, but they do not encode the angle directly.

[1]. Gorman, R. P., and Sejnowski, T. J. (1988). “Analysis of Hidden Units in a Layered Network Trained to Classify Sonar Targets” in Neural Networks, Vol. 1, pp. 75-89. Same authors almost a year later published: Learned Classification of Sonar Targets Using a Massively Parallel Network.

Sonar data and signal representation

From reference [1]

The data used for the network experiments were sonar returns collected from a metal cylinder and a cylindrically shaped rock positioned on a sandy ocean floor. Both targets were approximately 5 ft in length and the impinging pulse was a wide-band linear FM chirp (ka = 55.6). Returns were collected at a range of 10 meters and obtained from the cylinder at aspect angles spanning 90 � and from the rock at aspect angles spanning 180 �.

Network aequitecture and typical signal

A set of 208 returns (111 cylinder returns and 97 rock returns) were selected from a total set of 1200 returns on the basis of the strength of the specular return (4.0 to 15.0 dB signal-to-noise ratio). An average of 5 returns were selected from each aspect angle. Figure 2 shows a sample return from the rock and the cylinder. The preprocessing of the raw signal was based on experiments with human listeners (Gorman & Sawatari, 1987). The temporal signal was first filtered and spectral information was extracted and used to represent the signal on the input layer.

Sonar signal

The preprocessing used to obtain the spectral envelope is indicated schematically in Figure 3 where a set of sampling apertures (Figure 3a) are superimposed over the 2D display of a short-term Fourier Transform spectrogram of the sonar return. As shown in Figure 3b and c, the spectral envelope, P,0,v0(~), was obtained by integrating over each aperture. The spectral envelope was composed of 60 spectral samples, normalized to take on values between 0.0 and 1.0. (See Gorman & Sejnowski, 1987 for a detailed treatment of the preprocessing).

Exploratory amusements

After loading libraries and data,

## Libraries&data
    library(mlbench)
    library(ggplot2)
    library(ggthemes)
    library(caret)
    library('knitr')
    data("Sonar")

and creating training and testing sets,

## Train&test
  set.seed(107)
  inTrain <- createDataPartition(y = Sonar$Class, p = .75, list = FALSE)
  training <- Sonar[ inTrain,]
  testing <- Sonar[-inTrain,]

I made a quick approach to data by running a few graphical routines on previous, arbitrary, training set.

# 
for(i in names(training)){   # loop columns
  a <- ggplot(data = training, aes(training[,i],y = Class, coord_fixed(),
                                   col = Class))
  a <- a + geom_point(size=3)
  a <- a + xlab("Signal") + ylab("Mine or Rock") 
  a <- a + ggtitle("Signal by spectral sample and outcome (mine or rock)")
  a <- a + theme_solarized(light = FALSE)+scale_colour_solarized("blue")
  a <- a + xlim(0,1)
  a <- a + annotate("text", x = 0.50, y = 1.2, label = "Mine", colour = "white")
  a <- a + annotate("text", x = 0.50, y = 2.2, label = "Rock", colour = "white")
  a <- a + annotate("text", x = 0.94, y = 2.4, label = i, colour = "yellow", size = 7.5)
                    blue.bold.text <- element_text(face="bold", color="white")
  a <- a + theme(axis.title = blue.bold.text, legend.position="none")
## plot to file
WD0 <- "C:/MyDATA/Routput/SONAR1"   # (use your local disk)
filename=paste(i,".png",sep="")
ggsave(filename, plot = a, path=WD0,width = 16, height = 9, dpi = 120)
print(a)
}

Signal by spectral sample and “outcome” (mine or rock)

And also,

for (j in 1:157) {  # loop observations
      Xvalues <- as.data.frame(1:60)
      Yvalues <- as.data.frame(t(training[j,1:60])) # transpose, row to column
      Signal <- cbind(Xvalues,Yvalues) # two-columns data for observation 'j'
    colnames(Signal) <- c("Spectral sample", "Normalized energy")
    a <- ggplot(data = Signal, aes(x = Xvalues, y = Yvalues, coord_fixed()))
    a <- a + geom_line(lty = 1, color = "gray", size = 0.3, na.rm = TRUE)
    a <- a + xlab("Spectral sample") + ylab("Normalized energy") 
    a <- a + ggtitle("Signal by spectral sample")
    a <- a + theme_solarized(light = FALSE) + scale_colour_solarized("blue")
    # a <- a + theme_base() + scale_colour_few()
    a <- a + xlim (0,60) + ylim(0,1)
        ## outcomes by color
          if (training[j,61] == "M") {
              ColorCode <- "blue"
              outcome <- "Mine"
          }
          else {
              ColorCode <- "orange"
              outcome <- "Rock"
          }
    a <- a + annotate("text", x = 5, y = 0.92, label = outcome , colour = ColorCode, size = 5.4)
    a <- a + annotate("text", x = 55, y = 0.92, label = paste("Return", j),
                      colour = "gray", size = 4.5)
    blue.bold.text <- element_text(face="bold", color="white")
    a <- a + theme(axis.title = blue.bold.text, legend.position="none")
    a <- a + geom_point(size = 1.5, color = ColorCode)
    a <- a + geom_smooth(fill = ColorCode, color = ColorCode, size = 0.75,
                         alpha = 0.30, na.rm = TRUE)
    ## plot to file
      WD0 <- "C:/MyDATA/Routput/SONAR2"   # (use your local disk)
      filename=paste("S",j,".png",sep="")
      ggsave(filename, plot = a, path = WD0, width = 16, height = 9, dpi = 120)
      print(a)
}

Signal by return and “outcome” (mine or rock)

I tried first to do the above animations directly from R by using package animation (from the same author as knitr), but got nothing, surely I need more time. Its Github and the package site are sure the starting points. After that, I tried Gimp but the poor quality made me desist. So finally I happily used ezgif.com