All metallo-proteins need post-translation metal incorporation. In fact, the isotope ratio of Fe, Cu, and Zn in physiology and oncology have emerged as an important tool. The nickel containing F430 is the prosthetic group of the enzyme methyl coenzyme M reductase which catalyzes the release of methane in the final step of methano-genesis, a prime energy metabolism candidate for life exploration space mission in the solar system. The 3.5 Gyr early life sulfite reductase as a life switch energy metabolism had Fe-Mo clusters. The nitrogenase for nitrogen fixation 3 billion years ago had Mo. The early life arsenite oxidase needed for anoxygenic photosynthesis energy metabolism 2.8 billion years ago had Mo and Fe. The selection pressure in metal incorporation inside a protein would be quantifiable in terms of the related nucleotide sequence complexity with fractal dimension and entropy values. Simulation model showed that the studied metal-required energy metabolism sequences had at least ten times more selection pressure relatively in comparison to the horizontal transferred sequences in Mealybug, guided by the outcome histogram of the correlation R-sq values. The metal energy metabolism sequence group was compared to the circadian clock KaiC sequence group using magnesium atomic level bond shifting mechanism in the protein, and the simulation model would suggest a much higher selection pressure for the energy life switch sequence group. The possibility of using Kepler 444 as an example of ancient life in Galaxy with the associated exoplanets has been proposed and is further discussed in this report. Examples of arsenic metal bonding shift probed by Synchrotron-based X-ray spectroscopy data and Zn controlled FOXP2 regulated pathways in human and chimp brain studied tissue samples are studied in relationship to the sequence bioinformatics. The analysis results suggest that relatively large metal bonding shift amount is associated with low probability correlation R-sq outcome in the bioinformatics simulation.
The sulfate-reducing bacteria can be traced back to 3.5 billion years ago. The thermodynamics details of the sulfur cycle have been well documented. A recent sulfate-reducing bacteria report (Robator, Jungbluth, et al , 2015 Jan, Front. Microbiol) with Genbank nucleotide data has been analyzed in terms of the sulfite reductase (dsrAB) via fractal dimension and entropy values. Comparison to oil field sulfate-reducing sequences was included. The AUCG translational mass fractal dimension versus ATCG transcriptional mass fractal dimension for the low temperature dsrB and dsrA sequences reported in Reference Thirteen shows correlation R-sq ~ 0.79 , with a probably of about 3% in simulation. A recent report of using Cystathionine gamma-lyase sequence to produce CdS quantum dot in a biological method, where the sulfur is reduced just like in the H2S production process, was included for comparison. The AUCG mass fractal dimension versus ATCG mass fractal dimension for the Cystathionine gamma-lyase sequences was found to have R-sq of 0.72, similar to the low temperature dissimilatory sulfite reductase dsr group with 3% probability, in contrary to the oil field group having R-sq ~ 0.94, a high probable outcome in the simulation. The other two simulation histograms, namely, fractal dimension versus entropy R-sq outcome values, and di-nucleotide entropy versus mono-nucleotide entropy R-sq outcome values are also discussed in the data analysis focusing on low probability outcomes.
Horizontal gene transfer has been a major vehicle for efficient transfer of genetic materials among living species and could be one of the sources for noncoding DNA incorporation into a genome. Our previous study of lnc- RNA sequence complexity in terms of fractal dimension and information entropy shows a tight regulation among the studied genes in numerous diseases. The role of sequence complexity in horizontal transferred genes was investigated with Mealybug in symbiotic relation with a 139K genome microbe and Deinococcus radiodurans as examples. The fractal dimension and entropy showed correlation R-sq of 0.82 (N = 6) for the studied Deinococcus radiodurans sequences. For comparison the Deinococcus radiodurans oxidative stress tolerant catalase and superoxide dismutase genes under extracellular dGMP growth condition showed R-sq ~ 0.42 (N = 6); and the studied arsenate reductase horizontal transferred genes for toxicity survival in several microorganisms showed no correlation. Simulation results showed that R-sq < 0.4 would be improbable at less than one percent chance, suggestive of additional selection pressure when compared to the R-sq ~ 0.29 (N = 21) in the studied transferred genes in Mealybug. The mild correlation of R-sq ~ 0.5 for fractal dimension versus transcription level in the studied Deinococcus radiodurans sequences upon extracellular dGMP growth condition would suggest that lower fractal dimension with less electron density fluctuation favors higher transcription level.
The evolutionary rate co-variation in meiotic proteins has been reported for yeast and mammal using phylogenic branch lengths which assess retention, duplication and mutation. The bioinformatics of the corresponding DNA sequences could be classified as a diagram of fractal dimension and Shannon entropy. Results from biomedical gene research provide examples on the diagram methodology. The identification of adaptive selection using entropy marker and functional-structural diversity using fractal dimension would support a regression analysis where the coefficient of determination would serve as evolutionary pathway marker for DNA sequences and be an important component in the astrobiology community. Comparisons between biomedical genes such as EEF2 (elongation factor 2 human, mouse, etc), WDR85 in epigenetics, HAR1 in human specificity, clinical trial targeted cancer gene CD47, SIRT6 in spermatogenesis, and HLA-C in mosquito bite immunology demonstrate the diagram classification methodology. Comparisons to the SEPT4-XIAP pair in stem cell apoptosis, testesexpressed taste genes TAS1R3-GNAT3 pair, and amyloid beta APLP1-APLP2 pair with the yeast-mammal DNA sequences for meiotic proteins RAD50-MRE11 pair and NCAPD2-ICK pair have accounted for the observed fluctuating evolutionary pressure systematically. Regression with high R-sq values or a triangular-like cluster pattern for concordant pairs in co-variation among the studied species could serve as evidences for the possible location of common ancestors in the entropy-fractal dimension diagram, consistent with an example of the human-chimp common ancestor study using the FOXP2 regulated genes reported in human fetal brain study. The Deinococcus radiodurans R1 Rad-A could be viewed as an outlier in the RAD50 diagram and also in the free energy versus fractal dimension regression Cook’s distance, consistent with a non-Earth source for this radiation resistant bacterium. Convergent and divergent fluctuating evolutionary pressure could be studied with extension to genetic sequences in organisms in possible astrobiology conditions, with the assumption that the continuation of a book of life would require meiotic proteins everywhere in the universe.
A paleo-experimental evolution report on elongation factor EF-Tu structural stability results has provided an opportunity to rewind the tape of life using the ancestral protein sequence reconstruction modeling approach; consistent with the book of life dogma in current biology and being an important component in the astrobiology community. Fractal dimension via the Higuchi fractal method and Shannon entropy of the DNA sequence classification could be used in a diagram that serves as a simple summary. Results from biomedical gene research provide examples on the diagram methodology. Comparisons between biomedical genes such as EEF2 (elongation factor 2 human, mouse, etc), WDR85 in epigenetics, HAR1 in human specificity, DLG1 in cognitive skill, and HLA-C in mosquito bite immunology with EF Tu DNA sequences have accounted for the reported circular dichroism thermo-stability data systematically; the results also infer a relatively less volatility geologic time period from 2 to 3 Gyr from adaptation viewpoint. Comparison to Thermotoga maritima MSB8 and Psychrobacter shows that Thermus thermophilus HB8 EF-Tu calibration sequence could be an outlier, consistent with free energy calculation by NUPACK. Diagram methodology allows computer simulation studies and HAR1 shows about 0.5% probability from chimp to human in terms of diagram location, and SNP simulation results such as amoebic meningoencephalitis NAF1 suggest correlation. Extensions to the studies of the translation and transcription elongation factor sequences in Megavirus Chiliensis, Megavirus Lba and Pandoravirus show that the studied Pandoravirus sequence could be an outlier with the highest fractal dimension and lowest entropy, as compared to chicken as a deviant in the DNMT3A DNA methylation gene sequences from zebrafish to human and to the less than one percent probability in computer simulation using the HAR1 0.5% probability as reference. The diagram methodology would be useful in ancestral gene reconstruction studies in astrobiology and also be applicable to the study of point mutation in conformational thermostabilization research with Synchrotron based X-ray data for drug applications such as Parkinson’s disease.
The beetle, Agrilus planipennis Fairmaire, was introduced to Michigan in 2002 and has since spread to many other states. In recent years, it has been reported in parts of New York. The fluctuations in satellite data signal associated with indices describing ash tree health, such as leaf area index (LAI) and Normalized Difference Vegetation Index (NVDI) as reported by the MODIS, have been studied. The fraction of Photosynthetically Active Radiation (FPAR) data was also studied. MODIS hyperspectral data, as calibrated to winged aircraft hyperspectral data, was used for ash tree characterization.
Digital solar image data is available to users with access to standard, mass-market software. Many scientific
projects utilize the Flexible Image Transport System (FITS) format, which requires specialized software
typically used in astrophysical research. Data in the FITS format includes photometric and spatial calibration
information, which may not be useful to researchers working with self-calibrated, comparative approaches. This
project examines the advantages of using mass-market software with readily downloadable image data from the
Solar Dynamics Observatory for comparative analysis over with the use of specialized software capable of
reading data in the FITS format. Comparative analyses of brightness statistics that describe the solar disk in the
study of magnetic energy using algorithms included in mass-market software have been shown to give results
similar to analyses using FITS data. The entanglement of magnetic energy associated with solar eruptions, as
well as the development of such eruptions, has been characterized successfully using mass-market software. The
proposed algorithm would help to establish a publicly accessible, computing network that could assist in
exploratory studies of all FITS data. The advances in computer, cell phone and tablet technology could
incorporate such an approach readily for the enhancement of high school and first-year college space weather
education on a global scale. Application to ground based data such as that contained in the Baryon Oscillation
Spectroscopic Survey is discussed.
The Kepler mission is designed to survey a fist-sized patch of the sky within the Milky Way galaxy for the
discovery of exoplanets, with emphasis on near Earth-size exoplanets in or near the habitable zone. The Kepler
space telescope would detect the brightness fluctuation of a host star and extract periodic dimming in the
lightcurve caused by exoplanets that cross in front of their host star. The photometric data of a host star could be
interpreted as an image where fractal imaging would be applicable. Fractal analysis could elucidate the
incomplete data limitation posed by the data integration window. The fractal dimension difference between the
lower and upper halves of the image could be used to identify anomalies associated with transits and stellar
activity as the buried signals are expected to be in the lower half of such an image. Using an image fractal
dimension resolution of 0.04 and defining the whole image fractal dimension as the Chi-square expected value of
the fractal dimension, a p-value can be computed and used to establish a numerical threshold for decision
making that may be useful in further studies of lightcurves of stars with candidate exoplanets. Similar fractal
dimension difference approaches would be applicable to the study of photometric time series data via the
Higuchi method. The correlated randomness of the brightness data series could be used to support inferences
based on image fractal dimension differences. Fractal compression techniques could be used to transform a
lightcurve image, resulting in a new image with a new fractal dimension value, but this method has been found
to be ineffective for images with high information capacity. The three studied criteria could be used together to
further constrain the Kepler list of candidate lightcurves of stars with possible exoplanets that may be planned
for ground-based telescope confirmation.
Extremophilic cold-adapted sequences and their degradation codes have been studied using fractal dimension and Shannon entropy. The nucleotide fluctuation of a DNA and/or RNA sequence can be studied as a random series using the nucleotide atomic number differences between A, T, C, G, and U. Studies of degradation codes suggest a positive correlation of Shannon entropy with mRNA stability, and a negative correlation of fractal dimension with mRNA stability.
Pathogens could be inactivated via a light source coupled with a photosensitizing agent in photodynamic
antimicrobial chemotherapy (PACT). This project studied the effect of non-homogenous substrate on cell
colony. The non-homogeneity could be controlled by iron oxide nano-particles doping in porous glassy
substrates such that each cell would experience tens of hot spots when illuminated with additional light source.
The substrate non-homogeneity was characterized by Atomic Force Microscopy, Transmission Electron
Microscopy and Extended X-Ray Absorption Fine Structure at Brookhaven Synchrotron Light Source.
Microscopy images of cell motion were used to study the motility. Laboratory cell colonies on non-homogenous
substrates exhibit reduced motility similar to those observed with sub-lethal PCAT treatment. Such motility
reduction on non-homogenous substrate is interpreted as the presence of thermal stress. The studied pathogens
included E. coli and Pseudomonas aeruginosa. Non-pathogenic microbes Bacillus subtilis was also studied for
comparison. The results show that sub-lethal PACT could be effective with additional non-homogenous thermal
stress. The use of non-uniform illumination on a homogeneous substrate to create thermal stress in sub-micron
length scale is discussed via light correlation in propagation through random medium. Extension to sub-lethal
PACT application complemented with thermal stress would be an appropriate application.
Human (and other mammals) would secrete cerumen (ear wax) to protect the skin of the ear canal against
pathogens and insects. The studies of biodiversity of pathogen in human include intestine microbe colony, belly
button microbe colony, etc. Metals such as zinc and iron are essentials to bio-molecular pathways and would be
related to the underlying pathogen vitality. This project studies the biodiversity of cerumen via its metal content
and aims to develop an optical probe for metal content characterization. The optical diffusion mean free path and
absorption of human cerumen samples dissolved in solvent have been measured in standard transmission
measurements. EXFAS and XANES have been measured at Brookhaven Synchrotron Light Source for the
determination of metal contents, presumably embedded within microbes/insects/skin cells. The results show that
a calibration procedure can be used to correlate the optical diffusion parameters to the metal content, thus
expanding the diagnostic of cerumen in the study of human pathogen biodiversity without the regular use of a
synchrotron light source. Although biodiversity measurements would not be seriously affected by dead microbes
and absorption based method would do well, the scattering mean free path method would have potential to
further study the cell based scattering centers (dead or live) via the information embedded in the speckle pattern
in the deep-Fresnel zone.
Blood protein molecules could be embedded in porous glassy substrate with 10-nm pores. The embedding
principle is based on blood cell dehydration with the destruction of the cell membrane, and reconstitution and
centrifuge could yield a suitable solution for doping into a porous glassy medium. The doped glassy substrate
speckle pattern under laser illumination could be used to characterize the protein size distribution. Calibration
with known protein embedded samples would result in an optical procedure for the characterization of a blood
sample. Samples embedded with larger kilo-Dalton protein molecule show more variation in the speckle
patterns, consistent with protein folding interaction inside a pore cavity. A regression model has been used to
correlate the protein molecule sizes with speckle sizes. The use of diffusion mean free path information to study
protein folding in the embedding process is briefly discussed.
Cell density is an important parameter in the question of bio-variation and the studying of cell scattering could
be a viable tool. The development of spatially resolved optical fiber probe would enable the characterization of
optical scattering from cells within a colony. Single mode fiber probe would be budget friendly as compared to a
50-nm sub-cellular fiber probe. This project develops a calibration procedure to correlate the optical scattering
measured by a single mode fiber probe to that of a 50-nm sub-cellular fiber probe in the context of cell density
variation. The Fourier transform of intensity angular transmission would give correlation information in the Efield
in the spatial coordinate. Monte Carlo simulation could be used to constrain the input intensity function
spatial content resembling microscopy. The use of a 50-nm sub-cellular fiber probe for detailed study of
biological samples would give sub-micron scale length information.
Macro-bending fiber optic based heart-rate and pulse pressure shape monitors have been fabricated and tested for
non-invasive measurement. Study of fiber bending loss and its stability and variations are very important
especially for sensor designs based on optical fiber bending. Wavelengths from 1300 nm to 1550 nm have been
used with fabrication based on multimode fiber, single mode fiber, and photonic crystal fiber. The smallest
studied curvature would demand the use of single mode standard fibers. The collected data series show high
quality suitable for random series analysis. Fractal property of optically measured pulse pressure data has been
observed to correlate with physical activity. Correlation to EKG signal suggests that the fabricated monitors are
capable of measuring the differential time delays at wrist and leg locations. The difference in time delay could
be used to formulate a velocity parameter for diagnostics. The pulse shape information collected by the fiber
sensor provides additional parameters for the analysis of the fractal nature of the heart. The application to real
time measurement of blood vessel stiffness with this optical non-invasive fiber sensor is discussed.
Technologically important extremophiles including oil eating microbes, uranium and rocket fuel perchlorate
reduction microbes, electron producing microbes and electrode electrons feeding microbes were compared in
terms of their 16S rRNA sequences, a standard targeted sequence in comparative phylogeny studies. Microbes
that were reported to have survived a prolonged dormant duration were also studied. Examples included the
recently discovered microbe that survives after 34,000 years in a salty environment while feeding off organic
compounds from other trapped dead microbes. Shannon entropy of the 16S rRNA nucleotide composition and
fractal dimension of the nucleotide sequence in terms of its atomic number fluctuation analyses suggest a
selected range for these extremophiles as compared to other microbes; consistent with the experience of
relatively mild evolutionary pressure. However, most of the microbes that have been reported to survive in
prolonged dormant duration carry sequences with fractal dimension between 1.995 and 2.005 (N = 10 out of 13).
Similar results are observed for halophiles, red-shifted chlorophyll and radiation resistant microbes. The results
suggest that prolonged dormant duration, in analogous to high salty or radiation environment, would select high
fractal 16S rRNA sequences. Path analysis in structural equation modeling supports a causal relation between
entropy and fractal dimension for the studied 16S rRNA sequences (N = 7). Candidate choices for high fractal
16S rRNA microbes could offer protection for prolonged spaceflights. BioBrick gene network manipulation
could include extremophile 16S rRNA sequences in synthetic biology and shed more light on exobiology and
future colonization in shielded spaceflights. Whether the high fractal 16S rRNA sequences contain an asteroidlike
extra-terrestrial source could be speculative but interesting.
The Rubisco protein-enzyme is arguably the most abundance protein on Earth. The biology dogma of
transcription and translation necessitates the study of the Rubisco genes and Rubisco-like genes in various
species. Stronger correlation of fractal dimension of the atomic number fluctuation along a DNA sequence with
Shannon entropy has been observed in the studied Rubisco-like gene sequences, suggesting a more diverse
evolutionary pressure and constraints in the Rubisco sequences. The strategy of using metal for structural
stabilization appears to be an ancient mechanism, with data from the porphobilinogen deaminase gene in
Capsaspora owczarzaki and Monosiga brevicollis. Using the chi-square distance probability, our analysis
supports the conjecture that the more ancient Rubisco-like sequence in Microcystis aeruginosa would have
experienced very different evolutionary pressure and bio-chemical constraint as compared to Bordetella
bronchiseptica, the two microbes occupying either end of the correlation graph. Our exploratory study would
indicate that high fractal dimension Rubisco sequence would support high carbon dioxide rate via the Michaelis-
Menten coefficient; with implication for the control of the whooping cough pathogen Bordetella bronchiseptica,
a microbe containing a high fractal dimension Rubisco-like sequence (2.07). Using the internal comparison of
chi-square distance probability for 16S rRNA (~ E-22) versus radiation repair Rec-A gene (~ E-05) in high GC
content Deinococcus radiodurans, our analysis supports the conjecture that high GC content microbes containing
Rubisco-like sequence are likely to include an extra-terrestrial origin, relative to Deinococcus radiodurans.
Similar photosynthesis process that could utilize host star radiation would not compete with radiation resistant
process from the biology dogma perspective in environments such as Mars and exoplanets.
The fluctuations of leaf area index (LAI) and Fraction of Photosynthetically Active Radiation (FPAR) as
reported by the MODIS 8-day product MOD15A2 over a section of Harriman State Park, New York were
studied with reference to another nearby local park. The area selected for study, a seven km square grid with one
km resolution, is known for its biodiversity. Time series data points were generated using the sums of the grid's
49 pixel measurements for each of the 46 entries that make up the annual time series. A quadratic relation has
been observed that suggests that LAI/FPAR is proportional to FPAR if FPAR is considered as the forcing
parameter via chlorophyll (a, b, c, d and f), in an application model for the study of biodiversity. The LAI annual
time series from 2000 to 2009 follows the corresponding FPAR annual time series as expected, but with different
proportionality ratios in different seasons. The fractal analysis results of the time series data suggest that the LAI
sequences have a lower fractal dimension (~1.35) than those of the FPAR sequences (~1.55), consistent with the
idea that biological systems are capable of regulating fluctuation. The regression of LAI sequence fractal
dimension versus FPAR sequence fractal dimension exhibits an R-square of about 0.7 (N =10). The observed
regression outlier for the year 2009 could be indicative of the presence of additional factors. Synchrotron
EXAFS and XANES investigations of leaf samples reveal data consistent with metal absorption under stress.
Further studies of absorption under stress using remote sensing data are warranted.
The Zn-metalloprotease family contains conserved amino acid structures such that the nucleotide fluctuation at
the DNA level would exhibit correlated randomness as described by fractal dimension. A nucleotide sequence
fractal dimension can be calculated from a numerical series consisting of the atomic numbers of each nucleotide.
The structure's vibration modes can also be studied using a Gaussian Network Model. The vibration measure
and fractal dimension values form a two-dimensional plot with a standard vector metric that can be used for
comparison of structures. The preference for amino acid usage in extremophiles may suppress nucleotide
fluctuations that could be analyzed in terms of fractal dimension and Shannon entropy. A protein level cold
adaptation study of the thermolysin Zn-metalloprotease family using molecular dynamics simulation was
reported recently and our results show that the associated nucleotide fluctuation suppression is consistent with a
regression pattern generated from the sequences's fractal dimension and entropy values (R-square ~ 0.98, N =5).
It was observed that cold adaptation selected for high entropy and low fractal dimension values. Extension to the
Archaemetzincin M54 family in extremophiles reveals a similar regression pattern (R-square = 0.98, N = 6). It
was observed that the metalloprotease sequences of extremely halophilic organisms possess high fractal
dimension and low entropy values as compared with non-halophiles. The zinc atom is usually bonded to the
histidine residue, which shows limited levels of vibration in the Gaussian Network Model. The variability of the
fractal dimension and entropy for a given protein structure suggests that extremophiles would have evolved after
mesophiles, consistent with the bias usage of non-prebiotic amino acids by extremophiles. It may be argued that
extremophiles have the capacity to offer extinction protection during drastic changes in astrobiological
environments.
Daytime photosynthesis and nighttime nitrogen fixation metabolic processes have been reported in the
bacterium, Cyanothece 51142. The organism's auto-fluorescence with 532 nm excitation would place
cyanobacteria at the forefront in the remote sensing of microbial activity in astrobiology. The sensitivity of
nitrogenase to oxygen was studied in terms of sequence nucleotide fluctuation. A nucleotide sequence fractal
dimension can be calculated from a numerical series consisting of the atomic numbers of each nucleotide. The
fractal dimension and Shannon entropy form a two-dimensional measure that is useful in assessing evolutionary
pressures. The studied sequences include nitrogenase iron protein NifH, nitrogenase molybdenum-iron protein
alpha chain NifD and beta chain NifK. The photosynthesis-lacking UCYN-A cyanobacterium as reported
recently in the journal, Nature, was observed to have the lowest entropy with relatively high fractal dimension
values in the studied NifH, NifD and NifH sequences. The fractal dimension of NifH sequences correlates with
the NifD sequence values with an R-square of 0.91 (N = 8). The Shannon mononucleotide entropy of NifD
sequences correlates with the NifK sequence values with an R-square value of 0.92 (N = 8). The observed strong
correlation suggests the presence of gradual evolutionary pressure among the studied cyanobacteria, and throws
light on the reported paradox in evolution for the case of UCYN-A. The results show that diurnal oscillation
metabolic processes in cyanobacteria (including the photosynthesis-deficient case) are not associated with
extraordinary evolutionary pressures and thus are processes consistent with putative astrobiological organisms.
Archaea are important potential candidates in astrobiology as their metabolism includes solar, inorganic and
organic energy sources. Archaeal viruses would also be expected to be present in a sustainable archaeal
exobiological community. Genetic sequence Shannon entropy and fractal dimension can be used to establish a
two-dimensional measure for classification and phylogenetic study of these organisms. A sequence fractal
dimension can be calculated from a numerical series consisting of the atomic numbers of each nucleotide.
Archaeal 16S and 23S ribosomal RNA sequences were studied. Outliers in the 16S rRNA fractal dimension and
entropy plot were found to be halophilic archaea. Positive correlation (R-square ~ 0.75, N = 18) was observed
between fractal dimension and entropy across the studied species. The 16S ribosomal RNA sequence entropy
correlates with the 23S ribosomal RNA sequence entropy across species with R-square 0.93, N = 18. Entropy
values correspond positively with branch lengths of a published phylogeny. The studied archaeal virus
sequences have high fractal dimensions of 2.02 or more. A comparison of selected extremophile sequences with
archaeal sequences from the Humboldt Marine Ecosystem database (Wood-Hull Oceanography Institute, MIT)
suggests the presence of continuous sequence expression as inferred from distributions of entropy and fractal
dimension, consistent with the diversity expected in an exobiological archaeal community.
The Single-Stranded DNA-Binding Protein (RPA) Genes in gamma ray radiation-resistant halophilic archaeon
Halobacterium sp. NRC-1 were analyzed in terms of their nucleotide fluctuations. In an ATCG sequence, each
base was assigned a number equal to its atomic number. The resulting numerical sequence was the basis of the
statistical analysis in this study. Fractal analysis using the Higuchi method gave fractal dimensions of 2.04 and
2.06 for the gene sequences VNG2160 and VNG2162, respectively. The 16S rRNA sequence has a fractal
dimension of 1.99. The di-nucleotide Shannon entropy values were found to be negatively correlated with the
observed fractal dimensions (R2~ 0.992, N=3). Inclusion of Deinococcus radiodurans Rad-A in the regression
analysis decreases the R2 slightly to 0.98 (N=4). A third VNG2163 RPA gene of unknown function but with upregulation
activity under irradiation was found to have a fractal dimension of 2.05 and a Shannon entropy of 3.77
bits. The above results are similar to those found in bacterial Deinococcus radiodurans and suggest that their
high radiation resistance property would have favored selection of CG di-nucleotide pairs. The two transcription
factors TbpD (VNG7114) and TfbA (VNG 2184) were also studied. Using VNG7114, VNG2184, and
VNG2163; the regression analysis of fractal dimension versus Shannon entropy shows that R2 ~ 0.997 for N =3.
The VNG2163 unknown function may be related to the pathways with transcriptions closely regulated to
sequences VNG7114 and VNG2184.
The discovery of perchlorate on Mars raises the possibility of the existence of perchlorate reduction microbes on
that planet. The perchlorate reductase gene sequence fractal dimensions of two Dechloromonas species were
compared with five other sequences in the microbial dimethyl sulfoxide (DMSO) reductase family. A nucleotide
sequence can be expressed as a numerical sequence where each nucleotide is assigned its proton number. The
resulting numerical sequence can be investigated for its fractal dimension in terms of evolution and chemical
properties for comparative studies. Analysis of the fractal dimensions for the DMSO reductase family supports
phylogenetic analyses that show that the perchlorate reductase gene sequences are members of the same family.
A sub-family with roughly the same nucleotide length emerges having the property that the gene fractal
dimension is negatively correlated with the Shannon di-nucleotide entropy (R2 ~ 0.95, N =5). The gene
sequence fractal dimension is found to be positively correlated with the neighbor joining distances reported in a
published protein phylogeny tree (R2~ 0.92, N = 5). The multi-fractal property associated with these genes
shows that perchlorate reductase has lower dimensionality as compared to the relatively higher dimensionality
DNA-break repair genes Rec-A and Rad-A observed in the Dechloromonas aromatica and Deinococcus
radiodurans genomes. The studied perchlorate gene sequences show a higher Shannon di-nucleotide entropy
(~3.97 bits) relative to Dechloromonas aromatica DNA repair sequences (~3.87 bits Rec-A, ~3.92 bits Rad-A),
suggesting that there are fewer constraints on nucleotide variety in the perchorlate sequences . These
observations thus allow for the existence of perchlorate reducing microbes on Mars now or in the past. Timeresolved
UV fluorescence study near the emission bands of nucleotide sequences could be used for bio-detection
on Mars-like surfaces and the results may further constrain the proposed conjectures.
The deep water chlorophyll concentration fluctuation from 2003 to 2007 has been studied using fractal analysis.
The SeaWiFS global daily mean chlorophyll concentration time series were used. The Higuchi fractal algorithm
was used to calculate fractal dimension, which is given by the slope of an associated length versus the lag. Short
range fluctuation investigation using a six point slope gives fractal dimensions from 1.80 to 1.85, suggesting the
presence of correlation, which was confirmed by computer simulations. The gradual increase of fractal
dimension to 1.9 in about 15 lag-days suggests that a long-range de-correlation mechanism favoring random
fluctuation is present. The 2007 times series shows a relatively low overall fractal dimension and exhibits a
peculiar multi-fractal behavior. This phenomenon and the observed low accumulated cyclone energy in 2007
support the interpretation that cyclone energy can promote deep-water chlorophyll concentration fluctuation. A
regression of fractal dimension at 10 lag-days versus the log of cyclone energy gives an R2 value of 0.75 (N =
5)., which suggests the presence of additional or related de-correlation mechanisms.
A nucleotide sequence can be expressed as a numerical sequence when each nucleotide is assigned its proton
number. A resulting gene numerical sequence can be investigated for its fractal dimension in terms of evolution
and chemical properties for comparative studies. We have investigated such nucleotide fluctuation in the RecA
repair gene of Psychrobacter cryohalolentis K5, Psychrobacter arcticus 273-4, and Psychrobacter sp. PRwf-1.
The fractal dimension was found to correlate with the gene's operating temperature with the highest fractal
dimension associated with P. cryohalolentis K5 living at the low temperatures found in Siberian permafrost.
The CpG dinucleotide content was found to be about 5% for the three species of Psychrobacters, which is
substantially lower than that of Deinococcus radiodurans at about 12%. The average nucleotide pair-wise free
energy was found to be lowest for Psychrobacter sp. PRwf-1, the species with the lowest fractal dimension of
the three, consistent with the recent finding that Psychrobacter sp. PRw-f1 has a temperature growth maximum
of 15-20°C higher than P. arcticus 273-4 and P. cryohaloentis K5. The results suggest that microbial vitality in
extreme environments is associated with fractal dimension as well as high CpG dinucleotide content, while the
average nucleotide pair-wise free energy is related to the operating environment. Evidence that extreme
temperature operation would impose constraints measurable by Shannon entropy is also discussed. A
quantitative estimate of an entropy-based measure having the characteristics of a mechanical pressure shows that
the Psychrobacter RecA sequence experiences lower pressure than that of the human HAR1 sequence.
Cultures of the methane-producing archaea Methanosarcina, have recently been isolated from Alaskan
sediments. It has been proposed that methanogens are strong candidates for exobiological life in extreme
conditions. The spatial environmental gradients, such as those associated with the polygons on Mars' surface,
could have been produced by past methanogenesis activity. The 16S rRNA gene has been used routinely to
classify phenotypes. Using the fractal dimension of nucleotide fluctuation, a comparative study of the 16S
rRNA nucleotide fluctuation in Methanosarcina acetivorans C2A, Deinococcus radiodurans, and E. coli was
conducted. The results suggest that Methanosarcina acetivorans has the lowest fractal dimension, consistent
with its ancestral position in evolution. Variation in fluctuation complexity was also detected in the transcription
factors. The transcription factor B (TFB) was found to have a higher fractal dimension as compared to
transcription factor E (TFE), consistent with the fact that a single TFB in Methanosarcina acetivorans can code
three different TATA box proteins. The average nucleotide pair-wise free energy of the DNA repair genes was
found to be highest for Methanosarcina acetivorans, suggesting a relatively weak bonding, which is consistent
with its low prevalence in pathology. Multitasking capacity comparison of type-I and type-II topoisomerases
has been shown to correlate with fractal dimension using the methicillin-resistant strain MRSA 252. The
analysis suggests that gene adaptation in a changing chemical environment can be measured in terms of
bioinformatics. Given that the radiation resistant Deinococcus radiodurans is a strong candidate for an extraterrestrial
origin and that the cold temperature Psychrobacter cryohalolentis K5 can function in Siberian
permafrost, the fractal dimension comparison in this study suggests that a chemical resistant methanogen could
exist in extremely cold conditions (such as that which existed on early Mars) where demands on gene activity are
low. In addition, the comparative study of the Methanococcoides burtonii cold shock domain sequence has
provided further support for the correlation between multitasking capacity and fractal dimension.
KEYWORDS: Fractal analysis, Organisms, Monte Carlo methods, Statistical analysis, Proteins, Image information entropy, Resistance, Biological research, Information science, Genetics
We have characterized function related DNA sequences of various organisms using informatics techniques,
including fractal dimension calculation, nucleotide and multi-nucleotide statistics, and sequence fluctuation
analysis. Our analysis shows trends which differentiate extremophile from non-extremophile organisms, which
could be reproduced in extraterrestrial life. Among the systems studied are radiation repair genes, genes involved
in thermal shocks, and genes involved in drug resistance. We also evaluate sequence level changes that have
occurred during short term evolution (several thousand generations) under extreme conditions.
Diatom bioactivity has been reported to be responsible for about 20% of carbon fixation globally and together
with other photosynthetic organisms, the bioactivity can be monitored via satellite ocean imaging. The bioinformatics
embedded in the nucleotide fluctuations of photosynthesis and bio-silicate genes in diatoms were
studied. The recently reported phosphoenolpyruvate carboxylase PEPC1 and PEPC2 C4-like photosynthesis
genes in Phaeodactylum tricornutum were found to have similar fractal dimensions of about 2.01. In
comparison, the green alga Chlamydomonas reinhardtii PEPC1 and PEPC2 genes have fractal dimensions of
about 2.05. The PEPC CpG dinucleotide content is 8% in P. tricornutum and 10% in C. reinhardtii. Further
comparison of the cell wall protein gene showed that the VSP1 gene sequence in C. reinhardtii has a fractal
dimension of 2.03 and the bio-silica formation silaffin gene in Thalassiosira pseudonana has a fractal dimension
of 2.01. The phosphoenolpyruvate carboxylase PPC1 and PPC2 in T. pseudonana were found to have fractal
dimensions and CpG dinucleotide content similar to that of P. tricornutum. The fractal dimension of the dnaB
replication helicase gene is about 1.98 for both diatoms as well as for the alga Heterosigma akashiwo. In
comparison, the E. coli dnaB gene has a fractal dimension of about 2.03. Given that high fractal dimension and
CpG dinucleotide content sequences have been associated with the presence of selective pressures, the relatively
low fractal dimension gene sequences of the two unique properties of Earth-bound diatoms (photosynthesis and
bio-silica cell wall) suggests the potential for the development of high fractal dimension sequences for adaptation
in harsh environments.
Cell scattering produces a speckle pattern, while imaging produces a contrast pattern. This family of fluctuation signals can be studied by analysis techniques such as correlation and fractal dimension. Human breast cell (normal and cancerous) samples were investigated using laser speckle and imaging microscopy. Image data from tetraploid human cell motion and quorum sensing biofilm growth were studied as well, and we found that the signal fluctuations could be interpreted as gene expression fluctuations occurring during inter-cell communication. We have mapped nucleotide sequences as images and studied the fluctuation. We showed that the fractal dimension and correlation can be used for the description of bio-information from the DNA (nanometer) scale to the tissue (millimeter) level. Fluctuations of the HAR1 nucleotide sequence and IRF-6 single-mutation cases in the van der Woude syndrome were discussed. Sub-cell structures such as the 40S ribosome, GroEL, and lysozyme, were shown to carry texture fractal dimension information in their images consistent with their biological states. Clinical applications to X-ray mammography and Parkinson disease MRI data were discussed.
The absorption effect of the back surface boundary of a diffuse layer was studied via laser generated reflection speckle pattern. The spatial speckle intensity provided by a laser beam was measured. The speckle data were analyzed in terms of fractal dimension (computed by NIH ImageJ software via the box counting fractal method) and weak localization theory based on Mie scattering. Bar code imaging was modeled as binary absorption contrast and scanning resolution in millimeter range was achieved for diffusive layers up to thirty transport mean free path thick. Samples included alumina, porous glass and chicken tissue. Computer simulation was used to study the effect of speckle spatial distribution and observed fractal dimension differences were ascribed to variance controlled speckle sizes. Fractal dimension suppressions were observed in samples that had thickness dimensions around ten transport mean free path. Computer simulation suggested a maximum fractal dimension of about 2 and that subtracting information could lower fractal dimension. The fractal dimension was shown to be sensitive to sample thickness up to about fifteen transport mean free paths, and embedded objects which modified 20% or more of the effective thickness was shown to be detectable. The box counting fractal method was supplemented with the Higuchi data series fractal method and application to architectural distortion mammograms was demonstrated. The use of fractals in diffusive analysis would provide a simple language for a dialog between optics experts and mammography radiologists, facilitating the applications of laser diagnostics in tissues.
The radiation resistance-repair genes in Deinococcus radiodurans (DR) and E-coli were analyzed in terms of the
A, T, C, G nucleotide fluctuations. The studied genes were Rec-A, Rec-Q, and the unique DR PprA gene. In an
ATCG sequence, each base was assigned a number equal to its atomic number. The resulting numerical
sequence was the basis of the statistical analysis. Fractal analysis using the Higuchi method gave a fractal
dimension increase of the Deinococcus radiodurans genes as compared to E-coli, which is comparable to the
enhancement observed in the human HAR1 region (HAR1F gene) over that of the chimpanzee. Near neighbor
fluctuation was also studied via the Black-Scholes model where the increment sequence was treated as a random
walk series. The Deinococcus radiodurans radiation gene standard deviations were consistently higher than that
of the E-coli deviations, and agree with the fractal analysis results. The sequence stacking interaction was
studied using the published nucleotide-pair melting free energy values and Deinococcus radiodurans radiation
genes were shown to possess larger negative free energies. The high sensitivity of the fractal dimension as a
biomarker was tested with correlation analysis of the gamma ray dose versus fractal dimension, and the R square
values were found to be above 0.9 (N=5). When compared with other nucleotide sequences such as the rRNA
sequences, HAR1 and its chimpanzee counterpart, the higher fluctuation (correlated randomness) and larger
negative free energy of a DR radiation gene suggested that a radiation resistance-repair sequence exhibited
higher complexity. As the HAR1 nucleotide sequence complexity and its transcription activity of co-expressing
cortex protein reelin supported a positive selection event in humans, a similar inference of positive selection of
coding genes could be drawn for Deinococcus radiodurans when compared to E-coli. The origin of such a
positive selection would be consistent with that of a Martian environment.
The shape of an exoplanet lightcurve is usually obtained by averaging the noise over multiple datasets. Fractal
analysis has been demonstrated to be an effective tool for the detection of exoplanet transits using lightcurves
summed over all wavelengths sensitive to the detector (G. Tremberger, Jr et. al, 2006 Proc SPIE Vol 6265). The
detection of spectral features would depend on the extent to which the signal was buried in the noise. Different
noise sources would have different fractal characteristics. Also, the signal strength could be discontinuous in
time depending on the exoplanet's local atmospheric environment. Such a discontinuity is unlikely to be
detected with time integrated data. The lightcurve noise and shape information were characterized with fractal
dimension analysis of a noise buried time series signal. Computer simulation revealed that when the noise is
three times that of the signal, the fractal algorithm could detect the signal at about the 87% confidence level.
Application to noise buried time series datasets (HD 209458b lightcurve, HD149026b lightcurve) detected
discontinuities consistent with the results obtained by averaging datasets. Extension to individual wavelength
lightcurves would establish a detection limit for the existence of spectral features at wavelengths important for
exoplanet study. Other applications such as pre-implantation genetic screening spectroscopy and spatially varied
aneuploidy bio-data could use the same analysis principle as well.
The Pacific Ocean deep sea height data around latitude 20 N from Jason-1 satellite was analyzed in terms of
standard deviation (std) and fractal dimension during a 90-day period that included the coronal mass ejection
event of 2003 Oct 29 where a peak solar energetic particles of about 30,000 pfu was measured. The surface
height standard deviation series was observed to have two peaks that corresponded to two typhoon events of Oct
25 and Nov 26, 2003. The cross correlation of the height-std series and average-height series showed a positive
correlation with time delay. The fractal dimension of the height series peaked on Nov 1 (fractal dimension ~1.96
with a background 90-day average of ~ 1.81) and no corresponding peak was observed in the other time series
data. Computer simulation of the fractal dimension of a finite random series suggested a standard deviation of
about 0.071. Annual and long-term trends of the fractal dimensions were also found and investigated. The
possible contribution of coronal mass ejection to the surface height series fractal dimension and the height
correlation to chlorophyll were discussed.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.