Gene Syncc9605_0099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_0099 
Symbol 
ID3735450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp98442 
End bp99632 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content64% 
IMG OID637774678 
ProductA/G-specific DNA-adenine glycosylase 
Protein accessionYP_380430 
Protein GI78211651 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR00586] mutator mutT protein
[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0722569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCT CACCGCATCC CAGCGCCCGC ACCATTCACG CCAGTGGCGG CAACTCCGCT 
CCAGGCTTGC TCAACAACTC CGTTGAATTG AGCGCTTGCC TGCTCAGCTG GTGGCAGGCC
CATGGCCGGC GTGATCCCGT GCAGAAGCCT TGGATGTTCA AGCCAGCCGG GACTTGGCCT
GAAGCGGTTC ACCAACTTGA TCCCTACGGC ATTTGGATTG CTGAGGTGAT GCTGCAGCAG
ACCCAGTTGG CCGTGGCGCT TCCCTATTGG ATGCGTTGGA TGGAGGCATT TCCAACTGTC
GAAACGTTGG CAGCGGCGTC CCTGGATGAG GTGCGGCTGC AGTGGCAGGG CCTCGGGTAT
TACTCCCGGG TCCGCCGGCT GCATGAGGCG GCGCAGCGGT TGGTGGGCCG ACCGTGGCCG
CGCAGCTTGG AGGAGTGGAT GGCGTTGCCT GGCATCGGCC GCACCACCGC CGGCAGCATC
CTCTCCAGTG CCTTCAATCT CCGGCTGCCG ATCCTGGATG GCAACGTAAA ACGGGTGCTG
GCGCGCTTGA CGGCCCATGC GCGCCCGCCG GCCCGTGACG ATGCCTTGTT CTGGTGCTGG
AGTGAGGCTC TGCTTGATCC GGTTCGGGCA CGGGATACCA ACCAGGCCTT GATGGATCTG
GGGGCCACGC TCTGCACCCC CCGCCAGCCG GCCTGTCACC GCTGCCCCTG GCACTCCCAG
TGCGCTGCCT ACGCTTCCGG CGATCCCTGC CGCTGGCCCG TGACCAATGC CCCCAAGCCC
CTGCCCTTCC AGGTGATCGG TGTGGGTGTC GTGCTCAACG CTGCCGGGGA GGTGTTGATC
GACCAGCGCC TAGAGGAAGG CCTGCTGGGG GGAATGTGGG AGTTCCCCGG TGGCAAACAA
GAACAAGGCG AAACGATCGA AACCTGCATT GCCCGCGAGT TGAAGGAGGA GCTCGGCATT
GCGGTGACAG TGGGCGCTGA ACTGATCACC GTTGATCACG CCTACAGCCA CAAGAAGTTG
CGCTTTGTGG TGCATCTCTG CGACTGGATG TCGGGGGAGC CGCAGCCCCT TGCCAGTCAG
CAGGTGCGTT GGGTGCGCCC AGATGACCTG GTGGATTACG CCTTTCCGGC CGCCAATGCT
CGGATCATTG AGGCGTTGCT TGGCAGCTTG GAAAGCTCTG CCCACCCTTA A
 
Protein sequence
MPRSPHPSAR TIHASGGNSA PGLLNNSVEL SACLLSWWQA HGRRDPVQKP WMFKPAGTWP 
EAVHQLDPYG IWIAEVMLQQ TQLAVALPYW MRWMEAFPTV ETLAAASLDE VRLQWQGLGY
YSRVRRLHEA AQRLVGRPWP RSLEEWMALP GIGRTTAGSI LSSAFNLRLP ILDGNVKRVL
ARLTAHARPP ARDDALFWCW SEALLDPVRA RDTNQALMDL GATLCTPRQP ACHRCPWHSQ
CAAYASGDPC RWPVTNAPKP LPFQVIGVGV VLNAAGEVLI DQRLEEGLLG GMWEFPGGKQ
EQGETIETCI ARELKEELGI AVTVGAELIT VDHAYSHKKL RFVVHLCDWM SGEPQPLASQ
QVRWVRPDDL VDYAFPAANA RIIEALLGSL ESSAHP