Gene Dgeo_0298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0298 
Symbol 
ID4058022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp289216 
End bp290289 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content70% 
IMG OID641229300 
Productpeptidase M42 
Protein accessionYP_603770 
Protein GI94984406 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCC TGTCTGCCCC GACCTCCGCC GCCCGGCCCG GGGACTTCGA CCTGCCGTAC 
ACGACCGACC TCCTCCTGCG CCTGCTGAAC ACCCCCAGCC CGACCGGGTT CACCGAGGCC
GCCGTGCGTC TGCTGGAAGG GGAGCTGGAC GCGCTCGGCG TGCCGCACCG CCGCAGCAAG
AAGGGGGCGC TGACCTGGGA GATCGCGGGA CAACCTGGCC AGCCGCACAC CACCTTCAGC
GGTCACGTGG ACACGCTGGG CGCCATGGTG AAGGAGATCA AGGAGAACGG GCGGCTGCGT
CTCTTTCCCT TGGGCGGCTA CGACTGGGCC ACCATCGAGG GCGAGTACGT GCAGGTCCAC
ACCGGGCGGG GCGAGGCCGT CACCGGGACG GTCGTCAACA CCCACCAGAG CACCCACGTT
CACGGCCCTG CCCTACGGGA GCTGCGGCGC GAGCAGGCGG TGATGGAAGT CCGGCTGGAC
GCTCCCACCA CCTCTCCGGA GGAGACGCGG GCGCTGGGCA TCGAGGTGGG CGACTTCGTG
AGCTTCGATC CCCGCGCCAC CCTGACGGAC GCCGGGTACA TGAAGAGCCG CCACCTCGAC
AACAAGGCCG CGGTTGCCGT GTTCTTGGGC GTGACCCGTG CCCTGCTGGA GGCGCCACCT
GCTCGCACGG TCGCCTTCCA CGTCACCACC TACGAGGAGG TCGGGCACGG GGCCGCCACC
GGGATTCCGC CCCACACCGA CGAGCTGATC GCGGTGGACA TGGCCGCCGT GGGCGAGGGG
CAGACCAGCA GCGAGCACCA CGTCACCCTC TGCGTGGCCG ACAGCGGCGG GCCATATGAC
CACGCGCTCG GCAATCGGCT GCGGGCGGCG GCACGGCGGG CCGGGCTGGA GTTGCGGGTA
GACCTCTACC CCTACTACGC TTCGGACGGA ACGGCGGCCT GGCGCGCGGG CGGCGACTAT
CCGGTTGCCC TGATTGGACC TGGGGTGGAC GCGAGCCATG CTTACGAGCG CACCCACCTG
GACGCGTTGC GGGCGACGGC AGAACTGATG CTGGCGCATG TGCGGGGAGA GTGA
 
Protein sequence
MTVLSAPTSA ARPGDFDLPY TTDLLLRLLN TPSPTGFTEA AVRLLEGELD ALGVPHRRSK 
KGALTWEIAG QPGQPHTTFS GHVDTLGAMV KEIKENGRLR LFPLGGYDWA TIEGEYVQVH
TGRGEAVTGT VVNTHQSTHV HGPALRELRR EQAVMEVRLD APTTSPEETR ALGIEVGDFV
SFDPRATLTD AGYMKSRHLD NKAAVAVFLG VTRALLEAPP ARTVAFHVTT YEEVGHGAAT
GIPPHTDELI AVDMAAVGEG QTSSEHHVTL CVADSGGPYD HALGNRLRAA ARRAGLELRV
DLYPYYASDG TAAWRAGGDY PVALIGPGVD ASHAYERTHL DALRATAELM LAHVRGE