Gene Dgeo_3108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_3108 
Symbol 
ID5687571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_009939 
Strand
Start bp197661 
End bp199493 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content66% 
IMG OID641262571 
ProductN-6 DNA methylase 
Protein accessionYP_001527845 
Protein GI158421618 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAACA TGCCACCGCC GATCGACTTC CCCGCTCTTC TGCGCACGAT CCGCTCCGCG 
CTCGACCTCA CCCAGGAGCA GCTCGCGGAA CGCCTGGGTG TCTCCTTCGC CACGGTCAAC
CGCTGGGAAG GCGGCGGCAA CAAGCCGCAG CGTGCCGCGC AGGAGACCAT CCTCGCGCTG
GCGCGAGAGG CCGGGGTCGA GGGTGCCGAA AGCCCATCCG CCGCTGACGC TGCCGCTCAA
GTGACCCGGC GTCGCACCGG GCGGGCGGCG GCTGCGCCCA CCACCAAGCC GATGGAGCAG
ATGCTGTGGG ACGCCGCCTG TTCGATCCGA GGTGAGAAGG AGGCGGCCAA GTTCAAGGAT
TACCTGCTGC CGCTGCTCTT CCTCAAGCGC CTGTCCGACG TTTTCGACGA TGAAATCGAG
CGGCTGGCCG AGGAGTACGG CGACCGTGCC ACCGCGCTGG AGATTGCCGA GTCGGACCAC
TCCCTGCTGC GTTTCTACCT GCCGCCCGAA GCGCGCTGGA CGGTGATCAG CGGGCGCGAG
CCGTTCGACT GGCCGCGCGA TGTGCAAGGT CGCTCCACTG CGCCGCGCGA CATCGGCGAG
CATTTGACCC GCGCCGTACG CGCCGTGGTC AAGCACAACC CCTCGCTCTC TGGCGTGATC
GACGTGGTGG ACTTCGCCGC CGAAAGGAAC GGCGAGCGCG ACATCAACCC GGCCAAGCTG
CGCGGCGTGG TGGAGACGTT TTCCGATCCG CGCTACCGGC TGGGCCTCGC CGACGTGCAG
CCCGACTTCC TCGGTCGCGC CTACGAATAC CTGCTGCGCA AGTTCGCTGA AGGCTCCGGC
CAGAGCGCCG GCGAGTTCTT CACCCCGACC GAAGTGGGCT TTTTGATGGC CCACATCCTG
CGGCCCAAAC CCGGCGAGAC CTGCCACGAC TACGCCTGTG GTTCGGCGGG GCTGTTGATC
AAGCTCCAGC TCGTCGCCCG CGAACTCGAC CCCACCAGCC GTGTGCCGCT CAAGCTCTCC
GGCCAGGAAC TGCAGGCCGA GAGCTACGCC GTGGCGCAGA TGAACGCCAT CATCCACGAC
ATGGAGGTGG AGCTGGCGCG TGGCGACACC ATGATCAACC CCAAGTTCCG CAATGCGGAC
GGCTCCATCC GCCAACACGA CATCGTGGTG GCCAACCCGA TGTGGAACCA GTCCTTCGCA
CCGGACATCT TCGCCCACGA CCCGTTCGAC CGCTTCCGCA CGGCGGGCGG CATCACCAGC
GGAAAAGGAG ACTGGGCCTG GCTGCAACAC ACGCTGGCCT GCATGAACGA TCACGGCCGC
GCCGCCGTCG TGCTCGATAC CGGCGCGGTG ACGCGCGGCT CCGGCTCCAA GAACGAAGAC
AAGGAGCGCA CCATCCGCAA GTGGTTCGTC GAGCAGGACT TGATCGACGG GGTGATCCTG
CTGCCCGAGA ACCTCTTCTA CAACACCACG GCGGCGGGCG TAATCGTGGT GCTGAACAAG
CGCAAGCCTG CTGCGCGCAA AGGCAAGATC GTCCTGCTCA ACGCCAGCCG CCACTTCAGC
AAGGGCAGGC CCAAAAACTA CCTGCCCGAG GAAGACCTGC GCCCACTCGC CGCGATGTAC
CTCAAGGGGG AGCCGGTCGA CGGCGAGCTC GCTGTCATCA CCAAGCAGCA AGCGGAAGAA
GCGGACTACA ACCTCAGTCC TGGGCGCTGG ATCGCACAAG GCGGCAATGC TGACCACCGC
TCCATCAAGG CCATCGTCGC GGACATGCTG TCGCTCGACG AGAAAGCCCG AGAGATCGAT
CAAACCCTCG CTAAGTTGCT TGCACCGCTA TGA
 
Protein sequence
MSNMPPPIDF PALLRTIRSA LDLTQEQLAE RLGVSFATVN RWEGGGNKPQ RAAQETILAL 
AREAGVEGAE SPSAADAAAQ VTRRRTGRAA AAPTTKPMEQ MLWDAACSIR GEKEAAKFKD
YLLPLLFLKR LSDVFDDEIE RLAEEYGDRA TALEIAESDH SLLRFYLPPE ARWTVISGRE
PFDWPRDVQG RSTAPRDIGE HLTRAVRAVV KHNPSLSGVI DVVDFAAERN GERDINPAKL
RGVVETFSDP RYRLGLADVQ PDFLGRAYEY LLRKFAEGSG QSAGEFFTPT EVGFLMAHIL
RPKPGETCHD YACGSAGLLI KLQLVARELD PTSRVPLKLS GQELQAESYA VAQMNAIIHD
MEVELARGDT MINPKFRNAD GSIRQHDIVV ANPMWNQSFA PDIFAHDPFD RFRTAGGITS
GKGDWAWLQH TLACMNDHGR AAVVLDTGAV TRGSGSKNED KERTIRKWFV EQDLIDGVIL
LPENLFYNTT AAGVIVVLNK RKPAARKGKI VLLNASRHFS KGRPKNYLPE EDLRPLAAMY
LKGEPVDGEL AVITKQQAEE ADYNLSPGRW IAQGGNADHR SIKAIVADML SLDEKAREID
QTLAKLLAPL