Gene Dgeo_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2017 
Symbol 
ID4058480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2127419 
End bp2128972 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content65% 
IMG OID641231055 
ProductN-6 DNA methylase 
Protein accessionYP_605480 
Protein GI94986116 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.54323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA ATGGCACCAC CAACAGCAAC GGGGGCAACC TCGGCTTCGA GGCTGACCTT 
TTCAAGGCCG CTGACAAGCT GCGCGGCAAC ATGGAGCCCA GCGACTACAA GCACGTCGCT
CTGGGCTTGA TCTTCCTCAA ATACATCTCG GACGCCTTCG AGGCCAGGCA CCAGGCGCTG
CTGGCCGAAG ACCCGCGCGC GGCCGAAGAC CGCGACGAGT ACCTGGCCGA CAACGTCTTC
TGGGTGCCGA AAGAGGCGCG CTGGTCGCAC TTGCGGGCCA ACGCCAGGCG GCCCGAGATC
GGCCTCCTGA TCGATGAGGC CATGCGCGCC ATCGAGAAGG AGAACGAGTC GCTCAAGGGC
GTGTTGCCCA AGGACTACGC CAGGCCCGCG CTCAACAAGG TGATGCTGGG CGAGTTGATC
GACCTGATCT CCGGCATTGC GCTGGGCGAG GAAGGCGACC GCTCCAAGGA CATCCTCGGG
CGCGTGTACG AATACTTCCT GGGCCAGTTC GCCGGCGCCG AGGGCAAGCG GGGCGGCGAG
TTCTACACCC CGCGCTCGGT GGTTCGCGTG CTGGTGGAGA TGCTGGAGCC TTACCACGGG
CGGGTGTATG ACCCCTGCTG CGGCTCGGGC GGCATGTTCG TGCAGAGCGA AAAGTTCGTG
CAGGAGCACG GCGGGCGCAT CGGTGACATC GCCATCTACG GGCAGGAGAG CAACTACACC
ACCTGGCGGC TGTGCAAGAT GAACCTGGCG GTGCGCGGCA TCGATGCCGA CATCCGCTGG
AACAACGAAG GCAGCTTCCA CAAGGACGAA CTGCGCGACC TCAAGGCCGA TTTCATCCTC
GCCAACCCGC CGTTCAACAT CTCCGACTGG GGCGGCGAGC GCCTGCGCGA GGACGTGCGC
TGGAGTTTTG GCGTGCCGCC CGTGGGCAAT GCCAACTACG CCTGGCTGCA GCACATCCAT
CACCACCTCG CCCCCAATGG CACCGCGGGT GTGGTCCTGG CCAACGGCTC GATGAGTTCG
AACCAGTCGG GCGAGGGCGA GATTCGCAAG GCCATGGTCG AGGCCGACGT GGTGGACTGC
ATGGTGGCGT TGCCCGGGCA GCTTTTCTAC TCCACGCAGA TTCCCGCCTG CCTGTGGTTC
CTGGCACGCA ACAAAAACCC CGGCAAGGGC CTGCGCGACC GCCGCGGACA GGTGCTGTTC
ATCGACGCGC GCAAGCTGGG CGTGCTGGTG GACCGCACCC GGCGCGAACT CACGGACGCC
GAAATCCAGA AGATCGCCGA CACCTACCAC GCCTGGCGCG GTGAACCCGA TGCGGGCGAA
TACCAGGACG TACCCGGCTT CTGCAAATCC GCCACGCTGG AGGAGATCCG TAAACACGGC
TTTGTCCTCA CCCCGGGCCG CTATGTGGGC GCGGCACAGC AAGAGGACGA CGGCGAGCCG
TTCGAGGAAA AAATGGCGCG GTTGGCGGCC CAGTGGCGCG AGCAGCGGGC AGCGGCGGCC
AAGCTGGATG CGGCGATTGA AGCCAACCTG AAGGAGCTTG GGTATGGCGG GTGA
 
Protein sequence
MKKNGTTNSN GGNLGFEADL FKAADKLRGN MEPSDYKHVA LGLIFLKYIS DAFEARHQAL 
LAEDPRAAED RDEYLADNVF WVPKEARWSH LRANARRPEI GLLIDEAMRA IEKENESLKG
VLPKDYARPA LNKVMLGELI DLISGIALGE EGDRSKDILG RVYEYFLGQF AGAEGKRGGE
FYTPRSVVRV LVEMLEPYHG RVYDPCCGSG GMFVQSEKFV QEHGGRIGDI AIYGQESNYT
TWRLCKMNLA VRGIDADIRW NNEGSFHKDE LRDLKADFIL ANPPFNISDW GGERLREDVR
WSFGVPPVGN ANYAWLQHIH HHLAPNGTAG VVLANGSMSS NQSGEGEIRK AMVEADVVDC
MVALPGQLFY STQIPACLWF LARNKNPGKG LRDRRGQVLF IDARKLGVLV DRTRRELTDA
EIQKIADTYH AWRGEPDAGE YQDVPGFCKS ATLEEIRKHG FVLTPGRYVG AAQQEDDGEP
FEEKMARLAA QWREQRAAAA KLDAAIEANL KELGYGG