Gene Dgeo_3110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_3110 
Symbol 
ID5687573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_009939 
Strand
Start bp200767 
End bp203709 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content63% 
IMG OID641262573 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_001527847 
Protein GI158421620 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGC TCAAGATCAG CGAAGCCGGC ACCGTGCAAT TCCCGATGGT GGAACACGCG 
GTGGAGATCG GCTGGACTTC AATCACGCCG GAGGACGCAC GCACCAAGCG CGGCGGCGAG
GCGGGCACCT TCTTCCGCGA CGTGCTGGAA GCCAAGCTCG CCGCGTTCAA CCCTTGGATG
TCCGCCAACG CGGTGCGCTC CGTGGTGGAA ACCCTGGACG CGCTGCCGGC CAGCATCGAG
GGCAATCGCG AGCGCCTGGC CTGGCTGCGA GGGGAACGCT CCTGGTTCGA CGAACAGGAA
AAGCGCCATC GACGCGTCCA CCTCATCGAC TTCGAGCACG TGACGGACAA CGCCTTTCAC
GTCACCTGGG AATGGAAGAT CAAGCCGCCC GCGCGCAAGG GCAACCGGGC CGACGTGATG
TTCCTGGTCA ACGGCGTGCC GGTGTGCATC GTCGAGCACA AGAACCCGAA AGACGGCGAC
GCTATCGAGC GCGCCGTCAA GCAACTGCGC CGCTACGAGC TGGAAACGCC GGAGCTCTTG
GCATGCCCGC AACTCTTCAA CGTCACCCAC CTGCTCGACT ATTGGTATGG CGTGACCTGG
AACGCCAACC GGCGCGACAT GGCGCGCTGG AAGCAGGCAC CGGAGGAAAC CTACCGCTTT
GCGGTGCAAT CCTTTTTCGA GCCGACCGAC TTCCTGCGCA CGCTGCGGCA CTGGATCTTG
TTCTACGTGC AGGACGGCGA GACCCGCAAG TCGGTGCTGC GCCAGCACCA GCGGCGCGCC
ATCGACGCCA TCCTGAACCG CTGTGCCGAC CCAACCAAGA CAAGGGGCCT CATCTGGCAC
ACCCAGGGCT CGGGTAAGAC CTTTACGCTG CTGACCGCCG CTCGCCTGAT CCTGGAGGAC
AAGGCGCGCT TCGCCAACGC GACGGTGATT CTGGTGGTGG ACCGCACCGA GCTGGAAGGC
CAGTTGAAGG GCTGGGTGGA GCGCCTGCTG GGCGAGATGC AGAGCCAGGA CATCGCGGTC
AGGCGCGCCA ACAACAAGGC CGAACTTCAG TCCCTGCTGG ATGCCGACTT CCGCGGCCTG
ATCCTCTCGA TGATCCATAA GTTCGAGGCC ATCCGCAAAG ACAGCGTTCT GCGCGACAAC
GTCTACGTAT TCATCGACGA AGCGCACCGA TCGGTCGCCA AGGACCTCGG CACCTACCTG
ATGGCGGCCG TACCCAAGGC CACAATCATC GGCTTCACCG GCACACCCAT CGCGCGTACG
GCGCAAGGCG AAGGTACGTT CAAGATCTTC GGCACGCAGG ACGAGCTTGG GTATCTCGAC
AAGTACTCCA TCGCCGAGAG CATCGCCGAC GAGACGACCC TGCCGATCAA ACACGTGATG
GCGCCCAGCG AGATGACGGT GCCTGCCGAA CGGCTGGACA AGGAGTTCTT CGCGCTGGCC
GAGAGCGAAG GCATGACCGA TGTCGAGGAA CTGAACAAGG TGCTCGACCG CGCGGTGGGC
TTGCGCACCT TCTTGACGGC GGACGACCGC ATCGAGAAGG TGTCGGCCTT CATCGCCGAG
CACTTCAAGG AGAACGTGCT GCCTCTAGGC TACAAGGCCT TCGTGGTGGC AGTGAACCGC
GAGGCCTGCG CTAAGTATAA GAAGGCGCTG GACAAGCTGC TGCCTCCTGA GTGGACCGCG
CCGGTCTACA CAGAGAACTC CGCCGATGTG GTGGACCGAC CGCTGGTGGC CGAGTTGCAG
CTGTCGGACG AACAGGAAGA ACAAGTCCGC CTGCTGTTCA AGAAGCCTGC CGAGAACCCG
AAGATCCTGA TTGTCACCGA CAAGCTGCTC ACCGGCTACG ACGCGCCGCC GCTTTACTGC
CTGTACCTCG ACAAGCCGAT GCGCGACCAC GTGCTGCTGC AGTCGATTGC GCGTGTGAAC
CGCCCTTATG TAGATGCCAA CGGCGTGCAG AAGCGGGTGG GCCTCGTGGT GGACTTCGTC
GGCGTGCTGC GCGAGCTGAA GAAGGCGCTG CAATTCGATT CCAGCGACGT CAGCGGTGTG
ATCGAGGATT TGGATGTGCT GCTGCAGGAC TGCTTGCAAC GCATCGAGCA GGCCAAAAAG
GACTACCTCG AGACGGACGC CAGCGGTACG CCCGACGAGC GGCTGGAGCG TCTGGTGTTC
GGCCGCTTCC TGACGCCTGA GGCGCGCAAG ACCTTCTTCG AGCACTACAA GGAGATCGAG
GCGCTGTGGG AAATCCTCTC GCCCGACCCT CAGCTCCGTG ACCACATTGC GACCTACAAG
CAGCTTAGTC AGCTCTATGC GGCCGTGCGC AATGCCTACG CCGAAAAAGT TGGGTTTGTG
GCTGACCTAG CCTACAAGAC GCGGCGACTG ATCGAGGAAA GCGCGGAGCA ACATGGTCTT
GGGCGATTGA CTAAGACTGT GACCTTTGAT GTGGCAACCT TGAAGTCGCT GCGCGGTGAG
AAAGGTTCCG ACGAGGGCAA GGTGTTCAAC CTGGTGCGCG GGCTGCAGCA CGAGATCGAC
GAGGACCCTG TGGCAGCGCC GGTGCTGCAA CCGCTGAAAG ATCGTGCCGA GCGCATCCTG
AAGGATCTGG AAGAGCGCAA GACGACCGGT CTGGCGGCGA TGGACCAACT GGCGGCGTTG
GCGGCGGAGA AGGAAGCGGC CATGAAGGCG GCGCGCGACA GCGGCCTTTC GCCCCGCGCC
TTTGCCGTCG CCTGGGCGCT GCGTGAGGAC GCGGCCATCA AGGCCGCGGG CATCGACCTC
ATGACGTTGG CCAAGGACGC CGAAGACTTG CTCGGGCGTT TCCCGAATGC CTCGGTCAAC
ACCGATGAAC AGCGACGGCT GCGGGCCTCG CTCTACAAGC CCTTGCTGGC CCTGGCCCCG
GACGAGCGGG CACGGATCGT CGATCTGGTG GTGCGGCAGC TGCTCACGGA GGGCAGCGAA
TGA
 
Protein sequence
MSTLKISEAG TVQFPMVEHA VEIGWTSITP EDARTKRGGE AGTFFRDVLE AKLAAFNPWM 
SANAVRSVVE TLDALPASIE GNRERLAWLR GERSWFDEQE KRHRRVHLID FEHVTDNAFH
VTWEWKIKPP ARKGNRADVM FLVNGVPVCI VEHKNPKDGD AIERAVKQLR RYELETPELL
ACPQLFNVTH LLDYWYGVTW NANRRDMARW KQAPEETYRF AVQSFFEPTD FLRTLRHWIL
FYVQDGETRK SVLRQHQRRA IDAILNRCAD PTKTRGLIWH TQGSGKTFTL LTAARLILED
KARFANATVI LVVDRTELEG QLKGWVERLL GEMQSQDIAV RRANNKAELQ SLLDADFRGL
ILSMIHKFEA IRKDSVLRDN VYVFIDEAHR SVAKDLGTYL MAAVPKATII GFTGTPIART
AQGEGTFKIF GTQDELGYLD KYSIAESIAD ETTLPIKHVM APSEMTVPAE RLDKEFFALA
ESEGMTDVEE LNKVLDRAVG LRTFLTADDR IEKVSAFIAE HFKENVLPLG YKAFVVAVNR
EACAKYKKAL DKLLPPEWTA PVYTENSADV VDRPLVAELQ LSDEQEEQVR LLFKKPAENP
KILIVTDKLL TGYDAPPLYC LYLDKPMRDH VLLQSIARVN RPYVDANGVQ KRVGLVVDFV
GVLRELKKAL QFDSSDVSGV IEDLDVLLQD CLQRIEQAKK DYLETDASGT PDERLERLVF
GRFLTPEARK TFFEHYKEIE ALWEILSPDP QLRDHIATYK QLSQLYAAVR NAYAEKVGFV
ADLAYKTRRL IEESAEQHGL GRLTKTVTFD VATLKSLRGE KGSDEGKVFN LVRGLQHEID
EDPVAAPVLQ PLKDRAERIL KDLEERKTTG LAAMDQLAAL AAEKEAAMKA ARDSGLSPRA
FAVAWALRED AAIKAAGIDL MTLAKDAEDL LGRFPNASVN TDEQRRLRAS LYKPLLALAP
DERARIVDLV VRQLLTEGSE