Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_3110 |
Symbol | |
ID | 5687573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_009939 |
Strand | + |
Start bp | 200767 |
End bp | 203709 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641262573 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_001527847 |
Protein GI | 158421620 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACGC TCAAGATCAG CGAAGCCGGC ACCGTGCAAT TCCCGATGGT GGAACACGCG GTGGAGATCG GCTGGACTTC AATCACGCCG GAGGACGCAC GCACCAAGCG CGGCGGCGAG GCGGGCACCT TCTTCCGCGA CGTGCTGGAA GCCAAGCTCG CCGCGTTCAA CCCTTGGATG TCCGCCAACG CGGTGCGCTC CGTGGTGGAA ACCCTGGACG CGCTGCCGGC CAGCATCGAG GGCAATCGCG AGCGCCTGGC CTGGCTGCGA GGGGAACGCT CCTGGTTCGA CGAACAGGAA AAGCGCCATC GACGCGTCCA CCTCATCGAC TTCGAGCACG TGACGGACAA CGCCTTTCAC GTCACCTGGG AATGGAAGAT CAAGCCGCCC GCGCGCAAGG GCAACCGGGC CGACGTGATG TTCCTGGTCA ACGGCGTGCC GGTGTGCATC GTCGAGCACA AGAACCCGAA AGACGGCGAC GCTATCGAGC GCGCCGTCAA GCAACTGCGC CGCTACGAGC TGGAAACGCC GGAGCTCTTG GCATGCCCGC AACTCTTCAA CGTCACCCAC CTGCTCGACT ATTGGTATGG CGTGACCTGG AACGCCAACC GGCGCGACAT GGCGCGCTGG AAGCAGGCAC CGGAGGAAAC CTACCGCTTT GCGGTGCAAT CCTTTTTCGA GCCGACCGAC TTCCTGCGCA CGCTGCGGCA CTGGATCTTG TTCTACGTGC AGGACGGCGA GACCCGCAAG TCGGTGCTGC GCCAGCACCA GCGGCGCGCC ATCGACGCCA TCCTGAACCG CTGTGCCGAC CCAACCAAGA CAAGGGGCCT CATCTGGCAC ACCCAGGGCT CGGGTAAGAC CTTTACGCTG CTGACCGCCG CTCGCCTGAT CCTGGAGGAC AAGGCGCGCT TCGCCAACGC GACGGTGATT CTGGTGGTGG ACCGCACCGA GCTGGAAGGC CAGTTGAAGG GCTGGGTGGA GCGCCTGCTG GGCGAGATGC AGAGCCAGGA CATCGCGGTC AGGCGCGCCA ACAACAAGGC CGAACTTCAG TCCCTGCTGG ATGCCGACTT CCGCGGCCTG ATCCTCTCGA TGATCCATAA GTTCGAGGCC ATCCGCAAAG ACAGCGTTCT GCGCGACAAC GTCTACGTAT TCATCGACGA AGCGCACCGA TCGGTCGCCA AGGACCTCGG CACCTACCTG ATGGCGGCCG TACCCAAGGC CACAATCATC GGCTTCACCG GCACACCCAT CGCGCGTACG GCGCAAGGCG AAGGTACGTT CAAGATCTTC GGCACGCAGG ACGAGCTTGG GTATCTCGAC AAGTACTCCA TCGCCGAGAG CATCGCCGAC GAGACGACCC TGCCGATCAA ACACGTGATG GCGCCCAGCG AGATGACGGT GCCTGCCGAA CGGCTGGACA AGGAGTTCTT CGCGCTGGCC GAGAGCGAAG GCATGACCGA TGTCGAGGAA CTGAACAAGG TGCTCGACCG CGCGGTGGGC TTGCGCACCT TCTTGACGGC GGACGACCGC ATCGAGAAGG TGTCGGCCTT CATCGCCGAG CACTTCAAGG AGAACGTGCT GCCTCTAGGC TACAAGGCCT TCGTGGTGGC AGTGAACCGC GAGGCCTGCG CTAAGTATAA GAAGGCGCTG GACAAGCTGC TGCCTCCTGA GTGGACCGCG CCGGTCTACA CAGAGAACTC CGCCGATGTG GTGGACCGAC CGCTGGTGGC CGAGTTGCAG CTGTCGGACG AACAGGAAGA ACAAGTCCGC CTGCTGTTCA AGAAGCCTGC CGAGAACCCG AAGATCCTGA TTGTCACCGA CAAGCTGCTC ACCGGCTACG ACGCGCCGCC GCTTTACTGC CTGTACCTCG ACAAGCCGAT GCGCGACCAC GTGCTGCTGC AGTCGATTGC GCGTGTGAAC CGCCCTTATG TAGATGCCAA CGGCGTGCAG AAGCGGGTGG GCCTCGTGGT GGACTTCGTC GGCGTGCTGC GCGAGCTGAA GAAGGCGCTG CAATTCGATT CCAGCGACGT CAGCGGTGTG ATCGAGGATT TGGATGTGCT GCTGCAGGAC TGCTTGCAAC GCATCGAGCA GGCCAAAAAG GACTACCTCG AGACGGACGC CAGCGGTACG CCCGACGAGC GGCTGGAGCG TCTGGTGTTC GGCCGCTTCC TGACGCCTGA GGCGCGCAAG ACCTTCTTCG AGCACTACAA GGAGATCGAG GCGCTGTGGG AAATCCTCTC GCCCGACCCT CAGCTCCGTG ACCACATTGC GACCTACAAG CAGCTTAGTC AGCTCTATGC GGCCGTGCGC AATGCCTACG CCGAAAAAGT TGGGTTTGTG GCTGACCTAG CCTACAAGAC GCGGCGACTG ATCGAGGAAA GCGCGGAGCA ACATGGTCTT GGGCGATTGA CTAAGACTGT GACCTTTGAT GTGGCAACCT TGAAGTCGCT GCGCGGTGAG AAAGGTTCCG ACGAGGGCAA GGTGTTCAAC CTGGTGCGCG GGCTGCAGCA CGAGATCGAC GAGGACCCTG TGGCAGCGCC GGTGCTGCAA CCGCTGAAAG ATCGTGCCGA GCGCATCCTG AAGGATCTGG AAGAGCGCAA GACGACCGGT CTGGCGGCGA TGGACCAACT GGCGGCGTTG GCGGCGGAGA AGGAAGCGGC CATGAAGGCG GCGCGCGACA GCGGCCTTTC GCCCCGCGCC TTTGCCGTCG CCTGGGCGCT GCGTGAGGAC GCGGCCATCA AGGCCGCGGG CATCGACCTC ATGACGTTGG CCAAGGACGC CGAAGACTTG CTCGGGCGTT TCCCGAATGC CTCGGTCAAC ACCGATGAAC AGCGACGGCT GCGGGCCTCG CTCTACAAGC CCTTGCTGGC CCTGGCCCCG GACGAGCGGG CACGGATCGT CGATCTGGTG GTGCGGCAGC TGCTCACGGA GGGCAGCGAA TGA
|
Protein sequence | MSTLKISEAG TVQFPMVEHA VEIGWTSITP EDARTKRGGE AGTFFRDVLE AKLAAFNPWM SANAVRSVVE TLDALPASIE GNRERLAWLR GERSWFDEQE KRHRRVHLID FEHVTDNAFH VTWEWKIKPP ARKGNRADVM FLVNGVPVCI VEHKNPKDGD AIERAVKQLR RYELETPELL ACPQLFNVTH LLDYWYGVTW NANRRDMARW KQAPEETYRF AVQSFFEPTD FLRTLRHWIL FYVQDGETRK SVLRQHQRRA IDAILNRCAD PTKTRGLIWH TQGSGKTFTL LTAARLILED KARFANATVI LVVDRTELEG QLKGWVERLL GEMQSQDIAV RRANNKAELQ SLLDADFRGL ILSMIHKFEA IRKDSVLRDN VYVFIDEAHR SVAKDLGTYL MAAVPKATII GFTGTPIART AQGEGTFKIF GTQDELGYLD KYSIAESIAD ETTLPIKHVM APSEMTVPAE RLDKEFFALA ESEGMTDVEE LNKVLDRAVG LRTFLTADDR IEKVSAFIAE HFKENVLPLG YKAFVVAVNR EACAKYKKAL DKLLPPEWTA PVYTENSADV VDRPLVAELQ LSDEQEEQVR LLFKKPAENP KILIVTDKLL TGYDAPPLYC LYLDKPMRDH VLLQSIARVN RPYVDANGVQ KRVGLVVDFV GVLRELKKAL QFDSSDVSGV IEDLDVLLQD CLQRIEQAKK DYLETDASGT PDERLERLVF GRFLTPEARK TFFEHYKEIE ALWEILSPDP QLRDHIATYK QLSQLYAAVR NAYAEKVGFV ADLAYKTRRL IEESAEQHGL GRLTKTVTFD VATLKSLRGE KGSDEGKVFN LVRGLQHEID EDPVAAPVLQ PLKDRAERIL KDLEERKTTG LAAMDQLAAL AAEKEAAMKA ARDSGLSPRA FAVAWALRED AAIKAAGIDL MTLAKDAEDL LGRFPNASVN TDEQRRLRAS LYKPLLALAP DERARIVDLV VRQLLTEGSE
|
| |