Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_2012 |
Symbol | |
ID | 4058475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008025 |
Strand | - |
Start bp | 2119917 |
End bp | 2123057 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641231050 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_605475 |
Protein GI | 94986111 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.599838 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATTTC TGTCGGAAGC CGAGGTCGAA AACGCCCTTC TGGATCAGCT GCGCGCGCTC GGCTACAGCA TCGAACGCGA GGAGGACATC GGCCCCGACG GCCACCGGCC GGAGCGCGAG AGCCACGACG AGGTGGTGCT GCGCAAGCGG TTCGAGGACG CCGTTGCCCG CCTGAACCCC GGGGTGCCAC TGGAGGCGCG TCATGATGCC GTGCGGCGCG TGACGCAGTC CGAGCTGCCA TCGTTGCTTG CAGAAAACCG CCGCCTGCAC AAACTGCTGA CCGAAGGCGT GGATGTGGAG TACTACGCGG ACGATGGCGT GCTCACCGCC GGCAAGGCGC GGCTGATCGA CTTCGATGAC CCGGCCAACA ACGACTGGCT GGCGGTGAAC CAGTTCGTCG TCATCAACGG GCAGTACCAG CGGCGACCCG ATGTCGTGGT GTTCGTGAAC GGCTTGCCGC TTGCGGTGAT CGAGCTCAAG GCGCCGGGCA ACGACCAAGC CACGCTCACG GGCGCGTTCA ACCAGTTGCA GACCTACAAG GGGGAGATCA CCCAGCTTTT TCGCACCAAC GCACTGCTGG TCACGTCGGA CGGCATTTCC GCTCGGGTGG GGTCGCTGTC GGCCGACTTC GAGCGCTTCA TGCCGTGGCG CACCACCGAC GGTCGGGAGG TGGCGCCCAA GGGAGCGCCG GAGCTGGAGA CGCTGATCGA AGGCGTGTTC GAACATCGCC GCTTGCTCGA TCTGTTGCGT CACTTCACGG TCTTCGGCGA AACAGGCGCT GGGCTCATCA AGATCATCGC GGGCTACCAT CAGTTCCACG CGGTACGACA TGCGGTCGAG CGCACGGTGG CCGCATCCTC TGCCGGGGGA GACAGAAAGG CCGGGGTGAT CTGGCATACC CAGGGCTCGG GCAAGAGCCT GTTGATGGCG TTCTACGCAG GTCTTCTCGT TAGACACCCG GCGCTGGAAA ACCCGACCCT GGTCGTGCTG ACCGATCGCA ACGACCTGGA CGATCAGCTC TTCGCCACCT TTTCGATGTG CCGCGACCTG ATCCGGCAGA CGCCGGTGCA AGCAGAGGGA CGCGAGCACT TGAAAACGCT GCTGGACCGG GCCTCGGGCG GGGTGATCTT CACGACGCTG CAAAAGTTTG GCGAGATCGA CGGGCCACTG ACCACCCGGC GCAACGTGGT GGTCATCGCC GACGAGGCGC ACCGCAGCCA GTACGGCTTC AAGGCCAAGG TGGATGCCAA GACGGGCGAG ATCTCCTACG GCTTCGCCAA GTACCTGCGA GACGCGCTGC CGAACGCCTC GTTCATCGGC TTCACCGGTA CGCCCATCGA GGCAGGCGAC GTGAACACCC CGGCGGTGTT CGGCCACTAC ATCGACATTT ACGACATCAG CCGCGCGGTG GAAGACGGCG CGACGGTGCC CATCTACTAC GAATCGCGGT TGGCGCGCAT TGAACTCGAC GAGGACGAAA AGCCGAAGAT CGACGCCGAG ATCGAGGAGA TTCTGGAAGA CGAGGAAGAA CCCGCCCGCG AGCGCGCCAA GCAGAAGTGG GCGACGGTGG AGGCGCTCGT TGGCGCGGAC AAGCGCCTGC GACTGATCGC CCAGGATATC GTGCAGCACT TCGAGGCTCG CGTAGCCGCG CTGGACGGCA AGGCGATGAT CGTCTGCATG AGCCGGCGCA TCTGCGTCAA GCTCTACAAC GAGATCGTGA AGCTGCGTCC CGGATGGCAC AGCGATGACG ACAACGCCGG GGCCGTCAAG ATCGTGATGA CCGGGGCGGC CTCCGATCCG CCCGAGTGGC AGAAGCATAT CGGCAACAAG GCACGGCGCG ATCTGTTGGC CCGCCGCGCC CGCGACCCCA AAGACCCCTT GAAGCTCGTC ATCGTGCGCG ACATGTGGCT GACGGGCTTC GATGCGCCGT GCATGCACAC CATGTATGTG GACAAGCCGA TGCGCGGCCA CGGGCTGATG CAGGCGATTG CGCGGGTGAA CCGGGTGTTT CGCGACAAGC CCGCCGGGCT GATCGTGGAC TACATCGGCA TTGCGCAGAA CCTCAAAAAC GCGCTTGCGC AGTACTCGCC GCGCGACCGC GAAAACACCG GCATCGACGA AGCCGAAGCC ATCGCGGTAA TGCTGGAAAA ATACGAGGTC GTGCGCGACA TGTTCCACGG CTTTGACTAC CGCTCGGGTC TCAACGGTTC GCCCCAGGAG CGGCTGGCAA TGATGGCGGG GGCCATCGAG TGGATCCTGG AGAGGCAGCA GCAGTGGGCG GCGCAGGAAA CCACCCCGGA AGGCAAGAAG GCCGCGCACC GGCGCTTTGG CGATGCGGTG CTGGCCTTGT CCAAGGCGTA TGCCTTGGCT TCCGCCTCGG ACCCGGCGCG TGCTATCCGC GAAGAGGTGG GGTTTTTCCA GGCGATCCGT GCCGCGCTGA TCAAGAGCAG CACGGGCTCC GACGCAAACC CGCAAGCGCG CGAGTGGGCC ATCCAGCAGA TCGTCAGTCG CGCGGTGGTC TCGACCGAGA TTGTCGATAT CCTAACTGCC GCGGGCATCA AGAGTCCGGA CATCTCCATT CTGTCCGACG ACTTCCTGGC CGAAGTGCAG CAGATGGAGA AAAAGAACCT GGCGCTGGAA GCCCTGCGCA AGCTCATCAA CGACGGCATC CGCTCACGCG CCAAGGCCAA CGTCGTGCAG ACCCGTGCGT TTTCGCAGCG GCTGGAGGAT GCCGTTGCAC GCTACCACGC CAACGCCATC ACCACCGCCG AGGTGCTGCA GGAGCTGATC CACTTGGCCA AAGACATCCG CGCGGCGCGC CAGCGTGGCG AAGAGTCTGG ATTGTCCGAC GAGGAGATTG CCTTCTACGA CGCCCTGGCC GAGAACGAAA GCGCGGTTCA GGTCATGGGG GATGAGAAGC TGCGCGTGAT TGCCCACGAG CTGCTGGTGA ACCTGCGCGA AAACGTCTCC GTGGACTGGG CCCACCGTGA ATCGGCCCGC GCTCGCCTGC GCGTGCTGGT CAAGCGCATC CTGCGCAAGT ACGGTTACCC GCCTGATTTG CAGGACGCGG CGGTGCAGAC GGTGCTGCAG CAGGCCGAGG CACTGTCGGC GGTGTGGAGT CTGGCTCGTA ACTCTGGGTA G
|
Protein sequence | MAFLSEAEVE NALLDQLRAL GYSIEREEDI GPDGHRPERE SHDEVVLRKR FEDAVARLNP GVPLEARHDA VRRVTQSELP SLLAENRRLH KLLTEGVDVE YYADDGVLTA GKARLIDFDD PANNDWLAVN QFVVINGQYQ RRPDVVVFVN GLPLAVIELK APGNDQATLT GAFNQLQTYK GEITQLFRTN ALLVTSDGIS ARVGSLSADF ERFMPWRTTD GREVAPKGAP ELETLIEGVF EHRRLLDLLR HFTVFGETGA GLIKIIAGYH QFHAVRHAVE RTVAASSAGG DRKAGVIWHT QGSGKSLLMA FYAGLLVRHP ALENPTLVVL TDRNDLDDQL FATFSMCRDL IRQTPVQAEG REHLKTLLDR ASGGVIFTTL QKFGEIDGPL TTRRNVVVIA DEAHRSQYGF KAKVDAKTGE ISYGFAKYLR DALPNASFIG FTGTPIEAGD VNTPAVFGHY IDIYDISRAV EDGATVPIYY ESRLARIELD EDEKPKIDAE IEEILEDEEE PARERAKQKW ATVEALVGAD KRLRLIAQDI VQHFEARVAA LDGKAMIVCM SRRICVKLYN EIVKLRPGWH SDDDNAGAVK IVMTGAASDP PEWQKHIGNK ARRDLLARRA RDPKDPLKLV IVRDMWLTGF DAPCMHTMYV DKPMRGHGLM QAIARVNRVF RDKPAGLIVD YIGIAQNLKN ALAQYSPRDR ENTGIDEAEA IAVMLEKYEV VRDMFHGFDY RSGLNGSPQE RLAMMAGAIE WILERQQQWA AQETTPEGKK AAHRRFGDAV LALSKAYALA SASDPARAIR EEVGFFQAIR AALIKSSTGS DANPQAREWA IQQIVSRAVV STEIVDILTA AGIKSPDISI LSDDFLAEVQ QMEKKNLALE ALRKLINDGI RSRAKANVVQ TRAFSQRLED AVARYHANAI TTAEVLQELI HLAKDIRAAR QRGEESGLSD EEIAFYDALA ENESAVQVMG DEKLRVIAHE LLVNLRENVS VDWAHRESAR ARLRVLVKRI LRKYGYPPDL QDAAVQTVLQ QAEALSAVWS LARNSG
|
| |