Gene Dgeo_2012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2012 
Symbol 
ID4058475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2119917 
End bp2123057 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content64% 
IMG OID641231050 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_605475 
Protein GI94986111 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.599838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTTC TGTCGGAAGC CGAGGTCGAA AACGCCCTTC TGGATCAGCT GCGCGCGCTC 
GGCTACAGCA TCGAACGCGA GGAGGACATC GGCCCCGACG GCCACCGGCC GGAGCGCGAG
AGCCACGACG AGGTGGTGCT GCGCAAGCGG TTCGAGGACG CCGTTGCCCG CCTGAACCCC
GGGGTGCCAC TGGAGGCGCG TCATGATGCC GTGCGGCGCG TGACGCAGTC CGAGCTGCCA
TCGTTGCTTG CAGAAAACCG CCGCCTGCAC AAACTGCTGA CCGAAGGCGT GGATGTGGAG
TACTACGCGG ACGATGGCGT GCTCACCGCC GGCAAGGCGC GGCTGATCGA CTTCGATGAC
CCGGCCAACA ACGACTGGCT GGCGGTGAAC CAGTTCGTCG TCATCAACGG GCAGTACCAG
CGGCGACCCG ATGTCGTGGT GTTCGTGAAC GGCTTGCCGC TTGCGGTGAT CGAGCTCAAG
GCGCCGGGCA ACGACCAAGC CACGCTCACG GGCGCGTTCA ACCAGTTGCA GACCTACAAG
GGGGAGATCA CCCAGCTTTT TCGCACCAAC GCACTGCTGG TCACGTCGGA CGGCATTTCC
GCTCGGGTGG GGTCGCTGTC GGCCGACTTC GAGCGCTTCA TGCCGTGGCG CACCACCGAC
GGTCGGGAGG TGGCGCCCAA GGGAGCGCCG GAGCTGGAGA CGCTGATCGA AGGCGTGTTC
GAACATCGCC GCTTGCTCGA TCTGTTGCGT CACTTCACGG TCTTCGGCGA AACAGGCGCT
GGGCTCATCA AGATCATCGC GGGCTACCAT CAGTTCCACG CGGTACGACA TGCGGTCGAG
CGCACGGTGG CCGCATCCTC TGCCGGGGGA GACAGAAAGG CCGGGGTGAT CTGGCATACC
CAGGGCTCGG GCAAGAGCCT GTTGATGGCG TTCTACGCAG GTCTTCTCGT TAGACACCCG
GCGCTGGAAA ACCCGACCCT GGTCGTGCTG ACCGATCGCA ACGACCTGGA CGATCAGCTC
TTCGCCACCT TTTCGATGTG CCGCGACCTG ATCCGGCAGA CGCCGGTGCA AGCAGAGGGA
CGCGAGCACT TGAAAACGCT GCTGGACCGG GCCTCGGGCG GGGTGATCTT CACGACGCTG
CAAAAGTTTG GCGAGATCGA CGGGCCACTG ACCACCCGGC GCAACGTGGT GGTCATCGCC
GACGAGGCGC ACCGCAGCCA GTACGGCTTC AAGGCCAAGG TGGATGCCAA GACGGGCGAG
ATCTCCTACG GCTTCGCCAA GTACCTGCGA GACGCGCTGC CGAACGCCTC GTTCATCGGC
TTCACCGGTA CGCCCATCGA GGCAGGCGAC GTGAACACCC CGGCGGTGTT CGGCCACTAC
ATCGACATTT ACGACATCAG CCGCGCGGTG GAAGACGGCG CGACGGTGCC CATCTACTAC
GAATCGCGGT TGGCGCGCAT TGAACTCGAC GAGGACGAAA AGCCGAAGAT CGACGCCGAG
ATCGAGGAGA TTCTGGAAGA CGAGGAAGAA CCCGCCCGCG AGCGCGCCAA GCAGAAGTGG
GCGACGGTGG AGGCGCTCGT TGGCGCGGAC AAGCGCCTGC GACTGATCGC CCAGGATATC
GTGCAGCACT TCGAGGCTCG CGTAGCCGCG CTGGACGGCA AGGCGATGAT CGTCTGCATG
AGCCGGCGCA TCTGCGTCAA GCTCTACAAC GAGATCGTGA AGCTGCGTCC CGGATGGCAC
AGCGATGACG ACAACGCCGG GGCCGTCAAG ATCGTGATGA CCGGGGCGGC CTCCGATCCG
CCCGAGTGGC AGAAGCATAT CGGCAACAAG GCACGGCGCG ATCTGTTGGC CCGCCGCGCC
CGCGACCCCA AAGACCCCTT GAAGCTCGTC ATCGTGCGCG ACATGTGGCT GACGGGCTTC
GATGCGCCGT GCATGCACAC CATGTATGTG GACAAGCCGA TGCGCGGCCA CGGGCTGATG
CAGGCGATTG CGCGGGTGAA CCGGGTGTTT CGCGACAAGC CCGCCGGGCT GATCGTGGAC
TACATCGGCA TTGCGCAGAA CCTCAAAAAC GCGCTTGCGC AGTACTCGCC GCGCGACCGC
GAAAACACCG GCATCGACGA AGCCGAAGCC ATCGCGGTAA TGCTGGAAAA ATACGAGGTC
GTGCGCGACA TGTTCCACGG CTTTGACTAC CGCTCGGGTC TCAACGGTTC GCCCCAGGAG
CGGCTGGCAA TGATGGCGGG GGCCATCGAG TGGATCCTGG AGAGGCAGCA GCAGTGGGCG
GCGCAGGAAA CCACCCCGGA AGGCAAGAAG GCCGCGCACC GGCGCTTTGG CGATGCGGTG
CTGGCCTTGT CCAAGGCGTA TGCCTTGGCT TCCGCCTCGG ACCCGGCGCG TGCTATCCGC
GAAGAGGTGG GGTTTTTCCA GGCGATCCGT GCCGCGCTGA TCAAGAGCAG CACGGGCTCC
GACGCAAACC CGCAAGCGCG CGAGTGGGCC ATCCAGCAGA TCGTCAGTCG CGCGGTGGTC
TCGACCGAGA TTGTCGATAT CCTAACTGCC GCGGGCATCA AGAGTCCGGA CATCTCCATT
CTGTCCGACG ACTTCCTGGC CGAAGTGCAG CAGATGGAGA AAAAGAACCT GGCGCTGGAA
GCCCTGCGCA AGCTCATCAA CGACGGCATC CGCTCACGCG CCAAGGCCAA CGTCGTGCAG
ACCCGTGCGT TTTCGCAGCG GCTGGAGGAT GCCGTTGCAC GCTACCACGC CAACGCCATC
ACCACCGCCG AGGTGCTGCA GGAGCTGATC CACTTGGCCA AAGACATCCG CGCGGCGCGC
CAGCGTGGCG AAGAGTCTGG ATTGTCCGAC GAGGAGATTG CCTTCTACGA CGCCCTGGCC
GAGAACGAAA GCGCGGTTCA GGTCATGGGG GATGAGAAGC TGCGCGTGAT TGCCCACGAG
CTGCTGGTGA ACCTGCGCGA AAACGTCTCC GTGGACTGGG CCCACCGTGA ATCGGCCCGC
GCTCGCCTGC GCGTGCTGGT CAAGCGCATC CTGCGCAAGT ACGGTTACCC GCCTGATTTG
CAGGACGCGG CGGTGCAGAC GGTGCTGCAG CAGGCCGAGG CACTGTCGGC GGTGTGGAGT
CTGGCTCGTA ACTCTGGGTA G
 
Protein sequence
MAFLSEAEVE NALLDQLRAL GYSIEREEDI GPDGHRPERE SHDEVVLRKR FEDAVARLNP 
GVPLEARHDA VRRVTQSELP SLLAENRRLH KLLTEGVDVE YYADDGVLTA GKARLIDFDD
PANNDWLAVN QFVVINGQYQ RRPDVVVFVN GLPLAVIELK APGNDQATLT GAFNQLQTYK
GEITQLFRTN ALLVTSDGIS ARVGSLSADF ERFMPWRTTD GREVAPKGAP ELETLIEGVF
EHRRLLDLLR HFTVFGETGA GLIKIIAGYH QFHAVRHAVE RTVAASSAGG DRKAGVIWHT
QGSGKSLLMA FYAGLLVRHP ALENPTLVVL TDRNDLDDQL FATFSMCRDL IRQTPVQAEG
REHLKTLLDR ASGGVIFTTL QKFGEIDGPL TTRRNVVVIA DEAHRSQYGF KAKVDAKTGE
ISYGFAKYLR DALPNASFIG FTGTPIEAGD VNTPAVFGHY IDIYDISRAV EDGATVPIYY
ESRLARIELD EDEKPKIDAE IEEILEDEEE PARERAKQKW ATVEALVGAD KRLRLIAQDI
VQHFEARVAA LDGKAMIVCM SRRICVKLYN EIVKLRPGWH SDDDNAGAVK IVMTGAASDP
PEWQKHIGNK ARRDLLARRA RDPKDPLKLV IVRDMWLTGF DAPCMHTMYV DKPMRGHGLM
QAIARVNRVF RDKPAGLIVD YIGIAQNLKN ALAQYSPRDR ENTGIDEAEA IAVMLEKYEV
VRDMFHGFDY RSGLNGSPQE RLAMMAGAIE WILERQQQWA AQETTPEGKK AAHRRFGDAV
LALSKAYALA SASDPARAIR EEVGFFQAIR AALIKSSTGS DANPQAREWA IQQIVSRAVV
STEIVDILTA AGIKSPDISI LSDDFLAEVQ QMEKKNLALE ALRKLINDGI RSRAKANVVQ
TRAFSQRLED AVARYHANAI TTAEVLQELI HLAKDIRAAR QRGEESGLSD EEIAFYDALA
ENESAVQVMG DEKLRVIAHE LLVNLRENVS VDWAHRESAR ARLRVLVKRI LRKYGYPPDL
QDAAVQTVLQ QAEALSAVWS LARNSG