Gene Dgeo_0312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0312 
Symbol 
ID4058036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp308870 
End bp311197 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content67% 
IMG OID641229315 
Producthypothetical protein 
Protein accessionYP_603784 
Protein GI94984420 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0887413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00223033 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAAACGGT GGATTCGGGC GCTGCTGCCC GCGCTGGGGC TGCTGGGCCT GGGAACGGTG 
ACCGCCCAGA GCGTCCCCCT GGCCCAGCGG GTCACCGAGA CGGGCACCAT CAACTACGTC
ACGACCGGGG CGTCGTTCCG CGCCAACAGC ACCAACAGCA CGACCGGGGC CTCCTGCCTG
CTCACCTCCA GCACGGCGGC GAGTGTTGCC GGCTACAGCG TGGCGAACGG GGCCGTCTCT
CCGGCCCGTA CCGCTGACCA GACCGTGCCC ACCGGGGCGA ACGTCACCAA GGCGTACCTC
TACTGGACCG CCTCGGCGGG GGAGACGGGC TACAATGGTG GCGCGCCCCT CATTGACAAC
TCAGTGAAGT TCTACGTGAA CGGAACGAAC CCCCCTGCAA CCAACAACGT GACGGCGAGC
CGCACCTGGT CGGGGAGTGT GGCCCCCAGC GGCGGAAATG CTGCCGTTAC CCGGACCCAG
TTTGGCATGG GGGCCTTTGC CGATGTCACC AGCATCGTGC GCTCTAACCC TAATGCCCAG
TTTCGGATGG ACGACCTCAC CGTCTTCAAC GCGAGCGGCT CGCAGACCTG CAACACCTCC
ACGATGTACG GCAACTGGGG CCTGTACATC ATCTATAGCC TGCCGAGCGA GAGCAACAAA
ACGCTCGCCC TCTTCGACGG CCTCCAGTAC ATCGGCGGGA CCGGGGGCTA CGCTTCGGCG
GCCAGCGCCA GCGTGACCCT CTCCGGGCTG CGGGTGCCCA ACGCGGCCCC CGGCACTGAG
AAGATCGCCA AGACCACCCT GCTCGTCTCG GAGGGCGATG CAAGCACCGG AGCCTCCAAC
GACTCGCTCA CCTTGAACAC CAACCTTGAC GCGGCATTCG GGGTCAGCAA CTCACTCAAC
CCTGCCAACG ATGTGTTCAA CGGCAGCATC ACCGTGGGGC CGACCGATGG CGGCACGGCA
ACGGGCTACA CGTCAAGTAC CAGCCCCGGC GTGGTGGGGG GGCTGGACCT CGATACCTTC
GACCTCTCCA ACCGGGTGTC GAGTGGTACG ACCTCGCTCA CCGCGACCGT GGACTCGGCC
TCCGGCGAGC TGCTGATGCT CTACAGCGCC GTCCTGATGG CGACGACCAC GACCGCTGAC
CTCAGCGTCA CCAAGTCGGC TCCGGCCACC CAACAGGGGG CGGGCACCCT GACCTACACC
ATCACCGCGA GCAACGCCGG ACCCCACGAG GCGTACAACG TCGTCGTGAG CGACCCGCTG
CCCGCCGGGG TGACTTTTGT GAGCGCAAGC GGGGGTGGCA GTTACGACGC AGCGACCCGC
CGGGTAACCT GGACGATCGG AAAGTTCCTT GCCAACACCT CTCAGACCTA CACAGTGGCT
GTCACCGTGC CCAATGCGGC GGCGACCTAT CCGAACACCG TCAGCGTGAG CAGCGGCAGC
TTCGACCCGG TGTCGGCGAA CAATAGCGCG ACCGCGAGCA CCGTGGTGAC CCCCACCCCC
GACCTCACCC TCACCAAGAC CGGCCCGCAG TACGCCCGCC CCAGCACTGT GGCGAACACC
GATCCCACTG CCGGACCTGT GGTGGCAGCC CAGGACAGCT TCATCTCCTA CACGCTGACG
GTGAACACGG CGAACGCGAG CGCCACAGGG ACGACCACCG TGACCGACAC CCTTCCCGCG
GGCCTAAGCT GGGCGGGTGG AACGTCCAAC TACACGGCGG GGCCGGGCAC CTGGACCTGC
GGCGTTTCCG GACAGACCAT CACCTGCACG ACGCCCGGCC CCATCGTGGT AGGCACCCCT
CAGACCATCA CCCTGCAGAA CGTGCGGGTG GGTCCGGGGA CGGCGGCGGG CGCGACCTTT
ACCAATACGG CGACCGTGAG CAACCCGAAC GAGGCGGCAG CAGACAACAA TGCGGGCAAT
ACCGGGACGG CGACCACCCG GCTGATCCTC ACTCAGGTGA GCAAGCAGGT GCGGACGCTG
CCGGGAGGGA CGTTCGGCAC CAGCGCTTCG GTGCGGCCCG GTGACCTGCT GGAATACTGT
ATCGACACCC GCAACTTGGG TGGTGCCGAC CTCGCCAACT ATGTCCTCAG CGATACGTTG
AATCGCAATG GACGCTCGCT GACCAGCGTC ACCACCGACC CCGCCTATGG CGGGAAGGCC
ATCAAGTGGA CGCGTACCCC GGCCAGCGGA ACGGCGACCT CCTCCAACGC GACGGCTGCG
GCAGGGGATG ATGCCGGAAC CCTGACGGAT ACCAGCTTGT CCGTCAATCT GGGGACGCTG
GCTGCCGGGG AAACGGTGCG GACCTGCTTC CAGGTTCAGG TCAGGTAA
 
Protein sequence
MKRWIRALLP ALGLLGLGTV TAQSVPLAQR VTETGTINYV TTGASFRANS TNSTTGASCL 
LTSSTAASVA GYSVANGAVS PARTADQTVP TGANVTKAYL YWTASAGETG YNGGAPLIDN
SVKFYVNGTN PPATNNVTAS RTWSGSVAPS GGNAAVTRTQ FGMGAFADVT SIVRSNPNAQ
FRMDDLTVFN ASGSQTCNTS TMYGNWGLYI IYSLPSESNK TLALFDGLQY IGGTGGYASA
ASASVTLSGL RVPNAAPGTE KIAKTTLLVS EGDASTGASN DSLTLNTNLD AAFGVSNSLN
PANDVFNGSI TVGPTDGGTA TGYTSSTSPG VVGGLDLDTF DLSNRVSSGT TSLTATVDSA
SGELLMLYSA VLMATTTTAD LSVTKSAPAT QQGAGTLTYT ITASNAGPHE AYNVVVSDPL
PAGVTFVSAS GGGSYDAATR RVTWTIGKFL ANTSQTYTVA VTVPNAAATY PNTVSVSSGS
FDPVSANNSA TASTVVTPTP DLTLTKTGPQ YARPSTVANT DPTAGPVVAA QDSFISYTLT
VNTANASATG TTTVTDTLPA GLSWAGGTSN YTAGPGTWTC GVSGQTITCT TPGPIVVGTP
QTITLQNVRV GPGTAAGATF TNTATVSNPN EAAADNNAGN TGTATTRLIL TQVSKQVRTL
PGGTFGTSAS VRPGDLLEYC IDTRNLGGAD LANYVLSDTL NRNGRSLTSV TTDPAYGGKA
IKWTRTPASG TATSSNATAA AGDDAGTLTD TSLSVNLGTL AAGETVRTCF QVQVR