Gene Dret_0076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0076 
Symbol 
ID8417880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp99320 
End bp102208 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content65% 
IMG OID645036641 
ProductDEAD/DEAH box helicase domain protein 
Protein accessionYP_003196956 
Protein GI258404214 
COG category[R] General function prediction only 
COG ID[COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCTT CCGCCCAGGT CGACGCCTAC CTGCGGGCCC TGAAGGACTC CGGCCCTTAC 
GGGAGTCTGG TCCGGCACCA CGCCACCCTC CCCGCCCGAG CACCGTCCTA TAGCACCCCG
CGCGAGCCGT GGCCGGCGCC GGTCCAGGAA TTATGCCGCC AAGCCGGCAT AGACCGCCTC
TTCAGCCACC AGGCCGAAGC TGTGGATCAC ATCCGCGCCG GGCGCCACAC GGCCATTGCC
ACCCCCACGG CCAGCGGCAA GAGCCTGACC TTTGCTCTGC CCATTGCCGA AGCCCTGTGC
CAGGACCCGG AGAGCAGGGC GTTGCTGCTC TATCCGCTCA AAGCCCTGGC TCAAGACCAG
CTGCGGGCCC TGCAGGCCTT CACCCAGCCG TGGCCGCTGG AACAGCAGCC CCGCGCCGCC
ATTTACGACG GGGACACGCC ACAGGCCGAG CGGGGGAGGA TCCGCGCCAA CCCGCCCCAT
TTGCTGTGCA CCAACCCGGA CATGCTCCAT CTCGGTCTGA CCCCGCACCA CCACAGCTGG
GGGGAGTTTT TCAGCCGCCT GCGGTTCGTG ATCGTGGACG AGGTCCACAC CTACCGCGGG
GTCATGGGCT CGCACATGGC CTGGGTTTTC CGGCGCTTGC GCCGCATCTG CCGCTATTGG
GGCAGCGACC CCACGTTCAT CTTTGCCTCG GCGACCATCG GCAATCCGGC CCAGTTGGCG
CAGAACCTGG CCGGCGTGCC GGTCGAGGCC GTAACCGAAA ACGGGGCCGC GCAAGGCAAA
CGGCACATGG TCTTTCTCGA CGCCCTGGAA GGTGCCCCGC AGACCGCCCT GGGCCTTCTG
CAGGCCGCCC TGGCCCGGGG ACTACGGACC ATCGTTTACA CCCAGTCACG CAAATACACT
GAACTCCTCG CCATGTGGCT GTCCCAGCGC AGCCCCCGGT TCGCGGACCG TGTCAGCGCC
TACCGCTCCG GTTTTTTGCC CGAAGAACGC CGGGAGATCG AGACCGCCCT GGCCAACGGC
TCCCTGCTCG GGGTCATTTC CACCAGTGCC CTGGAACTCG GCATCGATAT CGGCAACCTC
GATCTCTGCC TTTTGGTCGG CTACCCCGGC TCGGTCATGG CTACCTGGCA GCGGTCCGGA
CGGGTCGGAC GCGGCCGGCA GGACAGCGCC ACCATCCTCT TGGGCGGTGA GGACGCCCTC
GACCAATATT TCATGCACCA TCCACAGGCC TTTTTCGCGC TGCCGCCGGA GGCCGCGGTC
ATCAATCCCG ACAACCCGGT CATCCGCGAC CGCCATCTGG TCTGCGCCGC CGCGGATCTC
CCCTTGCAGC ACACGGAACC GCTTGTAACC GATCCCGTCC ACCGCCGGGC CGTGCACCGT
CTTGAACAGC GCGGCGAACT GCTGCAAAGC GGCAGCGGCG AGGAGTGGCT GTCGCACCGC
AAGACCCCGC ACCGTCAGGT CAGCCTCCGC GGCACCGGGC GGACCCTGCC GATCCTGGAC
AGCGATTCCC GGGAACATAT CGGCAGCGTT GACGGCTATC GCGCCCTGCG AGAGACCCAT
CCCGGAGCGG TCTATCTCCA CCGGGGCCAG ACCTATGTCG TCGACCGCCT TGACCTCGAA
GAAGGCATGG TCCTGGCCCG CAAGCAGACG GTGCAATATT TCACCCGGGT CCGGGCCCAC
AAAGAAACCA CGATTCTGGA GACGACCGAA TGGGCCCATG TCTGGGGCAG TCGCATCCAC
CGCGGCCGCC TGCGGGTCAC AGAGACGGTC ACCGGCTACG AAAAACGCCA TGTCCGCGGC
CAACGTCTGT TGCGCATGCA TGAACTCGAC CTGCCGCCCC ACGTCTTTGA AACCGAGGGG
ATCTGGCTGG AGATCCCGGA CACCGTGCGA CAGGAGCTGG AAAAACGGTA CCTCCATTTC
ATGGGCGGCA TTCACGCCGT GGAACACGCC CTGATCGGGA TTATGCCCCT GCTGGTCCTC
GCCGACCGCA ACGACCTCGG CGGCATCTCC ACGGTCGGCC ACGAGCAACT CACCACCGCC
GCTGTCTTTG TCTACGACGG CACTCCCGGC GGGGCCGGAC TGACCCGTCA GGCCTTCCAG
CGTGCCGGAG AACTTTTGGA GCACACCCAG CGGACCATCC GCGACTGCCC CTGTGAAACC
GGTTGCCCCG GCTGCGTCCA TTCGCCAAAA TGCGGCTCCG GCAACCGGCC CATCGCCAAG
GACGCGGCCT TGGCGGTGCT GCAAACCCTG CACACCAGCC AAACGCCCCG CTCCGAACCA
GTTATCGTCC AACCCGCCGA GGAGCCCGAC ATGACCGTTA CAACGGACAT CCCCGACAAC
GCGGGACTAC CCCCGCTCCA TTACACCGTC TTCGACCTCG AAACCCAGCG TTCGGCCCAG
GAGGTCGGCG GCTGGAACCG GGCCAAGGAC ATGGGCATCA GCTGCGCGGT GGTCTACGAC
TCGGCCCTGG ACACCTATGT TGAATACGAG GAAAAAGATA TCCCGCAGCT GGTGGAACAC
CTGCACAAGT GCGACCTCGT CATCGGCTTC AACATCCTGC GCTTTGATTA TCAGGTCCTC
TCCGGCTACA GCCGGGCCGA TTTCCAGGCC CTGCCTTCGC TGGACCTGCT CCGTGTGGTC
CACAAGCAAT TGGGCTATCG CCTCTCCCTG GACAAATTGG CTCAGGCGAC CCTGCAAGCC
CAAAAAACCG CCAATGGCCT CCAGGCCCTG CAATGGTGGA AGGAGGGCCG GATCCGCGAC
ATCATCGACT ACTGCCGCCA GGATGTGGCC GTAACCCGGG ATCTCTACCG CTTCGGCCGC
GACAACGGCT ACCTCTTGTT CCACAACAAG GCCAAGCAGC TCGTCCGCGT CCCGGTCCAA
TGGGAATAG
 
Protein sequence
MSSSAQVDAY LRALKDSGPY GSLVRHHATL PARAPSYSTP REPWPAPVQE LCRQAGIDRL 
FSHQAEAVDH IRAGRHTAIA TPTASGKSLT FALPIAEALC QDPESRALLL YPLKALAQDQ
LRALQAFTQP WPLEQQPRAA IYDGDTPQAE RGRIRANPPH LLCTNPDMLH LGLTPHHHSW
GEFFSRLRFV IVDEVHTYRG VMGSHMAWVF RRLRRICRYW GSDPTFIFAS ATIGNPAQLA
QNLAGVPVEA VTENGAAQGK RHMVFLDALE GAPQTALGLL QAALARGLRT IVYTQSRKYT
ELLAMWLSQR SPRFADRVSA YRSGFLPEER REIETALANG SLLGVISTSA LELGIDIGNL
DLCLLVGYPG SVMATWQRSG RVGRGRQDSA TILLGGEDAL DQYFMHHPQA FFALPPEAAV
INPDNPVIRD RHLVCAAADL PLQHTEPLVT DPVHRRAVHR LEQRGELLQS GSGEEWLSHR
KTPHRQVSLR GTGRTLPILD SDSREHIGSV DGYRALRETH PGAVYLHRGQ TYVVDRLDLE
EGMVLARKQT VQYFTRVRAH KETTILETTE WAHVWGSRIH RGRLRVTETV TGYEKRHVRG
QRLLRMHELD LPPHVFETEG IWLEIPDTVR QELEKRYLHF MGGIHAVEHA LIGIMPLLVL
ADRNDLGGIS TVGHEQLTTA AVFVYDGTPG GAGLTRQAFQ RAGELLEHTQ RTIRDCPCET
GCPGCVHSPK CGSGNRPIAK DAALAVLQTL HTSQTPRSEP VIVQPAEEPD MTVTTDIPDN
AGLPPLHYTV FDLETQRSAQ EVGGWNRAKD MGISCAVVYD SALDTYVEYE EKDIPQLVEH
LHKCDLVIGF NILRFDYQVL SGYSRADFQA LPSLDLLRVV HKQLGYRLSL DKLAQATLQA
QKTANGLQAL QWWKEGRIRD IIDYCRQDVA VTRDLYRFGR DNGYLLFHNK AKQLVRVPVQ
WE