Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0076 |
Symbol | |
ID | 8417880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 99320 |
End bp | 102208 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645036641 |
Product | DEAD/DEAH box helicase domain protein |
Protein accession | YP_003196956 |
Protein GI | 258404214 |
COG category | [R] General function prediction only |
COG ID | [COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTCTT CCGCCCAGGT CGACGCCTAC CTGCGGGCCC TGAAGGACTC CGGCCCTTAC GGGAGTCTGG TCCGGCACCA CGCCACCCTC CCCGCCCGAG CACCGTCCTA TAGCACCCCG CGCGAGCCGT GGCCGGCGCC GGTCCAGGAA TTATGCCGCC AAGCCGGCAT AGACCGCCTC TTCAGCCACC AGGCCGAAGC TGTGGATCAC ATCCGCGCCG GGCGCCACAC GGCCATTGCC ACCCCCACGG CCAGCGGCAA GAGCCTGACC TTTGCTCTGC CCATTGCCGA AGCCCTGTGC CAGGACCCGG AGAGCAGGGC GTTGCTGCTC TATCCGCTCA AAGCCCTGGC TCAAGACCAG CTGCGGGCCC TGCAGGCCTT CACCCAGCCG TGGCCGCTGG AACAGCAGCC CCGCGCCGCC ATTTACGACG GGGACACGCC ACAGGCCGAG CGGGGGAGGA TCCGCGCCAA CCCGCCCCAT TTGCTGTGCA CCAACCCGGA CATGCTCCAT CTCGGTCTGA CCCCGCACCA CCACAGCTGG GGGGAGTTTT TCAGCCGCCT GCGGTTCGTG ATCGTGGACG AGGTCCACAC CTACCGCGGG GTCATGGGCT CGCACATGGC CTGGGTTTTC CGGCGCTTGC GCCGCATCTG CCGCTATTGG GGCAGCGACC CCACGTTCAT CTTTGCCTCG GCGACCATCG GCAATCCGGC CCAGTTGGCG CAGAACCTGG CCGGCGTGCC GGTCGAGGCC GTAACCGAAA ACGGGGCCGC GCAAGGCAAA CGGCACATGG TCTTTCTCGA CGCCCTGGAA GGTGCCCCGC AGACCGCCCT GGGCCTTCTG CAGGCCGCCC TGGCCCGGGG ACTACGGACC ATCGTTTACA CCCAGTCACG CAAATACACT GAACTCCTCG CCATGTGGCT GTCCCAGCGC AGCCCCCGGT TCGCGGACCG TGTCAGCGCC TACCGCTCCG GTTTTTTGCC CGAAGAACGC CGGGAGATCG AGACCGCCCT GGCCAACGGC TCCCTGCTCG GGGTCATTTC CACCAGTGCC CTGGAACTCG GCATCGATAT CGGCAACCTC GATCTCTGCC TTTTGGTCGG CTACCCCGGC TCGGTCATGG CTACCTGGCA GCGGTCCGGA CGGGTCGGAC GCGGCCGGCA GGACAGCGCC ACCATCCTCT TGGGCGGTGA GGACGCCCTC GACCAATATT TCATGCACCA TCCACAGGCC TTTTTCGCGC TGCCGCCGGA GGCCGCGGTC ATCAATCCCG ACAACCCGGT CATCCGCGAC CGCCATCTGG TCTGCGCCGC CGCGGATCTC CCCTTGCAGC ACACGGAACC GCTTGTAACC GATCCCGTCC ACCGCCGGGC CGTGCACCGT CTTGAACAGC GCGGCGAACT GCTGCAAAGC GGCAGCGGCG AGGAGTGGCT GTCGCACCGC AAGACCCCGC ACCGTCAGGT CAGCCTCCGC GGCACCGGGC GGACCCTGCC GATCCTGGAC AGCGATTCCC GGGAACATAT CGGCAGCGTT GACGGCTATC GCGCCCTGCG AGAGACCCAT CCCGGAGCGG TCTATCTCCA CCGGGGCCAG ACCTATGTCG TCGACCGCCT TGACCTCGAA GAAGGCATGG TCCTGGCCCG CAAGCAGACG GTGCAATATT TCACCCGGGT CCGGGCCCAC AAAGAAACCA CGATTCTGGA GACGACCGAA TGGGCCCATG TCTGGGGCAG TCGCATCCAC CGCGGCCGCC TGCGGGTCAC AGAGACGGTC ACCGGCTACG AAAAACGCCA TGTCCGCGGC CAACGTCTGT TGCGCATGCA TGAACTCGAC CTGCCGCCCC ACGTCTTTGA AACCGAGGGG ATCTGGCTGG AGATCCCGGA CACCGTGCGA CAGGAGCTGG AAAAACGGTA CCTCCATTTC ATGGGCGGCA TTCACGCCGT GGAACACGCC CTGATCGGGA TTATGCCCCT GCTGGTCCTC GCCGACCGCA ACGACCTCGG CGGCATCTCC ACGGTCGGCC ACGAGCAACT CACCACCGCC GCTGTCTTTG TCTACGACGG CACTCCCGGC GGGGCCGGAC TGACCCGTCA GGCCTTCCAG CGTGCCGGAG AACTTTTGGA GCACACCCAG CGGACCATCC GCGACTGCCC CTGTGAAACC GGTTGCCCCG GCTGCGTCCA TTCGCCAAAA TGCGGCTCCG GCAACCGGCC CATCGCCAAG GACGCGGCCT TGGCGGTGCT GCAAACCCTG CACACCAGCC AAACGCCCCG CTCCGAACCA GTTATCGTCC AACCCGCCGA GGAGCCCGAC ATGACCGTTA CAACGGACAT CCCCGACAAC GCGGGACTAC CCCCGCTCCA TTACACCGTC TTCGACCTCG AAACCCAGCG TTCGGCCCAG GAGGTCGGCG GCTGGAACCG GGCCAAGGAC ATGGGCATCA GCTGCGCGGT GGTCTACGAC TCGGCCCTGG ACACCTATGT TGAATACGAG GAAAAAGATA TCCCGCAGCT GGTGGAACAC CTGCACAAGT GCGACCTCGT CATCGGCTTC AACATCCTGC GCTTTGATTA TCAGGTCCTC TCCGGCTACA GCCGGGCCGA TTTCCAGGCC CTGCCTTCGC TGGACCTGCT CCGTGTGGTC CACAAGCAAT TGGGCTATCG CCTCTCCCTG GACAAATTGG CTCAGGCGAC CCTGCAAGCC CAAAAAACCG CCAATGGCCT CCAGGCCCTG CAATGGTGGA AGGAGGGCCG GATCCGCGAC ATCATCGACT ACTGCCGCCA GGATGTGGCC GTAACCCGGG ATCTCTACCG CTTCGGCCGC GACAACGGCT ACCTCTTGTT CCACAACAAG GCCAAGCAGC TCGTCCGCGT CCCGGTCCAA TGGGAATAG
|
Protein sequence | MSSSAQVDAY LRALKDSGPY GSLVRHHATL PARAPSYSTP REPWPAPVQE LCRQAGIDRL FSHQAEAVDH IRAGRHTAIA TPTASGKSLT FALPIAEALC QDPESRALLL YPLKALAQDQ LRALQAFTQP WPLEQQPRAA IYDGDTPQAE RGRIRANPPH LLCTNPDMLH LGLTPHHHSW GEFFSRLRFV IVDEVHTYRG VMGSHMAWVF RRLRRICRYW GSDPTFIFAS ATIGNPAQLA QNLAGVPVEA VTENGAAQGK RHMVFLDALE GAPQTALGLL QAALARGLRT IVYTQSRKYT ELLAMWLSQR SPRFADRVSA YRSGFLPEER REIETALANG SLLGVISTSA LELGIDIGNL DLCLLVGYPG SVMATWQRSG RVGRGRQDSA TILLGGEDAL DQYFMHHPQA FFALPPEAAV INPDNPVIRD RHLVCAAADL PLQHTEPLVT DPVHRRAVHR LEQRGELLQS GSGEEWLSHR KTPHRQVSLR GTGRTLPILD SDSREHIGSV DGYRALRETH PGAVYLHRGQ TYVVDRLDLE EGMVLARKQT VQYFTRVRAH KETTILETTE WAHVWGSRIH RGRLRVTETV TGYEKRHVRG QRLLRMHELD LPPHVFETEG IWLEIPDTVR QELEKRYLHF MGGIHAVEHA LIGIMPLLVL ADRNDLGGIS TVGHEQLTTA AVFVYDGTPG GAGLTRQAFQ RAGELLEHTQ RTIRDCPCET GCPGCVHSPK CGSGNRPIAK DAALAVLQTL HTSQTPRSEP VIVQPAEEPD MTVTTDIPDN AGLPPLHYTV FDLETQRSAQ EVGGWNRAKD MGISCAVVYD SALDTYVEYE EKDIPQLVEH LHKCDLVIGF NILRFDYQVL SGYSRADFQA LPSLDLLRVV HKQLGYRLSL DKLAQATLQA QKTANGLQAL QWWKEGRIRD IIDYCRQDVA VTRDLYRFGR DNGYLLFHNK AKQLVRVPVQ WE
|
| |