Gene Dret_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0035 
Symbol 
ID8417837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp39394 
End bp42723 
Gene Length3330 bp 
Protein Length1109 aa 
Translation table11 
GC content59% 
IMG OID645036598 
Producttrehalose synthase 
Protein accessionYP_003196915 
Protein GI258404173 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTGC AGAAGATGAT CCCCGTGACC CAGGACCCCC AATGGTACAA AGACGCCGTG 
ATTTATGAAG TGCCGGTGAA GTCCTTTTGT GACAGCAACG GGGACGGCAT TGGCGATTTC
CGGGGGCTTT TGCATAAACT GGATTACCTC GAGCGCCTGG GGGTGACGGC GTTGTGGCTG
TTGCCGTTTT ATCCTTCCCC GCTCAAGGAT GACGGATACG ATATCGCGGA GTATTTTTCC
GTGCACAAGG ATTACGGTCA GCTACAAGAT TTCAAAGCCT TTTTGCGTGA AGCCCATCAA
CGGGGATTGA AGGTCATCAC TGAATTGGTC ATTAATCACA CATCCAGCGA TCACCTGTGG
TTCAAGAAAT CCCGCCAAGC CGAACCCGGC AGCTATTGGC GCGATTTTTA CGTCTGGAGC
GACACGCCCA ATCGGTACGA AGACGCGCGG ATCATCTTCA AGGATTTTGA ACAATCGAAC
TGGACCTGGG ACCCTGTAGC TGGCGCCTAT TATTGGCACC GGTTCTATTC GCATCAGCCG
GACCTCAATT TCGACAACCC TGAAGTCCAA AAGGCGGTCT TCGAGGCCCT GGATTTCTGG
CTGGATATGG GCGTCGACGG TCTGCGGCTG GACGCGATCC CCTATCTCTA CGAACGTGAG
GGGACGAATT GCGAGAATCT TCCGGAAACC CACGCGTTTT TGAAAAAGCT CCGGGCCCAT
GTCGATGCCA GACATCGGAA CAAGATGCTC CTGGCTGAGG CCAACCAATG GCCGGAAGAT
GCGGTGGCCT ATTTTGGCGA CGGCGACGAA TGCCACATGT CTTTCCACTT TCCGATCATG
CCGCGCATCT TCATGTCCCT GTGGATGGAA GACCGCTTCC CCTTGATCGA CATGATTGAG
CAGACACCGG AGATTCCGGC CGGGTGCCAG TGGGCGCTGT TTTTGCGCAA CCATGACGAA
CTCACCTTGG AAATGGTCTC TGACGAAGAG CGGGACTATA TGTACCGCGT CTATGCCAGC
GACCCCCGGG CACGGATCAA TCTGGGCATT CGGCGGCGAC TGGCCCCGTT GATGCACAAC
AACCGGCGCA AACTGGAACT TTTGACCCTG CTCCTGCTCT CCCTGCCGGG CACGCCGGTG
CTCTATTATG GCGACGAGAT CGGCATGGGC GACAATTTTT TCCTCGGCGA TCGCAACGGG
GTCCGGACGC CCATGCAGTG GACGCCGGAC CGCAACGCCG GGTTCTCTAC GGCCAATCCC
CAGCAGTTGT ATTTGCCGGT TATCCACGAC CCCGAATACC ATTTTCAGTC CATCAACGTC
GAAAACCAGG AAAAAAACCC CTCCTCCCTG TTGTGGTGGA TGCGGCGGGT CATCGCCATG
CGGCGGCGTT TCCGGGCCTT TAGCCGTGGT GCAATCGATT TTCTCCTGCC GGACAATGAA
AAGGTCCTGA CCTTCATTCG CAGCTACGGC GAGGAGCACA TCCTTGTCGT GGTCAACCTC
TCCCGGTTTT CGCAATCGGT ACGCCTGGAC CTCTCGGAAT ACGCGGGCCG GGTGCCGGAA
GAACTCTTCA GCGGCAACCG GTTCCCGGAG ATCGGCGAGG ACCCCTATCC ATTGACCTTG
GGATTCAACG ACTATTTCTG GTTCGTGCTG CGCGAGCCGC AATCCCGACT GCAGACACCG
CAGGGGCCGC CGCGCCTGGA GATGGACACC GACTGGAAGC ACCTGCTGCA CGGGACCTTC
CGTGAGTATC TGGAGATGGA AGTGCTGCCG CGGTTTTTGC GTCAGAGCCG CTGGTTCGGT
TCCAAGGCCA AAACCATGCG CCATCTGCAG GTCGTGGAAG ACGTGAACAT GGGCCACAAC
GGGGAGAAGA CCCATCTGCT CGTTGTGCGG GTCGACTATA CCGAAGGAGG GATGGAACAC
TATCTCCTGC CCTTGTCCTA TACGGACCGG GCCGAGGCGG AATCCCTGCT GCAGGAACAT
CCCCAGGCGG TGATCGCCTA TCTTGAGCTT CAGGACAGCA GCGGCGTCCT CTATGATGGG
CTTTTCAGCG CCAGCTTCCG CACCGTGCTT TTGGAGATGA TCCTCGGACA GCGCAAAAAG
AGCGGTCCCG GCGGCGAGGT CCACGGTGTG CGGGGGCGTT GTCTGAAATC CCTGATCAAG
GACGGCCACC ATATCCCCGC TTCTCGAGTT TTGGCCGCGG AGCAGAGCAA CAGTTCCATC
CTCTATGGCC AGTCGGTGAT CCTCAAGTTG TACCGCCGTC TGGAGCAGGG CACGAATCCG
GACGCGGAGA TCACTCGTCA TCTTGGCCGG TTGCGTCACG GGCCCAAGGT TCCCGGCTTC
GCGGGCCTGC TCGAATACCG CCGGGAGGAC CAGGAACCTG TGACCCTTGG CCTGGCCCAG
CAGTATGTGC CGAGCCGCGG CGATGCCTGG ACCTTTGTCC TATCGGAGTT GGACAGTTTT
TGGGATCGCG TGGCGCGTGA TGAAACGCGC TGGCAGGGCC CGGAACCCGG CTGGCTTCCC
CGGGCCGGCA ATGCCGCGAT GCCGGAGGAA CTGCTGGATC GGGTCGGCCA GGAGTTTTTG
GACAAGATCG AGCTCCTGGG GCGGCGGACC GGAGAACTCC ACCGGGCGCT GACCGATTCG
GACCCGGAAT CTCCCTTCGC CCCCGAACCG TTTTCCAAGC TTTATCAGCG CTCGCTGTAC
CAGTCGGTGC GCTATCAGGT CCGTAAAACG CTGCACAGTG TGCGTCGCCA TCTGGACGAG
CTTCCCGAGG CGATCCGTCC ACAGGCAGAG GCCTTGCTGG TCAATGAGCA TCTGGTGCTC
GAACGCCTGG GCGGACTGAC CGCCCATCGG GTCGAGGCCC AGAAGATTCG CATCCACGGC
GATTACCACC TCGGTCAGGT GCTTTACACC GGCGAGGATT TCTGGATTAT AGACTTTGAG
GGAGAGCCAG CCAGGCCGCT GAGCGAACGG CGGTTGAAGC GGTCTCCACT GCGGGATGTG
GCCGGGATGC TGCGCTCTTT CGACTACGCG GTGCACACCT CGCTCTCGCG GCAAGAGAGC
GGGGTGACCT CAGCAGTCGG CAGGAGTTGG ACCGCCCCCT GGTACGCCGC GGTCTGCCGG
ACGTATCTGC GCGGGTATCT CGACCAGGTT GAAGACGCCG CTTTCGTCCC CAGGGATCCG
GAGGACATCT GGCGTCTGCT CGAAGGATTT TTGATTGAGA AGGCGGTCTA CGAAGTCGGC
TATGAAGCCA ACAACCGTCC CCACTGGATT TGGCTCCCTC TCGGCGGCCT GTTGCGCCTG
CTGGGCAAGG AGCCCGATGT GGACAGTTAA
 
Protein sequence
MALQKMIPVT QDPQWYKDAV IYEVPVKSFC DSNGDGIGDF RGLLHKLDYL ERLGVTALWL 
LPFYPSPLKD DGYDIAEYFS VHKDYGQLQD FKAFLREAHQ RGLKVITELV INHTSSDHLW
FKKSRQAEPG SYWRDFYVWS DTPNRYEDAR IIFKDFEQSN WTWDPVAGAY YWHRFYSHQP
DLNFDNPEVQ KAVFEALDFW LDMGVDGLRL DAIPYLYERE GTNCENLPET HAFLKKLRAH
VDARHRNKML LAEANQWPED AVAYFGDGDE CHMSFHFPIM PRIFMSLWME DRFPLIDMIE
QTPEIPAGCQ WALFLRNHDE LTLEMVSDEE RDYMYRVYAS DPRARINLGI RRRLAPLMHN
NRRKLELLTL LLLSLPGTPV LYYGDEIGMG DNFFLGDRNG VRTPMQWTPD RNAGFSTANP
QQLYLPVIHD PEYHFQSINV ENQEKNPSSL LWWMRRVIAM RRRFRAFSRG AIDFLLPDNE
KVLTFIRSYG EEHILVVVNL SRFSQSVRLD LSEYAGRVPE ELFSGNRFPE IGEDPYPLTL
GFNDYFWFVL REPQSRLQTP QGPPRLEMDT DWKHLLHGTF REYLEMEVLP RFLRQSRWFG
SKAKTMRHLQ VVEDVNMGHN GEKTHLLVVR VDYTEGGMEH YLLPLSYTDR AEAESLLQEH
PQAVIAYLEL QDSSGVLYDG LFSASFRTVL LEMILGQRKK SGPGGEVHGV RGRCLKSLIK
DGHHIPASRV LAAEQSNSSI LYGQSVILKL YRRLEQGTNP DAEITRHLGR LRHGPKVPGF
AGLLEYRRED QEPVTLGLAQ QYVPSRGDAW TFVLSELDSF WDRVARDETR WQGPEPGWLP
RAGNAAMPEE LLDRVGQEFL DKIELLGRRT GELHRALTDS DPESPFAPEP FSKLYQRSLY
QSVRYQVRKT LHSVRRHLDE LPEAIRPQAE ALLVNEHLVL ERLGGLTAHR VEAQKIRIHG
DYHLGQVLYT GEDFWIIDFE GEPARPLSER RLKRSPLRDV AGMLRSFDYA VHTSLSRQES
GVTSAVGRSW TAPWYAAVCR TYLRGYLDQV EDAAFVPRDP EDIWRLLEGF LIEKAVYEVG
YEANNRPHWI WLPLGGLLRL LGKEPDVDS