Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0035 |
Symbol | |
ID | 8417837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 39394 |
End bp | 42723 |
Gene Length | 3330 bp |
Protein Length | 1109 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 645036598 |
Product | trehalose synthase |
Protein accession | YP_003196915 |
Protein GI | 258404173 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTGC AGAAGATGAT CCCCGTGACC CAGGACCCCC AATGGTACAA AGACGCCGTG ATTTATGAAG TGCCGGTGAA GTCCTTTTGT GACAGCAACG GGGACGGCAT TGGCGATTTC CGGGGGCTTT TGCATAAACT GGATTACCTC GAGCGCCTGG GGGTGACGGC GTTGTGGCTG TTGCCGTTTT ATCCTTCCCC GCTCAAGGAT GACGGATACG ATATCGCGGA GTATTTTTCC GTGCACAAGG ATTACGGTCA GCTACAAGAT TTCAAAGCCT TTTTGCGTGA AGCCCATCAA CGGGGATTGA AGGTCATCAC TGAATTGGTC ATTAATCACA CATCCAGCGA TCACCTGTGG TTCAAGAAAT CCCGCCAAGC CGAACCCGGC AGCTATTGGC GCGATTTTTA CGTCTGGAGC GACACGCCCA ATCGGTACGA AGACGCGCGG ATCATCTTCA AGGATTTTGA ACAATCGAAC TGGACCTGGG ACCCTGTAGC TGGCGCCTAT TATTGGCACC GGTTCTATTC GCATCAGCCG GACCTCAATT TCGACAACCC TGAAGTCCAA AAGGCGGTCT TCGAGGCCCT GGATTTCTGG CTGGATATGG GCGTCGACGG TCTGCGGCTG GACGCGATCC CCTATCTCTA CGAACGTGAG GGGACGAATT GCGAGAATCT TCCGGAAACC CACGCGTTTT TGAAAAAGCT CCGGGCCCAT GTCGATGCCA GACATCGGAA CAAGATGCTC CTGGCTGAGG CCAACCAATG GCCGGAAGAT GCGGTGGCCT ATTTTGGCGA CGGCGACGAA TGCCACATGT CTTTCCACTT TCCGATCATG CCGCGCATCT TCATGTCCCT GTGGATGGAA GACCGCTTCC CCTTGATCGA CATGATTGAG CAGACACCGG AGATTCCGGC CGGGTGCCAG TGGGCGCTGT TTTTGCGCAA CCATGACGAA CTCACCTTGG AAATGGTCTC TGACGAAGAG CGGGACTATA TGTACCGCGT CTATGCCAGC GACCCCCGGG CACGGATCAA TCTGGGCATT CGGCGGCGAC TGGCCCCGTT GATGCACAAC AACCGGCGCA AACTGGAACT TTTGACCCTG CTCCTGCTCT CCCTGCCGGG CACGCCGGTG CTCTATTATG GCGACGAGAT CGGCATGGGC GACAATTTTT TCCTCGGCGA TCGCAACGGG GTCCGGACGC CCATGCAGTG GACGCCGGAC CGCAACGCCG GGTTCTCTAC GGCCAATCCC CAGCAGTTGT ATTTGCCGGT TATCCACGAC CCCGAATACC ATTTTCAGTC CATCAACGTC GAAAACCAGG AAAAAAACCC CTCCTCCCTG TTGTGGTGGA TGCGGCGGGT CATCGCCATG CGGCGGCGTT TCCGGGCCTT TAGCCGTGGT GCAATCGATT TTCTCCTGCC GGACAATGAA AAGGTCCTGA CCTTCATTCG CAGCTACGGC GAGGAGCACA TCCTTGTCGT GGTCAACCTC TCCCGGTTTT CGCAATCGGT ACGCCTGGAC CTCTCGGAAT ACGCGGGCCG GGTGCCGGAA GAACTCTTCA GCGGCAACCG GTTCCCGGAG ATCGGCGAGG ACCCCTATCC ATTGACCTTG GGATTCAACG ACTATTTCTG GTTCGTGCTG CGCGAGCCGC AATCCCGACT GCAGACACCG CAGGGGCCGC CGCGCCTGGA GATGGACACC GACTGGAAGC ACCTGCTGCA CGGGACCTTC CGTGAGTATC TGGAGATGGA AGTGCTGCCG CGGTTTTTGC GTCAGAGCCG CTGGTTCGGT TCCAAGGCCA AAACCATGCG CCATCTGCAG GTCGTGGAAG ACGTGAACAT GGGCCACAAC GGGGAGAAGA CCCATCTGCT CGTTGTGCGG GTCGACTATA CCGAAGGAGG GATGGAACAC TATCTCCTGC CCTTGTCCTA TACGGACCGG GCCGAGGCGG AATCCCTGCT GCAGGAACAT CCCCAGGCGG TGATCGCCTA TCTTGAGCTT CAGGACAGCA GCGGCGTCCT CTATGATGGG CTTTTCAGCG CCAGCTTCCG CACCGTGCTT TTGGAGATGA TCCTCGGACA GCGCAAAAAG AGCGGTCCCG GCGGCGAGGT CCACGGTGTG CGGGGGCGTT GTCTGAAATC CCTGATCAAG GACGGCCACC ATATCCCCGC TTCTCGAGTT TTGGCCGCGG AGCAGAGCAA CAGTTCCATC CTCTATGGCC AGTCGGTGAT CCTCAAGTTG TACCGCCGTC TGGAGCAGGG CACGAATCCG GACGCGGAGA TCACTCGTCA TCTTGGCCGG TTGCGTCACG GGCCCAAGGT TCCCGGCTTC GCGGGCCTGC TCGAATACCG CCGGGAGGAC CAGGAACCTG TGACCCTTGG CCTGGCCCAG CAGTATGTGC CGAGCCGCGG CGATGCCTGG ACCTTTGTCC TATCGGAGTT GGACAGTTTT TGGGATCGCG TGGCGCGTGA TGAAACGCGC TGGCAGGGCC CGGAACCCGG CTGGCTTCCC CGGGCCGGCA ATGCCGCGAT GCCGGAGGAA CTGCTGGATC GGGTCGGCCA GGAGTTTTTG GACAAGATCG AGCTCCTGGG GCGGCGGACC GGAGAACTCC ACCGGGCGCT GACCGATTCG GACCCGGAAT CTCCCTTCGC CCCCGAACCG TTTTCCAAGC TTTATCAGCG CTCGCTGTAC CAGTCGGTGC GCTATCAGGT CCGTAAAACG CTGCACAGTG TGCGTCGCCA TCTGGACGAG CTTCCCGAGG CGATCCGTCC ACAGGCAGAG GCCTTGCTGG TCAATGAGCA TCTGGTGCTC GAACGCCTGG GCGGACTGAC CGCCCATCGG GTCGAGGCCC AGAAGATTCG CATCCACGGC GATTACCACC TCGGTCAGGT GCTTTACACC GGCGAGGATT TCTGGATTAT AGACTTTGAG GGAGAGCCAG CCAGGCCGCT GAGCGAACGG CGGTTGAAGC GGTCTCCACT GCGGGATGTG GCCGGGATGC TGCGCTCTTT CGACTACGCG GTGCACACCT CGCTCTCGCG GCAAGAGAGC GGGGTGACCT CAGCAGTCGG CAGGAGTTGG ACCGCCCCCT GGTACGCCGC GGTCTGCCGG ACGTATCTGC GCGGGTATCT CGACCAGGTT GAAGACGCCG CTTTCGTCCC CAGGGATCCG GAGGACATCT GGCGTCTGCT CGAAGGATTT TTGATTGAGA AGGCGGTCTA CGAAGTCGGC TATGAAGCCA ACAACCGTCC CCACTGGATT TGGCTCCCTC TCGGCGGCCT GTTGCGCCTG CTGGGCAAGG AGCCCGATGT GGACAGTTAA
|
Protein sequence | MALQKMIPVT QDPQWYKDAV IYEVPVKSFC DSNGDGIGDF RGLLHKLDYL ERLGVTALWL LPFYPSPLKD DGYDIAEYFS VHKDYGQLQD FKAFLREAHQ RGLKVITELV INHTSSDHLW FKKSRQAEPG SYWRDFYVWS DTPNRYEDAR IIFKDFEQSN WTWDPVAGAY YWHRFYSHQP DLNFDNPEVQ KAVFEALDFW LDMGVDGLRL DAIPYLYERE GTNCENLPET HAFLKKLRAH VDARHRNKML LAEANQWPED AVAYFGDGDE CHMSFHFPIM PRIFMSLWME DRFPLIDMIE QTPEIPAGCQ WALFLRNHDE LTLEMVSDEE RDYMYRVYAS DPRARINLGI RRRLAPLMHN NRRKLELLTL LLLSLPGTPV LYYGDEIGMG DNFFLGDRNG VRTPMQWTPD RNAGFSTANP QQLYLPVIHD PEYHFQSINV ENQEKNPSSL LWWMRRVIAM RRRFRAFSRG AIDFLLPDNE KVLTFIRSYG EEHILVVVNL SRFSQSVRLD LSEYAGRVPE ELFSGNRFPE IGEDPYPLTL GFNDYFWFVL REPQSRLQTP QGPPRLEMDT DWKHLLHGTF REYLEMEVLP RFLRQSRWFG SKAKTMRHLQ VVEDVNMGHN GEKTHLLVVR VDYTEGGMEH YLLPLSYTDR AEAESLLQEH PQAVIAYLEL QDSSGVLYDG LFSASFRTVL LEMILGQRKK SGPGGEVHGV RGRCLKSLIK DGHHIPASRV LAAEQSNSSI LYGQSVILKL YRRLEQGTNP DAEITRHLGR LRHGPKVPGF AGLLEYRRED QEPVTLGLAQ QYVPSRGDAW TFVLSELDSF WDRVARDETR WQGPEPGWLP RAGNAAMPEE LLDRVGQEFL DKIELLGRRT GELHRALTDS DPESPFAPEP FSKLYQRSLY QSVRYQVRKT LHSVRRHLDE LPEAIRPQAE ALLVNEHLVL ERLGGLTAHR VEAQKIRIHG DYHLGQVLYT GEDFWIIDFE GEPARPLSER RLKRSPLRDV AGMLRSFDYA VHTSLSRQES GVTSAVGRSW TAPWYAAVCR TYLRGYLDQV EDAAFVPRDP EDIWRLLEGF LIEKAVYEVG YEANNRPHWI WLPLGGLLRL LGKEPDVDS
|
| |