Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4164 |
Symbol | |
ID | 6411848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4460873 |
End bp | 4464175 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642714046 |
Product | trehalose synthase |
Protein accession | YP_001993135 |
Protein GI | 192292530 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA TGACACCGAT CGCCCCGAGC GACAGGCCGG ACGCGGAAAC TTCTGACGAA CTCTGGTACA AAGACGCCAT CATCTACCAG TTGCACGTCA AGGCGTTCGC CGACAGCAAC AATGACGGCA TCGGCGACTT CGCGGGTCTC ACCGAGAAGC TGGATTATCT GCAGGAGCTT GGCGTCAATA CGCTGTGGCT GCTGCCGTTC TATCCCTCGC CGCAGCGCGA CGACGGATAC GACATCGCCG ACTACGGATC GATCAATCCC GACTTCGGCA CGATGAAGGA CTTCCGCCGC TTCATCGTCG AGGCCAAGAA GCGCAATCTG CGGGTCATCA CCGAGCTGGT CATCAACCAC ACCTCGGATC AGCACGCCTG GTTCAAGCGG GCACGACGCA GCCCGAAGGA CTCCAGCGCG CGCGACTGGT ACGTCTGGAG CGACACCGAC CAGAAGTATC TCGGCACCCG GATCATTTTC ACCGATACCG AGAAATCGAA CTGGACCTGG GATCCCGAGG CCGGCCAGTA TTACTGGCAC CGGTTCTTCT CGCACCAGCC GGATCTCAAC TTCGACAATC CGCGCGTCGT CAGCGCCGTG GTCAAGGTGA TGAAACGCTG GCTCGACACC GGCGTCGACG GCTTCCGGCT CGACGCGATC CCGTATCTGT GCGAGCGCGA CGGCACCAAC AACGAGAACC TTCCCGAGAC CCACGCCGTC ATCAAGCGGC TGCGCGCCGA GCTCGACGCC TACGCCAAGG GCAAGCTGCT GCTCGCCGAG GCCAATCAAT GGCCGGAGGA CGTGCAGCAG TATTTCGGCG ATGGCGACGA ATGCCACATG GCCTACCACT TCCCGCTGAT GCCGCGGATC TATATGGCGA TCGCGCAGGA AGACCGTTTC CCGATCACCG ACATCATGCG GCAGACGCCG GAGATCCCGG CGAACTGCCA GTGGGCGATG TTCCTGCGCA ACCACGACGA GCTGACGCTC GAAATGGTCA CCGACGTCGA GCGCGACTAT CTGTGGAAGA CCTACGCGGC CGATCCGCGA GCGCGGATCA ACGTCGGCAT CCGTCGCCGC CTCGCGCCGC TGATGGACAA TGATCGCCGC AAGATCGAAC TAATGAACTC GCTGCTGATG TCGTTCCCCG GCACGCCGAT CATCTACTAT GGCGACGAAA TAGGCATGGG CGACAACATC TATCTCGGCG ACCGCAACGG CGTGCGGACA CCGATGCAGT GGTCGCCTGA CCGCAACGGT GGATTTTCGC GGGCCGACCC GGCGCGGCTG TACGCGCCGC CGATCATGGA CCCGGTGTAC GGCTACGCCT CGGTCAACGT CGAAGCCCAG CAGCGCAGCC TGTCGTCGTT GCTGAGCGCG ATGAAGCGCT TGATCGCGGT GCGCAAATCG ACGCCGGCGT TCGGCCGCGG CAGCATCACC TTCATCCGTC CGAGCAACCG CGCGGTGATC GCCTACGTTC GCCAATACAA GGACGAGGTC ATCCTGTGCG TCGCCAATCT GTCGCGTGCC GCGCAGGCGA CCGAACTCGA CCTGTCGCCG TGGAAGGACC GCGTGCCGCA GGAGATGCTC GGCCGCACCA AATTCCCGCC GATCGGCGAT CCGGCCTACA TGATCACGCT GGCGCCCTTC GGCTTCTACT GGTTCAAGCT GCAGGAGCCC GGCACGGCCG CGCATGTCGC GCCGGTCTCC ACCGTGCCTG AATTCGTCAC GCTGGTGGTG CCGCTCGGTT CGACCTGGAT GACGCTCGGC CGCACCCGCT CGATGTTCGA GCACGAGGTG CTACCGACCT TCCTGTCCCG CACCCGCTGG TTTCCGGAGC GCAATCCGCG GGCGATCCAT CCGCGGCTCA CCTCGGCGAT TCCGTTCGCC AATGACGGCG ACAACCGTCC GTGGCTGGCG TTCTTCGAGG CCACGCAGCG GGGGACGACG GCGCGCTATC TGCTGCCGAT GCAGATCGAC TGGGTGCGGT TCGATCGCGA ACGCTACAAT CCGCGGGCCT ATGCGGCGGT GCGCCAGGGC GCCCGCGAAG GCACCCTGCT CGATGTCGCC GCCGATCCCT CGTTCATCGA CCTGCTGCTG GAGAACGTCC GCGACGCCGT CACCGTGACC GGCGATAACG GCGACCGACT GGAATTCCGC CCTGGCTCCA AGCTCGCGGA ACGGCCCGCC GGACCGTTCC AGAACATTCG CGCGGTCGAA ACCGAGCAGT CGAATTCGAC CGCACTGGTC GACGGCGACT ACGTCGTCAA ACTGTATCGG CGGCTGCAGA TCGGCATCAA TCCCGAGCTG GAGATGGGAC GCTTCCTCAC CGAGGTCGCC GGCTATGCCA ACACCCCGGC CCTGCTCGGC AGCGTCGAGC TGATCGAAGG CGACAAGACC AGCGCGGTCG CGGTGGTGCA CGAATTCGCC CGCAATCAGG GCGACGGCTG GACCGTCACC TCCGGCTATC TCGATCGCTA CATCGACGAA CAACGCGTGC TGAGCCGCGC CGAGGAAAAG ACCGAGAGCG ATCAGCTCGC GCCCTACCTG CATTTCATGC AGCAGACCGG CAAGCGGGTC GCGGAGATGC ACATCGCGCT CGCCAGCCAT CCCGAGGTGC CGGACTTCGC CCCGGAGCCG ATCACCCTCG ACGCCTCGCG CGACTGGGCC GAAACCGTTG CGGCATCCGC CGACCGGATG CTCGACGAAC TGCGGCGCCG GCGCGACACG CTGAAGGAAG GCGACCGGAC GCTGGTCGAC GAACTGCTGG CGGCCCGCGA CGTGCTGATG TCCCGCATCC ACGGCCTGCT CGGCGACGAC GGCGGCCTGA ACATCCGCCA TCACGGCGAC TTCCATCTCG GCCAGATGTT GATCGTCAAG GACGACATCT ACATCATCGA CTTCGAAGGT GAGCCGCGCC GCACCTTGGA CGAACGCCGC GCCAAAGCAC CGGCGGCTCG CGACGTGGCC GGGCTGATCC GGTCGATCGA CTATTCGACG ACCGCCGCTC TCGAGCGAGC GCTGAAAGCA GCCTCCGACG AGCCCGGCCG GCTGACCGAG GCGCTCGACC TGTGGCGGAT CCGCGCCACA GGCGCGTTCC TGGACGCCTA TCGCCAAACC ATGGGCGACA GCCCGGTGTG GCCGGCGGAT ATCGCGGCCG CCGACCGGGT GCTCGACTTC TTCCTGATTG AAAAGGCGCT GTACGAGATC GACTACGAGA TCGCGCATCG TCCCGACTGG GTGCATGTGC CGCTGGCCGG CATTCTCCGT ATCCTGTCGC CGCCTCCCGA GGAGCTTCCA TGA
|
Protein sequence | MNKMTPIAPS DRPDAETSDE LWYKDAIIYQ LHVKAFADSN NDGIGDFAGL TEKLDYLQEL GVNTLWLLPF YPSPQRDDGY DIADYGSINP DFGTMKDFRR FIVEAKKRNL RVITELVINH TSDQHAWFKR ARRSPKDSSA RDWYVWSDTD QKYLGTRIIF TDTEKSNWTW DPEAGQYYWH RFFSHQPDLN FDNPRVVSAV VKVMKRWLDT GVDGFRLDAI PYLCERDGTN NENLPETHAV IKRLRAELDA YAKGKLLLAE ANQWPEDVQQ YFGDGDECHM AYHFPLMPRI YMAIAQEDRF PITDIMRQTP EIPANCQWAM FLRNHDELTL EMVTDVERDY LWKTYAADPR ARINVGIRRR LAPLMDNDRR KIELMNSLLM SFPGTPIIYY GDEIGMGDNI YLGDRNGVRT PMQWSPDRNG GFSRADPARL YAPPIMDPVY GYASVNVEAQ QRSLSSLLSA MKRLIAVRKS TPAFGRGSIT FIRPSNRAVI AYVRQYKDEV ILCVANLSRA AQATELDLSP WKDRVPQEML GRTKFPPIGD PAYMITLAPF GFYWFKLQEP GTAAHVAPVS TVPEFVTLVV PLGSTWMTLG RTRSMFEHEV LPTFLSRTRW FPERNPRAIH PRLTSAIPFA NDGDNRPWLA FFEATQRGTT ARYLLPMQID WVRFDRERYN PRAYAAVRQG AREGTLLDVA ADPSFIDLLL ENVRDAVTVT GDNGDRLEFR PGSKLAERPA GPFQNIRAVE TEQSNSTALV DGDYVVKLYR RLQIGINPEL EMGRFLTEVA GYANTPALLG SVELIEGDKT SAVAVVHEFA RNQGDGWTVT SGYLDRYIDE QRVLSRAEEK TESDQLAPYL HFMQQTGKRV AEMHIALASH PEVPDFAPEP ITLDASRDWA ETVAASADRM LDELRRRRDT LKEGDRTLVD ELLAARDVLM SRIHGLLGDD GGLNIRHHGD FHLGQMLIVK DDIYIIDFEG EPRRTLDERR AKAPAARDVA GLIRSIDYST TAALERALKA ASDEPGRLTE ALDLWRIRAT GAFLDAYRQT MGDSPVWPAD IAAADRVLDF FLIEKALYEI DYEIAHRPDW VHVPLAGILR ILSPPPEELP
|
| |