Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3487 |
Symbol | |
ID | 4024001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 3876534 |
End bp | 3879323 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637963691 |
Product | malto-oligosyltrehalose synthase |
Protein accession | YP_570611 |
Protein GI | 91977952 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3280] Maltooligosyl trehalose synthase |
TIGRFAM ID | [TIGR02401] malto-oligosyltrehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCCTG CGATTCCGAC TGCAACCTAC CGCATTCAGC TCACCGCCGA TTTCGGCTTC GACGATGCGG CCGCGATCGT GCCGTATCTG AAAGCGCTCG GCATTTCGCA TCTCTACGCC TCGCCCTTCA CCAAAGCGCG CAAGGGCTCG ACCCACGGCT ACGACATCGT CGATCACACC CAACTCAATC CCGAGTTGGG CGGCGAAGAA GGATTCGCGC GCCTGTCCGC AGCGTTGAAG AGCCACGATA TCGGTCTGAT CCTCGACTTC GTGCCCAACC ATGTCGGCGT GCACTTCGCC GACAATCCGT GGTGGCTCGA TGTGCTGGAA TGGGGCCCGG CGTCGCCGCA CGCGGCGTCG TTCGATATCG ACTGGGACAT CCTGCCGTTT CGAACCCGCG GCGGCGTGCT GCTGCCGATC ATCGGCTCCT CTTACGGCAA GGCGCTGGAA AGCGGCGAGA TCGAGCTGCG CTACGATCCC GACGAAGGCA GTTTCTCGGC GTGGTACTTC GAGCACCGAC TGCCGATCGC GCCGCAGCGC TACAGCGAGA TCCTGCGCGC CATCGTGCGC GAGGCCGACG CCGCCGATGA TCCTGCCGGC AAAGCGATCC TCGACCTCGC CGCACGCTAT CGCGGGCTGC GCCATCCCGA CCGCAACGAG GCCCCTGCGT TCAAGGCCGC GCTGAAAGCG ATCCCCGGCA GCGCGGCGCT GATCGACAAG GGGCTGCTCG CCTATCGCGC GGGCGAAGGC CGCACTGCGC AGATCCAGGC GCTGCACAAT CTGCTGGAGC GCCAGCACTA CAAGCTCGGC CATTGGCAGC TCGCGGCGAG CGAGATCAAC TATCGGCGTT TCTTCGACGT CAATACCCTC GCCGGCCTGC GCGTCGAGGA CGGCGGCACG TTCGAGGCGA TCCACCGGCT GGTGAAGCGG CTGATCGCGG ACGGTCAGCT GCAGGGGCTG CGGCTCGATC ACATCGACGG CCTGCGCGAC CCCGCGCAAT ATTTCCAGCG GCTGCGCCGG CTCGCCCGCG ATGCGCAGGG AAAAGCCGCC GCGCCGCTCT ACATGGTGAT CGAAAAGATT CTCGGCGAAG GCGAAGCGCT GCCGCGCTTC GCCGGCGTGC ATGGCACCAC CGGCTACGAA TGGCTGAACG TGATCACCCA TGCGCTGGTC GACGGCGCCG GCCTGCAGCC GCTCGACGAA GTCTGGCGGC AGGTGAGCAA CACCTCGCCG GATTTCGCGC CGGTGCTGAA GGAAGCCAAG CGCCGCGTAC TGCAGACGCT GCTGCTCAGC GAATTCACCG TGCTGACCCG GCTGCTGGCG CGGATCGCCG GCGGCCATTA TTCGACGCGG GATTTTTCCG CCGACAATCT GCGCCAGGTG TTCGAACTCT ACGTGCTGCA CTTCCCGGTG TATCGCACCT ACCTCACCGC GTCGGGTCCG ACCGCACTCG ACCGCGAGCT GATCGCGCAA ACCATCGAGA AGGCGCGCGC CGACTGGTTC GGCGCCGACG ATGGCATCTT CGACTTCCTG CAGGACGCGC TGACGATGGA CCTGCTGAAG CCGGGCCGCG CCGCGCACAG CAAGCCGCGG GTGCGCCGCT TCGCGCTCAA GGTCCAGCAA TTCACCGGGC CGACCATGGC GAAGTCGCTC GAGGACACCT CGTTCTATCG CTACCACCGC CTGCTCGCGC TCAACGAAGT CGGCGGCGAG GCCTCCGCGC ACGCGCTGGC CCCCGACGCG TTCCACCGAC AGATGACGCA GCGCGCCAGG GATTGGCCGC ACGGCATGAC CGCGACGATG ACGCACGACG CCAAGCGCGG CGAGGACGCG CGGACGCGGC TGCTGGCGCT GGCGGAAATG CCGGGCGAAT GGGCGAGCCT GGTCGCCAAA TGGAAGCTGC TCAACGCACC GCATCTGGTG ACCCACGGCG AGAGGCGCGC GCCGTCCGCG ACGTTCGAAT ACATGTTGTA TCAGGCCCTG ATCGGCGCGT GGCCGCTGCA GCCCGACGCG GATTTCACCG ACCGGATGCA GGGCTATGCG CTGAAGGCAG CGCGCGAAGG CAAGCAGGAA ACCAACTGGA TCAACCCGGA CCTCGCCTAT GAGGAAGGTA TCCGCACCTT CATCGACCGC ATTCTCGACC CGGCGCAATC CGGGCCGTTT CTGGAATCGC TGCAGAACTT GTCACAGCGC GTCTCGGTGA TCGGCGCGCT GAATTCGCTG AGCCAGATGA CGTTGAAAGC GACGATGCCC GGCGTGCCGG ACTTCTATCA GGGTACCGAG TTCTGGGACT TCTCGCTGGT CGACCCGGAC AACCGACGCA AGGTCGATTT CGCCGCACGC GAAACCTCGC TTGCGGCGCT CGCCGCCCCG GACTGGGACG CGTTGCTGAC GACCTGGAGC GACGGCCGGC TCAAGCTGGC CTGGACGCGG CAACTGCTGA AGCTGCGGTC CGAGCTGCGC GACGTCTTCA CCGACGGCGA CTACCGGCCG CTTGCCGTCA ATGGCCCGCA TCGCGACCAC GCGATCGCCT TCGCCCGCAG CCGCGGCGCG GACGCCGCGA TCATCGTCGT CGGAAAGAAC TTTGCGCCGC TGTCCGATCA GGGCCGGCAA TGGCCGCGCG GCGACGCATT CGACGCAACG GTGGAAATTT CGGGCCTTGT CATCGATGGC GACGACCGCA CCGAGCTGCC TCTCAGCGAG CTGTTTTGCG ATTTGCCGGT CGCTATCCGC CGGGCGCGTG TTGTAAACGC GAAGCGGACA GTCAGGACAC GCCGCAAGAG CGACGCATAG
|
Protein sequence | MPPAIPTATY RIQLTADFGF DDAAAIVPYL KALGISHLYA SPFTKARKGS THGYDIVDHT QLNPELGGEE GFARLSAALK SHDIGLILDF VPNHVGVHFA DNPWWLDVLE WGPASPHAAS FDIDWDILPF RTRGGVLLPI IGSSYGKALE SGEIELRYDP DEGSFSAWYF EHRLPIAPQR YSEILRAIVR EADAADDPAG KAILDLAARY RGLRHPDRNE APAFKAALKA IPGSAALIDK GLLAYRAGEG RTAQIQALHN LLERQHYKLG HWQLAASEIN YRRFFDVNTL AGLRVEDGGT FEAIHRLVKR LIADGQLQGL RLDHIDGLRD PAQYFQRLRR LARDAQGKAA APLYMVIEKI LGEGEALPRF AGVHGTTGYE WLNVITHALV DGAGLQPLDE VWRQVSNTSP DFAPVLKEAK RRVLQTLLLS EFTVLTRLLA RIAGGHYSTR DFSADNLRQV FELYVLHFPV YRTYLTASGP TALDRELIAQ TIEKARADWF GADDGIFDFL QDALTMDLLK PGRAAHSKPR VRRFALKVQQ FTGPTMAKSL EDTSFYRYHR LLALNEVGGE ASAHALAPDA FHRQMTQRAR DWPHGMTATM THDAKRGEDA RTRLLALAEM PGEWASLVAK WKLLNAPHLV THGERRAPSA TFEYMLYQAL IGAWPLQPDA DFTDRMQGYA LKAAREGKQE TNWINPDLAY EEGIRTFIDR ILDPAQSGPF LESLQNLSQR VSVIGALNSL SQMTLKATMP GVPDFYQGTE FWDFSLVDPD NRRKVDFAAR ETSLAALAAP DWDALLTTWS DGRLKLAWTR QLLKLRSELR DVFTDGDYRP LAVNGPHRDH AIAFARSRGA DAAIIVVGKN FAPLSDQGRQ WPRGDAFDAT VEISGLVIDG DDRTELPLSE LFCDLPVAIR RARVVNAKRT VRTRRKSDA
|
| |