Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3682 |
Symbol | |
ID | 3971655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4104270 |
End bp | 4107050 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637926792 |
Product | malto-oligosyltrehalose synthase |
Protein accession | YP_533536 |
Protein GI | 90425166 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3280] Maltooligosyl trehalose synthase |
TIGRFAM ID | [TIGR02401] malto-oligosyltrehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCTG CCATTCCGAC CGCGACCTAT CGCGTGCAAT TGACCGCCGA TTTCGGCTTC GACGATGCCG CCGCCATCGT CCCCTATCTG AAGTCGCTGG GCATCACCCA CCTGTATGCG TCGCCGTTCC TGAAAGCCCG CAAGGGCTCT AGCCACGGCT ACGACATCGT CGACCACACC AAGATCAATC CTGAACTCGG CGGCGAGGAC GGCTTCGAGC GGCTGTCCGC AACGCTGAAA CAACATGATC TCGGCCTGAT CCTGGACTTC GTGCCCAACC ATGTCGGGGT GCATTTCGCC GACAACCCGT GGTGGCTCGA CGTGTTGGAA TGGGGACCGG CGTCGCCGCA CGCCGCTTCG TTCGACATCG ATTGGGATAT CCTGCCGCAT CGGCCGCGCG GCGGCGTGCT GTTGCCGATC ATCGGCTCGT CCTACGGCGA GGCGCTGGAG CGCGGCGAGA TCGAGCTGCG CTACGACGCC GAGAGCGGCA GTCTGTCGGC CTGGTACTTC GAGCATCGGC TGCCGATCGC ACCGCAGCGC TATGGCGAAG TGCTGCGCAA CGTGGTGAAG GCGGCCGAGG CCGAGCAGCA ACCGGGCGGC CGCGCCATCC TCGAACTCGT GCAACGCTCC CCGGTGCCGC GCCAGCCGGA CCGCGCCGCG GCGCCGGCGT TCAAGGACGC GTTGAAACGC ATTCCCGGCA GCGCCGCGAT CATCGCGCGC GGCTTAGAGG CCTATCGCGC TGGCCAGGAT CGCCCGGCGC AGACCCAGAT GCTGCACCTG TTGCTGGAAC GCCAGCACTA CAAGCTCGGG CATTGGCGAC TGGCGTCCAG CGAAATCAAC TACCGGCGGT TCTTCGACGT CAACAGCCTG GCCGGGCTGC GGGTCGAGGA CGCCGGCACC TTCGAGGCGA TCCACCAGCT AGTGCGGCGG CTGATCGCCG AAGACAAGCT GCAGGGCCTG CGGCTCGACC ATATCGACGG GCTGCGCGAC CCCGCGCAAT ATTTCCAGCG GCTGCGCCGG CTGCTGCGCG AGGCGCGGGG TGACACCGCA CAGCCGTTCT ACATGCTGAT CGAAAAGATC CTCGGCGAGG ACGAGAGCCT GCGCCGCTTC ACCGGGGTGC ACGGCACCAC CGGCTACGAG TGGATGAACG TGATCACGCA GGTGCTGGTC GACGGCGCCG GGCTTGCCGC ATTGGACGAG GTGTGGCGCC AGGTCAGCAA CACGCCGCCG AAATTCGCCC CGGTGTTGAA AGAGGCCAAG CGGCGGGTGC TGGAAACCCT GCTGCTCAGC GAATTCACCG TGTTGTCGCG GTTGTTGGCG CGGATCGCCG CCGGGCATTA CTCGACGCGG GATTTTTCCG CCGACAATCT GCGGCAGATC CTGGAACTCT ATGTGCTGCA CTTCCCGGTC TATCGCACTT ATCTGACCGC CGCGGGGCCG ACCGCGCTCG ACCGCGAACT GATCGCGCAG ACCATCGAAA AGGCCCGCGC CGAATGGTTC AACGCCGACG AGGGGATTTT CGATTTTCTG CGCGACGTGC TGACGCTGGA TCTGATCAAG CCCGGCCGCG CCGCGCATTC CAAGCCGCGG GTGCGCCGCT TCGCGCTGAA GCTGCAGCAA TTCACCGGCC CGACCATGGC GAAGTCGCTG GAGGATACCG CGTTCTATCG CTATCACCGG CTGCTGGCGC TGAACGAGGT CGGCGGCGAT CCGTCCGCTG ACGCGATGTC GATCGACAGC TTCCACGAAA CCATGCGCAA GCGCGCGATC GACTGGCCGC ACGGCATGAC GGCGACCGCC ACCCACGACA CCAAACGCGG CGAGGACGCC CGCGCGCGGC TGTTGGCGCT GGCGGAAATC CCCGGCGAAT GGTCGGCCTT GGTGGCGAAA TGGAAGATGC TGAACGCGCC GCATCTCGTC ATCAAGGGCG ACGCCCGAAC CCCGTCGGCG CCGTTCGAAT ACATGCTGTA TCAGGCGCTG GTCGGCGCCT GGCCGCTCGA CGGCGATCCG GCCTTTCTCG ACCGCATGCA GGCCTATGCG CTGAAGGCGG CGCGCGAGGG CAAGCAGGAA ACCAGCTGGC TCAATCCCAA CCTGGATTAC GAGGAAGGCG TCAACGGATT CCTGGCGCGG ATCCTCGATC CCGCCGTCGC CGGCGACTTC ATCGCCCAGA TGCAGACGCT GGTGCAACGC GTGGCGTTGC TCGGCGCGCT GAACTCGCTG AGCCAGGTGA CGCTGAAGGC GATGCTGCCG GGCGTGCCGG ATTTCTATCA GGGCACCGAG ATGTGGGATA CATCGCTGGT CGATCCGGAC AACCGGCGAG CGGTGGACTT TGCCGCGCGC AGCACGGCGC TGCACGGCCT CGAGCAGCCG GATTGGACCG ACCTCGCCGC GAACTGGCAG GACGGCCGCA TCAAGCTGGC CTGGACACGG CAGCTGTTGA AGCTGCGCGC CGAAAAGCCC GAACTGTTTC TCAACGGCAG CTACGAGCCG TTGCCGGTCA CCGGGCCGCA CGCCGACCGG GTGATCGCCT TCGCCCGGCG CCATGAACGC GACGCGGTGA TCGTCGTGGT GGCAAAAGCG ATGGCCGCGG CGACGAAGGA CGGCCGCCGC TGGCCGGCGC CCGATGCCTT CGAAGGAACG GTGGTGGCCG AACATTACGA GGTCGATGGC GCCCCGCTGC AGCTCGCCGA ACTGTTCGCG CAGATGCCGG TGGCGGTGCG TCCCGCGCGC TTCAAAGGCA CGTTGAGCGC CACCCGGCTG CGCGCCACGC GCAAAGGCTG A
|
Protein sequence | MPPAIPTATY RVQLTADFGF DDAAAIVPYL KSLGITHLYA SPFLKARKGS SHGYDIVDHT KINPELGGED GFERLSATLK QHDLGLILDF VPNHVGVHFA DNPWWLDVLE WGPASPHAAS FDIDWDILPH RPRGGVLLPI IGSSYGEALE RGEIELRYDA ESGSLSAWYF EHRLPIAPQR YGEVLRNVVK AAEAEQQPGG RAILELVQRS PVPRQPDRAA APAFKDALKR IPGSAAIIAR GLEAYRAGQD RPAQTQMLHL LLERQHYKLG HWRLASSEIN YRRFFDVNSL AGLRVEDAGT FEAIHQLVRR LIAEDKLQGL RLDHIDGLRD PAQYFQRLRR LLREARGDTA QPFYMLIEKI LGEDESLRRF TGVHGTTGYE WMNVITQVLV DGAGLAALDE VWRQVSNTPP KFAPVLKEAK RRVLETLLLS EFTVLSRLLA RIAAGHYSTR DFSADNLRQI LELYVLHFPV YRTYLTAAGP TALDRELIAQ TIEKARAEWF NADEGIFDFL RDVLTLDLIK PGRAAHSKPR VRRFALKLQQ FTGPTMAKSL EDTAFYRYHR LLALNEVGGD PSADAMSIDS FHETMRKRAI DWPHGMTATA THDTKRGEDA RARLLALAEI PGEWSALVAK WKMLNAPHLV IKGDARTPSA PFEYMLYQAL VGAWPLDGDP AFLDRMQAYA LKAAREGKQE TSWLNPNLDY EEGVNGFLAR ILDPAVAGDF IAQMQTLVQR VALLGALNSL SQVTLKAMLP GVPDFYQGTE MWDTSLVDPD NRRAVDFAAR STALHGLEQP DWTDLAANWQ DGRIKLAWTR QLLKLRAEKP ELFLNGSYEP LPVTGPHADR VIAFARRHER DAVIVVVAKA MAAATKDGRR WPAPDAFEGT VVAEHYEVDG APLQLAELFA QMPVAVRPAR FKGTLSATRL RATRKG
|
| |