Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2781 |
Symbol | |
ID | 3910574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3169108 |
End bp | 3171270 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637884681 |
Product | glucosyltransferase MdoH |
Protein accession | YP_486394 |
Protein GI | 86749898 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2943] Membrane glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.373951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.861199 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCCCG TAGCCCGGCC GATGGGCGCG GCCGAGGCCG AGATCGTGCT GGGGCTGCCA GATCCCCTCC CTGCAGACGC CCCGCTGGCG ATGCCCGTGC AATCCTTCAC CGAGGCTCCC GAGCCCGTGC ATCTGCCGTC GGCCAGCTCC GGCCGATGGC GTCGGTCGCT GATCATGCTG GCCACGGCGC TGCTGACCTA TGCGGGCGTC GACGAGATGT ATGAGGTCCT CAAGGTCGGC GGCGTCACCA TCATCGAAGG CATCGTGCTG GCGCTGTTCG CCGTACTGTT CGCGTGGGTG GCGCTGTCGT TCGTCTCCGG GCTCGCCGGC TTTGCCGCGC TGCTCCGCGG CTGGCGCGAC AACCTCGATC TCGACACCGA TGGCCCGCTA CCTGCGGTGA GCGCCAAAGT CGCAATGCTG CTGCCGACCT ACAACGAAGA CCCGCAAACC GTCCTCGCCC GCCTGCAGGC GACCCGCGAG TCGACCGACG CAACCGGCAG CGGCGCTCAG TTCGACTGGT ATATCCTCAG CGATACGACC GATCCCGCCG TCTGGGCGCT CGAAGAAAAG TGCTTTCTGG CGCTGGCGAC GACCAACACC CGGCTGTTCT ATCGCCACCG CCCCGACAAT CACGCCCGCA AGTCCGGCAA CATCGAGGAT TGGGTCAAGC GGTTCGGCGG CGGCTACGAC TTCATGGTCA TTCTCGACGC CGACAGCGTG ATGACCGGCG ACGCGCTGGT GCGGCTCGCG GCAGCGATGG AGCGGCATCC CGACGTCGGG CTGATCCAGA CTCTTCCGGT CGTCGTCAAT GCGCGTTCGC TGTTCGCGCG GCTGCAGCAG TTCGCCGGGC GGATGTACGG CCCGATGATC GCCGCCGGCG TGGCCTGGTG GCACGGCTCG GAAAGTAATT ACTGGGGCCA CAACGCGATC ATCCGGGTGG CGGCGTTCGC CGCCTGCACC GGCCTGCCGA ACCTGTCCGG GCGCAAGCCG TTCGGCGGCA GCATTCTCAG CCACGATTTC GTTGAAGCGG CATTGTTGCG TCGCGCCGGC TGGCGCATCC ACATGGTGCC GACGCTACGC GGCAGTTACG AAGAGGTGCC GCCGACGATG CTGGATTTCG CGGCGCGTGA CCGGCGCTGG TGCCAGGGCA ATCTGCAGCA CGCCGCGATC CTGCCGGCCC ACGGCCTGCA TTGGGTCTCG CGTTTGCATT TCCTCACCGG GATCGGCGCC TATATCACCG CGCCGATGTG GCTGGCCTTT CTGGTCGCCG GTATCCTGAT TTCGCTGCAG GCGCAGTATG TGCGACCGGA ATACTTCCCG AAGGACTTCT CGTTGTTTCC GGTCTGGCCG GCGCAGGACC CGATCCGCGC GGCCTGGGTG TTCGCCGCCA CGATGGGCCT GCTGATCGTG CCGAAGCTGC TCGCGCTGAT CCTGGTGCTG ATCCGGCGCG ACACGCGGCT GGGCTTCGGC GGCGGCTTTC GCGCTTTCGC GGGCCTGATG TTCGAAACGC TGATGTCGGG CCTCACCGCA CCGGTGATGA TGATCTTCCA ATCTACTGCG GTCGGCCAGA TCGTGATGGG TCAGGATTCC GGCTGGCAGC TTCAACATCG CGGCGACGGC TCGATCCCGT TCGGCGACGT CGCCCGCCGC TACGCGTTGC CGACGCTGAT CGGCATCGCC ATGGCGACCA GCGCGCTGCT GGTCTCCTGG CCGCTGTTTT GGTGGATGAC CCCGGTGGTC CTCGGCCTCG TGCTGGCAAT CCCTGTGGCT GCGCTCACCA ATCGCTCGAC GTCGGCGCGG CCCGCGCTGC TGGCAACGCC GGAGGACCTC GATCCGCCGC CGATCCTCGC CCGCGTCCGC GACATCGCCG CGACGCTGCC GGCGTCCGAG GGCGAGGACG ACCCGTTGGT CGCGTTCCGG CAGAACCGTC CCCTGTTCGA CCTCCACATC GCCGGCCTGT CGCACCACCC GCCGCGCGCG CGCGGCCGCG TCGATCCGAA CCTGGCGCTC GCCCGCGCCA TGGTCGACGA CGCGGACTAT TTCGAGGAGA TCGTGGCCTG GCTGAACAAG CCGGAAAAAC GCGCGCTGAT GGGTAACGCG GACCTGCTCC GCCGTGTCGT GGCGATGCCC GCATCGCCGG AACACGCCGG CGAGTCACAC TGA
|
Protein sequence | MGPVARPMGA AEAEIVLGLP DPLPADAPLA MPVQSFTEAP EPVHLPSASS GRWRRSLIML ATALLTYAGV DEMYEVLKVG GVTIIEGIVL ALFAVLFAWV ALSFVSGLAG FAALLRGWRD NLDLDTDGPL PAVSAKVAML LPTYNEDPQT VLARLQATRE STDATGSGAQ FDWYILSDTT DPAVWALEEK CFLALATTNT RLFYRHRPDN HARKSGNIED WVKRFGGGYD FMVILDADSV MTGDALVRLA AAMERHPDVG LIQTLPVVVN ARSLFARLQQ FAGRMYGPMI AAGVAWWHGS ESNYWGHNAI IRVAAFAACT GLPNLSGRKP FGGSILSHDF VEAALLRRAG WRIHMVPTLR GSYEEVPPTM LDFAARDRRW CQGNLQHAAI LPAHGLHWVS RLHFLTGIGA YITAPMWLAF LVAGILISLQ AQYVRPEYFP KDFSLFPVWP AQDPIRAAWV FAATMGLLIV PKLLALILVL IRRDTRLGFG GGFRAFAGLM FETLMSGLTA PVMMIFQSTA VGQIVMGQDS GWQLQHRGDG SIPFGDVARR YALPTLIGIA MATSALLVSW PLFWWMTPVV LGLVLAIPVA ALTNRSTSAR PALLATPEDL DPPPILARVR DIAATLPASE GEDDPLVAFR QNRPLFDLHI AGLSHHPPRA RGRVDPNLAL ARAMVDDADY FEEIVAWLNK PEKRALMGNA DLLRRVVAMP ASPEHAGESH
|
| |