Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3223 |
Symbol | |
ID | 4884849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3153423 |
End bp | 3154571 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640129151 |
Product | sugar transferase family protein |
Protein accession | YP_001060234 |
Protein GI | 126442241 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCATG CGGGAAGCGC GTTCGGCATG AACGTAGAAG TACGAATCAA ATTATTGACG AATCGCAATC GAGGTTCACT TATGCCAGCG ATATTTCCCA GGGTCGTCGA CATTGCGGTG ATCGTGTTCG GCGCGTTCCT TCCGATTCTG ATCGAGGCGT CGGGCGGCCC GCATGCCGAG ATCTTCGACG GCACGCTGGT GGCGTTCGCC GCCGCGCTAT CGTTGTCGGT CTTTCCCGCG TGCGGGATCT ATGAGGCGTC CCGGCGGCGC TCGCCCGTGC ATCTGATCAG CCGCACGGCG CTCGCGTGGC TCGTCGTGCA GGGCGGCACC GTCGTGCTGC TGTACGTGCT GCATCGCGCG CAGATCCTGT CGAGCGCCTG GTTCGCCTAC TGGACCGTGA CGACGGGCAT TGGGCTGCTG ATCTTTCGCG CAGTGACGCT CGCGATCTTC GGTCTCCTCG CGCGCGCGAG CCGGCAGGTG AAGGCCGCGA CGCTCGATCA GGTGGGGCAC GTCGCGCAGC GCATGCGCAC GTCGGGTATC GCCAAGCGCA TCGTCAAGCG CGCGTTCGAC GTTGCCGCCG CGTCGTGCCT GATCGTCGTG CTGTCGCCCG CGCTCGCCGT CATCGCGTTC CTCGTGAAGC GCGACGGCGG CCCCGCCGTG TTCGGGCACG TGCGCATCGG GCGCGACGGC CGTCCGTTCA AGTGCCTGAA GTTCCGCTCG ATGGTGATGA ACGCCGACGC CGTGCTGAAG GCGCTGCTCG AGCGCGACCC GCACGCGCGC GCGGAATGGG AGCGCGAGTT CAAGTTGAAG AACGACGTGC GGATCACGCC GATCGGCCGC TTCCTGCGCC GCAGCAGTCT CGACGAGCTG CCGCAGTTGA TGAACGTCGT GAGAGGCGAG ATGAGCCTGG TCGGGCCGCG TCCGGTCGTC GAGGCCGAGC TCGCGCGCTA CGGCGAGGAC GTGCGCTATT ACCTCGCCGC GAAGCCCGGC ATGACGGGTC TCTGGCAAGT GAGCGGGCGC AACGATACCA GCTACGCGAC ACGGGTGTCG CTCGACGTGT CGTACGTGAA GGAATGGTCG CTTCGCCGCG ACCTGGTCAT CCTGTTGAAG ACCGTCAACG TCGTCCTGCG GGGATCGGGT GCGTATTGA
|
Protein sequence | MPHAGSAFGM NVEVRIKLLT NRNRGSLMPA IFPRVVDIAV IVFGAFLPIL IEASGGPHAE IFDGTLVAFA AALSLSVFPA CGIYEASRRR SPVHLISRTA LAWLVVQGGT VVLLYVLHRA QILSSAWFAY WTVTTGIGLL IFRAVTLAIF GLLARASRQV KAATLDQVGH VAQRMRTSGI AKRIVKRAFD VAAASCLIVV LSPALAVIAF LVKRDGGPAV FGHVRIGRDG RPFKCLKFRS MVMNADAVLK ALLERDPHAR AEWEREFKLK NDVRITPIGR FLRRSSLDEL PQLMNVVRGE MSLVGPRPVV EAELARYGED VRYYLAAKPG MTGLWQVSGR NDTSYATRVS LDVSYVKEWS LRRDLVILLK TVNVVLRGSG AY
|
| |