Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3217 |
Symbol | |
ID | 4884761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3146869 |
End bp | 3148065 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640129145 |
Product | capsule polysaccharide biosynthesis protein |
Protein accession | YP_001060228 |
Protein GI | 126439364 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3562] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGTCG TCGTCGTCGA TTCGATGGAG CGGTACTACT TTGCCGTGCG GCTCGTGAAG GCGGTCAGAA AGGAATTCGA CTTTCTGTTC GCCGCGAGCG AGCCGCTCGC GCACCTGATG GCACTGGCCG CGGGCTTTCG GTCGGTCTAT CTGCGGCGCG GGGCGCATGC GCCCGCCATG CTCGATGCGG CGGCCGGGAT TCGGTCCGAT GCATCGATCG AGGTGCTGAA CGGACAGATG ACGCCCGAGC GCGCGCGCGC CGACGCGCAG GCGGTCTTCG CCGCGATGTC CGGGGTGTTC AGCCGTCATC TCGTGTCGCA GTGCCTGATG TGGAACGGCC AGCAGCTCGT TTGCCGCGCG GTCGCGCATG CGTGTGCGGC GCACGGGGTA CCGACGAAAT TCGTCGAGAT CTCGAATTTG CCGGACAAGC TGTTCGTCGA TCGGCTCGGC GTCAACGCGC TGTCGTCGAT CAGCCGCAAT CCGGCCGTCA TCGACGGCTT GCCGATGCCG ACCGAAGGCG AGCATCGCCG CTGGTTCGCG CGCTACGAGG CGTACAAGGC GCGGCCGCTG CCGCAGTCGC GCACGTCGTG GGCGCGCAAG GCGATGTCGG CCGCGAACCA TGCGCTGAAG CTCGCCACGC AAGGCGTGGC GCGCAAACGT TTGGACGCGG CGCGCGCGAC GAACGGCGCA CGCGCGCCGG CGCAGGCGAA AGTGCTGAGC ACGAAGGAAC TCTCGGCGCT GCGCTACGTG TTCCTGCCGC TGCAGGTGTC GGGCGACACG CAGATCAAGC TGCATTCGGA TGTCGACAAC CTGAAGGCGA TCCGGCTCGC GTTCGAGCAC GCGGCGAACG AGAACGCGGA CTTGATCGTC AAGCTGCATC CGGCCGAGCG TGACGTGGCC GTGATCGACG AGGTGGTGCG GATGCAGCGC GTCTATCACT TCGACCTCGT GACGTCGCCG ACCACCGATC TGATCAAGCA CGCGCATTCG GTCGTCACGA TCAATTCGAC GGTCGGCCTC GAGGCGCTGC TGTACGGCAA GCCGGTCGTG TCGCTCGGCC GGTGCTTCTA CAAGGAGTTC GATCGCGCGA GGCTGCTCAA GTACATCCAT GCGTTCCTGA TCGACGGCAT CGATTACTTC GGTCGAGCGG ACATCGCGCC GCGCGCCGCG CGAAACGTGT TCTCGATGAA GCACTGA
|
Protein sequence | MIVVVVDSME RYYFAVRLVK AVRKEFDFLF AASEPLAHLM ALAAGFRSVY LRRGAHAPAM LDAAAGIRSD ASIEVLNGQM TPERARADAQ AVFAAMSGVF SRHLVSQCLM WNGQQLVCRA VAHACAAHGV PTKFVEISNL PDKLFVDRLG VNALSSISRN PAVIDGLPMP TEGEHRRWFA RYEAYKARPL PQSRTSWARK AMSAANHALK LATQGVARKR LDAARATNGA RAPAQAKVLS TKELSALRYV FLPLQVSGDT QIKLHSDVDN LKAIRLAFEH AANENADLIV KLHPAERDVA VIDEVVRMQR VYHFDLVTSP TTDLIKHAHS VVTINSTVGL EALLYGKPVV SLGRCFYKEF DRARLLKYIH AFLIDGIDYF GRADIAPRAA RNVFSMKH
|
| |