Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3289 |
Symbol | |
ID | 4901776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 3205515 |
End bp | 3206636 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640136515 |
Product | capsular polysaccharide biosynthesis/export periplasmic protein |
Protein accession | YP_001067526 |
Protein GI | 126453256 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGCGG TCACGCTCGC CGGTTGCTCA AGTATCCCTA CGTCGGGGGC CAGTGGCGCG CAAATCGCGC GGGCTGCGCA GAGTCCATCC GGAATTCAGA TCGTCGATGT GACCGAGGAT GTCGCGCGCC AGCTGTTTGC TGATCGAAAC ACGGCGGACT TCGTGACGGC GCTGGGCGGC GGTGCGTCGT TCCGGCAACA GTTGGGCGTC GGCGATACGA TTCAGGTGTC CATCTGGGAG GCGCCACCCG CCACGCTTTT TGGCGCGGCT CAGTCGGAAG GGAGTTCGGG GCCGGCGAAC GCGCGCGTGA CGGTGTTGCC CGATCAAGCC ATCGATGGCG ACGGCAATGT CAATATTCCG TTTGCGGGCC AGGTGAAGGC GGCCGGCCGC TCGCCCACGC AGTTGGCGCG TGAGATTGCC GCGCGGCTGA AGAGCATGGC GCACGATCCG CAAGTGCTCG TGAAGCTTTC ACGCAACGAG ACGTCATATG TGACGGTCGT GGGCGATGTG GCCGAAAACG CTCGCATGGC TCTGACCGCT CGGGGCGAGC GCCTGCTTGA TGCATTGGCG AGCGCAGGCG GGGCGAAGCA CCCGGTTGAC AAAGTTACGA TCCAGATAAC GCGCGGCAAG ACGGTGGCCT CGTTGCCGCT CGACATGGTT ATTCGTGATC CGCGGCAGAA CGTCCCGTTG CATGCGGGCG ATGTGGTCAC TGTCCTGTTT CAGCCATATA GCTTTACGGT GCTCGGCGCG ACGGGCAAGA ATGACGAAAT CAATTTTGAA GCGAAGGGCA TCACGCTTGC GCAGGCCCTG GCGCGTGCTG GCGGCTTGCA GGATTCGCGC GCCGATGCAA AGGGCGTATT CATCTTCCGA CTTGAAGACG CCAACGCGCT GAAATGGCCG ACGGCTCCCG TGCGTACGAC TGCGGATGGA AAGGTGCCTG TCGTGTATCG CGTGAATCTT CGCGATCCGA ATTCGTTCTT CGTGGCTCAG AGCTTCAGGG TCGACAACAA CGATCTGTTG TACGTTTCGA ATGCGCCGAT TGCCGAACTT CAAAAATTCT TGAATGTCGT GTTCTCCGTT GCGTATCCGG TGATTACCGG CGTTCAGACA GTCAGGTACT GA
|
Protein sequence | MGAVTLAGCS SIPTSGASGA QIARAAQSPS GIQIVDVTED VARQLFADRN TADFVTALGG GASFRQQLGV GDTIQVSIWE APPATLFGAA QSEGSSGPAN ARVTVLPDQA IDGDGNVNIP FAGQVKAAGR SPTQLAREIA ARLKSMAHDP QVLVKLSRNE TSYVTVVGDV AENARMALTA RGERLLDALA SAGGAKHPVD KVTIQITRGK TVASLPLDMV IRDPRQNVPL HAGDVVTVLF QPYSFTVLGA TGKNDEINFE AKGITLAQAL ARAGGLQDSR ADAKGVFIFR LEDANALKWP TAPVRTTADG KVPVVYRVNL RDPNSFFVAQ SFRVDNNDLL YVSNAPIAEL QKFLNVVFSV AYPVITGVQT VRY
|
| |