Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2482 |
Symbol | |
ID | 4905350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 2445293 |
End bp | 2446468 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640145586 |
Product | capsular polysaccharide biosynthesis/export protein |
Protein accession | YP_001076513 |
Protein GI | 126456441 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.153813 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAATC GTTCGCTTAG ACCCCTGGCG CTCGCCGTCG CCGCCGCCAC GCTGCTGCAG GCGTGCGCGA CGGCGCCCGG CAACTACCTC GACACGTCGC GTCTCGACGA CAAGGACAGC CAGTCCGCCG AGCATTACAA CGTGCAGCTC ATTACCGCGC AGCTCGTCGT TTCGCAGGCC GACGCGCAGC GCAAGGCTGG GCCGTTGCCG CCGGCGCGCT TCGTCGATCC GATGCAGTAC GTCTACCGGA TCGCGCCGCA GGACATTCTC GGCGTGACCG TCTGGGATCA TCCGGAGCTC ACGACGCCGC AAGGCCAATC GTTCTCGAGC GGCGGCAACA CGACGCAGAC GGTCGCGGGC GCGCTGCAGC AGCCGTATGC GAATGCGTTG CCCGGCCAGG CCGATCCGTA CGGCCAGACG GTGATGTCCG ACGGCACGAT CTACTTTCCG TTCGTCGGCC GCCTGCACGC GGCGGGCAAG ACGGTCGGCC AGGTGCGCGA CGAACTCGCC GCGCGGCTGG CGCGTTACGT GAAGAATCCG CAGGTCGACG TGCGCGTGCT GTCGTATCGC AGCCAGAAGG TGCAGGTGAC CGGCGAAGTG AAGACGCCCG GCCCGCTTGC GATCACCGAT GTGCCGCTCA CGCTCGTGGA CGCGATCACG CGCTCGGGCG GCTCGACGAA CGAGGCCGAC CTGCAGCGCG TGCGCCTCAC GCGCGACGGC AAGTTCTACC AACTCGACGC GAACGGCATG CTCGATCGCG GCGACGTCAC GCAGAACGTG ATGCTGCAGC CGGGCGACAT CGTCAACGTG CCGGACCGCG GCGACAGCCG CGTGTTCGTG ATGGGCGAGG TGAAGACGCC CGCGACGGTG CCGATGCTCA AGGGGCGCTT GACGATCGCG GACGCGCTCA CGGCGGGAGG CGGCATTCTC GATACCGATG CGAATCCGCG TCAGGTGTAC GTGTTGCGCG ATCTGCAGGA CAAACCGAAC ACACCGGACA TCTTCCGCCT CGACATGACG CAGCCCGACG CGCTGATGCT GTCGAGCCGC TTCCAGTTGA AGCCGCTCGA CGTCGTGTAC GTCGGCACGG CGGGATCGGT GCGCTTCAAC CGCCTGCTGC AGCAGATCTT CCCGACGATC CAGTCGATTT ACTACATGAA GCAGATCACG CGCTGA
|
Protein sequence | MLNRSLRPLA LAVAAATLLQ ACATAPGNYL DTSRLDDKDS QSAEHYNVQL ITAQLVVSQA DAQRKAGPLP PARFVDPMQY VYRIAPQDIL GVTVWDHPEL TTPQGQSFSS GGNTTQTVAG ALQQPYANAL PGQADPYGQT VMSDGTIYFP FVGRLHAAGK TVGQVRDELA ARLARYVKNP QVDVRVLSYR SQKVQVTGEV KTPGPLAITD VPLTLVDAIT RSGGSTNEAD LQRVRLTRDG KFYQLDANGM LDRGDVTQNV MLQPGDIVNV PDRGDSRVFV MGEVKTPATV PMLKGRLTIA DALTAGGGIL DTDANPRQVY VLRDLQDKPN TPDIFRLDMT QPDALMLSSR FQLKPLDVVY VGTAGSVRFN RLLQQIFPTI QSIYYMKQIT R
|
| |