Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3292 |
Symbol | |
ID | 4902265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 3208289 |
End bp | 3210025 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640136518 |
Product | capsular polysaccharide biosynthesis protein |
Protein accession | YP_001067529 |
Protein GI | 126451734 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3563] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.333396 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAAGCGC TCGATGCGGC ACTCGAGCAA GACTGTACTG CGGGCACCGC AGCGGCCGTT GCCGAGTTGA TGAAACGCGT GCTCGCCAGT CACGCAATAC GAGGCCGCGA CGGGGTGTCG GAATTTCGTG CGCCCCCTCG GCTGCCCGGC GAAACCCGCG TGTTGCTGAT CGACGAGCGC AAGTATTCGC AAGGGATCGG CGCCGTCGCG ACGCGCAACA ACCGTGGCGC GTTCGAGCGG ATGATCCGGG CTGCCCGCGC GGCCCATCCA GATGCCGAAT TTTGGCTCGC CCGCACGAGA GATCGTGGCT CCGGTGTGTG GCTATCGGCG TCCGCGGCCG ACATCCTCCC TGCCGACATA CACCGCCTGG GTGAACACGA ATCGCTATGC GCCGCGCTGG AGCACGTCGA CCACGTCTAC ACGGTGGGCG CCTCCGAGGG AATGCAGGCG CTGCTGGCCG GCCGGCGAGT GCATGTGTTC GGCGCGCCAT ACTATGCCGG CTGGGGCCTG ACCGACGATG CCGTTCAGTT GCCCGGTCGC CACGCGCGGC CCACACTCGC GGCGTTGTTC GATGTCGTCT TTCTGCGCTT TGCCCGCTAC CTGAATCCCG CCACGCACGC GCCCGGCCGC ATCGACGATC TACTCGACGC GATCGAATGG CAGAACACCG TTCGCCGGCG ATTCGCCGAT CTGCGGCAGG TGGCCGGCAT ACGCTTCCAA TGGTGGAAGC GCCCATTCGC CACCCCATAT CTCACGGCCG GGGGCGGAAC GCTGAGGTGG ACTCGCGACG CAAGCCGTCT GCGCGAAGGG GAGCACGCCG CGCTCTGGGG GGCACGCGGC ACGAACGACT TGCCCCCCGG CACAAGGGTC ATACGCATCG AAGACGGATT CCTGCATTCG ACCGGCCTCG GCTCGGACCA CGTGGCGCCG TGCAGCCAGG TCATCGATCG AAGCGGCCTC TATTTCGATC CGAGCCGGCC GAGCGATCTC ACGACCATTC TGAACGAAAC CGACTTCGAC GATGCCGAAC TGGTCCGGGC GAACAGGCTA CGCCGCGAAA TCGCCCGCCT GGGCCTGACC AAGTACAACC TCGGTCGCCG CAAACCGGCA TGGTCCCCTC CTCCGGGCAA GCGCGTGGTA CTCGTACCCG GTCAGGTGGC GGACGATGCC TCCATCCGGC TCGGCACGCG CGGCATTACG ACCGCGGAAG ATCTCCTTCG TGAGGTTCGC GCCAGGCGCC CGGACGCCTT CATCGTCTAC AAGCCTCACC CGGACGTCCT GTCGGGCAAT CGCCGGGGGG CAATCGAGGT GAATGCATGG GCCGACCTGA TCGAACAGGA TGCCGACCTG ATCTCGCTGA TAGAAGTGGC CGACGAGATC CACACCCTTT CGTCGCTGTC CGGCTTCGAA GCGCTGATCC GCGGCAAGGC CGTGCATACC TATGGTCTGC CGTTCTATGC AGGATGGGGG CTGACGCAGG ACGCGCTCGC GCAACCCTGG CGCAAGCGCA CGCTTTCTCT TGATATGCTG ACAGCCGGCG TGTTGCTGCG CTATCCGGTC TACTGGGATT GGTCTCTCCG GCTGTTCGCC TCGCCCGAAC TCGTTGTTCG GCAACTGGCC ATTCCGGCCG CGCGACCGCT GACGAGTATC CGCGGCGATC GCCTGCGGCC GGTTCGGAAA GCATCCCGCT GGATTGCAAG CTGTCTGCGC CATCTCCTCT GGCAATGCGG AAAGTAG
|
Protein sequence | MQALDAALEQ DCTAGTAAAV AELMKRVLAS HAIRGRDGVS EFRAPPRLPG ETRVLLIDER KYSQGIGAVA TRNNRGAFER MIRAARAAHP DAEFWLARTR DRGSGVWLSA SAADILPADI HRLGEHESLC AALEHVDHVY TVGASEGMQA LLAGRRVHVF GAPYYAGWGL TDDAVQLPGR HARPTLAALF DVVFLRFARY LNPATHAPGR IDDLLDAIEW QNTVRRRFAD LRQVAGIRFQ WWKRPFATPY LTAGGGTLRW TRDASRLREG EHAALWGARG TNDLPPGTRV IRIEDGFLHS TGLGSDHVAP CSQVIDRSGL YFDPSRPSDL TTILNETDFD DAELVRANRL RREIARLGLT KYNLGRRKPA WSPPPGKRVV LVPGQVADDA SIRLGTRGIT TAEDLLREVR ARRPDAFIVY KPHPDVLSGN RRGAIEVNAW ADLIEQDADL ISLIEVADEI HTLSSLSGFE ALIRGKAVHT YGLPFYAGWG LTQDALAQPW RKRTLSLDML TAGVLLRYPV YWDWSLRLFA SPELVVRQLA IPAARPLTSI RGDRLRPVRK ASRWIASCLR HLLWQCGK
|
| |