Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3257 |
Symbol | |
ID | 4881967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 3191990 |
End bp | 3193726 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640129185 |
Product | capsular polysaccharide biosynthesis protein |
Protein accession | YP_001060268 |
Protein GI | 126442114 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3563] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAAGCGC TCGATGCGGC ACTCGAGCAA GACTGTACTG CGGGCACCGC AGCGGCCGTT GCCGAGTTGA TGAAACGCGT GCTCGCCAGT CACGCAATAC GAGGCCGCGA CGGGGTGTCG GAATTTCGTG CGCCCCCTCG GCTGCCCGGC GAAACCCGCG TGTTGCTGAT CGACGAGCGC AAGTATTCGC GAGGGATCGG CGCCGTCGCG ACGCGCAACA ACCGTGGCGC TTTCGAGCGG ATGATCCAGG CTGCCCGCGC GGCCCATCCA AATGCCGAAT TTTGGCTCGC CCGCACGAGA GATCGTGGCT CCGGTGTGTG GCTATCGGCG TCCGCGGCCG ACATCCTCCC TGCCGACATA CAACGCCTGG GTGAACACGA ATCGCTATGC GCCGCGCTGG AGCACGTCGA CCACGTCTAC ACGGTGGGCG CCTCCGAGGG AATGCAGGCG CTGCTGGCCG GCCGGCGAGT GCATGTGTTC GGCGCGCCAT ACTATGCCGG CTGGGGCCTG ACCGACGATG CCGTTCAGTT GCCCGGTCGC CACGCGCGGC CCACACTCGC GGCGTTGTTC GATGTCGTCT TTCTGCGCTT TGCCCGTTAC CTGAATCCCG CCACGCACGC GCCCGGCCGC ATCGACGATC TACTCGACGC GATCGAATGG CAGAACACCG TTCGCCGGCG ATTCGCCGAT CTGCGGCAGG TGGCCGGCAT ACGCTTCCAA TGGTGGAAGC GCCCATTCGC CACCCCATAT CTCACGGCCG GGGGCGGAAC GCTGAGGTGG ACTCGCGACG CAAGCCGTCT GCGCGAAGGG GAGCACGCCG CGCTCTGGGG GGCACGCGGC ACGAACGACT TGTCCCCCGG CACAAGGGTC ATACGCATCG AAGATGGATT CCTGCATTCG ACCGGCCTCG GCTCGGACCA CGTGGCGCCG TGCAGCCAGG TCATCGATCG AAGCGGCCTC TATTTCGATC CGAGCCGGCC GAGCGATCTC ACGACCATTC TGAACGAAAC CGACTTCGAC GATGCCGAAC TGGTCCGGGC GAACAGGCTA CGCCGTGAAA TCGCCCGCCT GGGCCTGACC AAGTACAACC TCGGTCGCCG CAAACCGGCA TGGTCCCCTC CTCCGGGCAA GCGCGTGGTA CTCGTACCCG GTCAGGTGGC GGACGATGCC TCCATCCGGC TCGGCACGCG CGGCATTACG ACCGCGGAAG ATCTCCTTCG CGAGGTTCGC GCCAGGCGCC CGGACGCCTT CATCGTCTAC AAGCCTCACC CGGACGTCCT GTCGGGCAAT CGCCGGGGGG CAATCGAGGT GAATGCATGG GCCGACCTGA TCGAACAGGA TGCCGACCTG ATCTCGCTGA TAGAAGTGGC CGACGAGATC CACACCCTTT CGTCGCTGTC CGGCTTCGAA GCGCTGATCC GCGGCAAGGC CGTGCATACC TATGGTCTGC CGTTCTATGC AGGATGGGGG CTGACGCAGG ACGCGCTCGC GCAACCCTGG CGCAAGCGCA CGCTTTCTCT TGATATGCTG ACAGCCGGCG TGTTGCTGCG CTATCCGGTC TACTGGGATT GGTCTCTCCG GCTGTTCGCC TCGCCCGAAC TCGTTGTTCG GCAACTGGCC ATTCCGGCCG CGCGACCGCT GACGAGTATC CGCGGCGATC GCCTGCGGCC GGTTCGGAAA GCATCCCGCT GGATTGCAAG CTGTCTGCGC CATCTCCTCT GGCAATGCGG AAAGTAG
|
Protein sequence | MQALDAALEQ DCTAGTAAAV AELMKRVLAS HAIRGRDGVS EFRAPPRLPG ETRVLLIDER KYSRGIGAVA TRNNRGAFER MIQAARAAHP NAEFWLARTR DRGSGVWLSA SAADILPADI QRLGEHESLC AALEHVDHVY TVGASEGMQA LLAGRRVHVF GAPYYAGWGL TDDAVQLPGR HARPTLAALF DVVFLRFARY LNPATHAPGR IDDLLDAIEW QNTVRRRFAD LRQVAGIRFQ WWKRPFATPY LTAGGGTLRW TRDASRLREG EHAALWGARG TNDLSPGTRV IRIEDGFLHS TGLGSDHVAP CSQVIDRSGL YFDPSRPSDL TTILNETDFD DAELVRANRL RREIARLGLT KYNLGRRKPA WSPPPGKRVV LVPGQVADDA SIRLGTRGIT TAEDLLREVR ARRPDAFIVY KPHPDVLSGN RRGAIEVNAW ADLIEQDADL ISLIEVADEI HTLSSLSGFE ALIRGKAVHT YGLPFYAGWG LTQDALAQPW RKRTLSLDML TAGVLLRYPV YWDWSLRLFA SPELVVRQLA IPAARPLTSI RGDRLRPVRK ASRWIASCLR HLLWQCGK
|
| |