Gene BURPS1106A_3289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3289 
Symbol 
ID4901776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3205515 
End bp3206636 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content61% 
IMG OID640136515 
Productcapsular polysaccharide biosynthesis/export periplasmic protein 
Protein accessionYP_001067526 
Protein GI126453256 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGCGG TCACGCTCGC CGGTTGCTCA AGTATCCCTA CGTCGGGGGC CAGTGGCGCG 
CAAATCGCGC GGGCTGCGCA GAGTCCATCC GGAATTCAGA TCGTCGATGT GACCGAGGAT
GTCGCGCGCC AGCTGTTTGC TGATCGAAAC ACGGCGGACT TCGTGACGGC GCTGGGCGGC
GGTGCGTCGT TCCGGCAACA GTTGGGCGTC GGCGATACGA TTCAGGTGTC CATCTGGGAG
GCGCCACCCG CCACGCTTTT TGGCGCGGCT CAGTCGGAAG GGAGTTCGGG GCCGGCGAAC
GCGCGCGTGA CGGTGTTGCC CGATCAAGCC ATCGATGGCG ACGGCAATGT CAATATTCCG
TTTGCGGGCC AGGTGAAGGC GGCCGGCCGC TCGCCCACGC AGTTGGCGCG TGAGATTGCC
GCGCGGCTGA AGAGCATGGC GCACGATCCG CAAGTGCTCG TGAAGCTTTC ACGCAACGAG
ACGTCATATG TGACGGTCGT GGGCGATGTG GCCGAAAACG CTCGCATGGC TCTGACCGCT
CGGGGCGAGC GCCTGCTTGA TGCATTGGCG AGCGCAGGCG GGGCGAAGCA CCCGGTTGAC
AAAGTTACGA TCCAGATAAC GCGCGGCAAG ACGGTGGCCT CGTTGCCGCT CGACATGGTT
ATTCGTGATC CGCGGCAGAA CGTCCCGTTG CATGCGGGCG ATGTGGTCAC TGTCCTGTTT
CAGCCATATA GCTTTACGGT GCTCGGCGCG ACGGGCAAGA ATGACGAAAT CAATTTTGAA
GCGAAGGGCA TCACGCTTGC GCAGGCCCTG GCGCGTGCTG GCGGCTTGCA GGATTCGCGC
GCCGATGCAA AGGGCGTATT CATCTTCCGA CTTGAAGACG CCAACGCGCT GAAATGGCCG
ACGGCTCCCG TGCGTACGAC TGCGGATGGA AAGGTGCCTG TCGTGTATCG CGTGAATCTT
CGCGATCCGA ATTCGTTCTT CGTGGCTCAG AGCTTCAGGG TCGACAACAA CGATCTGTTG
TACGTTTCGA ATGCGCCGAT TGCCGAACTT CAAAAATTCT TGAATGTCGT GTTCTCCGTT
GCGTATCCGG TGATTACCGG CGTTCAGACA GTCAGGTACT GA
 
Protein sequence
MGAVTLAGCS SIPTSGASGA QIARAAQSPS GIQIVDVTED VARQLFADRN TADFVTALGG 
GASFRQQLGV GDTIQVSIWE APPATLFGAA QSEGSSGPAN ARVTVLPDQA IDGDGNVNIP
FAGQVKAAGR SPTQLAREIA ARLKSMAHDP QVLVKLSRNE TSYVTVVGDV AENARMALTA
RGERLLDALA SAGGAKHPVD KVTIQITRGK TVASLPLDMV IRDPRQNVPL HAGDVVTVLF
QPYSFTVLGA TGKNDEINFE AKGITLAQAL ARAGGLQDSR ADAKGVFIFR LEDANALKWP
TAPVRTTADG KVPVVYRVNL RDPNSFFVAQ SFRVDNNDLL YVSNAPIAEL QKFLNVVFSV
AYPVITGVQT VRY