Gene BURPS1710b_3289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3289 
SymbolwcbO 
ID3690450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3598043 
End bp3599350 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content59% 
IMG OID637729744 
Productcapsular polysaccharide export protein 
Protein accessionYP_334660 
Protein GI76809884 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3562] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGTGAAA CGGCGTGGGT GACGGGCTGC AAGGCGGGTG TTGTCATTTT GTCGGCGACT 
TGTTCGAGCG CGTCGGCCTT CATAACTCAA TCAGATAAGA ATCGTATGTC CCGCTTCTTC
CTTGCCCTGC AAGGCACGGC CTCTCCTTTT TTTGGTCGAC TCGCTGCCGG TCTCGGCCAG
CGGGGCCACC AGGTTCGGCG TGTGAATTTT TGCGGCGGAG ATCTCGCGTA TCAAGGTTCG
GAAAGCGCTT GGAACTATCG CGACGAACCC GAAGGCCTGG TTGCGTGGTA TCGCGATGCC
ATTGCGACCA ATGGAGTGAC GGATGTGCTT CTGTTTGGCG ACTGCCGTGC GATCCACCGG
CCGATGCATG AGATCGCTCG CGCATCGGGG GTGCGTGTTC ACGTATTCGA AGAGGGGTAT
GTTCGACCGC ACTGGATCAC AATGGAAAGG CACGGCGTCA ACGGCCGATC GTTGCTGCCG
CGCGACCCGG CTTACTATCT CGACGCACGC CGGCATATCC CGCCAGCGGT ACCCGGGAAA
CCGACCGGCT ACAACCTGTA CGAGCGCGCC TGCCACGATA TCAGGTATCG CATGGCCAAC
GCGTTGTACG CGCATCGTTT CCCGCATTAC AAGTCGCACC GTCCGAGAAA CGGCTTACAG
GAGTACGCGG GCCTCGCGTA TCGCGCCGTT CAGCAACACG TGCGCGATAG GGAGGCCGAG
AACGTCACCC GTGATCTGCT GGAACGAAAA CGCCGCTACT ATCTGTTTCC GCTGCAGCTC
AATTCCGACT CCCAGATCGT CGATCATTCC CCTTTTGGCG GCATTTGCGA CGCGATAGCG
ATTGTTTTAC ACTCATTCGC CGAAAATGCG CCCGACGACA GTTGGCTTGT CATCAAGAAT
CATCCGTTGG ACACCGGTCT GATCGGCTAC CGTCAATTTG CAACGGCATT GGCCACTGAA
CTGGGTATCG AGAAGAGAAT GGCCTTCATC GATGCGGGCC ACTTGCCGAC GTTACTCGAT
CAATGTCGTG GCGTGGTCGT GATAAACAGC ACGGTCGGTT TGTCCGCCGT CCACCATCGA
CGCCCGCTCG TTGCATTGGG CACCGCGATC TATTCGATGC CGGGGCTGAC TTGGCAAGGC
AGCCTGGCGG ACTTTTGGAC GGAGGCTGGT AGCCCGGACA TGAATCTCTA TCAGGCTTTT
CTCGACTACG TGATGCACCA TACGCAGATC AACGGAGATT TCTATACGCG CACCGGTATA
GAGATGAGCG TCGCCGGCGC CGTGAGCCGG CTCGAGGCGG TGTCGTGA
 
Protein sequence
MGETAWVTGC KAGVVILSAT CSSASAFITQ SDKNRMSRFF LALQGTASPF FGRLAAGLGQ 
RGHQVRRVNF CGGDLAYQGS ESAWNYRDEP EGLVAWYRDA IATNGVTDVL LFGDCRAIHR
PMHEIARASG VRVHVFEEGY VRPHWITMER HGVNGRSLLP RDPAYYLDAR RHIPPAVPGK
PTGYNLYERA CHDIRYRMAN ALYAHRFPHY KSHRPRNGLQ EYAGLAYRAV QQHVRDREAE
NVTRDLLERK RRYYLFPLQL NSDSQIVDHS PFGGICDAIA IVLHSFAENA PDDSWLVIKN
HPLDTGLIGY RQFATALATE LGIEKRMAFI DAGHLPTLLD QCRGVVVINS TVGLSAVHHR
RPLVALGTAI YSMPGLTWQG SLADFWTEAG SPDMNLYQAF LDYVMHHTQI NGDFYTRTGI
EMSVAGAVSR LEAVS