Gene BURPS1106A_A0566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0566 
Symbol 
ID4905817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp557090 
End bp559492 
Gene Length2403 bp 
Protein Length800 aa 
Translation table11 
GC content69% 
IMG OID640143672 
Productcapsular polysaccharide biosynthesis/export protein 
Protein accessionYP_001074602 
Protein GI126456078 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCGCGC ATGCGCAAAT CGTCCCTTTC GTTCCGTCTG CATCCGCCGG CGCGGACGCG 
CGTTCGGCGG TGCCGCTGCC GCCGATCGAC GGCCTGCCGG CGGCGGACGG CCTGAGCGTG
CGGCAACGCG ACGCGCTTCA GCCGCCGCTG CCGGGCGCGA AGCTGACGAA GGACGCCACG
ACGAAAGACG GCCTGTTGAA GGACGGCGCC GCGCTGAACG AGCCGCCGCG CCCCTGCGGG
CCGCTCGGAT GCCGTGGCGA AGCGATCGAT TGCCCGCCGT CGGCCGACGG CCGGCCGACG
TCCATTACGC GCCGGGCGGA CGGCACGCCC GAATTGCCGG ATCCGTGCGG GCGGCCGCGC
GCCGCTTCGC AACCGGCGCT GACCGATTTT CAGGTCTTCG TGCAGCAGAG CACGGGCCGC
GCGTTGCCGC TGTTCGGCTA CAACTTCTTT TCGACGACGA CGACCTACCG GTCGCTCGAC
AACGTTCCGG TTCCCGACGA TTACGTGCTC GGCCCCGGCG ACGAAGTGCT GCTGCACGCG
TGGGGCGGCG TCGCGGGCGA TCAGCGCCTC GTCGTCGACC GCAACGGTCA GGTGAGCATT
CCGGGCGTCG GTGTCGTGAC GGTGGCGGGC GCGCGCGCGT CCGAGCTCGA TTCGCTGCTG
CGCGGCGGGC TTGGCCGATA TTTCACGGAT TTCAACGTGA ATGCGGCGCT CGGGCGTCTG
CGATCGATCC CGGTGTATGT CGTCGGCAAG GCGATGTCGC CCGGCAGCTA CACGGTGCCC
GGCACGTCGA CGATCATCGG CGCGTTGTTC GCGAGCGGCG GCCCCGCGTT CAACGGCTCG
ATGCGCGCGG TGGCGCTGTA TCGCAATGGC CGGCAGATCG CCGCGCTCGA CGTCTACGGC
TTTCTCACGC GCGGCGCGGT GAAGGACGAT GTCCATCTGC TGCCGGGCGA CGTCATCGTC
ATTCCGCCCG CCGGGCCGCG CGTCGCGCTG ACGGGCGCGC TCGACGCGCC CGGCATCTAC
GAGCTGCGCA AGACGAGCGA AACGTTGCGC GAGCTGCTCG ACGACGCGGG CGGCGTCACC
GCGCTCACGA GCCTCGACCG CGTGATGATC GAACGGGTGA ACCCGGCCGA CGGCGCCGCG
CCGCGTTCCG TGCAGGAAGT GCGGCTCGAC GAAGCGGGGC TCGCGACGGT CGTGAAGGAC
GGCGACATCG TGACGCTGTC GATGGTGTCG CAGAAATTCT CGAACGCCGT GACGCTGCGC
GGCAACGTCG CCGCGGCGCT CAGGTATCCG TACAAACCGG GCATGACGCT CGCCGATTTG
CTGCCGTCCC CGGAGGCCGT GCTGACGCCC GACTACTTCA CGCGCAAGAA CATTCTCGTC
GAATACGCGA AGAGCGGCGG CGCCGACCGG CGGCCGTTCG ACGGCATCCG CAATCTCGTC
GACGAGCCCA ACTGGCACTA CGCGATCATT CAGCGGCTCG ACCCCGTCAC GCTGACCGAG
AACGTGATCG CGTTCAACCT GCGCGCGGTG ATGACGAAAG GCAGCGCCGA GGCCGGCATC
GCGCTGCAGC CCGGCGACGT GGTGACGATT TTCGGCAAGC GGGACCTGCG CAACCCGGTC
GACGACAACA AGAGCCTCGT TCGCGTGGAC GGCGAAGTGC GCGCACCCGG CGTCTACCAG
CTCAACGCGG GCGAATCGCT GCGCGAGCTG CTGCAGCGCG CGGGCGGCCT GACAAGCAAC
GCGTATGTGT TCGGCCTCGA GTTCACGCGC GAATCCACCC GCAGCCAGCA GCAGCAGAAC
CTGAACGCCG CGCTCGACCG CGCGACGTTG CAGGCGAACA GCAAGCTCGC CGCGGCGCTC
GCGAACCTGC CGAGCGGCGA CACGCAGTAT CTGCAGGCTC AGATGGCGGC CGCGCAGCAG
GCGCAACTCG CGCGGCTGCG CGAGCTCAAG CCGACGGGGC GCGTGTCGCT CGAGCTGAGC
ACGCGCGCGA GCCGCATCGG CGATCTGCCC GATCTGCCGC TGAACGACGG CGATGCGATT
TACGTGCCGC CGGTGCCGGC GTTCGTGACC GTCTACGGCG CGGTCGACAA TCAGAACGCG
GTGATCTGGA AGCTGCATCG AACGGTCGCC GATACGTTGA GGGTCGCGGG CGTCCAGCGG
GACATCGCGG ACCTCGACGC CGCATTCGTG CTGCGCGCCG ACGGCAGCGT CGCGTCGGCG
AGCAGCGCCG GCTGGTTCGG CAGCTTCGGC GCGCTCGAGC TGATGCCGGG CGACGCGCTC
GTGGTTCCCG AGAAGCTCGA CCGGCGCACC GCGATGACGA AGTTCCTCGC CGGCCTCAAG
GACTGGTCGC AGGTGCTGGC CAATTTCGGC CTGGGCGCGG CGGCAATCAA GGTGCTCAAG
TAA
 
Protein sequence
MFAHAQIVPF VPSASAGADA RSAVPLPPID GLPAADGLSV RQRDALQPPL PGAKLTKDAT 
TKDGLLKDGA ALNEPPRPCG PLGCRGEAID CPPSADGRPT SITRRADGTP ELPDPCGRPR
AASQPALTDF QVFVQQSTGR ALPLFGYNFF STTTTYRSLD NVPVPDDYVL GPGDEVLLHA
WGGVAGDQRL VVDRNGQVSI PGVGVVTVAG ARASELDSLL RGGLGRYFTD FNVNAALGRL
RSIPVYVVGK AMSPGSYTVP GTSTIIGALF ASGGPAFNGS MRAVALYRNG RQIAALDVYG
FLTRGAVKDD VHLLPGDVIV IPPAGPRVAL TGALDAPGIY ELRKTSETLR ELLDDAGGVT
ALTSLDRVMI ERVNPADGAA PRSVQEVRLD EAGLATVVKD GDIVTLSMVS QKFSNAVTLR
GNVAAALRYP YKPGMTLADL LPSPEAVLTP DYFTRKNILV EYAKSGGADR RPFDGIRNLV
DEPNWHYAII QRLDPVTLTE NVIAFNLRAV MTKGSAEAGI ALQPGDVVTI FGKRDLRNPV
DDNKSLVRVD GEVRAPGVYQ LNAGESLREL LQRAGGLTSN AYVFGLEFTR ESTRSQQQQN
LNAALDRATL QANSKLAAAL ANLPSGDTQY LQAQMAAAQQ AQLARLRELK PTGRVSLELS
TRASRIGDLP DLPLNDGDAI YVPPVPAFVT VYGAVDNQNA VIWKLHRTVA DTLRVAGVQR
DIADLDAAFV LRADGSVASA SSAGWFGSFG ALELMPGDAL VVPEKLDRRT AMTKFLAGLK
DWSQVLANFG LGAAAIKVLK