Gene BURPS668_3253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3253 
Symbol 
ID4883374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3187586 
End bp3189133 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content56% 
IMG OID640129181 
Productcapsule polysaccharide exporter 
Protein accessionYP_001060264 
Protein GI126438767 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID[TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.51682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGCG TTGCATTCGC TGACTTCTTT TCTGGTAATG ACTTGCCGAT GCTGGAATCG 
GTCATTGCTG TTTCGAACCC GCAAGCGGCG CTTGCCCGTG CCAGGGAGCG TTATTCCGAT
TGTTCGCCCG CGATAGATAC GCCGACTGCT TTGGGATATG TATTCAATTC GTGGCGTTTC
CTGCTCGAAA CCGGCGACGG TCAGGTGCGC CTGCGCATTC AATCGGAATT CGAGGGTACG
GACCTCGAGC ATCGCCAACT CGTCTTGAGA TATTTGCTGA CCGATGCGCG CGCATCGTCC
ATTTCCGTCG ACGAATTGCT TCCCGCGTCA TCGCCATACT CGGTGGACGC GGCGGAACTG
GTTGATAATC CTTCGTCCGT CGTGTCGAAA CGAACCACCA TGGAGAGCTT CGATGCTCCG
GCGCAGGAGT CGTCATGGTT CTGGCGGGTA TGGACGATCG GGCGGCTGAG GCTGACCCCG
TTGTTTCTGG TGACGGTGGT GCTGCCGACT GCAATCGCTA CCCTCTACTA TGGGCTGATT
GCGTCGGATG TCTACATCTC GGAATCGCAG TTTATCTTGC GTACGCCCAA GCGAACGAGC
GAACCGGGTT TGGGGGCATT GCTTCAGGGC GTCTCCCTAT CAAAGACCAG TGACGACACC
AGTGTCGTTC AAGATTACAT AAAGTCGAGA GACGCACTCG TCGTTCTCGA GCAGAAAATG
CCGCTCAGAG ATGCATTTGG CGGTCGCTAT GCTGATATCT TCAGTCGATT CCCGGGCGTG
ATCGGGCGCG ATGGCTTCGA GTATTTCTAC CGCTATTTCA AAAACCATGT TGCCGTCGAC
GTAGGGACTG CAACTGCGGT CACTACGCTA CGCGTTCAGG CCTATACGTC GAAAGACGCA
TACCTGATCA ACAAGTATTT GCTCGAGATG GCGGAGAAGC GAGTCAATGA ACTCAACGAT
CGCTCACGTC AGGATTCGCT CCGTTACGCC CTTGCGGAGG TGGCGGGAGC GGAAGCGAAG
GTAAAGGCTG CGTCGACGGC GTTGTCGAAA TTCCGAAGCA AGGCGAGTCT GTTCGATCCG
GATCGGCAGT CCAATATCCA ACTGGAACTG GTAGCGAAGT TGCGGGGCGA GCTCATTGCC
AAGCAGGCGC AGCTATCGCA ATTGCGCATG CTCAGCCCGC AGAATCCGCA GATCCCGAGT
CTGGCCGCGT CTATCGGCGC GTTGGAAGCG TCGATTGCGA GCGAGAAAGG GGGCGTGGCG
GGAGAGAAGA ATTCGTTGTC GGACCGTTCG GTCGAGTACC AGCGCCTGGC TTTGGAGCAA
TCGTTCGGGG AGAAGCTTCT TGCCTCTGCG CTGACATCGT TGGAGCAAGC CAGGGCAGAT
GCGGAAAGAA AACAAATCTA TCTGGAACGG GTTGCGGAAC CGAACGAACC GGACGTGGCG
ATGGAACCCA AGCGTGTCCG GAACATATTT GCCTGTTTCA TTTTGGGGCT CGCGGCCTGG
GGCGTGTTGA GCATGCTGGT AGCGGGTATT CGTGAACATC AGGAATGA
 
Protein sequence
MNGVAFADFF SGNDLPMLES VIAVSNPQAA LARARERYSD CSPAIDTPTA LGYVFNSWRF 
LLETGDGQVR LRIQSEFEGT DLEHRQLVLR YLLTDARASS ISVDELLPAS SPYSVDAAEL
VDNPSSVVSK RTTMESFDAP AQESSWFWRV WTIGRLRLTP LFLVTVVLPT AIATLYYGLI
ASDVYISESQ FILRTPKRTS EPGLGALLQG VSLSKTSDDT SVVQDYIKSR DALVVLEQKM
PLRDAFGGRY ADIFSRFPGV IGRDGFEYFY RYFKNHVAVD VGTATAVTTL RVQAYTSKDA
YLINKYLLEM AEKRVNELND RSRQDSLRYA LAEVAGAEAK VKAASTALSK FRSKASLFDP
DRQSNIQLEL VAKLRGELIA KQAQLSQLRM LSPQNPQIPS LAASIGALEA SIASEKGGVA
GEKNSLSDRS VEYQRLALEQ SFGEKLLASA LTSLEQARAD AERKQIYLER VAEPNEPDVA
MEPKRVRNIF ACFILGLAAW GVLSMLVAGI REHQE