Gene BURPS1106A_1792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1792 
Symbol 
ID4900643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1753644 
End bp1754651 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content73% 
IMG OID640135022 
Producttype II secretion system protein 
Protein accessionYP_001066061 
Protein GI126453046 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4965] Flp pilus assembly protein TadB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0984165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAGCG CGGCGCTCTG GGCGCTCGCG CTCGCGCTGC TGTGCGTCGC CGGGGCGTTC 
GCGCTATGGC GGCGCGGCGA GGCGAACAGG GAGCGCGCGC ATGCGGCGCG CTACATCGAC
AGCCGGCTCG AGCCCGGCGC GCGCGCGAGC GCGCAGCCGA AGATGCCGGC CGCGGCCGAG
CCCAAGCGCG CGGCGCCCAT GCCGGCCGCG GGCGCGGCGG GCGGCGCGCG CGCGGAGAAG
CCCGCCGAAG GGCTCGCGCG CTGGCGTGAG CGCGCGGCCG ACGCATGGCT GAACGTGTCG
AACCGCGCGG GCGTGTCCGA GATCCGCGCG CCGCTCGCCG CGCTCGCCGC GACGACGGCC
GTCGCCACGC TGTGGGCGGG CCTGCGCGGC GGGCTGCTCG CCGCCTGCGC GGCGCTCGTC
GCGGGCGCGA CGCTCGCGGT CTTCTGGCTC GTGTCGCGGA TGCAGAAGCG GCGGCTGCGG
ATCGTGCGCC AACTGCCGTC GTTCCTCGAC GGCATCGTGC GTCTCGTCAC GCTCGGCAAC
AGCGTGCCGG CCGCGTTCCA GGCGACGCTG CAGACGACCG AGGCGCCGCT GCGCGGCTGT
CTCGATCACG TGTCGCGGAT GCTGCGCTCG GGCGTCGAGA TCGACCGTGC GATGGTGTCC
ATCGCGGCGC TCTACCGGAT CAAGGAATTC GAGCTCGTCG GCTCGGTGCT GCGGTTGTCC
GTCAAGTACG GCGGCCGCGC CGACGTGATG CTCGACCGAA TGGCCGTGTT CATGCGCGAT
CTCGAGCAGG CCGAGCGCGA GCTCGTCGCG ATGTCGGCGG AGACGCGGCT GTCGGCATGG
GTGCTCGGCG CGCTGCCCGT GGGCATCGGC AGCTTCGTGA TCGCGACGAA TCCGAAATAT
TTCAGCGCGA TGTGGCTTGA CCCGACGGGC CGCCAGCTCG TGTATCTCGC ATTCATCCTG
CAAATCGCCG GCGGCTACTG GCTGTACCGG CTCGCCCGAT TGAGGTGA
 
Protein sequence
MSSAALWALA LALLCVAGAF ALWRRGEANR ERAHAARYID SRLEPGARAS AQPKMPAAAE 
PKRAAPMPAA GAAGGARAEK PAEGLARWRE RAADAWLNVS NRAGVSEIRA PLAALAATTA
VATLWAGLRG GLLAACAALV AGATLAVFWL VSRMQKRRLR IVRQLPSFLD GIVRLVTLGN
SVPAAFQATL QTTEAPLRGC LDHVSRMLRS GVEIDRAMVS IAALYRIKEF ELVGSVLRLS
VKYGGRADVM LDRMAVFMRD LEQAERELVA MSAETRLSAW VLGALPVGIG SFVIATNPKY
FSAMWLDPTG RQLVYLAFIL QIAGGYWLYR LARLR