Gene BURPS1106A_A2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2202 
Symbol 
ID4906289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2151501 
End bp2152355 
Gene Length855 bp 
Protein Length284 aa 
Translation table11 
GC content72% 
IMG OID640145307 
ProductYscJ/HrcJ family type III secretion apparatus lipoprotein 
Protein accessionYP_001076235 
Protein GI126457668 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4669] Type III secretory pathway, lipoprotein EscJ 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain
[TIGR02544] type III secretion apparatus lipoprotein, YscJ/HrcJ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.357372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCATGA AACCACTCCG TCTCCCGATT TCCGCCGCCG GCGCGCGCCG CGCCGCCCGC 
CTCGCCGCGC TCGTCGCGTG CGTGGCGCTC TTCGCCGGCT GCCGGCAGGA GCTGTACGGC
GGCCTCGCCG AGCGCGACTG CAACGAGATG ATGGCCGCGC TGCTGCAAAA CGGCGTCGAC
GCGCAGAAGA AGACGCCCGA CGGCGGCAAG ACATGGACGC TCGCCGTCGA CGACAAGCAG
ATCGTCAAGG CGATGGAAGT GCTGCGCGCG CGCGGGCTGC CCGCGACGCG CTACGACGAT
CTCGGCGCGC TGTTCAAGAA GGACGGCCTC GTGTCGACGC CGACCGAGGA GCGCGTGCGC
TTCATCTACG GCGTGTCGCA GGAGCTGTCG GACACGCTGT CGAAAATCGA CGGCGTCGTC
GTCGCGCGCG TGCACATCGT GCTGCCGAAC AACGATCCGC TCGCGCAGGT CGCGAAGCCC
TCGTCGGCCT CGGTGTTCAT CAAGTACCGG CCGAACGCGA ATCTCGCGAC GCTCACGCCG
CAGATCAAGA ACCTCGTCGT TCATAGCGTC GAAGGGCTGA CGTACGACGA AGTGAGCGTC
ACCTCCGTCG CGGCCGATCC GGTCGATCTC GTGTCGGCCG CGCAGCCCGC CGCGCAGAAC
TCCCGCGGCG CGACGCTCGT CGGCGTGCTG ATCGCGCTCG CCGTGGGCGG CGCGCTCGCG
GCCGCGGGCG GCGCGCTGTG GTGGCGCGCG CGCAAGCGCG GCGGCGGCGC GGGCGCGCAC
GGGATCGCCG CGCGGCCGCG CGGCGGCGCC CGCGACGCGA AGGCCGCCGC GCCCCGGCAG
GCCGGCGCGC AATGA
 
Protein sequence
MIMKPLRLPI SAAGARRAAR LAALVACVAL FAGCRQELYG GLAERDCNEM MAALLQNGVD 
AQKKTPDGGK TWTLAVDDKQ IVKAMEVLRA RGLPATRYDD LGALFKKDGL VSTPTEERVR
FIYGVSQELS DTLSKIDGVV VARVHIVLPN NDPLAQVAKP SSASVFIKYR PNANLATLTP
QIKNLVVHSV EGLTYDEVSV TSVAADPVDL VSAAQPAAQN SRGATLVGVL IALAVGGALA
AAGGALWWRA RKRGGGAGAH GIAARPRGGA RDAKAAAPRQ AGAQ