Gene BURPS1106A_A2091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2091 
Symbol 
ID4906310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2054027 
End bp2055313 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content70% 
IMG OID640145196 
Producttype III secretion system protein PrgH/EprH 
Protein accessionYP_001076124 
Protein GI126457081 
COG category 
COG ID 
TIGRFAM ID[TIGR02554] type III secretion system protein PrgH/EprH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0979619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAC CAGATTCCGG ACTTGAAGCT CTGCAACTCC GAATCCTGTT CGGTCCGCTA 
TTCGGCTCGG ATATCGCGAT TCCGTCAGGG GAAGTATTTT TCTGCGTCGG CGAGCAGGTG
ATCGACGATC GTCCGGCGGA GCATCCGGAA AATCGCGCCG GCCATTTACT GGAGCGCGCG
GTCGATACGC TGTATATCCC GCACCGGGCC GGCGCGCCGA ATTTCCGCCT GCGTTTTCCG
GGCGCGCCGA CGCAGGCGGC GCGAACCGCC GAAACCGGCG AAGCCGCGCC CGGCGATTTC
GAAGTCGATT TTCTGTCGGC GGACGGTTGC GTCACGCAAC GCGCGGCATT CAACACCGTC
TGCCGCTTCG GCGATATCGC GTTCGCGCTC AGGCGTCAGC GCGAGCCATG GAGCGAGGCG
GTCATGCACT ACGCGCCGCA CGCGCCTTCG CGTGCGGCGG ACGCCGCCGA GCCGGGCGCG
CCCGGTGAGC CCGGCGATGG CGGCGAGCGC GCATCGCGCT TCGCGCTGAA GCTCGGCGCG
CTGCTCGTCG CGGGGGTCGC GCTCGCGGCG CTCGCGTACT GGCAGGTGCA GCGCTATGTC
GGCGCGCAGA AGCTCGCGAG CGTCAACGGC GTGCTGGCGG GCGCGCCCGT GCCCAACGCG
ATCCTGCCCG GCGACGACGG CCGGATCTAC GTGCTGAGCG CGTCGCAGGA CGGCGCCGAA
TGGGACCGCG AGGCGCTGCT GAAGGCGGCG CTGCCGGAGA AGATCGAAGT CGCCGTGATC
GGCGCGGAGC GGCAACGCGT CGAGCGCCGG CTCGACGAAG CCGGCGTCGA TTTCGTGACC
GTGCGCCTCG ACGCGCCCGA GCACCCGGAG CTGATCCTCA CCGGCGCCGC GCCCGCCGCC
GCGCGCGCAC GCGCGATCGG CGAGCTGCGG CACGCGGCCC CGTACGTCCG GGACGTGCGC
GTGATCGACG CGAGCCTCGG CGCGATCGAG CAGGAGGCGC GCAACGCGCT CGACAAGGTG
GGCGCGCGCT ACCGGCTGCT CGCGCGGCGC GGCGGCGCGA CGTTCGAGGT GGCGAGCTCG
TTCGGCGACG AGGAGCTCGC CGCCTTGCAG AACCTCATGC GCTCGTTCGG CCACAAGTGG
GGCACGCGCC GCGTCGATTT CAAGATCGCG CTGCGCACCG ACTGGCTGAA GGGCAAATCG
TATCGGGAAG GCGGCGACGG CTACGTGCTG CTCGATCACG CGTCCTGGTA TTTCCCGCAA
CCCCTGGAAG GAGCACATTA CCGATGA
 
Protein sequence
MNKPDSGLEA LQLRILFGPL FGSDIAIPSG EVFFCVGEQV IDDRPAEHPE NRAGHLLERA 
VDTLYIPHRA GAPNFRLRFP GAPTQAARTA ETGEAAPGDF EVDFLSADGC VTQRAAFNTV
CRFGDIAFAL RRQREPWSEA VMHYAPHAPS RAADAAEPGA PGEPGDGGER ASRFALKLGA
LLVAGVALAA LAYWQVQRYV GAQKLASVNG VLAGAPVPNA ILPGDDGRIY VLSASQDGAE
WDREALLKAA LPEKIEVAVI GAERQRVERR LDEAGVDFVT VRLDAPEHPE LILTGAAPAA
ARARAIGELR HAAPYVRDVR VIDASLGAIE QEARNALDKV GARYRLLARR GGATFEVASS
FGDEELAALQ NLMRSFGHKW GTRRVDFKIA LRTDWLKGKS YREGGDGYVL LDHASWYFPQ
PLEGAHYR