Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2179 |
Symbol | |
ID | 4886890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2115415 |
End bp | 2116701 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640132116 |
Product | type III secretion system protein PrgH/EprH |
Protein accession | YP_001063173 |
Protein GI | 126445031 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02554] type III secretion system protein PrgH/EprH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.767802 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAC CAGATTCCGG ACTTGAAGCT CTGCAACTCC GAATCCTGTT CGGTCCGCTA TTCGGCTCGG ATATCGCGAT TCCGTCAGGG GAAGTATTTT TCTGCGTCGG CGATCAGGTG ATCGACGATC GTCCGGCGGA GCATCCGGAA AATCGCGCCG GCCATTTACT GGAGCGCGCG GTCGATACGC TGTATATCCC GCACCGGGCC GGCGCGCCGA ATTTCCGCCT GCGTTTTCCG GGCGCACCGA CGCAGGCGGC GCGAACCGCC GAAACCGGCG AAGCCGCGCC CGGCGATTTC GAAGTCGATT TTCTGTCGGC GGACGGTTGC GTCACGCAAC GCGCGGCATT CAACACCGTC TGCCGCTTCG GCGATATCGC GTTCGCGCTC AGGCGTCAGC GCGAGCCATG GAGCGAGGCG GTCATGCACT ACGCGCCGCA CGCGCCTTCG CGTGCGGCGG ACGCCGCCGA GCCGGGCGCA CCCGGTGAGC CCGGCGATGG CGGCGAGCGC GCATCGCGCT TCGCGCTGAA GCTCGGCGCG CTGCTCGTCG CGGGGGGCGC GCTCGCGGCG CTCGCGTACT GGCAGGTGCA GCGCTATGTC GGCGCGCAGA AGCTCGCGAG CGTCAACGGC GTGCTGGCGG GCGCGCCCGT GCCCAACGCG ATCCTGCCCG GCGACGACGG CCGGATCTAC GTGCTGAGCG CGTCGCAGGA CGGCGCCGAA TGGGACCGCG AGGCGCTGCT GAAGGCGGCG CTGCCGGAGA AGATCGAAGT CGCCGTGATC GGCGCGGAGC GGCAACGCGT CGAGCGCCGG CTCGACGAAG CCGGCGTCGA TTTCGTGACC GTGCGCCTCG ACGCGCCCGA GCACCCGGAG CTGATCCTCA CCGGCGCCGC GCCCGCCGCC GCGCGCGCAC GCGCGATCGG CGAGCTGCGG CGCGCGGCCC CGTACGTCCG GGACGTGCGC GTGATCGACG CGAGCCTCGG CGCGATCGAG CAGGAGGCGC GCAACGCGCT CGACAAGGTG GGCGCGCGCT ACCGGCTGCT CGCGCGGCGC GGCGGCGCGA CGTTCGAGGT GGCGAGCTCG TTCGGCGACG AGGAGCTCGC CGCCTTGCAG AACCTCATGC GCTCGTTCGG CCACAAGTGG GGCACGCGCC GCGTCGATTT CAAGATCGCG CTGCGCACCG ACTGGCTGAA GGGCAAATCG TATCGGGAAG GCGGCGACGG CTACGTGCTG CTCGATCACG CGTCCTGGTA TTTCCCGCAA CCCCTGGAAG GAGCACATTA CCGATGA
|
Protein sequence | MNKPDSGLEA LQLRILFGPL FGSDIAIPSG EVFFCVGDQV IDDRPAEHPE NRAGHLLERA VDTLYIPHRA GAPNFRLRFP GAPTQAARTA ETGEAAPGDF EVDFLSADGC VTQRAAFNTV CRFGDIAFAL RRQREPWSEA VMHYAPHAPS RAADAAEPGA PGEPGDGGER ASRFALKLGA LLVAGGALAA LAYWQVQRYV GAQKLASVNG VLAGAPVPNA ILPGDDGRIY VLSASQDGAE WDREALLKAA LPEKIEVAVI GAERQRVERR LDEAGVDFVT VRLDAPEHPE LILTGAAPAA ARARAIGELR RAAPYVRDVR VIDASLGAIE QEARNALDKV GARYRLLARR GGATFEVASS FGDEELAALQ NLMRSFGHKW GTRRVDFKIA LRTDWLKGKS YREGGDGYVL LDHASWYFPQ PLEGAHYR
|
| |