Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1996 |
Symbol | |
ID | 4886527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1932464 |
End bp | 1933522 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640131934 |
Product | type III secretion system protein HrcU |
Protein accession | YP_001062991 |
Protein GI | 126442638 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1377] Flagellar biosynthesis pathway, component FlhB |
TIGRFAM ID | [TIGR01404] type III secretion protein, YscU/HrpY family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.920837 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACG AAAAGACCGA GGAACCCACC GACAAGAAGC TGCGTGACGC GCGTCGAGAC GGTGAGGTAT CCCGTAGCAC GGATCTCTCC GACGCCGTAT CCATGTCCGC TGCGATTCTG TTGCTGGTTG CGGCCGCCGA TCATTTCGGC GATGCAATGC GAGCGTTGGT CAACGGCGCG CTAGCGTTCG TTTCCGCCGA TCATTCTCTC GTCGAGATGA CCGCGCGGCT GTACCAGTTC GGCGGCATCG CGCTATCGGC GGTCATGCCG CTGCTGTTCG TCGCGGCGCT CGCGGGTATC GGCGGATCGG TCCTCCAGGT CGGGCTGCAG ATATCGCTGA AACCGGTCAT GCCGAATCTC GGCGCACTCA ATCCGGCTGA AGGTCTGAAG AAGCTTTTTT CGCCGCGTAG CGCGATCGAG TCCATCAAGA TGATCGTCAA GGCCGTCATC GTGTTCTGCG TGGCGTGGAA AACGATCGTA TGGCTGTTCC CGCTCATCGC CGGCGCGCTG TATCAATCGC CGCCCGAACT GTCACGCATA TTCCGGGAGA TCCTGGCGAA GTGGCTGATG GTGGTGGCCG GTCTATGCCT TCTGATGGGG GCGGCCGACG TGAAACTCCA GCGCTTCATG TTCATGCAGA AGATGAAGAT GACGAAGGAC GAGGTGAAGC GCGAATCCAA AAACGACGAA GGCGATCCGC TGCTCAAGGG CGAGCGCAAG CGGCTTGCGC GCGAACTGGC GGCCGCGCCG CCACAGCATC AGGTCGCGCA CGCGAATTTC GTCGTCGTCA ACCCCACCCA CTACGCGGTC GCGGTTCGTT ACGCGCCCGA CGAGCATCCG CTCCCCCGCG TGGTCGCGAA GGGCCTCGAC GAAGCGGCCA TCGCACTGCG GCGGGCCGCG CAAGACGCGA ACATCCCGAT CATCGGCAAT CCCCCTGTCG CGCGCGCGTT GTTCCGAATT GGCGTCGAGG AGCCGGTGCC CGAAGAACTG TTCGAGATCG TTGCCGCGAT CCTGCGCTGG ATCGACGCGA TCGGCCCGCG CCGAAACGAA CGGGCCTGA
|
Protein sequence | MSDEKTEEPT DKKLRDARRD GEVSRSTDLS DAVSMSAAIL LLVAAADHFG DAMRALVNGA LAFVSADHSL VEMTARLYQF GGIALSAVMP LLFVAALAGI GGSVLQVGLQ ISLKPVMPNL GALNPAEGLK KLFSPRSAIE SIKMIVKAVI VFCVAWKTIV WLFPLIAGAL YQSPPELSRI FREILAKWLM VVAGLCLLMG AADVKLQRFM FMQKMKMTKD EVKRESKNDE GDPLLKGERK RLARELAAAP PQHQVAHANF VVVNPTHYAV AVRYAPDEHP LPRVVAKGLD EAAIALRRAA QDANIPIIGN PPVARALFRI GVEEPVPEEL FEIVAAILRW IDAIGPRRNE RA
|
| |