Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2076 |
Symbol | |
ID | 4905922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 2040377 |
End bp | 2041612 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640145181 |
Product | surface presentation of antigens protein SpaS |
Protein accession | YP_001076109 |
Protein GI | 126455636 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1377] Flagellar biosynthesis pathway, component FlhB |
TIGRFAM ID | [TIGR01404] type III secretion protein, YscU/HrpY family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGAGA AAACCGAGAA GCCGACCGCG AAGAAGCTGC GCGACGCGGC GAAGAAGGGG CAGACGTTCA AGGCGCGGGA CATCGTCGCG CTCATCGTGA TCGCCACGGG CGCGCTGGCC GCGCCCGCGC TCGTCGATCT GACGCGCATC GCGGCCGAAT TCGTGCGGAT CGCGTCGACG GGCGCGCAGC CGAACCCGGG TGCGTACGCA TTCGCGTGGG CGAAGCTGTT CCTGCGCATC GCCGCGCCGT TCGTGCTGCT CTGCGCGGCG GCGGGCGCGC TGCCGTCGCT TGTGCAAAGC CGCTTCACGC TCGCGGTCGA ATCGATCCGC TTCGATCTCA CCGCGCTCGA TCCGGTCAAG GGAATGAAGC GGCTCTTCAG CTGGCGCTCG GCGAAGGACG CGGTGAAGGC GCTGCTCTAT GTCGGCGTGT TCGCGCTCAC GGTGCGCGTG TTCGCCGATC TCTACCACGC CGACGTGTTC GGGCTGTTCC GCGCGCGCCC GGCGCTGCTC GGCCACATGT GGATCGTGCT CACGGTGCGC CTCGTGCTGC TGTTCCTGCT GTGCGCACTG CCCGTGCTGA TCCTCGACGC CGCCGTCGAA TACTTCCTGT ACCACCGCGA ACTGAAGATG GACAAGCACG AGGTGAAGCA GGAATACAAG GAGAGCGAGG GCAATCACGA GATCAAGAGC AAGCGGCGCG AGATTCATCA GGAACTGCTG TCGGAGGAGA TCAAGGCGAA CGTCGAGCAG TCCGATTTCA TCGTCGCGAA CCCGACCCAC ATCGCGATCG GCGTCTACGT GAATCCGGAC ATCGTGCCGA TTCCGTTCGT GTCGGTGCGC GAGACCAACG CACGCGCGCT CGCCGTCATT CGGCATGCCG AAGCGTGCGG CGTGCCCGTC GTGCGCAACG TCGCGCTCGC GCGCTCGATC TATCGCAACT CGCCGCGCCG CTACAGCTTC GTGAGCCACG ACGACATCGA CGGCGTGATG CGCGTGCTGA TCTGGCTCGG CGAGGTCGAG GCGGCCAATC GCGGCGGGCC GCCGCCCGAG ACGCGCGCGC CGACTTCGGC CGAGCCGCAA GCGCGCGACG GCGTGGCCCC GCCGGGCGAC GCCTGCGCGG ACAACGCCTT TCCCGACGAC GCCCCACCGG GCGCCGCCGC GCCGAACGCC GGTTCGCCGG ACAGCCCGGC GCCGGACGGC GGCGCGCCGG CCCGAACGGG CGATCAAAAC GCTTGA
|
Protein sequence | MAEKTEKPTA KKLRDAAKKG QTFKARDIVA LIVIATGALA APALVDLTRI AAEFVRIAST GAQPNPGAYA FAWAKLFLRI AAPFVLLCAA AGALPSLVQS RFTLAVESIR FDLTALDPVK GMKRLFSWRS AKDAVKALLY VGVFALTVRV FADLYHADVF GLFRARPALL GHMWIVLTVR LVLLFLLCAL PVLILDAAVE YFLYHRELKM DKHEVKQEYK ESEGNHEIKS KRREIHQELL SEEIKANVEQ SDFIVANPTH IAIGVYVNPD IVPIPFVSVR ETNARALAVI RHAEACGVPV VRNVALARSI YRNSPRRYSF VSHDDIDGVM RVLIWLGEVE AANRGGPPPE TRAPTSAEPQ ARDGVAPPGD ACADNAFPDD APPGAAAPNA GSPDSPAPDG GAPARTGDQN A
|
| |