Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_2071 |
Symbol | bsaZ |
ID | 4789566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008835 |
Strand | + |
Start bp | 2122209 |
End bp | 2123444 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | surface presentation of antigens protein SpaS |
Protein accession | YP_001025867 |
Protein GI | 124382305 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.086614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGAGA AAACCGAGAA GCCGACCGCG AAGAAGCTGC GCGACGCGGC GAAGAAGGGG CAGACGTTCA AGGCGCGGGA CATCGTCGCG CTCATCGTGA TCGCCACGGG CGCGCTGGCC GCGCCCGCGC TCGTCGATCT GACGCGCATC GCGGCCGAAT TCGTGCGGAT CGCGTCGACG GGCGCGCAGT CGAACCCGGG TGCGTACGCA TTCGCGTGGG CGAAGCTGTT CCTGCGCATC GCCGCGCCGT TCGTGCTGCT CTGCGCGGCG GCGGGCGCGC TGCCGTCGCT TGTGCAAAGC CGCTTCACGC TCGCGGTCGA ATCGATCCGC TTCGATCTCA CCGCGCTCGA TCCGGTCAAG GGAATGAAGC GGCTCTTCAG CTGGCGCTCG GCGAAGGACG CGGTGAAGGC GCTGCTCTAT GTCGGCGTGT TCGCGCTCAC GGTGCGCGTG TTCGCCGATC TCTACCACGC CGACGTGTTC GGGCTGTTCC GCGCGCGCCC GGCGCTGCTC GGCCACATGT GGATCGTGCT CACGGTGCGC CTCGTGCTGC TGTTCCTGCT GTGCGCACTG CCCGTGCTGA TCCTCGACGC CGCCGTCGAA TACTTCCTGT ACCACCGCGA ACTGAAGATG GACAAGCACG AGGTGAAGCA GGAATACAAG GAGAGCGAGG GCAATCACGA GATCAAGAGC AAGCGGCGCG AGATTCATCA GGAACTGCTG TCGGAGGAGA TCAAGGCGAA CGTCGAGCAG TCCGATTTCA TCGTCGCGAA CCCGACCCAC ATCGCGATCG GCGTCTACGT GAATCCGGAC ATCGTCCCGA TTCCGTTCGT GTCGGTGCGC GAGACCAACG CACGCGCGCT CGCCGTCATT CGGCATGCCG AAGCGTGCGG CGTGCCCGTC GTGCGCAACG TCGCGCTCGC GCGCTCGATC TATCGCAACT CGCCGCGCCG CTACAGCTTC GTGAGCCACG ACGACATCGA CGGCGTGATG CGCGTGCTGA TCTGGCTCGG CGAGGTCGAG GCGGCCAATC GCGGCGGGCC GCCGCCCGAG ACGCGCGCGC CGACTTCGGC CGAGCCGCAA GCGCGCGACG GCGTGGCCCC GCTGGGCGAC GCCTGCGCGG ACAACGCCTT TCCCGACGAC GCCCCACCGG GCGCCGCCGC GCCGAACGCC GGTTCGCCGG ACAGCCCGGC GCCGGACGGC GGCGCGCCGG CCCGAACGGG CGATCAAAAC GCTTGA
|
Protein sequence | MAEKTEKPTA KKLRDAAKKG QTFKARDIVA LIVIATGALA APALVDLTRI AAEFVRIAST GAQSNPGAYA FAWAKLFLRI AAPFVLLCAA AGALPSLVQS RFTLAVESIR FDLTALDPVK GMKRLFSWRS AKDAVKALLY VGVFALTVRV FADLYHADVF GLFRARPALL GHMWIVLTVR LVLLFLLCAL PVLILDAAVE YFLYHRELKM DKHEVKQEYK ESEGNHEIKS KRREIHQELL SEEIKANVEQ SDFIVANPTH IAIGVYVNPD IVPIPFVSVR ETNARALAVI RHAEACGVPV VRNVALARSI YRNSPRRYSF VSHDDIDGVM RVLIWLGEVE AANRGGPPPE TRAPTSAEPQ ARDGVAPLGD ACADNAFPDD APPGAAAPNA GSPDSPAPDG GAPARTGDQN A
|
| |