Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMAA1544 |
Symbol | bsaO |
ID | 3087750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei ATCC 23344 |
Kingdom | Bacteria |
Replicon accession | NC_006349 |
Strand | - |
Start bp | 1673077 |
End bp | 1674897 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637565428 |
Product | type III secretion system protein BsaO |
Protein accession | YP_106133 |
Protein GI | 53716325 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02516] type III secretion outer membrane pore, YscC/HrcC family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.643588 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGGCG CGGCATTCGC CGCGCCGACC GACGCGGCGC CGCTCGCCGA CGCGCCCGCG GCGGCGAGCG CGACGCCCGA CGCACGGGAC GGCGACGCGT CGCGCGTGGC CGCGCCGCCG GCGCGCGCGC CGCAGGACGA CGAGCGCCAC TTCGTCGCGA ACGACGCGAG CATCAGCGTG CTGCTCAATG CGCTGTCGGG CCGGCTGCAC AAGCCGATCG TCGCGAGCGA GAAGGTGCGC CGCAAGCACG TGACGGGCGA GTTCGACCTC GCGCAGCCCC GCGCGCTGCT CGCGCGGCTC GGCGAATCGA TGTCGCTGCT CTGGTACGAC GACGGCGCGT CGATCTACAT CTACGACAAC TCGGAGATCA AGAACGCGGT CGTCTCGATG CGCCATGCGA CGGTGCGCAA CCTGCGCAAT TTCATCCGGC AGACGCGGCT CTACGATCCG CGCTTTCCGG TGCGCGGCGA CGACCTGAGC AACACGTTCT ACGTGACGGG CGCGCCCGTC TACGTGAATC TCGTCGCCGC CGCCGCGCGC TATCTCGACG AGGTGCGCTC GAACGAGGCG AGCGACCGGC AGGTCGTGCG CGTCGTGCAG CTTCACAACA GCTTCGTCGT CGACCGCCAG TACACGCTGC GCGACAAGGC AGTCGACATC CCGGGCATGG CGACCGTGCT CGGCCGCATC TTCGGCCCGG CGCGGCCGGG CGCGCCGGCG GACTCGCCCG TCGCGGCGGC CGACGCCACG GCGCGCGGCG GCGCGGGCGG CGCGGCGGGC AAGCCGGCGT TCTCGCTTGC CGATGCGTTG CCCGCGCCGC TCGACGCCGG CAACGCGCCG GGCGGCGCGG GCTCGACGCA TTCGACAAAC CCGGCGAACG CCGCGAGCCC GATGGGCGGC GCGGCGGGCG GCGTCGCGCT GCCCGCGTCG GACGGCGTGC GCGCGGTGGC GTATCCGGAC ACGAACAGCG TGATCCTCGT CGGCCGGCTC GACAAGGTGC AGGACATGGA GGCGCTGATC CGCTCGCTCG ATGTCGAGAA GCGGCAGATC GAGCTGTCGC TGTGGATCAT CGACATCCGC AAGAGCCGGC TCGATCAGCT CGGCATCGAC TGGCAGGGCG CGCTCAATGC GCCGGGCATC GGCGTCGGCT TCAACAATCG CGGCGGCAAC GTGACGACGC TCGACGGCAC CAGGTTCCTC GCATCGGTCG CGGCGCTCAG CCAGACGGGC GACGCCACCG TGATCTCGCG GCCGATCGTG CTCACGCAGG AGAACGTGCC GGCGACGTTC GACAGCAACC AGACGTTCTA CGCGAAGCTG ATCGGCGAGC GCACGGTGCA GCTCGATCAC GTGACCTACG GCACGCTCGT CAACGTGCTG CCGCGGCTCA CGCGCGATGG GTCGCAGGTC GAGATGATCG TCGACATCGA GGACGGCAAC ACCGACGGCG CGACGAGCGA CGGCCAGATC GTCATCGACA ACAACACGAT GCCGCTCGTG AACCGCACCG AGATCAACAC GGTCGCGCGC GTGCCGCACG AGATGAGCCT GCTCATCGGC GGCAACACGC GCGACGACGT CACGCGCCGC ACGTTCCGGA TTCCCGGGCT GGCCAGCATT CCGCTGATCG GCGGGCTGTT TCGCGGGCAT TCGGATCGGC ACGAGCAGGT GGTGCGCGTG TTCCTGATCC AGCCGAAGCT GCTGCGCGCG GGCGCGGCCT GGCCCGACGG CCAGCCGTGG GAATCGGGCG ATCCGGCGGA CAACGCGACG CTGCGCGCAA CCGTGCAGAT GCTCAAACCC TACATGGACG ACAAGTCATG A
|
Protein sequence | MTGAAFAAPT DAAPLADAPA AASATPDARD GDASRVAAPP ARAPQDDERH FVANDASISV LLNALSGRLH KPIVASEKVR RKHVTGEFDL AQPRALLARL GESMSLLWYD DGASIYIYDN SEIKNAVVSM RHATVRNLRN FIRQTRLYDP RFPVRGDDLS NTFYVTGAPV YVNLVAAAAR YLDEVRSNEA SDRQVVRVVQ LHNSFVVDRQ YTLRDKAVDI PGMATVLGRI FGPARPGAPA DSPVAAADAT ARGGAGGAAG KPAFSLADAL PAPLDAGNAP GGAGSTHSTN PANAASPMGG AAGGVALPAS DGVRAVAYPD TNSVILVGRL DKVQDMEALI RSLDVEKRQI ELSLWIIDIR KSRLDQLGID WQGALNAPGI GVGFNNRGGN VTTLDGTRFL ASVAALSQTG DATVISRPIV LTQENVPATF DSNQTFYAKL IGERTVQLDH VTYGTLVNVL PRLTRDGSQV EMIVDIEDGN TDGATSDGQI VIDNNTMPLV NRTEINTVAR VPHEMSLLIG GNTRDDVTRR TFRIPGLASI PLIGGLFRGH SDRHEQVVRV FLIQPKLLRA GAAWPDGQPW ESGDPADNAT LRATVQMLKP YMDDKS
|
| |