Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_2062 |
Symbol | bsaQ |
ID | 4789964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008835 |
Strand | + |
Start bp | 2113968 |
End bp | 2116040 |
Gene Length | 2073 bp |
Protein Length | 690 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | type III secretion system protein BsaQ |
Protein accession | YP_001025858 |
Protein GI | 124382333 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.287404 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAAGA ATCTCCTGAT CAAGGCCCAA GCGCGCCCCG AACTGATCGT CCTGTGCCTG ATGGTGCTCG TGATCGCGAT GCTGATCGTG CCGCTGCCGC CGTACGTGCT CGATTTCCTG ATCGGCCTGA ACATCGTGAC GGCGCTGCTC GTGTTCATGG GCTCGTTCTA CATCGTCAAC ATCCTCGAGT TCTCGACGTT TCCGCCGATC CTGCTGATCA CGACGCTGTT CCGGCTCGCG CTGTCGATCA GCACGAGCCG GATGATCCTG CTCACCGCCG AGGGCGGCAA GATCATCACG ACGTTCGGGC AGTTCGTGAT CGGCGATAAC CTCGTGGTCG GCTTCGTCGT GTTCGTGATC GTGACGATCG TGCAGTTCAT CGTGATCACG AAGGGCTCGG AGCGCGTCGC CGAAGTCGCG GCGCGCTTCT CGCTCGACGC GATGCCCGGC AAGCAGATGA GCATCGACGC GGACCTGCGC GCCGGCATCA TCGACAACGA AGGCGTGAAG GCGCGCCGCT CGGCGCTCGA GCGTGAAAGC CAGCTGTACG GCTCGTTCGA CGGCGCGATG AAGTTCATCA AGGGCGATGC GATCGCCGGC ATCGTCGTCA TCTTCGTGAA CCTGATCGGC GGCATCTCGG TCGGCGTGAT GCAGCACGGC ATGTCGGTGT CCGACGCGCT CACGACCTAC ACGATCCTGT CGATCGGCGA CGGCCTCGTC GCGCAGATCC CGGCGCTCCT GATCGCGATC GGCGCGGGCT TCGTCGTCAC CCGCGTGGGC GGCTCGGGCA ACAACCTCGG CGCGAACATC GTCGGCGAGC TGTTCTCGAA CCCGTTCGTG CTCGTCGTCA CGGCCGTGCT CGCGCTCGCG ATCGGCCTGC TGCCCGGCTT TCCGCTGATC GTCTTCCTGC TGATCGCGCT CGCGCTCGGC GCGCTGTACG TGCGCCGCGA ATGGCGCAAG CCGGGCGAGC CGGCGCGCGA GGGCGGGCTC GTCGGCAAGG TCGCGGGCGC GCTCTCGGGC GGCGGCGCGG CGAAAGCGGC CGACGGCGGC GCCGGCGATG TCGACATCGA CAAGCTGATT CCGGAGACGG TGCCGCTGAT GCTGATGGTG CCCGAGGCGG CGCAGCCGAT GTTCGAGCAG GAAGGCGTGA TCGGCGCGTT CCGGCGGCGC GCGTTCGTCG ACATGGGGCT GCGCCTGCCC GACATCCGCG TCGTGTATTC GCCGCAGGTG CATCCGCGCG AGGCGATCGT GCTGATCAAC GAGATTCGCG CGGCGACGTT CGCGATCTGC TTCGACCGGC ACCGCGTGGT CGGCTCGACG CTCGCGCTCG AAGGGCTGCC CGTCGACGTC GTGACGCTGC CGGACGGCGC GGGCGGCGAC GCGTGGTGGG TGCCGGGCGC GCAGACCGAC GCGCTCGCGA AGATCGACGT GCTCACGCGC TCGGCGATCG ACGATCTGTA CGGGCAGTTT CTCGCGGTGA TGCTCGCCAA CGTATCGGAA TTCTTCGGCG TGCAGGAAGC CAAGCGCCTG CTCGACGACA TGGACCAGAA GTACCCGGAG CTCATCAAGG AGAGCTATCG GCACATTTCG GTGCAGCGCA TCGCGGAGGT GTTCCAGCGC CTGCTCGCCG AGAAGATCTC GATTCGCAAC ATGAAGCTGA TTCTCGAATC GCTCGCGCAA TGGGGGCCGA AGGAGAAGGA TTCGATCCTG CTCGTCGAGC ACGTGCGCGC GGCGCTCGCG CGCTACATCT CGAACCGCTT CGCCGCGGGC GGCAAGCTGC GCGCGCTCGT GCTGTCGGCG CAGTTCGAGG ACGCGGTGGG CAAGGGCGTG CGGCAGACCT CGGGCGGCGC GTATCTGAAC CTGGAGCCCG CGACGAGCGA GCAACTGCTC GACCGGCTCG CGCTCGAGCT CGCGCGCGCG GGCTTCTCGC AGCGCGACAT GGTGCTGCTT GCCTCGATGG AAGTCAGGCG TTTCGTCAAG CGGCTGATCG AGAGCCGCTT TCCGGAACTG GAGGTGCTGT CGTTCGGCGA AGTGGCCGAC AGCGTCGCAA TCGATGTGTT GAAGACGATC TAA
|
Protein sequence | MLKNLLIKAQ ARPELIVLCL MVLVIAMLIV PLPPYVLDFL IGLNIVTALL VFMGSFYIVN ILEFSTFPPI LLITTLFRLA LSISTSRMIL LTAEGGKIIT TFGQFVIGDN LVVGFVVFVI VTIVQFIVIT KGSERVAEVA ARFSLDAMPG KQMSIDADLR AGIIDNEGVK ARRSALERES QLYGSFDGAM KFIKGDAIAG IVVIFVNLIG GISVGVMQHG MSVSDALTTY TILSIGDGLV AQIPALLIAI GAGFVVTRVG GSGNNLGANI VGELFSNPFV LVVTAVLALA IGLLPGFPLI VFLLIALALG ALYVRREWRK PGEPAREGGL VGKVAGALSG GGAAKAADGG AGDVDIDKLI PETVPLMLMV PEAAQPMFEQ EGVIGAFRRR AFVDMGLRLP DIRVVYSPQV HPREAIVLIN EIRAATFAIC FDRHRVVGST LALEGLPVDV VTLPDGAGGD AWWVPGAQTD ALAKIDVLTR SAIDDLYGQF LAVMLANVSE FFGVQEAKRL LDDMDQKYPE LIKESYRHIS VQRIAEVFQR LLAEKISIRN MKLILESLAQ WGPKEKDSIL LVEHVRAALA RYISNRFAAG GKLRALVLSA QFEDAVGKGV RQTSGGAYLN LEPATSEQLL DRLALELARA GFSQRDMVLL ASMEVRRFVK RLIESRFPEL EVLSFGEVAD SVAIDVLKTI
|
| |