Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_A3311 |
Symbol | |
ID | 4791967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008836 |
Strand | - |
Start bp | 3362564 |
End bp | 3363778 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_001029247 |
Protein GI | 124386290 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00468101 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGC TTCTCGCAGC CGTCGGATTG TCGCTGATCC TCCTGTCGGC CGCCGCGAAC GCGGCGGTGC CGTCGCTGCA ACAAATCCAG CAATCGATCG CGCAAGGCAA CTGGCAGCGC GCCGATGCGC AGCTCTCGCA AGTGATCGAC GCGTACCCGG ACAACGCGCG CGCCCGCTAT CTGTACGGCC AGGTGCTCGA CCGCGAAGGC CGCCCCGCCG AGGCGCTCGC GCAGATCGAA CGGGCGAAGT CGCTCGATCC GCAACTGCGC TTCACCGATC CGTCGCGCTT CGCGCAAACT GAAGCGCGCG TGCGGGCCGA CGCGCGCCGC GCGACGGCCG CGCAGGACTC GCGCTCGGCG ACCTCGGGCG GCATGCTCGC CGCGCCGCAG GCGCCGGCCC AGGCCCGCGC GCCATTCTCC GCCGCCCCTG TCGCGCCTGC CGCGCCCGTG CATCGCGGCC CGTCGGTGGG TATGTGGATC GGCTTCGCGG TGCTGCTCGG CGTGATCGTG ATCGTGCTGC GCAAAACGTT GCGCCGCGCG CGCTCGGCGG ACGATCAGCG CGCCGACGAC GAACGCCGCG CGCAGTTGAA GCGCGCAACC GACATCCTCA ACGAAGTGCG TCCGCTCAAG CTCGACGCGC GGCTGTCGAC GGCGCCGGGC GCCGCCGCGC TCAACGGCGA GATCGAGGGG CTCGAAGCCC AGGCGCGCGA GCTCGTCGAG ACCCTGTCGA ACGGCAAGAA TCCCGCGCCG CCGTACCGGC TCGACGAGTT GGAGAAACAG TTCGCCAGCC TGAAGGCGCG CGTCGAGGGG CGCCCGGATC CGAACGCGGC CGCGCCGGGC GGGCCTGGCC AAACGGGCTC GGTATTTGCT CAGGAGGCCG ATCGGTTGAC GGGGGCACAG GGCCAGCCGC CCTACTCGCC GTATCCGCCG CAGCCGCAAC AGCCGCCGCC CGTCGTGATC CAGCAAGGCG GCGGCGGCTT CGGCGGCGGC ATGGGCGGGC TGCTCACGGG CGTCCTGCTC GGCCAGGCGA TGTCGCACGG CCGCGACCGC GTGATCGAGC GCGACGTGAT CGTCGACGAC GAAGCGCGGC GCCGCGCGGG CGCCGATCCC GGCATCGACT TCGGCCAGGG CGACAGCTGG GACAGCGGCG GCTCGGACGG CGGCGGGAGC ATCGATCTCG GCAGCAGCGG CGACGATTGG AGCAACAACG GTTGA
|
Protein sequence | MKKLLAAVGL SLILLSAAAN AAVPSLQQIQ QSIAQGNWQR ADAQLSQVID AYPDNARARY LYGQVLDREG RPAEALAQIE RAKSLDPQLR FTDPSRFAQT EARVRADARR ATAAQDSRSA TSGGMLAAPQ APAQARAPFS AAPVAPAAPV HRGPSVGMWI GFAVLLGVIV IVLRKTLRRA RSADDQRADD ERRAQLKRAT DILNEVRPLK LDARLSTAPG AAALNGEIEG LEAQARELVE TLSNGKNPAP PYRLDELEKQ FASLKARVEG RPDPNAAAPG GPGQTGSVFA QEADRLTGAQ GQPPYSPYPP QPQQPPPVVI QQGGGGFGGG MGGLLTGVLL GQAMSHGRDR VIERDVIVDD EARRRAGADP GIDFGQGDSW DSGGSDGGGS IDLGSSGDDW SNNG
|
| |