Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_A0063 |
Symbol | hisS |
ID | 4792353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008836 |
Strand | + |
Start bp | 59259 |
End bp | 60599 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | histidyl-tRNA synthetase |
Protein accession | YP_001026072 |
Protein GI | 124385953 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00160741 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGAAC AAAAGCGAAA GCTCGAGAAG CTGACGGGCG TGAAGGGCAT GAACGACATC CTCCCGCAGG ATGCCGGCTT GTGGGAATTC TTCGAGGCGA CGGTGAAGTC GCTGCTGCGC GCATACGGCT ATCAGAACAT CCGCACGCCG ATCGTCGAGC ATACGCAGCT CTTCACGCGC GGCATCGGCG AGGTGACCGA CATCGTCGAA AAGGAGATGT ACAGCTTCGT CGATGCGTTG AACGGCGAGA ACCTGACGCT GCGCCCCGAG AACACCGCGG CCGTCGTGCG CGCGGCGATC GAGCACAACA TGCTGTATGA CGGCCCGAAA CGCCTGTGGT ATCTCGGGCC GATGTTCCGC CACGAGCGCC CGCAGCGCGG CCGTTATCGC CAGTTCCATC AGGTCGGCGT CGAGGCGCTC GGCTTCGCGG GCCCGGACGC GGACGCGGAA ATCATCATGA TGTGCCAGCG CCTGTGGGAC GATCTCGGCC TCACCGGCAT CAAGCTCGAG ATCAACTCGC TCGGTCTCGC CGAGGAGCGC GCCGCGCACC GCGTCGAGCT CATCAAGTAT CTCGAGCAGC ACGTCGACAA GCTCGACGAC GACGCGCAGC GCCGCCTCTA CACCAACCCG CTGCGCGTGC TCGACACGAA GAATCCGGCG CTGCAGGAGA TCGTGCGGAA CGCGCCGCAG CTGATCGATT TCCTCGGCGA CGTGTCGCGC GCGCACTTCG ACGGCTTGCA GCAGCTGCTG AAGGCGAACA ACCTGCCGTT TACGATCAAT CCGCGGCTCG TGCGCGGGCT CGACTACTAC AACCTGACCG TGTTCGAGTG GGTGACCGAC AAGCTCGGCG CGCAGGGCAC GGTCGCCGCG GGCGGCCGCT ACGATCCGCT GATCGAGCAG TTGGGCGGCA AGCCGACCGC CGCGTGCGGC TGGGCGATGG GTGTCGAGCG CATCCTCGAG CTCCTGAAGG AAGAGCACCT CGTGCCGGAG CAGGAAGGCG TCGACGTGTA CGTCGTCCAT CAGGGCGACG CGGCGCGCGA GCAGGCGTTC ATCGTCGCCG AGCGTCTGCG CGACACCGGC CTCGACGTGA TCCTGCATTG CAGCGCGGAC GGCGCGGGCG CGAGCTTCAA GTCGCAGATG AAGCGCGCGG ATGCAAGCGG CGCGGCGTTC GCGGTGATCT TCGGCGAAGA CGAGGTCGCG AACGGCACGG TGAGCGTGAA GCCGCTGCGC GGCACGGGCG CCGAAGGCGA GAAGAACGTT CAGCAGTCCG TGCCGGTCGA AAGCTTGACC GAATTTCTAA TCAATGCGAT GGTTGCAACC GCCGAAGACG GCGACGACTG A
|
Protein sequence | MTEQKRKLEK LTGVKGMNDI LPQDAGLWEF FEATVKSLLR AYGYQNIRTP IVEHTQLFTR GIGEVTDIVE KEMYSFVDAL NGENLTLRPE NTAAVVRAAI EHNMLYDGPK RLWYLGPMFR HERPQRGRYR QFHQVGVEAL GFAGPDADAE IIMMCQRLWD DLGLTGIKLE INSLGLAEER AAHRVELIKY LEQHVDKLDD DAQRRLYTNP LRVLDTKNPA LQEIVRNAPQ LIDFLGDVSR AHFDGLQQLL KANNLPFTIN PRLVRGLDYY NLTVFEWVTD KLGAQGTVAA GGRYDPLIEQ LGGKPTAACG WAMGVERILE LLKEEHLVPE QEGVDVYVVH QGDAAREQAF IVAERLRDTG LDVILHCSAD GAGASFKSQM KRADASGAAF AVIFGEDEVA NGTVSVKPLR GTGAEGEKNV QQSVPVESLT EFLINAMVAT AEDGDD
|
| |