Gene BMA10229_A0063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A0063 
SymbolhisS 
ID4792353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp59259 
End bp60599 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content65% 
IMG OID 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001026072 
Protein GI124385953 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00160741 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAC AAAAGCGAAA GCTCGAGAAG CTGACGGGCG TGAAGGGCAT GAACGACATC 
CTCCCGCAGG ATGCCGGCTT GTGGGAATTC TTCGAGGCGA CGGTGAAGTC GCTGCTGCGC
GCATACGGCT ATCAGAACAT CCGCACGCCG ATCGTCGAGC ATACGCAGCT CTTCACGCGC
GGCATCGGCG AGGTGACCGA CATCGTCGAA AAGGAGATGT ACAGCTTCGT CGATGCGTTG
AACGGCGAGA ACCTGACGCT GCGCCCCGAG AACACCGCGG CCGTCGTGCG CGCGGCGATC
GAGCACAACA TGCTGTATGA CGGCCCGAAA CGCCTGTGGT ATCTCGGGCC GATGTTCCGC
CACGAGCGCC CGCAGCGCGG CCGTTATCGC CAGTTCCATC AGGTCGGCGT CGAGGCGCTC
GGCTTCGCGG GCCCGGACGC GGACGCGGAA ATCATCATGA TGTGCCAGCG CCTGTGGGAC
GATCTCGGCC TCACCGGCAT CAAGCTCGAG ATCAACTCGC TCGGTCTCGC CGAGGAGCGC
GCCGCGCACC GCGTCGAGCT CATCAAGTAT CTCGAGCAGC ACGTCGACAA GCTCGACGAC
GACGCGCAGC GCCGCCTCTA CACCAACCCG CTGCGCGTGC TCGACACGAA GAATCCGGCG
CTGCAGGAGA TCGTGCGGAA CGCGCCGCAG CTGATCGATT TCCTCGGCGA CGTGTCGCGC
GCGCACTTCG ACGGCTTGCA GCAGCTGCTG AAGGCGAACA ACCTGCCGTT TACGATCAAT
CCGCGGCTCG TGCGCGGGCT CGACTACTAC AACCTGACCG TGTTCGAGTG GGTGACCGAC
AAGCTCGGCG CGCAGGGCAC GGTCGCCGCG GGCGGCCGCT ACGATCCGCT GATCGAGCAG
TTGGGCGGCA AGCCGACCGC CGCGTGCGGC TGGGCGATGG GTGTCGAGCG CATCCTCGAG
CTCCTGAAGG AAGAGCACCT CGTGCCGGAG CAGGAAGGCG TCGACGTGTA CGTCGTCCAT
CAGGGCGACG CGGCGCGCGA GCAGGCGTTC ATCGTCGCCG AGCGTCTGCG CGACACCGGC
CTCGACGTGA TCCTGCATTG CAGCGCGGAC GGCGCGGGCG CGAGCTTCAA GTCGCAGATG
AAGCGCGCGG ATGCAAGCGG CGCGGCGTTC GCGGTGATCT TCGGCGAAGA CGAGGTCGCG
AACGGCACGG TGAGCGTGAA GCCGCTGCGC GGCACGGGCG CCGAAGGCGA GAAGAACGTT
CAGCAGTCCG TGCCGGTCGA AAGCTTGACC GAATTTCTAA TCAATGCGAT GGTTGCAACC
GCCGAAGACG GCGACGACTG A
 
Protein sequence
MTEQKRKLEK LTGVKGMNDI LPQDAGLWEF FEATVKSLLR AYGYQNIRTP IVEHTQLFTR 
GIGEVTDIVE KEMYSFVDAL NGENLTLRPE NTAAVVRAAI EHNMLYDGPK RLWYLGPMFR
HERPQRGRYR QFHQVGVEAL GFAGPDADAE IIMMCQRLWD DLGLTGIKLE INSLGLAEER
AAHRVELIKY LEQHVDKLDD DAQRRLYTNP LRVLDTKNPA LQEIVRNAPQ LIDFLGDVSR
AHFDGLQQLL KANNLPFTIN PRLVRGLDYY NLTVFEWVTD KLGAQGTVAA GGRYDPLIEQ
LGGKPTAACG WAMGVERILE LLKEEHLVPE QEGVDVYVVH QGDAAREQAF IVAERLRDTG
LDVILHCSAD GAGASFKSQM KRADASGAAF AVIFGEDEVA NGTVSVKPLR GTGAEGEKNV
QQSVPVESLT EFLINAMVAT AEDGDD