Gene BMASAVP1_A1834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A1834 
SymbolhisS 
ID4680465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp1821080 
End bp1822420 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content65% 
IMG OID639846098 
Producthistidyl-tRNA synthetase 
Protein accessionYP_993153 
Protein GI121598300 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.283546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAC AAAAGCGAAA GCTCGAGAAG CTGACGGGCG TGAAGGGCAT GAACGACATC 
CTCCCGCAGG ATGCCGGCTT GTGGGAATTC TTCGAGGCGA CGGTGAAGTC GCTGCTGCGC
GCATACGGCT ATCAGAACAT CCGCACGCCG ATCGTCGAGC ATACGCAGCT CTTCACGCGC
GGCATCGGCG AGGTGACCGA CATCGTCGAA AAGGAGATGT ACAGCTTCGT CGATGCGTTG
AACGGCGAGA ACCTGACGCT GCGCCCCGAG AACACCGCGG CCGTCGTGCG CGCGGCGATC
GAGCACAACA TGCTGTATGA CGGCCCGAAA CGCCTGTGGT ATCTCGGGCC GATGTTCCGC
CACGAGCGCC CGCAGCGCGG CCGTTATCGC CAGTTCCATC AGGTCGGCGT CGAGGCGCTC
GGCTTCGCGG GCCCGGACGC GGACGCGGAA ATCATCATGA TGTGCCAGCG CCTGTGGGAC
GATCTCGGCC TCACCGGCAT CAAGCTCGAG ATCAACTCGC TCGGTCTCGC CGAGGAGCGC
GCCGCGCACC GCGTCGAGCT CATCAAGTAT CTCGAGCAGC ACGTCGACAA GCTCGACGAC
GACGCGCAGC GCCGCCTCTA CACCAACCCG CTGCGCGTGC TCGACACGAA GAATCCGGCG
CTGCAGGAGA TCGTGCGGAA CGCGCCGCAG CTGATCGATT TCCTCGGCGA CGTGTCGCGC
GCGCACTTCG ACGGCTTGCA GCAGCTGCTG AAGGCGAACA ACCTGCCGTT TACGATCAAT
CCGCGGCTCG TGCGCGGGCT CGACTACTAC AACCTGACCG TGTTCGAGTG GGTGACCGAC
AAGCTCGGCG CGCAGGGCAC GGTCGCCGCG GGCGGCCGCT ACGATCCGCT GATCGAGCAG
TTGGGCGGCA AGCCGACCGC CGCGTGCGGC TGGGCGATGG GTGTCGAGCG CATCCTCGAG
CTCCTGAAGG AAGAGCACCT CGTGCCGGAG CAGGAAGGCG TCGACGTGTA CGTCGTCCAT
CAGGGCGACG CGGCGCGCGA GCAGGCGTTC ATCGTCGCCG AGCGTCTGCG CGACACCGGC
CTCGACGTGA TCCTGCATTG CAGCGCGGAC GGCGCGGGCG CGAGCTTCAA GTCGCAGATG
AAGCGCGCGG ATGCAAGCGG CGCGGCGTTC GCGGTGATCT TCGGCGAAGA CGAGGTCGCG
AACGGCACGG TGAGCGTGAA GCCGCTGCGC GGCACGGGCG CCGAAGGCGA GAAGAACGTT
CAGCAGTCCG TGCCGGTCGA AAGCTTGACC GAATTTCTAA TCAATGCGAT GGTTGCAACC
GCCGAAGACG GCGACGACTG A
 
Protein sequence
MTEQKRKLEK LTGVKGMNDI LPQDAGLWEF FEATVKSLLR AYGYQNIRTP IVEHTQLFTR 
GIGEVTDIVE KEMYSFVDAL NGENLTLRPE NTAAVVRAAI EHNMLYDGPK RLWYLGPMFR
HERPQRGRYR QFHQVGVEAL GFAGPDADAE IIMMCQRLWD DLGLTGIKLE INSLGLAEER
AAHRVELIKY LEQHVDKLDD DAQRRLYTNP LRVLDTKNPA LQEIVRNAPQ LIDFLGDVSR
AHFDGLQQLL KANNLPFTIN PRLVRGLDYY NLTVFEWVTD KLGAQGTVAA GGRYDPLIEQ
LGGKPTAACG WAMGVERILE LLKEEHLVPE QEGVDVYVVH QGDAAREQAF IVAERLRDTG
LDVILHCSAD GAGASFKSQM KRADASGAAF AVIFGEDEVA NGTVSVKPLR GTGAEGEKNV
QQSVPVESLT EFLINAMVAT AEDGDD