Gene BURPS668_2189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2189 
SymbolhisS 
ID4881700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2181878 
End bp2183218 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content66% 
IMG OID640128117 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001059224 
Protein GI126440200 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAC AAAAGCGAAA GCTCGAGAAG CTGACGGGCG TGAAGGGCAT GAACGACATC 
CTCCCGCAGG ATGCCGGCTT GTGGGAATTC TTCGAGGCGA CGGTGAAGTC GCTGCTGCGC
GCATACGGCT ATCAGAACAT CCGCACGCCG ATCGTCGAGC ATACGCAGCT CTTCACGCGC
GGCATCGGCG AGGTGACCGA CATCGTCGAA AAGGAGATGT ACAGCTTCGT CGATGCGTTG
AACGGCGAGA ACCTGACGCT GCGCCCCGAG AACACCGCGG CCGTCGTGCG CGCGGCGATC
GAGCACAACA TGCTGTACGA CGGCCCGAAA CGCCTGTGGT ATCTCGGGCC GATGTTCCGC
CACGAGCGCC CGCAGCGCGG CCGTTATCGC CAGTTCCATC AGGTCGGCGT CGAGGCGCTC
GGCTTCGCGG GCCCCGACGC GGACGCGGAA ATCATCATGA TGTGCCAGCG CCTGTGGGAC
GATCTCGGCC TCACCGGCAT CAAGCTCGAG ATCAACTCGC TCGGCCTCGC CGAGGAGCGC
GCCGCGCACC GCGTCGAGCT CATCAAGTAT CTCGAGCAGC ACGTCGACAA GCTCGACGAC
GACGCGCAGC GCCGCCTCTA CACCAACCCG CTGCGCGTGC TCGACACGAA GAACCCGGCG
CTGCAGGAGA TCGTGCGGAA CGCGCCGCAG CTGATCGATT TCCTCGGCGA CGTGTCGCGC
GCGCACTTCG ACGGCCTGCA GCGGCTGCTG AAGGCGAACA ACCTGCCGTT CACGATCAAT
CCGCGGCTCG TGCGCGGGCT CGACTACTAC AACCTGACCG TGTTCGAGTG GGTGACCGAC
AAGCTCGGCG CGCAGGGCAC GGTCGCCGCG GGCGGCCGCT ACGATCCGCT GATCGAGCAG
TTGGGCGGCA AGCCGACCGC CGCGTGCGGC TGGGCGATGG GCGTCGAGCG CATCCTCGAG
CTCCTGAAGG AAGAGCACCT CGTGCCGGAG CAGGAAGGCG TCGACGTGTA CGTCGTCCAC
CAGGGCGACG CGGCGCGCGA GCAGGCGTTC ATCGTCGCCG AGCGTCTGCG CGACACCGGC
CTCGACGTGA TCCTGCATTG CAGCGCGGAC GGCGCGGGCG CGAGCTTCAA GTCGCAGATG
AAGCGCGCGG ATGCAAGCGG CGCAGCGTTC GCGGTGATCT TGGGCGAAGA CGAGGTCGCG
AACGGCACGG TGAGCGTGAA GCCGCTGCGC GGCACGGGCG CCGAAGGCGA GAAGAACGTT
CAGCAGTCCG TGCCGGTCGA AAGCTTGACC GAATTTCTAA TCAATGCGAT GGTTGCAACC
GCCGAAGACG GCGACGACTG A
 
Protein sequence
MTEQKRKLEK LTGVKGMNDI LPQDAGLWEF FEATVKSLLR AYGYQNIRTP IVEHTQLFTR 
GIGEVTDIVE KEMYSFVDAL NGENLTLRPE NTAAVVRAAI EHNMLYDGPK RLWYLGPMFR
HERPQRGRYR QFHQVGVEAL GFAGPDADAE IIMMCQRLWD DLGLTGIKLE INSLGLAEER
AAHRVELIKY LEQHVDKLDD DAQRRLYTNP LRVLDTKNPA LQEIVRNAPQ LIDFLGDVSR
AHFDGLQRLL KANNLPFTIN PRLVRGLDYY NLTVFEWVTD KLGAQGTVAA GGRYDPLIEQ
LGGKPTAACG WAMGVERILE LLKEEHLVPE QEGVDVYVVH QGDAAREQAF IVAERLRDTG
LDVILHCSAD GAGASFKSQM KRADASGAAF AVILGEDEVA NGTVSVKPLR GTGAEGEKNV
QQSVPVESLT EFLINAMVAT AEDGDD