Gene BURPS1106A_2227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2227 
SymbolhisS 
ID4901911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2215435 
End bp2216775 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content66% 
IMG OID640135456 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001066491 
Protein GI126453409 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAC AAAAGCGAAA GCTCGAGAAG CTGACGGGCG TGAAGGGCAT GAACGACATC 
CTCCCGCAGG ATGCCGGCTT GTGGGAATTC TTCGAGGCGA CGGTGAAGTC GCTGCTGCGC
GCATACGGCT ATCAGAACAT CCGCACGCCG ATCGTCGAGC ATACGCAGCT CTTCACGCGC
GGCATCGGCG AGGTGACCGA CATCGTCGAA AAGGAGATGT ACAGCTTCGT CGATGCGTTG
AACGGCGAGA ACCTGACGCT GCGCCCCGAG AACACCGCGG CCGTCGTGCG CGCGGCGATC
GAGCACAACA TGCTGTATGA CGGCCCGAAA CGCCTGTGGT ATCTCGGGCC GATGTTCCGC
CACGAGCGCC CGCAGCGCGG CCGTTATCGC CAGTTCCATC AGGTCGGCGT CGAGGCGCTC
GGCTTCGCGG GCCCCGACGC GGACGCGGAA ATCATCATGA TGTGCCAGCG CCTGTGGGAC
GATCTCGGTC TCACCGGCAT CAAGCTCGAG ATCAACTCGC TCGGTCTCGC CGAGGAGCGC
GCCGCGCACC GCGTCGAGCT CATCAAGTAT CTCGAGCAGC ACGTCGACAA GCTCGACGAC
GACGCGCAGC GCCGCCTCTA CACCAACCCG CTGCGCGTGC TCGACACGAA GAACCCGGCG
CTGCAGGAGA TCGTGCGGAA CGCGCCGCAG CTGATCGATT TCCTCGGCGA CGTGTCGCGC
GCGCACTTCG ACGGCCTGCA GCGGCTGCTG AAGGCGAACA ACCTGCCGTT TACGATCAAT
CCGCGGCTCG TGCGCGGGCT CGACTACTAC AACCTGACCG TGTTCGAGTG GGTGACCGAC
AAGCTCGGCG CGCAGGGCAC GGTCGCCGCG GGCGGCCGCT ACGATCCGCT GATCGAGCAG
TTGGGCGGCA AGCCGACCGC CGCGTGCGGC TGGGCGATGG GTGTCGAGCG CATCCTCGAG
CTCCTGAAGG AAGAGCACCT CGTGCCGGAG CAGGAAGGCG TCGACGTGTA CGTCGTCCAC
CAGGGCGACG CGGCGCGCGA GCAGGCGTTC ATCGTCGCCG AGCGTCTGCG CGACACCGGC
CTCGACGTGA TCCTGCATTG CAGCGCGGAC GGCGCGGGCG CGAGCTTCAA GTCGCAGATG
AAGCGCGCGG ATGCAAGCGG CGCGGCGTTC GCGGTGATCT TCGGCGAAGA CGAGGTCGCG
AACGGCACGG TGAGCGTGAA GCCGCTGCGC GGCACGGGCG CCGAAGGCGA GAAGAACGTT
CAGCAGTCCG TGCCGGTCGA AAGCTTGACC GAATTTCTAA TCAATGCGAT GGTTGCAACC
GCCGAAGACG GCGACGACTG A
 
Protein sequence
MTEQKRKLEK LTGVKGMNDI LPQDAGLWEF FEATVKSLLR AYGYQNIRTP IVEHTQLFTR 
GIGEVTDIVE KEMYSFVDAL NGENLTLRPE NTAAVVRAAI EHNMLYDGPK RLWYLGPMFR
HERPQRGRYR QFHQVGVEAL GFAGPDADAE IIMMCQRLWD DLGLTGIKLE INSLGLAEER
AAHRVELIKY LEQHVDKLDD DAQRRLYTNP LRVLDTKNPA LQEIVRNAPQ LIDFLGDVSR
AHFDGLQRLL KANNLPFTIN PRLVRGLDYY NLTVFEWVTD KLGAQGTVAA GGRYDPLIEQ
LGGKPTAACG WAMGVERILE LLKEEHLVPE QEGVDVYVVH QGDAAREQAF IVAERLRDTG
LDVILHCSAD GAGASFKSQM KRADASGAAF AVIFGEDEVA NGTVSVKPLR GTGAEGEKNV
QQSVPVESLT EFLINAMVAT AEDGDD