Gene STER_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_1950 
SymbolhisS 
ID4437461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp1807719 
End bp1808999 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content40% 
IMG OID639677515 
Producthistidyl-tRNA synthetase 
Protein accessionYP_821256 
Protein GI116628637 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTC AAAAACCTAA GGGAACGCAG GATATTTTAC CTGGGGATAG TGCCAAATGG 
CAGTACGTGG AGAATGTTGC ACGTGAAACA TTTAAAAAAT ACAATTATGG TGAAATTCGT
ACGCCTATGT TTGAACATTA CGAGGTCATT TCACGTTCAG TAGGTGATAC AACTGATATC
GTTACTAAGG AAATGTATGA TTTTCATGAT AAGGGAGACC GTCATATTAC ACTCCGCCCA
GAAGGAACAG CACCGGTTGT ACGCTCTTAT GTAGAAAACA AACTCTTTGC GCCAGAGGTC
CAAAAACCTG TTAAAGTTTA TTATATTGGA TCAATGTTCC GTTATGAACG TCCTCAAGCA
GGACGCTTGC GCGAGTTCCA CCAACTAGGT GTAGAGTGCT TTGGCTCAAA AAATCCAGCA
ACAGATGTTG AAACAATTGC CATGGCCTAC CAACTCTTTA ATACGCTTGG CATTAAGGAT
GTTACTCTTC ATTTGAATAG TCTTGGAAAT ACTGACAGTC GTCTGGCTTA TCGTCAGGCC
TTGATTGACT ATTTGACACC AATGCGCGAG AGTTTGTCAA AAGATAGCCA ACGCCGTTTG
GAAGAAAATC CTTTGCGAGT ACTTGATTCA AAAGAAAAAG AAGATAAGGT TGCAGTTGAA
AATGCTCCAT CTATCCTTGA TTATTTAGAT GAAGAAAGTC AAACTCACTT TGATGAAGTG
CGTGCCATGC TCGATAGTCT TAACATTCCA TATGTGATTG ATACCAATAT GGTACGTGGT
CTGGATTACT ATAACCACAC GATTTTTGAA TTTATTACCA CTATTGACAA GTCTGAGTTA
ACAATCTGTG CGGGCGGTCG TTATGATAGT TTGGTTGAAT ATTTCGGTGG TCCAGAAACA
GCTGGATTTG GTTTTGGACT TGGTTTAGAA CGCTTGCTTT TGGTTCTTGA TAAGCAAGGC
ATTAAACTTC CGGTAGAAGA AAGTCTTGAT GTCTACATTG CAGTACTTGG TTCGGGCGCT
AATGGCAAAG CTCTTGAGTT AGTTCAATCC ATCCGCTACC AAGGATTTAA AGCTGAACGT
GATTACCTTG GACGTAAGAT TAAGGCACAG TTTAAGTCAG CAGATACCTT CAAAGCCAAG
ACTGTTATCA CATTAGGTGA GAGTGAAGTG GAGTCAGGTG TGGTTAAGGT CAAAAATAAT
GCTACTCGTG AGGAAGTTAC TGTAAGTTTT GAAGAGCTAA CTACAAACTT CGCAACAGTC
CTCAAACAGT TAGAAAAGTA G
 
Protein sequence
MKLQKPKGTQ DILPGDSAKW QYVENVARET FKKYNYGEIR TPMFEHYEVI SRSVGDTTDI 
VTKEMYDFHD KGDRHITLRP EGTAPVVRSY VENKLFAPEV QKPVKVYYIG SMFRYERPQA
GRLREFHQLG VECFGSKNPA TDVETIAMAY QLFNTLGIKD VTLHLNSLGN TDSRLAYRQA
LIDYLTPMRE SLSKDSQRRL EENPLRVLDS KEKEDKVAVE NAPSILDYLD EESQTHFDEV
RAMLDSLNIP YVIDTNMVRG LDYYNHTIFE FITTIDKSEL TICAGGRYDS LVEYFGGPET
AGFGFGLGLE RLLLVLDKQG IKLPVEESLD VYIAVLGSGA NGKALELVQS IRYQGFKAER
DYLGRKIKAQ FKSADTFKAK TVITLGESEV ESGVVKVKNN ATREEVTVSF EELTTNFATV
LKQLEK