Gene GYMC61_0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_0944 
SymbolhisS 
ID8524767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp943040 
End bp944320 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content57% 
IMG OID 
Producthistidyl-tRNA synthetase 
Protein accessionYP_003252093 
Protein GI261418411 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTTC AAATTCCAAG AGGGACGCAA GATGTGCTGC CAGGGGACAC GGAAAAATGG 
CAATATGTCG AACATGTCGC CCGCAACCTT TGCAGCCGAT ACGGGTATCG GGAAATCCGG
ACGCCGATTT TCGAGCATAC GGAACTTTTT TTGCGCGGCG TGGGGGATAC AACGGACATT
GTGCAAAAGG AAATGTACAC GTTCGAAGAC AAAGGGGGGC GCGCGCTGAC GCTCCGCCCG
GAAGGCACGG CGCCGGTTGT CAGAGCGTTT GTCGAGCATA AGCTGTACGG CAGTCCGCAC
CAGCCGCTGA AACTGTATTA CAGCGGGCCG ATGTTTCGCT ACGAGCGGCC GGAAGCGGGA
CGGTTCCGCC AGTTCGTCCA GTTCGGCGTC GAGGCGCTCG GCAGCAGCGA TCCGGCGATT
GACGCCGAGG TGATGGCGCT AGCGATGCAT ATTTACGAAG CGCTTGGCTT AAAACGGATC
CGGCTTGTCA TCAACAGCTT GGGCGACCTT GACAGCCGCC GGGCGCACCG AGAGGCGCTT
GTTCGCCATT TTTCAAGCCG CATCCACGAG CTGTGCCCGG ACTGCCAGAC GAGGCTTCAT
ACGAATCCGC TCCGCATTCT CGACTGCAAA AAAGACCGCG ATCATGAGCT GATGGCGACG
GCGCCGTCGA TTTTGGATTA TTTGAACGAA GACTCGCGTG CTTATTTTGA AAAAGTCAAA
CAATATTTAA CTAACCTTGG CATTCCGTTT GTCATTGATT CGCGTCTTGT GCGCGGACTT
GATTACTACA ATCATACGAC GTTCGAAATT ATGAGCGAAG CGGAAGGATT TGGCGCGGCG
GCGACGCTGT GCGGCGGCGG GCGTTACAAC GGGCTTGTCC AAGAAATCGG CGGTCCGGAG
ACGCCGGGCA TCGGCTTTGC CTTGAGCATC GAGCGGCTGT TGGCCGCCCT TGACGCCGAA
GGAGTGGAAT TGCCGGTGGA GAGCGGGCTT GACTGCTATG TCGTCGCTGT CGGTGAGCGG
GCGAAAGATG AGGCGGTCCG CCTCGTTTAT GCATTGCGCC GCTCCGGATT GAGGGTCGAT
CAAGATTATT TGGGCCGAAA ATTGAAGGCG CAGCTGAAGG CCGCCGACCG GCTTGGGGCA
TCGTTTGTCG CCATTATCGG CGATGAAGAA CTCGAGAGAC AGGAAGCGGC GGTAAAGCAT
ATGGCGAGCG GCGAGCAAAC GAATGTACCG CTCGGCGAGT TGGCGCACTT TTTGCATGAA
CGGATCGGGA AGGAGGAGTG A
 
Protein sequence
MAFQIPRGTQ DVLPGDTEKW QYVEHVARNL CSRYGYREIR TPIFEHTELF LRGVGDTTDI 
VQKEMYTFED KGGRALTLRP EGTAPVVRAF VEHKLYGSPH QPLKLYYSGP MFRYERPEAG
RFRQFVQFGV EALGSSDPAI DAEVMALAMH IYEALGLKRI RLVINSLGDL DSRRAHREAL
VRHFSSRIHE LCPDCQTRLH TNPLRILDCK KDRDHELMAT APSILDYLNE DSRAYFEKVK
QYLTNLGIPF VIDSRLVRGL DYYNHTTFEI MSEAEGFGAA ATLCGGGRYN GLVQEIGGPE
TPGIGFALSI ERLLAALDAE GVELPVESGL DCYVVAVGER AKDEAVRLVY ALRRSGLRVD
QDYLGRKLKA QLKAADRLGA SFVAIIGDEE LERQEAAVKH MASGEQTNVP LGELAHFLHE
RIGKEE