Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_0944 |
Symbol | hisS |
ID | 8524767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | + |
Start bp | 943040 |
End bp | 944320 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | histidyl-tRNA synthetase |
Protein accession | YP_003252093 |
Protein GI | 261418411 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTTC AAATTCCAAG AGGGACGCAA GATGTGCTGC CAGGGGACAC GGAAAAATGG CAATATGTCG AACATGTCGC CCGCAACCTT TGCAGCCGAT ACGGGTATCG GGAAATCCGG ACGCCGATTT TCGAGCATAC GGAACTTTTT TTGCGCGGCG TGGGGGATAC AACGGACATT GTGCAAAAGG AAATGTACAC GTTCGAAGAC AAAGGGGGGC GCGCGCTGAC GCTCCGCCCG GAAGGCACGG CGCCGGTTGT CAGAGCGTTT GTCGAGCATA AGCTGTACGG CAGTCCGCAC CAGCCGCTGA AACTGTATTA CAGCGGGCCG ATGTTTCGCT ACGAGCGGCC GGAAGCGGGA CGGTTCCGCC AGTTCGTCCA GTTCGGCGTC GAGGCGCTCG GCAGCAGCGA TCCGGCGATT GACGCCGAGG TGATGGCGCT AGCGATGCAT ATTTACGAAG CGCTTGGCTT AAAACGGATC CGGCTTGTCA TCAACAGCTT GGGCGACCTT GACAGCCGCC GGGCGCACCG AGAGGCGCTT GTTCGCCATT TTTCAAGCCG CATCCACGAG CTGTGCCCGG ACTGCCAGAC GAGGCTTCAT ACGAATCCGC TCCGCATTCT CGACTGCAAA AAAGACCGCG ATCATGAGCT GATGGCGACG GCGCCGTCGA TTTTGGATTA TTTGAACGAA GACTCGCGTG CTTATTTTGA AAAAGTCAAA CAATATTTAA CTAACCTTGG CATTCCGTTT GTCATTGATT CGCGTCTTGT GCGCGGACTT GATTACTACA ATCATACGAC GTTCGAAATT ATGAGCGAAG CGGAAGGATT TGGCGCGGCG GCGACGCTGT GCGGCGGCGG GCGTTACAAC GGGCTTGTCC AAGAAATCGG CGGTCCGGAG ACGCCGGGCA TCGGCTTTGC CTTGAGCATC GAGCGGCTGT TGGCCGCCCT TGACGCCGAA GGAGTGGAAT TGCCGGTGGA GAGCGGGCTT GACTGCTATG TCGTCGCTGT CGGTGAGCGG GCGAAAGATG AGGCGGTCCG CCTCGTTTAT GCATTGCGCC GCTCCGGATT GAGGGTCGAT CAAGATTATT TGGGCCGAAA ATTGAAGGCG CAGCTGAAGG CCGCCGACCG GCTTGGGGCA TCGTTTGTCG CCATTATCGG CGATGAAGAA CTCGAGAGAC AGGAAGCGGC GGTAAAGCAT ATGGCGAGCG GCGAGCAAAC GAATGTACCG CTCGGCGAGT TGGCGCACTT TTTGCATGAA CGGATCGGGA AGGAGGAGTG A
|
Protein sequence | MAFQIPRGTQ DVLPGDTEKW QYVEHVARNL CSRYGYREIR TPIFEHTELF LRGVGDTTDI VQKEMYTFED KGGRALTLRP EGTAPVVRAF VEHKLYGSPH QPLKLYYSGP MFRYERPEAG RFRQFVQFGV EALGSSDPAI DAEVMALAMH IYEALGLKRI RLVINSLGDL DSRRAHREAL VRHFSSRIHE LCPDCQTRLH TNPLRILDCK KDRDHELMAT APSILDYLNE DSRAYFEKVK QYLTNLGIPF VIDSRLVRGL DYYNHTTFEI MSEAEGFGAA ATLCGGGRYN GLVQEIGGPE TPGIGFALSI ERLLAALDAE GVELPVESGL DCYVVAVGER AKDEAVRLVY ALRRSGLRVD QDYLGRKLKA QLKAADRLGA SFVAIIGDEE LERQEAAVKH MASGEQTNVP LGELAHFLHE RIGKEE
|
| |