Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2507 |
Symbol | hisS |
ID | 7979059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2535385 |
End bp | 2536656 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644799308 |
Product | histidyl-tRNA synthetase |
Protein accession | YP_002950468 |
Protein GI | 239827844 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000231867 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTTC AAATTCCGCG AGGAACGCAG GATATTTTGC CTGGTGAAGT AGAAAAGTGG CAATATGTTG AAAAAATAGC GCGTGATATT TGCAAACGTT ACAATTACCA TGAAATTCGC ACTCCGATCT TTGAACATAC CGAATTATTT TTGCGCGGTG TTGGCGATAC GACGGATATT GTGCAAAAAG AAATGTATAC GTTTGAAGAC CGGGCGGGAA GAAGCATGAC GCTTCGTCCG GAAGGGACTG CGCCGGTTGT GCGTTCGTTC GTTGAAAATA AAATGTACGG CAATCCAAAT CAGCCAATTA AGTTATATTA CATAGGTCCG ATGTTCCGCT ACGAAAGGCC GCAGGCAGGG CGGTTCCGTC AATTTGTTCA ATTTGGTGTG GAGGCGATTG GAAGCAATGA TCCGGCGATT GACGCGGAAG TTATTGCAAT GGCGATGGAA TTGTATCGAT CGTTAGGATT AAAAAAATTA AGACTGGTCA TTAACAGCCT AGGGGATGTA GAAACCCGAA AAGCGCATCG TCAAGCGCTG ATCGATCATT TTAAAAGCCG GATTCATGAA CTTTGCGAAG ATTGCCAGGT GCGTCTGGAA AAAAATCCGC TTCGCATTTT AGATTGCAAA AAAGACCGTG ACCATGAATT AATGGCAACG GCGCCATCGA TTCTCGATTA TTTAAATGAT GAATCGCGGC ATTATTTTGA AAAAGTAAAA GCTTACTTAA CGAAATTGGG AATTCCATTT GAAGTAGATC CTCGCCTCGT ACGCGGATTA GATTATTATC ATCATACGAC GTTTGAAATT ATGAGTGATG CAGAAGGATT TGGAGCGATT ACAACGTTAT GCGGTGGAGG TCGCTATAGC GGGTTGGTGC AAGAAATAGG CGGGCCAGAA ACACCTGGGA TTGGATTTGC GCTAAGTATT GAGCGGTTGC TCGCCGCTTT AGAGGCAGAA GGGATTACAT TGCCGATTAG CGAAGGCATT GATTGCTATG TTGTCGCTGT TGGCGAAAAA GCAAAAGATG AGTCCATATT ACTTGTTCAC AAGCTGCGAA AAGCTGGAAT TGTCGCTGAT AAAGATTATC AGGACCGGAA AATAAAAGCG CAATTAAAAT CGGCAGACCG CTTGAACGCC AAGTTTGTGG CGATTCTCGG CGATGATGAA CTCGCAAAAG AAGTAATCAA TATAAAAGAG ATGAGCACAG GAGAGCAAAC AGAAGTACCG CTACATTCTG TCGTCGATTA TTTAAAAGAA AGATTGTCAT AG
|
Protein sequence | MAFQIPRGTQ DILPGEVEKW QYVEKIARDI CKRYNYHEIR TPIFEHTELF LRGVGDTTDI VQKEMYTFED RAGRSMTLRP EGTAPVVRSF VENKMYGNPN QPIKLYYIGP MFRYERPQAG RFRQFVQFGV EAIGSNDPAI DAEVIAMAME LYRSLGLKKL RLVINSLGDV ETRKAHRQAL IDHFKSRIHE LCEDCQVRLE KNPLRILDCK KDRDHELMAT APSILDYLND ESRHYFEKVK AYLTKLGIPF EVDPRLVRGL DYYHHTTFEI MSDAEGFGAI TTLCGGGRYS GLVQEIGGPE TPGIGFALSI ERLLAALEAE GITLPISEGI DCYVVAVGEK AKDESILLVH KLRKAGIVAD KDYQDRKIKA QLKSADRLNA KFVAILGDDE LAKEVINIKE MSTGEQTEVP LHSVVDYLKE RLS
|
| |