Gene GWCH70_2507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2507 
SymbolhisS 
ID7979059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2535385 
End bp2536656 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content43% 
IMG OID644799308 
Producthistidyl-tRNA synthetase 
Protein accessionYP_002950468 
Protein GI239827844 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000231867 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTTC AAATTCCGCG AGGAACGCAG GATATTTTGC CTGGTGAAGT AGAAAAGTGG 
CAATATGTTG AAAAAATAGC GCGTGATATT TGCAAACGTT ACAATTACCA TGAAATTCGC
ACTCCGATCT TTGAACATAC CGAATTATTT TTGCGCGGTG TTGGCGATAC GACGGATATT
GTGCAAAAAG AAATGTATAC GTTTGAAGAC CGGGCGGGAA GAAGCATGAC GCTTCGTCCG
GAAGGGACTG CGCCGGTTGT GCGTTCGTTC GTTGAAAATA AAATGTACGG CAATCCAAAT
CAGCCAATTA AGTTATATTA CATAGGTCCG ATGTTCCGCT ACGAAAGGCC GCAGGCAGGG
CGGTTCCGTC AATTTGTTCA ATTTGGTGTG GAGGCGATTG GAAGCAATGA TCCGGCGATT
GACGCGGAAG TTATTGCAAT GGCGATGGAA TTGTATCGAT CGTTAGGATT AAAAAAATTA
AGACTGGTCA TTAACAGCCT AGGGGATGTA GAAACCCGAA AAGCGCATCG TCAAGCGCTG
ATCGATCATT TTAAAAGCCG GATTCATGAA CTTTGCGAAG ATTGCCAGGT GCGTCTGGAA
AAAAATCCGC TTCGCATTTT AGATTGCAAA AAAGACCGTG ACCATGAATT AATGGCAACG
GCGCCATCGA TTCTCGATTA TTTAAATGAT GAATCGCGGC ATTATTTTGA AAAAGTAAAA
GCTTACTTAA CGAAATTGGG AATTCCATTT GAAGTAGATC CTCGCCTCGT ACGCGGATTA
GATTATTATC ATCATACGAC GTTTGAAATT ATGAGTGATG CAGAAGGATT TGGAGCGATT
ACAACGTTAT GCGGTGGAGG TCGCTATAGC GGGTTGGTGC AAGAAATAGG CGGGCCAGAA
ACACCTGGGA TTGGATTTGC GCTAAGTATT GAGCGGTTGC TCGCCGCTTT AGAGGCAGAA
GGGATTACAT TGCCGATTAG CGAAGGCATT GATTGCTATG TTGTCGCTGT TGGCGAAAAA
GCAAAAGATG AGTCCATATT ACTTGTTCAC AAGCTGCGAA AAGCTGGAAT TGTCGCTGAT
AAAGATTATC AGGACCGGAA AATAAAAGCG CAATTAAAAT CGGCAGACCG CTTGAACGCC
AAGTTTGTGG CGATTCTCGG CGATGATGAA CTCGCAAAAG AAGTAATCAA TATAAAAGAG
ATGAGCACAG GAGAGCAAAC AGAAGTACCG CTACATTCTG TCGTCGATTA TTTAAAAGAA
AGATTGTCAT AG
 
Protein sequence
MAFQIPRGTQ DILPGEVEKW QYVEKIARDI CKRYNYHEIR TPIFEHTELF LRGVGDTTDI 
VQKEMYTFED RAGRSMTLRP EGTAPVVRSF VENKMYGNPN QPIKLYYIGP MFRYERPQAG
RFRQFVQFGV EAIGSNDPAI DAEVIAMAME LYRSLGLKKL RLVINSLGDV ETRKAHRQAL
IDHFKSRIHE LCEDCQVRLE KNPLRILDCK KDRDHELMAT APSILDYLND ESRHYFEKVK
AYLTKLGIPF EVDPRLVRGL DYYHHTTFEI MSDAEGFGAI TTLCGGGRYS GLVQEIGGPE
TPGIGFALSI ERLLAALEAE GITLPISEGI DCYVVAVGEK AKDESILLVH KLRKAGIVAD
KDYQDRKIKA QLKSADRLNA KFVAILGDDE LAKEVINIKE MSTGEQTEVP LHSVVDYLKE
RLS