Gene Mthe_0933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0933 
Symbol 
ID4463326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1016729 
End bp1018141 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content57% 
IMG OID639699952 
Producthistidyl-tRNA synthetase 
Protein accessionYP_843361 
Protein GI116754243 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.638014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCAA GATATAAAGG CATTTTTGCG AATCCGTGGA TGATGATCCA GAGGCCCAGA 
GGAACCAGGG ACTTCCCGCC TGAGGAGGCT TACAGAAGGC GTGCTGTCAG GGAGAAGATG
ATCGATGTGA TGGAGAGATG GGGCTACCGC GAGGTTGCAA CCCCGACATT CGAGCATCTC
GAGCTCTTCA CGCTCAAATC AGGTGAGGGG GTCATAGAGG AGATATACAG CTTCAAGGAC
AAGGGCGGCA GGGACATCGC CCTAAGGCCG GAGCTCACCG CACCCGTGAT GAGGATGTAC
GTCAGCGAGC TTCACAGCTC CCCGAAGCCG TTGAGGCTCT ACTACTTCGC CAACTGCTTC
AGGTACGAGA GGCCGCAGAA GGGCAGGTTC AGGGAGTTCT GGCAGCTCGG CTGCGAGCTC
ATCGGCGGAA GGCGTCCCGA CTCAGAGGCA GAGGTCATAG CGATGGCTGA CGAGGTCCTG
AGAGCAGTTG GGATCAGAGG GGATATCCAC ATCGGGTATC TGGGACTGAT AAGATCGATG
CTGAAGAAGG TCCCCGAGGC GCACAGGCAG AGCATAATGA GGCTCATCGA CAAGAAGGAG
AGGGATGCGC TCAGGGAGCT GCTCCAGAGC ATAGGGGCTG AGGATCTCGG GATCATTGAG
CTGATCAGCC TGAAGGGGAA GGATGCGCTC GAAAGAGCTG AGGAGCTGAG CGCTGCCCTT
TCATCAATCG AGATTCCCGG AGAGAATGCC AGCGCCGCGC AATCTGTTGC GGCACCAGGA
GCGAGCGGGA GAGAGGGTGC ACGCGCCTCT GCGAGAGCGG GCAGATCTCC TGAGTCTGAG
ATGGACCTGA GCGAGTTCAG GGAGATGCTG GAGCTTCTGG ATGCATACGA TGTTGAGGCG
ACGATCGACT TCGAGATAGT GCGCGGTCTC GAGTACTACA CCGGAACGGT CTTCGAGATC
TACGCCTCCG GCCTGGGAGC CCAGAACCAG ATATGCGGCG GCGGATCCTA TGAGCTTGCA
AGTCTATTTG GAGGATCCGA GACGTTCTCC ACCGGATTCG GCCTCGGGTT CGACAGGATA
ATGGAGGTCG TCGGAGAGGT CGATCGACAA AAGCCTCCCG TGGTTCTTGC ATTCACCCCC
GATGTGAAGA TTGATGCGAT CAGGATCGCG AAGCGTCTCA GGAATATCGT TCCCGTTGTC
ATGGATGTCA TGGGACGCTC TCTGAGCGCA CAGCTGAAAT CCGCATCTGC GATCAACGCT
GAGCACGTCA TAATCATCGG GAGAAGGGAG CTCGACTCCG GGAAGCTCGT TCTCAGGGAT
ATGGTGAAAG GCTCTCAGGA GGAGCTGAGC ATCGAGGAGA TCGAGGAACA GCTGAGAATC
GCCTTTGAAC AAACCACCCC TGTGCGAAGA TAA
 
Protein sequence
MAARYKGIFA NPWMMIQRPR GTRDFPPEEA YRRRAVREKM IDVMERWGYR EVATPTFEHL 
ELFTLKSGEG VIEEIYSFKD KGGRDIALRP ELTAPVMRMY VSELHSSPKP LRLYYFANCF
RYERPQKGRF REFWQLGCEL IGGRRPDSEA EVIAMADEVL RAVGIRGDIH IGYLGLIRSM
LKKVPEAHRQ SIMRLIDKKE RDALRELLQS IGAEDLGIIE LISLKGKDAL ERAEELSAAL
SSIEIPGENA SAAQSVAAPG ASGREGARAS ARAGRSPESE MDLSEFREML ELLDAYDVEA
TIDFEIVRGL EYYTGTVFEI YASGLGAQNQ ICGGGSYELA SLFGGSETFS TGFGLGFDRI
MEVVGEVDRQ KPPVVLAFTP DVKIDAIRIA KRLRNIVPVV MDVMGRSLSA QLKSASAINA
EHVIIIGRRE LDSGKLVLRD MVKGSQEELS IEEIEEQLRI AFEQTTPVRR