Gene Mpal_2198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2198 
SymbolhisS 
ID7270283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2339258 
End bp2340487 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content65% 
IMG OID643570812 
Producthistidyl-tRNA synthetase 
Protein accessionYP_002467217 
Protein GI219852785 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACAGC GACCACGCGG CACACGTGAC TTTTTACCAG ACGAGATGGA AGCACGGCGG 
ATGATCGAGG GACGGATGCG CGAGGCGGTC AGACGCTGGG GGTACCGCGA GGTCGCGACC
CCGATCTTTG AGGACCTCTC CCTCTTCACG ATGCGGTCAG GGCAGGGGAT CATCGACGAA
ATGTATGTCT TTCAGGACAA GGGGGGCAGG GATCTCGCAC TTCGGCCCGA GTTGACCGCG
GCCGTCCTTC GGATGTATGT GAACGAGGCC CGGGTGCTCG CAAAGCCACT CCGATGGTGC
TATTTCGCCG ATTGTTTCCG GTACGAAAGG CCTCAGAAGG GGCGGTACCG GCAGTTCTGG
CAGTTCGGGG TCGAACTGAT CGGGGCCGAT ACCGCAGCGG CCGATGCCGA GGTGATCCTG
GTCGCCGATA ATGCGATCCG GTCCAGCGGG CTTGACTACG ACCTGAAGAT CGGTCATCTC
GGGCTGATGA AGCACCTGCT CGCCGATGTA GGCGAGGAGG TCCGGCGGAA GGTGATGGCC
TACCTCGACA AGAAGGAGTT CGACGGCCTC AAGGACTACC TGGAGACCGC CGGACTGGCC
GCCCTGACCG ACCCGCTCAC CCGGCTGCTG GCGGCCGAGA CCCTGGACGA GGCCTTCGCG
GTCACCGGTC CTTTGCCTGA GGAGGGGCGG ATCCGGGAGA TGGCAGCGCT GCTCGACGCC
ACCGGTGTTC GCTACTCGTA CAACTTCGGG ATCGCCCGGG GGCTGGATTA CTATACAGGA
ATGGTTTTCG AAGGGTTCGC GAAGAACCTC GGGGCCGAGA GCCAGATCGT CGGCGGCGGG
GCGTACCGGC TCGCGCAGGT CTTCGGCGGG GACGATGCTC CGTCGGTCGG GTTTGCGATC
GGCTTCGACC GGGTGATGGT CGCACTCGGC GAGGTCCAGA CCAGAAAGCA GAAGATCGTC
GGGGTCGTCT CCACGGCCGA GGGCCGGCCC TTTGCATTCC GGGTGGCAAA CGCCTTTCGG
ACGGCCGGCA TCCGCGCCGA CCTCGACCTC TCCGATCGCG GGCTCGGGGC CCAGCTGGCC
CGGGCTGCGA AGGAGGCCGA CTTCGCCGTG CTGATCGGCG AGCGGGAGGT CGCCACCGGG
ACAGTCACTC TCAAGAACCT CGCCACCGGG ACGCAGACCG CCGTGACGAT CGACCTGGCG
GTCAGGACGG TGATCGATGG TTCTCGCTGA
 
Protein sequence
MVQRPRGTRD FLPDEMEARR MIEGRMREAV RRWGYREVAT PIFEDLSLFT MRSGQGIIDE 
MYVFQDKGGR DLALRPELTA AVLRMYVNEA RVLAKPLRWC YFADCFRYER PQKGRYRQFW
QFGVELIGAD TAAADAEVIL VADNAIRSSG LDYDLKIGHL GLMKHLLADV GEEVRRKVMA
YLDKKEFDGL KDYLETAGLA ALTDPLTRLL AAETLDEAFA VTGPLPEEGR IREMAALLDA
TGVRYSYNFG IARGLDYYTG MVFEGFAKNL GAESQIVGGG AYRLAQVFGG DDAPSVGFAI
GFDRVMVALG EVQTRKQKIV GVVSTAEGRP FAFRVANAFR TAGIRADLDL SDRGLGAQLA
RAAKEADFAV LIGEREVATG TVTLKNLATG TQTAVTIDLA VRTVIDGSR