Gene Msed_2267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2267 
SymbolhisS 
ID5104219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2168940 
End bp2170232 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content40% 
IMG OID640508164 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001192329 
Protein GI146305013 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.921097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.960691 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATCCT ATGAGCCTAT GCGCGGGATG GAGGATTATT TTGATGTGGA TTCAAAGATC 
ATTAGGTGGA TAGAGAGCAA TTTTAGGGAA ACTGTAGAGA AAGCAGGTTA TAAGGAGGCC
ATGACCCCAA TTGTCGAGGA CTTTGAACTG TTTTCCCTTA AGGGTGGAGA GGAACTAAGG
AACACGATGT ACGTGTTTAA GGATAAGGGG GAGAGAGAAG TGGCTCTAAG ACCTGAGATA
ACTCCCAGTA TCGTGAGGTT ATATCTTAAT TCTTTACAAC ATTATCCAAA ACCCCTAAGA
ATTTTCTATA TAGGGAGGGT ATATAGGTAT GATGAGCCTC AACAGGGCAG ATATAGGGAA
TTTAGGCAAG CAGGGGTTGA ATTACTAGGC TCGGATTCTA TTCTAGCCGA CATAGAAGTT
CTCCATCTCT TAGAAAATTT CTATAGAAGA ATAAACCTTA AGGACAAAAT ATCTTTGAAA
ATAAATAATA TAGGAATTTT CAGAATAATC TTTAATAAAC TCTCATTTGA TGAGCAAGTT
CAGGAACACC TGTTACACCT TTTGGATAAG GGAAAGATAG AGGAGGCGGA AAAAATCTTA
GATGAGAAGA TACGCGATAA CTCTAAAATA AGACAATTCA TTTATACACT AATTACTAAC
GGTAGATCAC TCAAATTAGA AGAGGCAATG AGGGAGGCAG AAAAGACAGA ATTGTCTGAA
CTTGTAAATG AGATAGAGAA TCTAAAACTC ATCTCAGAAA TTCTGTCGTC CCTGAATCTT
GACCATATTG TAGATCTTGG TTTCGTTAGA GGACTTGCAT ACTACACTGG CCCTATCTTT
GAAGTGGTGA AAAGAGATCT TCCGTTCAGT ATCGCTGGAG GCGGAAGATA TGACTCGCTA
GTGGAAGTTT ACGGAGGAAA TAGGACGCCT GCGGTTGGAT TTGCCATAGG AATAGAGAGA
ACTATGTATG CACTTAACAA AGATGGGATT AAATTCGACT CCAATGCCCC ATTGGTAGCT
GTTGTAGCCT TAGACAGGTC AGTTATCCCC CATGCGCTTT CCATAGTTTC CATGCTCAGG
GACAAGGGGT TCATAACTGT GCTAAACAAC AAGGAGATTC CGCTATCAAA ACTCGTACCC
CTATATGCCG AACAGGGATT TACACATCTA ATCATCATGG GGCAAAAAGA AATCACTTCA
GGGAAAGTCA CTGTTAGGAA CCTTGTTAAA AGGGAACAGA TAACAACTGA TGTAAAGGAG
TTAACGAACG TGATTACTGC TCCCGATAAA TAA
 
Protein sequence
MVSYEPMRGM EDYFDVDSKI IRWIESNFRE TVEKAGYKEA MTPIVEDFEL FSLKGGEELR 
NTMYVFKDKG EREVALRPEI TPSIVRLYLN SLQHYPKPLR IFYIGRVYRY DEPQQGRYRE
FRQAGVELLG SDSILADIEV LHLLENFYRR INLKDKISLK INNIGIFRII FNKLSFDEQV
QEHLLHLLDK GKIEEAEKIL DEKIRDNSKI RQFIYTLITN GRSLKLEEAM REAEKTELSE
LVNEIENLKL ISEILSSLNL DHIVDLGFVR GLAYYTGPIF EVVKRDLPFS IAGGGRYDSL
VEVYGGNRTP AVGFAIGIER TMYALNKDGI KFDSNAPLVA VVALDRSVIP HALSIVSMLR
DKGFITVLNN KEIPLSKLVP LYAEQGFTHL IIMGQKEITS GKVTVRNLVK REQITTDVKE
LTNVITAPDK