Gene Msed_2157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2157 
Symbol 
ID5104896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2072318 
End bp2073349 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content47% 
IMG OID640508048 
Productradical SAM domain-containing protein 
Protein accessionYP_001192220 
Protein GI146304904 
COG category[R] General function prediction only 
COG ID[COG2108] Uncharacterized conserved protein related to pyruvate formate-lyase activating enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.350893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000485535 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAACTCCT TGCTCAAGGG AAATCCTGAG ATAGGACTTT ACAATAGGGA ATTGCCCAGG 
GGATGCGAAC TCTGTAGAAT GGGCGGTAAG ATGGTTGTGT TCATTTCTGG AGAGTGTGGA
GACTCCTGCT ATTACTGTCC CGTTAGCGAG GGAAGGTTTG GTAAGGATTC AGCGTATGCC
AATGAGTACA GGGTAAAGGA GTTACAAGAC TTCATTTATG AGTCCTACAG GATGAATGCC
CTTGGTGCAG GGATAACTGG TGGAGATCCA CTACTTCACC TGGACAGGGT AGTGGAATTA
ATTACGTTAC TCAAGGACGA ATTTGGTAGA TCTTATCACA TACACCTTTA CACCACTGGT
AGATACGCCT CAACTGACGC ACTTTTGGAG CTGGCGAAGG CTGGTCTTGA CGAAATAAGG
TTCCATCCTG TAAAGGACCA GTACCTTTCA GCAGTGGAGA GGGCACTCAA GGTTGGGATA
GATGTGGGAC TGGAACTGCC AGTTATACCC GGAGAGGAGG ATAGGCTATC TAAGCTGATT
AATTGGGCTA GGGAAAAGGG CGTGAAGTTC GTTAACCTCA ACGAACTTGA GCTAACCGAG
AGAAATTTCC ATAGCCTCAA TTCCAAGGGT TTCAGGATAG GTCATGGGTT AGCCGGTGTA
TCTGGGAGTT TCGAGACCTC CATGAAGGTG CTTGAGACAT TTCATGAAGC GAACATATCA
CTTCACTACT GTAGTTCGGT ATACAAGGAT GTCGTAGAGA CTAGAACTAG GTTCATCAGA
ACCTTGAGAG CTAGCGGTAA ACCCTACGAG GACATCACAG GAGAGGGTAC CTCATTGCGG
GCCATAGTCA AGTCATCCGC GGATCTTTCG GATTTCGGGG AAAAGATAGG AGACACGTTT
GTGACCAGTC CATCTCTAGT TAACGTCCTT CCCAAGGAAA AGGTTGACGA GATATGGATT
GTGGAGGAAC TACCATATGG TCAAAGACTC TCAGAGAAAC TAGTTTATTC TAAATCTAAG
AATGGCCAGT AG
 
Protein sequence
MNSLLKGNPE IGLYNRELPR GCELCRMGGK MVVFISGECG DSCYYCPVSE GRFGKDSAYA 
NEYRVKELQD FIYESYRMNA LGAGITGGDP LLHLDRVVEL ITLLKDEFGR SYHIHLYTTG
RYASTDALLE LAKAGLDEIR FHPVKDQYLS AVERALKVGI DVGLELPVIP GEEDRLSKLI
NWAREKGVKF VNLNELELTE RNFHSLNSKG FRIGHGLAGV SGSFETSMKV LETFHEANIS
LHYCSSVYKD VVETRTRFIR TLRASGKPYE DITGEGTSLR AIVKSSADLS DFGEKIGDTF
VTSPSLVNVL PKEKVDEIWI VEELPYGQRL SEKLVYSKSK NGQ