Gene Msed_2227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2227 
Symbol 
ID5104288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2131844 
End bp2132812 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content45% 
IMG OID640508120 
Productaldo/keto reductase 
Protein accessionYP_001192289 
Protein GI146304973 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000023844 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTTGCTCA GGGACTTGGG TCATACTGGC ATAAAGACTT CAGAGCTGGG AATTGGAATG 
TGGACATTGG TTACAGATTG GTGGGGTGAA CCAGATAAAG CACAGGAGAT AGTTCGGCGC
GCTATTGAGC TAGGAATTAA CTTCTTTGAC ACGGCAGATA TGTATGGCAA CGGAAGGGCA
GAGGAGGTAC TGGGAAGATC CCTAGGATCT AAGAGGGACA AGGTAGTAAT CCTAACTAAG
GTGGGTTACG ATTTCTATTC GTCACCGCAA AGGCCTAGAC AAAGGTTCGA TCTAGATTAT
CTCAGGACCG CTGTGGATAG ATCGCTGAAA AGACTCTCAA CTAACTATGT CGACATTCTC
ATGATACATA ATCCGAAGAT GAAGGACATA ACCAGGAGGG ATCTGTTAGA TTTTATGAGG
TCACTTAAAT CAGATGGGAT TGCGAGGGCA GTTGGGGTGG CGTTGGGCCC CACATTGGGT
TGGGAAGATG AAGGGTTGAA GGCCATAGAG ATGGGGTATG AGGCCCTGGA ACACATATTC
AATCTAATCG AGCTATATCC AGGGTTAAGG TTTCTAGAGT TTGATGTGGG CCATATAGTT
AGGGTACCAC ATGCATCTGA CGTGCTAAAC GAATCAAAGT GGCCCCTGAA CTACGATCCG
AAGTTGCACA GACACTTCAA GAGTCAGCAA TGGATAAATA CTGCAGTGGA TAGGACTAAG
GGTCTACTGG ACTACGCTAG TAAGCTTGGA GTTACGCTAA GCCAGCTAGC CTTAAGTTTC
GTGCTGTCCC ACAAAAGGGT TTCAACAGTA ATTCCCAACA TCACTACGGT CAGGGAATTG
GAAGAGTTTG TGAAATCCAC AGAATTTGTT TTGAACAACG ATGACGTGAA TTTCCTTATG
GACTATTACG AGAGGAATTA TAGGGACCTT AACGAAGAGA GTATTAAAGA AACGCAAGCT
TACAAATGA
 
Protein sequence
MLLRDLGHTG IKTSELGIGM WTLVTDWWGE PDKAQEIVRR AIELGINFFD TADMYGNGRA 
EEVLGRSLGS KRDKVVILTK VGYDFYSSPQ RPRQRFDLDY LRTAVDRSLK RLSTNYVDIL
MIHNPKMKDI TRRDLLDFMR SLKSDGIARA VGVALGPTLG WEDEGLKAIE MGYEALEHIF
NLIELYPGLR FLEFDVGHIV RVPHASDVLN ESKWPLNYDP KLHRHFKSQQ WINTAVDRTK
GLLDYASKLG VTLSQLALSF VLSHKRVSTV IPNITTVREL EEFVKSTEFV LNNDDVNFLM
DYYERNYRDL NEESIKETQA YK