Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2227 |
Symbol | |
ID | 5104288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 2131844 |
End bp | 2132812 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640508120 |
Product | aldo/keto reductase |
Protein accession | YP_001192289 |
Protein GI | 146304973 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000023844 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTTGCTCA GGGACTTGGG TCATACTGGC ATAAAGACTT CAGAGCTGGG AATTGGAATG TGGACATTGG TTACAGATTG GTGGGGTGAA CCAGATAAAG CACAGGAGAT AGTTCGGCGC GCTATTGAGC TAGGAATTAA CTTCTTTGAC ACGGCAGATA TGTATGGCAA CGGAAGGGCA GAGGAGGTAC TGGGAAGATC CCTAGGATCT AAGAGGGACA AGGTAGTAAT CCTAACTAAG GTGGGTTACG ATTTCTATTC GTCACCGCAA AGGCCTAGAC AAAGGTTCGA TCTAGATTAT CTCAGGACCG CTGTGGATAG ATCGCTGAAA AGACTCTCAA CTAACTATGT CGACATTCTC ATGATACATA ATCCGAAGAT GAAGGACATA ACCAGGAGGG ATCTGTTAGA TTTTATGAGG TCACTTAAAT CAGATGGGAT TGCGAGGGCA GTTGGGGTGG CGTTGGGCCC CACATTGGGT TGGGAAGATG AAGGGTTGAA GGCCATAGAG ATGGGGTATG AGGCCCTGGA ACACATATTC AATCTAATCG AGCTATATCC AGGGTTAAGG TTTCTAGAGT TTGATGTGGG CCATATAGTT AGGGTACCAC ATGCATCTGA CGTGCTAAAC GAATCAAAGT GGCCCCTGAA CTACGATCCG AAGTTGCACA GACACTTCAA GAGTCAGCAA TGGATAAATA CTGCAGTGGA TAGGACTAAG GGTCTACTGG ACTACGCTAG TAAGCTTGGA GTTACGCTAA GCCAGCTAGC CTTAAGTTTC GTGCTGTCCC ACAAAAGGGT TTCAACAGTA ATTCCCAACA TCACTACGGT CAGGGAATTG GAAGAGTTTG TGAAATCCAC AGAATTTGTT TTGAACAACG ATGACGTGAA TTTCCTTATG GACTATTACG AGAGGAATTA TAGGGACCTT AACGAAGAGA GTATTAAAGA AACGCAAGCT TACAAATGA
|
Protein sequence | MLLRDLGHTG IKTSELGIGM WTLVTDWWGE PDKAQEIVRR AIELGINFFD TADMYGNGRA EEVLGRSLGS KRDKVVILTK VGYDFYSSPQ RPRQRFDLDY LRTAVDRSLK RLSTNYVDIL MIHNPKMKDI TRRDLLDFMR SLKSDGIARA VGVALGPTLG WEDEGLKAIE MGYEALEHIF NLIELYPGLR FLEFDVGHIV RVPHASDVLN ESKWPLNYDP KLHRHFKSQQ WINTAVDRTK GLLDYASKLG VTLSQLALSF VLSHKRVSTV IPNITTVREL EEFVKSTEFV LNNDDVNFLM DYYERNYRDL NEESIKETQA YK
|
| |