Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2072 |
Symbol | |
ID | 5105052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1989497 |
End bp | 1990276 |
Gene Length | 780 bp |
Protein Length | 259 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507962 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001192136 |
Protein GI | 146304820 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCTGG GAATAAGGGG TAAAAGGGTT CTTGTAACTG CCTCAAGCAA AGGGATAGGT TTCGCCACAG CAAAAAGGTT TCTGGAGGAA GGGGCTGTAG TCACCATTTC GTCCCATAAC CTCGAAACGT TGAAATCTGC CTACGAGAAA CTGAGAAACT TGGGTCAAGT TTACATGGTC CAGGCAGACT TGACTAAACC AGATGAGGCT AGGGAGCTTG TGAAGATGGC CCACGATACC ATGAATGGGT TAGACGTAAC GGTATACGTC ACTGGAAGCC CAAAGCCAGG TAACCTTCTT GAGCTCACGG ATAAGGACTG GATGGACGCA TTCAATTTAC TATTAATGAG TGCAGTTGTC GTTACGAGGG AATCTGCCAA GTACATGAAA CCCGGTGGCA GGATAATTCT TTCGACCTCT ATGACCCTAA AACAACCAAT CGATAACCTG GACCTTTCCA ACGTTGTTAG GCTATCCCTT GCAGGTCTCA TTAAGTCTGC CTCCAGAGAA CTAGGTCTTA AGGGGATTCT CGTGAATGGG GTCATGCCAG GCTGGACTCT CACAGAGAGA GTTAATCAAT TAGCCCGGGA CAGGGCAAGA AGAGAGGGGA AGACTGAGGA ACAGGTTATA TCTGAGATTG TCAAGGAAGT CCCGCTAAAT AGGATAGGCC TTCCAGAGGA AGTTGCTAAC GTTATCCTCT TCTTGAGCTC TTCCCTATCC ACATACGTCA CTGGAACCCT CATCCCCGTG GATGGAGGAC TGATCAGGAC AACCCTTTAA
|
Protein sequence | MDLGIRGKRV LVTASSKGIG FATAKRFLEE GAVVTISSHN LETLKSAYEK LRNLGQVYMV QADLTKPDEA RELVKMAHDT MNGLDVTVYV TGSPKPGNLL ELTDKDWMDA FNLLLMSAVV VTRESAKYMK PGGRIILSTS MTLKQPIDNL DLSNVVRLSL AGLIKSASRE LGLKGILVNG VMPGWTLTER VNQLARDRAR REGKTEEQVI SEIVKEVPLN RIGLPEEVAN VILFLSSSLS TYVTGTLIPV DGGLIRTTL
|
| |