Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1761 |
Symbol | |
ID | 5104761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1697872 |
End bp | 1698669 |
Gene Length | 798 bp |
Protein Length | 265 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640507656 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001191840 |
Protein GI | 146304524 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.254427 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCCTG TTCAGGGAAA AAGGGTACTT GTTACCTCAT CCACAGAGGG AATAGGCAGG GGAGTTGCAG AGACCTTTGC CTCCCACGGC GCCGTGGTCA CAATCACCTC TAGATCTGGG GAGAAGTTGC ACCGGGCACT TCACGACCTA AGGAAGATCA GCCCAGCAGT TTACGGAACC CAGTCAGACA TGACCAATCT GGGCTCGTTG AACTCCCTAG TCTCATACGC TCTCCACGTT ATGGGAGGAA TAGACATCCT AGTGGTGAAT TCAGGGAATC CACCCAGGGA ACCCATCACC TTCTCAGAGG CTGACATTCA CGACTGGGAA TATGCGACCA AGCTCTACCT TCTCAGCGCC GTCTCCCTTT CCAAGCTGGT TATCCCCGAC ATGATTTCCA GGCAATGGGG CAGGATATTC TTCCTCTCCT CCTGGACCGT GAGGGAGCCC CAGAGCATCC TAGTCCTCGC TGACGTTTCC CGTTCCCCAC TTCTTCAGTT AACCAAGATC CTCTCAAGGG ACTACGGCAG ACACGGGATC ACCGTCAACA CGATCCTCAT GGGTAGCTTC CCCACCGAAG GCGCCAAGAA AACCCTATCG CGATACGCTG AATCCAAGGG CCTCCCCTTC GAACAAGTTT GGAAGGAGAG GGTCCTCGAC CCCATCTCCG TTGGTAGGCT GGGCGATGTG AAGAGGGACC TAGGCTCCCT CCTTCTCTTT CTCTCCACGG ACATGGGAAG TTACATCACG GGCACAAGCA TCCTGGTGGA CGGAGGAACA ACGTCATCCG TGGGTTAA
|
Protein sequence | MFPVQGKRVL VTSSTEGIGR GVAETFASHG AVVTITSRSG EKLHRALHDL RKISPAVYGT QSDMTNLGSL NSLVSYALHV MGGIDILVVN SGNPPREPIT FSEADIHDWE YATKLYLLSA VSLSKLVIPD MISRQWGRIF FLSSWTVREP QSILVLADVS RSPLLQLTKI LSRDYGRHGI TVNTILMGSF PTEGAKKTLS RYAESKGLPF EQVWKERVLD PISVGRLGDV KRDLGSLLLF LSTDMGSYIT GTSILVDGGT TSSVG
|
| |