Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0196 |
Symbol | |
ID | 5103940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 159566 |
End bp | 160582 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640506101 |
Product | alcohol dehydrogenase |
Protein accession | YP_001190297 |
Protein GI | 146302981 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.583804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0665338 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCCA TTATCATGGA GGGAGGAAAG GCAGTACTCA AGGAAGTCCC AATTCCCAAG CTGGGTCAGG GCGACGTCCT TGTGGAGATG AAGGCATGTG GACTTTGTGG AACAGATATA GAGAAAATGT GCGGACAATA CACGGCCTCT CAACCCATAC TGGGACACGA GCCCGCTGGG GTAGTAGCCG AATCCATGTC AGACGCAGTT AAGCCTGGGG ATAGGGTATT CGCACATCAT CACGTCCCAT GTTACGAATG TTATTACTGT AAGAAGGGAA GCCCTACCAT GTGCCCGTAT TATAGGAAGA CGAACTTAGA TCCTGGGGGC TTCGCCGAGT TCTTCAGGGT GCCAGCGTGG AACGTGGAGA AAGGAGGAAT ACTCGTATTG CCCAGTAACG TTTCATTTGA GGAGGGTTCC TTCGTGGAGC CCCTAGCTAC AGTGGTTAGG GCCCAGAGGC GCGTAGGAAT ATCGAGGGAC GACTCGGTTC TCATAGTCGG TGCAGGTCCC ATGGGGCTAT TACATCTTCT AAAGATTAAG GACATGGGAG TGTCAAACGT AATCATTTCT GACGTATCGG AGTACAGGTT GAACTTCGCT GAATCTCTAG GTGCTTCCCT CTTGCTCAAT CCCACCAAAC ACCAAGTGGA ACAGGAAGCG AAGAAGGCCA CGGATGGTCG TGGCGTAGAT GTGGCGATAA TAGCCTCAGG GGCTCCTCAA GCTATCCTTT CAGGTCTGAA CTCTGTGAGG AAGGGAGGTA GGGTGCTACT GTTTGGGGTT CCCTACAAAG GAACGATACT CAACTACGAT ATTAGCGAGC TCCTTAACAA CGAAATCTCA GTAATTCCCA GTAATGCAGC TGTAGAGGAG GACACAAGGG AGGCCCTGAA ACTCATCTCT GAGAGAAGGG TTGATGTAAG GAAGCTTGTG ACACATAGAT ATGACCTAGA GCAGTTCCAC GAGGCCGTAA GGGTAGCCAA ACAAGGAAAC GCCATAAAGG TAGTAATAAC TAGTTAA
|
Protein sequence | MKAIIMEGGK AVLKEVPIPK LGQGDVLVEM KACGLCGTDI EKMCGQYTAS QPILGHEPAG VVAESMSDAV KPGDRVFAHH HVPCYECYYC KKGSPTMCPY YRKTNLDPGG FAEFFRVPAW NVEKGGILVL PSNVSFEEGS FVEPLATVVR AQRRVGISRD DSVLIVGAGP MGLLHLLKIK DMGVSNVIIS DVSEYRLNFA ESLGASLLLN PTKHQVEQEA KKATDGRGVD VAIIASGAPQ AILSGLNSVR KGGRVLLFGV PYKGTILNYD ISELLNNEIS VIPSNAAVEE DTREALKLIS ERRVDVRKLV THRYDLEQFH EAVRVAKQGN AIKVVITS
|
| |