Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0601 |
Symbol | |
ID | 5105573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 554262 |
End bp | 555236 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640506505 |
Product | alcohol dehydrogenase |
Protein accession | YP_001190700 |
Protein GI | 146303384 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.474293 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGCTG CAGTTCTTAA AGAGTTCGGT AGACCCCTAG TCCTGGTGGA CGTTCCTCCT CCAAGCGTTT CCATGAGGGT GGGGGCTACA GGTCTATGTC ACGGGGATTT ACACGTCCTC ACAGGTCAAT GGAGTTGGGA TGTTCAAACT AGGTTACCCC TAGTCCTTGG GCACGAGATA TCAGTGGTGG ATGAGACTGG AAATACATTT CTCGTTTACA ACGCCAAGGG TTGCGGTACC TGCAGGGAAT GCAGATCCGG TTTTCCACAG TTCTGTGAGA GGGTTGAGGT TTTAGGCATT CAAAGAAATG GCGGATTTGC GGAGAGAGTT GACGTCACTG GTTTTCCCCT TGTCCCTGTT AGGGGTTCAC CCCTTGAGGT TGCCCCCCTG GCTGACGCCG GTGTGACGGC GATGAGCTCT GTGGAGGGAA TAACTGAGGG AAGCAGGGTC GCGGTAATTG GTACGGGAGC AGTGGCTTTA CTTTCAATTC AACTTCTCAA GAACCTCAAT TCCGAGGTGT GGGTTGTGGG TAGGAACCCA CTGAAGCTGA AGAAGGCTAG GGAATTGGGG GCCGACGAGA TAGTTTTCAC TAAGGGGGAA TATTCAACGG ATCTCTCCGG TTCGGTGGGG TTGAGGAAGT TCGACTTTAT CCTGGACTAC GTGGGAAGCG ACTTCACGTT GAGGGATTTG CCTTGGCTGT TGAGGAGGAT GGGAGAGCTG AGGGTAGTTG GCGAGTTTGG CGGGGAACTA TCTATCCCAG ATCAGCTCCT AGTTCTCAGG GGACTCAGGG TGAGGGGAAT ACTTTACGGT ACTATGAAGA ATCTAGTGGA TGTGGTAAAG TTGTTTGAGG ATGGTAAGCT GAAGACCTTA ACGGTACCTT ACCCTCTAGA CGAGGTAAAC CAGGCCATTA TGGATTTAAT GGAGGGGAGA ATCGTGGGAA GGGCCGTGAT TTATCCTACT TCCTCTTCAA CCTAA
|
Protein sequence | MRAAVLKEFG RPLVLVDVPP PSVSMRVGAT GLCHGDLHVL TGQWSWDVQT RLPLVLGHEI SVVDETGNTF LVYNAKGCGT CRECRSGFPQ FCERVEVLGI QRNGGFAERV DVTGFPLVPV RGSPLEVAPL ADAGVTAMSS VEGITEGSRV AVIGTGAVAL LSIQLLKNLN SEVWVVGRNP LKLKKARELG ADEIVFTKGE YSTDLSGSVG LRKFDFILDY VGSDFTLRDL PWLLRRMGEL RVVGEFGGEL SIPDQLLVLR GLRVRGILYG TMKNLVDVVK LFEDGKLKTL TVPYPLDEVN QAIMDLMEGR IVGRAVIYPT SSST
|
| |