Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2268 |
Symbol | |
ID | 3831379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2375283 |
End bp | 2376359 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637830188 |
Product | alcohol dehydrogenase |
Protein accession | YP_431098 |
Protein GI | 83591089 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.396502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0088894 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTTATA TGATCCCTGA AAAAATGAAA GCCCTGGTTC TCTTCGGCCC TAACGACGTG CGCCTGGTAG AAAAGCCGGT GCCCAAACCC GGCCCCGGCG AGGTGCTGGT CAGGGTGGCG GCCTGCGGCA TCTGCGGCAC CGATGTAAAG ATTATCACCA AGGGCATGCC AAAGATGCCT CCCTACGGTG AATTTACCTT CGGCCATGAA TGGGCCGGGA CCATTGTCGC CCTGGGAGAA ACAGTGGACG AATTCCAGGT CGGCGACCGG GTAGCCATCG AGGCTCACAA GGGTTGCGGC CGCTGTGAAA ACTGCATCGA CGGCAAGTAC ACTGCCTGCC TAAACTACGG CCGCCTGGAC AAGGGCCACC GGGCCGCGGG CATGACGGTA GACGGTGGCT TTGCCGAGTA TGCCGTCCAG CATGTAAATT CAGTCTACAA GATTCCCGAC AATATTACTT TCAACGAAGC CACCTATGTG ACTACGGCCG GCTGTGCCCT CTACGCCATC GACAAGAGCG GCGGTTATAT TGCCGGGGAT ACGGTCCTGG TCATCGGCCC CGGCCCTATT GGTCTCTCTG TGGTCCAGGG AGCCCGGTCC CTGGGGGCCG AAAAGATCAT CCTCATGGGA ACCCGAGAAG ACCGCCTGGT CAAGGGTCGT GAGCTCGGCG CCACCCATAC CATTAATATT CGTGAGGTGG CCGATCCCGT TGCCGAAGTA ATGGCCATCA CCGGTGGTAA GGGCTGCGAG CGGGTTTTTG AGTGCGCCGG CAACTCTCAG TCCTTCGAGT ACGGCATCAA AGCGGCTAAA AAGGGCGGTG TCATGGTCCT GGTTTCCTTC TATAAGGAAC CGGTAATGGC TAACCTGGAT TATGTCGTCT TAAACCAGAT CAGCCTGCTC ACCGTACGCG GTGAGGGCAA CCAGAACTGT AAGCGCGCCC TGTCACTGAT GGCCCAGGGT AAGATTGACG CCAAACCAAT TATGACCCAC GCCTTCCCGT TAGAAGAGTT CCAGAAGGGC CTGGATTACT TCGTCAACCG CAAAGACGGG GCCATGAAGG TGGTTATCAA CCCCTAA
|
Protein sequence | MTYMIPEKMK ALVLFGPNDV RLVEKPVPKP GPGEVLVRVA ACGICGTDVK IITKGMPKMP PYGEFTFGHE WAGTIVALGE TVDEFQVGDR VAIEAHKGCG RCENCIDGKY TACLNYGRLD KGHRAAGMTV DGGFAEYAVQ HVNSVYKIPD NITFNEATYV TTAGCALYAI DKSGGYIAGD TVLVIGPGPI GLSVVQGARS LGAEKIILMG TREDRLVKGR ELGATHTINI REVADPVAEV MAITGGKGCE RVFECAGNSQ SFEYGIKAAK KGGVMVLVSF YKEPVMANLD YVVLNQISLL TVRGEGNQNC KRALSLMAQG KIDAKPIMTH AFPLEEFQKG LDYFVNRKDG AMKVVINP
|
| |