Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1641 |
Symbol | |
ID | 3831270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1676296 |
End bp | 1677243 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637829566 |
Product | aldo/keto reductase |
Protein accession | YP_430486 |
Protein GI | 83590477 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.316031 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.116242 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAGTATA CTACCCTGGG AACCAGCGGG ATTCGGGTGT CCCGCCTGTG TTTCGGTTCC CTGACCATGG GCCCGCTCCA GGCGGGCCTT ACTATCAAGG AAGGAGCCGG GCTTATTCGT CGTGCCCTGG AACTGGGAGT AAACTTTATC GACACGGCCC AGTGTTATAA CAACTACCCC TACATCCGGG CCGCCCTAAA AGGTTGGTCC GGGGAGGTGG TAGTAGCGAC CAAATCCTAC GACTACACGG CGGAAGGCAT GGCCTTAAGC CTGGAAGAGG CCCGGCGGGA AATGGACCGG GAGGTTATCG ACATCTTTCT CCTCCACGAG CAGGAAACGG CCCTTACCCT GGCCGGGCAC CGGCCGGCCC TGGATTATCT TCTGGAGGCC AGGGAGCGGG GGCTGGTCCG GGCGGTGGGG ATTTCCTCCC ACGCCGTAGA AGCTGTGCGG GTAGCCGCGA CTATGCCGGA AATTGATGTC ATTCATCCCC TCTACAACCG TGCCGGGTTA GGGATCCTCG ACGGGACCAG GGAAGAGATG CTGGCCGCCA TTACCCTGGC CCACACCAGG GGTAAGGGAA TCTATGCCAT GAAGGCCCTG GGGGGCGGCC ACCTGGCCGG AGAGGCCCGG GCGGCCCTGG AGTTCGTCCT TAAGACGCCC GGGATTGACG CCGTGGCCGT AGGCATGCAG TCTCCAGCCG AGGTGGAGGC CAATGTAGCC TGGTGCAGCG GGCGGGAACC CGGAGAGGAC GTTCTCAGCC GGCTCCAGGC CCGGAAACGA TGCCTGCTAA TAGAAGAATG GTGCCAGGGG TGCGGCAACT GCGTGCGGCG TTGCCCCCAG GGGGCCCTGG AGGTTATCGA GGGCCGCGCT GTCGTCGACC CGGAGCGCTG CATCCTCTGC GGCTACTGCG CCGGGGCCTG CCGTGATTTC TGTATTAAGG TGATCTAA
|
Protein sequence | MQYTTLGTSG IRVSRLCFGS LTMGPLQAGL TIKEGAGLIR RALELGVNFI DTAQCYNNYP YIRAALKGWS GEVVVATKSY DYTAEGMALS LEEARREMDR EVIDIFLLHE QETALTLAGH RPALDYLLEA RERGLVRAVG ISSHAVEAVR VAATMPEIDV IHPLYNRAGL GILDGTREEM LAAITLAHTR GKGIYAMKAL GGGHLAGEAR AALEFVLKTP GIDAVAVGMQ SPAEVEANVA WCSGREPGED VLSRLQARKR CLLIEEWCQG CGNCVRRCPQ GALEVIEGRA VVDPERCILC GYCAGACRDF CIKVI
|
| |