Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1956 |
Symbol | |
ID | 3832307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2034459 |
End bp | 2035346 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637829887 |
Product | molybdopterin dehydrogenase, FAD-binding |
Protein accession | YP_430797 |
Protein GI | 83590788 |
COG category | [C] Energy production and conversion |
COG ID | [COG1319] Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0844554 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAGA AGTATCTCTT CGCCACTTCC CCGGCCGACT GCCTGGCCCT GCTGGCCGCG AACCCTGGGG CGCGCCTCAT TGCCGGCGGT ACTGACCTGG TCATCGACCT CAAGGAAAAA AGGCGGCAGG TTACCACCCT GGTGGATATC ACCAGGATAC CCGAATTAAA AATAATTACC GAAACAAATG GTAAGATTAT CCTGGGCGGG GCGGCCACCC ACACCCGCGT GGCCACCTCA TCCCTTATCC GGCAGAAGCT CCCGGCCCTG GCTGCCGCCG CTGCCGCGGT CGGATCCCCC CAGGTGCGCA ATGTCGGCAC CCTGGCCGGC AATGTGGTCA ATGCCCAGCC GGCGGCGGAT ACGGCTGTGG CCCTGGTAGC CCTGGGAGCC GTGGCCACCA TCCTGGGAGC GGAGGGGGAA CGCCAGGTGC CGGTGGCCGA CCTCTACGCC GGTGTGGGCC GCTCCCTGGT AGATGCCGGC CGCGAGATTA TAACCAGATT TACCGTGGAC CTGTGGGGGG AGGGCGAATC CTCGGCTTTT GTCCGCCTGT CGCCCCGGCG CGCCCTCTCC CTGCCCATGC TAAACGTGGC CGTCCGGGTG CAGGTCAGGG AAGGGATATG CACCAGGGCG CGTATCTCAA TCGCCCCGGT GGCGCCCCGG CCCTTCCTGT GCGAAGAAGC GGCCGCCAGC CTCGTGGGCC GGGAACCCAC GGCGAAAGCA ATTGCCAGGG CGGCTTCCGT AGCTAAGGAG GCAGCCCGGC CCAGGGACAG CCTCCTCCGG GGTTCCGGCG CCTACCGGAA GGACATGACG GCTGTGCTGG TAGCCAGGGC TTTAAGCGAA GCTTTTTCCC GCGCAACCAG CAGGAAAATA GAAAACGATG TCGAATAG
|
Protein sequence | MPEKYLFATS PADCLALLAA NPGARLIAGG TDLVIDLKEK RRQVTTLVDI TRIPELKIIT ETNGKIILGG AATHTRVATS SLIRQKLPAL AAAAAAVGSP QVRNVGTLAG NVVNAQPAAD TAVALVALGA VATILGAEGE RQVPVADLYA GVGRSLVDAG REIITRFTVD LWGEGESSAF VRLSPRRALS LPMLNVAVRV QVREGICTRA RISIAPVAPR PFLCEEAAAS LVGREPTAKA IARAASVAKE AARPRDSLLR GSGAYRKDMT AVLVARALSE AFSRATSRKI ENDVE
|
| |