Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1226 |
Symbol | |
ID | 3832861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1263698 |
End bp | 1264978 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637829161 |
Product | aldehyde oxidase and xanthine dehydrogenase, a/b hammerhead |
Protein accession | YP_430083 |
Protein GI | 83590074 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.847916 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGTGA TCGGTGCTTC CCCCGCCAGA GGTGATGCCC GAGCCAAGGT CACCGGTGAG GCCATCTACC CGGCTGATAT CGTCTTCCCG GGCATGATTT ACGGCCAGGC TATTCGCAGC CCCCACCCCC ACGCCAGGAT TGTCAATATC GACACCGCCG CAGCCCTGAA GGTACCCGGG GTCCTCTGCG TGCTTACCGC CCGGGATATT CCCGGACACA ACGGCCAGGG TGTTCTTTTC CAGGATATGC CCGTCCTCGC CGGAAACGAG GTGCGCTCGG TTAACGACGT CGTAGCCCTG GTGGGGGCTA CCACCCCGGC GGCGGCCCGG GAAGGGGCCG CTATGGTAAA GGTGGACTAT GAGGAACTAC CGGCCCTCCT GGACCCGGTG GCCGCGATGC AACCGGGCGC GCCCCGGGTC CATCCCGACC GGGAGAATAT TATTTACCAC CTGCCCATCA GGAGGGGCGA CGTGGCGGCC GGTTTCGCCG CTGCCGACGT GGTTGTGGAA AACACCTACC GTACCCAGCT CCTGGACCAC GCCTTCCTCC AACCGGAAGC CGCAGTGGCC CGGCTGGACG AGCGCGGCCA CCTTATAATC TATGTGGCCA CCCAGTATGT CCACTGGGAT CGGACGGAAG TAGCACGGGT GCTGGGCTGG AACCAGGATC GCGTCCGCAT TGTGGCTCCG GCGGTGGGGG GTGCCTTCGG CGGCCGGGAA GATATGACCC TGCAGACCCT GGTGGCTTTG CTGGCCGTCC ATACCCGCCG GCCGGCCAAA ATGGTTCTCA GCAGGGAAGA ATCCTTTTTC GCCCACAGCA AACGGCATCC CATGATTATG CGCTATAAGA CCGGGGCTAC ACGCGAGGGG AAATTAACGG CCCTGGAAGC CGAAATTATC GGCGACAGCG GCGCCTATTG TTCCTGGGCC CCCAATGTAC TGCGTAAGGC GGCCATCCAT GCCACCGGGC CTTATGTCAT CCCCAACGTC AAGATCGATG CCTATGCCGT CTATACCAAC AACCCCTTTA CGGGGGCTAT GCGCGGCTTT GGCGCCACCC AGCCGCCCCT GGCTTATGAA AGCCAGATGG ACGAACTGGC TGCGCAGCTG GGCATTCACC CCTTTACCAT CCGCTGGCTC AACGCTTTCC GCCAGGGGGA TGTAACCGCT ACCGGCCAGG TCCTGGAAAG TAGCGTCGGT CTTACGGAAA CTATGCTCCA GGCAGCCCGG GCTGCCGGCT GGTCCCCTGA CAATTTGCTA CCGGGAGGGA AGCTAGGATG A
|
Protein sequence | MGVIGASPAR GDARAKVTGE AIYPADIVFP GMIYGQAIRS PHPHARIVNI DTAAALKVPG VLCVLTARDI PGHNGQGVLF QDMPVLAGNE VRSVNDVVAL VGATTPAAAR EGAAMVKVDY EELPALLDPV AAMQPGAPRV HPDRENIIYH LPIRRGDVAA GFAAADVVVE NTYRTQLLDH AFLQPEAAVA RLDERGHLII YVATQYVHWD RTEVARVLGW NQDRVRIVAP AVGGAFGGRE DMTLQTLVAL LAVHTRRPAK MVLSREESFF AHSKRHPMIM RYKTGATREG KLTALEAEII GDSGAYCSWA PNVLRKAAIH ATGPYVIPNV KIDAYAVYTN NPFTGAMRGF GATQPPLAYE SQMDELAAQL GIHPFTIRWL NAFRQGDVTA TGQVLESSVG LTETMLQAAR AAGWSPDNLL PGGKLG
|
| |