Gene Moth_1226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1226 
Symbol 
ID3832861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1263698 
End bp1264978 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content63% 
IMG OID637829161 
Productaldehyde oxidase and xanthine dehydrogenase, a/b hammerhead 
Protein accessionYP_430083 
Protein GI83590074 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.847916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGTGA TCGGTGCTTC CCCCGCCAGA GGTGATGCCC GAGCCAAGGT CACCGGTGAG 
GCCATCTACC CGGCTGATAT CGTCTTCCCG GGCATGATTT ACGGCCAGGC TATTCGCAGC
CCCCACCCCC ACGCCAGGAT TGTCAATATC GACACCGCCG CAGCCCTGAA GGTACCCGGG
GTCCTCTGCG TGCTTACCGC CCGGGATATT CCCGGACACA ACGGCCAGGG TGTTCTTTTC
CAGGATATGC CCGTCCTCGC CGGAAACGAG GTGCGCTCGG TTAACGACGT CGTAGCCCTG
GTGGGGGCTA CCACCCCGGC GGCGGCCCGG GAAGGGGCCG CTATGGTAAA GGTGGACTAT
GAGGAACTAC CGGCCCTCCT GGACCCGGTG GCCGCGATGC AACCGGGCGC GCCCCGGGTC
CATCCCGACC GGGAGAATAT TATTTACCAC CTGCCCATCA GGAGGGGCGA CGTGGCGGCC
GGTTTCGCCG CTGCCGACGT GGTTGTGGAA AACACCTACC GTACCCAGCT CCTGGACCAC
GCCTTCCTCC AACCGGAAGC CGCAGTGGCC CGGCTGGACG AGCGCGGCCA CCTTATAATC
TATGTGGCCA CCCAGTATGT CCACTGGGAT CGGACGGAAG TAGCACGGGT GCTGGGCTGG
AACCAGGATC GCGTCCGCAT TGTGGCTCCG GCGGTGGGGG GTGCCTTCGG CGGCCGGGAA
GATATGACCC TGCAGACCCT GGTGGCTTTG CTGGCCGTCC ATACCCGCCG GCCGGCCAAA
ATGGTTCTCA GCAGGGAAGA ATCCTTTTTC GCCCACAGCA AACGGCATCC CATGATTATG
CGCTATAAGA CCGGGGCTAC ACGCGAGGGG AAATTAACGG CCCTGGAAGC CGAAATTATC
GGCGACAGCG GCGCCTATTG TTCCTGGGCC CCCAATGTAC TGCGTAAGGC GGCCATCCAT
GCCACCGGGC CTTATGTCAT CCCCAACGTC AAGATCGATG CCTATGCCGT CTATACCAAC
AACCCCTTTA CGGGGGCTAT GCGCGGCTTT GGCGCCACCC AGCCGCCCCT GGCTTATGAA
AGCCAGATGG ACGAACTGGC TGCGCAGCTG GGCATTCACC CCTTTACCAT CCGCTGGCTC
AACGCTTTCC GCCAGGGGGA TGTAACCGCT ACCGGCCAGG TCCTGGAAAG TAGCGTCGGT
CTTACGGAAA CTATGCTCCA GGCAGCCCGG GCTGCCGGCT GGTCCCCTGA CAATTTGCTA
CCGGGAGGGA AGCTAGGATG A
 
Protein sequence
MGVIGASPAR GDARAKVTGE AIYPADIVFP GMIYGQAIRS PHPHARIVNI DTAAALKVPG 
VLCVLTARDI PGHNGQGVLF QDMPVLAGNE VRSVNDVVAL VGATTPAAAR EGAAMVKVDY
EELPALLDPV AAMQPGAPRV HPDRENIIYH LPIRRGDVAA GFAAADVVVE NTYRTQLLDH
AFLQPEAAVA RLDERGHLII YVATQYVHWD RTEVARVLGW NQDRVRIVAP AVGGAFGGRE
DMTLQTLVAL LAVHTRRPAK MVLSREESFF AHSKRHPMIM RYKTGATREG KLTALEAEII
GDSGAYCSWA PNVLRKAAIH ATGPYVIPNV KIDAYAVYTN NPFTGAMRGF GATQPPLAYE
SQMDELAAQL GIHPFTIRWL NAFRQGDVTA TGQVLESSVG LTETMLQAAR AAGWSPDNLL
PGGKLG