Gene Moth_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1972 
Symbol 
ID3831154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2055244 
End bp2057211 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content59% 
IMG OID637829903 
Productcarbon-monoxide dehydrogenase, catalytic subunit 
Protein accessionYP_430813 
Protein GI83590804 
COG category[C] Energy production and conversion 
COG ID[COG1151] 6Fe-6S prismane cluster-containing protein 
TIGRFAM ID[TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000154748 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATACT CCTATCCCAT ACTGAACGCC GAAATGCCTG GACGTGAAGA TGTGCTCCGA 
TTAACTCCCA ACCCGGCCAG TAAAGACCTC CTTGAGTACC TGCATCAAGA AGGCGTTGAA
ACCTGGTTGG ATAGGTACGA AGCCCAGCAA CCCATGTGTG GTTACGGCCT GCGGGGTCTC
TGCTGCCGCA TGTGCCAGTG GGGTCCCTGC CGCCTGGATA ATAAGCGGCA GAGGGGAATT
TGCGGCCGGG ATCTGAGCAC CGTCATCATG GCCAACCTGG TACGCTCCCT TGTGGCCGGC
CTGGCCGCCC ATGGCCGCCA CGCCCATGAA GTAATCCTTA CGATAATGGC GGCCGCCGAA
GGGAAGGCTA ACTTGCCTCT GAAGGGAGAG GAACGGGTGC TGGATGTGGC CAACCGCCTC
GGGCTTACAA CCGACGGGCG AACGATAAAG GAAATAGCCC GGGAAGTGGC GGAGGTACTC
CTGGAGGACC TGGGCCGTCT GACCATGACC CCCATCCGCA TCTTGACGGC CTACGCGCCC
CGGGAAAGGA TAGAAACCTG GCAAACCCTC GGGGTGCTGC CCCGCTCCGG AGCCTATGAA
ATTATGGAAA CCTTGCATAT GACAACCCTG GGCGGAACCA GTGACTGGAC TTCCCTGACA
GAGCAAGAGC TCCGGGCGGC CCTGGCCTAT TGCTACAGTA CTCTGTTCGG TAGTTCCCTG
GCTACAGAGA TGCTTTTCGG CATACCCCGG CCGAAGGTAG CCACGGTAAA TTACGGCATC
CTCAAGGAAG ATCACGTTAA CATCCTTATC CACGGTCATT CCCCGGTCAT GGTGGAAAAA
ATCCTGGAGA AAATCCGCAC CCCCGAAATC CAGGAGTTGG CCAGAAAGGC AGGGGCCAAG
GGTATAGTCG TCGGTGGCAT GTGCTGTACC GGTGAAGAAT TACTGGCCCG TTACGGTGTT
CCTACGGTGA CCAACATCAT GGGCCAGGAG CTAGCCCTGG GAACCGGGGC CGTTGACACT
GTAGTAGTAG ATATGCAGTG CGTCATTCCC GGTATGAAGA TAGTTGCCGA TTGCTTTGGC
ACCCAGGTAA TCACCACCTG CAACTCCAAC CGTATCCCCG GGGCCATCCA CATTCCCTTC
GACCCGGAAA ATCCCGAAGG GCTGGACGAA GACGCCCTCA AGGTGGCGCG CCTGGCGGTA
GAAGCCTTCG CCCACCGTGA CCGCAGTAAA ATGCATATTC CCCGGGAAAC CACAGAAGCC
ATGGTCGGCT GGAGTTACGA GGCGATAGTT GACACCTTCG GGGGCCTTAA AGGCCTGCTA
GAGCTTTTAC GGGAAGGAAA AATTAAAGGC ATCGCTACGG TGGTGGGTTG CAATACGCCG
AAGGTACCGT ATGAATTCAA CCATGTTACC ATCGTTCGGA GGCTTATCGA AAGCGACATC
CTGGTTACCA CTACCGGCTG CTGCTCCCAC GCCCTCCTGA ACGCCGGCCT GTGCTCACCC
GCAGCAGCCA GCCAGGCCGG TCCCGGCCTC CAGGAGGTTT GCCGGTCCAG GGGCATCCCG
CCGGTCCTGG CGGTGGGGGG CTGTGTGGAC AATACCAGAA CCCTCCGGCT TTTTATCGAC
CTGGCCGAGG AGGCTGGCGT AGCCATGCCA AAAATGCCCT TTGTCTTTGT CGGGCCGGAA
CCCGGCAATG AAAAGACGGT GGGACAAGGG GTAACCTTTC TGGCCCACGG TATATCCAAT
GTCATCGGTT TCCCCGGTCC CATCCCCGTT CCCCTGCCCC GGCCCGTGGC AGGGGCTGCC
CCCGACGAAT ACGACCGGGG TAGCAACCCG GTAGCTGACT TTTTTGCCGG CGACGGCTTA
TATGAAAAAG TAGGAGCACG GATCTATACC GAACCCTATC CCAAGCTGGC GGCCCAGACC
ATCCGCATGC TAATCAGGCG CCAGAGGCTG GCGCTGGGGT GGAAGTAG
 
Protein sequence
MKYSYPILNA EMPGREDVLR LTPNPASKDL LEYLHQEGVE TWLDRYEAQQ PMCGYGLRGL 
CCRMCQWGPC RLDNKRQRGI CGRDLSTVIM ANLVRSLVAG LAAHGRHAHE VILTIMAAAE
GKANLPLKGE ERVLDVANRL GLTTDGRTIK EIAREVAEVL LEDLGRLTMT PIRILTAYAP
RERIETWQTL GVLPRSGAYE IMETLHMTTL GGTSDWTSLT EQELRAALAY CYSTLFGSSL
ATEMLFGIPR PKVATVNYGI LKEDHVNILI HGHSPVMVEK ILEKIRTPEI QELARKAGAK
GIVVGGMCCT GEELLARYGV PTVTNIMGQE LALGTGAVDT VVVDMQCVIP GMKIVADCFG
TQVITTCNSN RIPGAIHIPF DPENPEGLDE DALKVARLAV EAFAHRDRSK MHIPRETTEA
MVGWSYEAIV DTFGGLKGLL ELLREGKIKG IATVVGCNTP KVPYEFNHVT IVRRLIESDI
LVTTTGCCSH ALLNAGLCSP AAASQAGPGL QEVCRSRGIP PVLAVGGCVD NTRTLRLFID
LAEEAGVAMP KMPFVFVGPE PGNEKTVGQG VTFLAHGISN VIGFPGPIPV PLPRPVAGAA
PDEYDRGSNP VADFFAGDGL YEKVGARIYT EPYPKLAAQT IRMLIRRQRL ALGWK