Gene Moth_1203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1203 
Symbol 
ID3832970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1240015 
End bp1242039 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content55% 
IMG OID637829136 
Productcarbon-monoxide dehydrogenase, catalytic subunit 
Protein accessionYP_430060 
Protein GI83590051 
COG category[C] Energy production and conversion 
COG ID[COG1151] 6Fe-6S prismane cluster-containing protein 
TIGRFAM ID[TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGGT TCCGCGATCT CTCCCATAAT TGTAGGCCCT CAGAGGCACC ACGGGTCATG 
GAACCCAAAA ACAGGGACCG CACCGTAGAT CCGGCGGTCC TGGAAATGCT GGTTAAAAGT
AAGGATGACA AAGTCATCAC CGCTTTTGAC CGCTTCGTCG CCCAGCAACC CCAGTGTAAA
ATCGGGTATG AAGGTATTTG CTGCCGTTTC TGCATGGCCG GTCCCTGCCG TATCAAGGCA
ACCGATGGCC CTGGCAGCCG TGGTATTTGC GGCGCTTCTG CCTGGACCAT TGTCGCCCGT
AATGTAGGTT TAATGATCCT TACCGGTGCC GCCGCCCACT GCGAACACGG CAACCATATA
GCCCATGCCC TGGTAGAAAT GGCCGAAGGT AAAGCTCCTG ATTATAGCGT CAAGGACGAG
GCCAAGCTCA AAGAAGTCTG CCGACGGGTG GGTATTGAGG TAGAAGGCAA AAGCGTTTTG
GAACTGGCCC AGGAGGTAGG CGAGAAGGCC CTGGAAGACT TCCGCCGCTT GAAGGGTGAA
GGTGAAGCCA CCTGGCTGAT GACCACTATT AATGAGGGCC GGAAAGAAAA GTTCCGTACC
CACAATGTTG TTCCCTTTGG TATTCATGCC TCTATTTCCG AGCTGGTCAA TCAGGCCCAT
ATGGGTATGG ATAACGACCC TGTTAACCTG GTCTTCAGCG CCATCAGGGT AGCCCTGGCT
GACTATACGG GTGAACATAT AGCTACTGAT TTCTCCGACA TTCTCTTCGG TACTCCCCAA
CCGGTGGTCA GCGAAGCCAA CATGGGGGTC CTGGATCCGG ATCAAGTCAA CTTCGTCCTC
CATGGCCATA ATCCCTTGTT GAGTGAGATT ATTGTCCAGG CGGCGCGGGA GATGGAAGGA
GAGGCCAAGG CCGCCGGTGC CAAAGGCATC AACCTGGTGG GTATCTGCTG CACCGGTAAC
GAAGTCCTGA TGCGCCAGGG TATCCCCTTG GTTACTTCCT TCGCCTCCCA GGAACTGGCC
ATCTGCACCG GAGCTATTGA CGCCATGTGC GTCGACGTCC AGTGTATTAT GCCTTCCATC
AGCGCCGTAG CCGAGTGTTA TCATACCCGG ATCATCACTA CTGCCGATAA CGCCAAGATT
CCCGGTGCCT ACCATATCGA CTATCAAACG GCTACGGCTA TCGAAAGCGC GAAAACCGCC
ATCCGCATGG CCATCGAGGC ATTCAAGGAA AGAAAAGAAA GTAACCGTCC GGTTTACATC
CCCCAGATTA AGAACCGGGT AGTCGCCGGC TGGAGCCTTG AAGCCCTGAC CAAACTCCTG
GCTACCCAGA ATGCTCAAAA TCCCATCCGG GTACTCAACC AGGCCATCCT GGACGGTGAA
CTGGCTGGCG TAGCCTTAAT CTGCGGGTGT AACAACCTCA AAGGGTTCCA GGATAACTCC
CACCTGACGG TAATGAAAGA ACTGCTGAAA AATAATGTCT TTGTGGTGGC TACGGGTTGC
TCCGCCCAGG CCGCCGGAAA GCTTGGCCTC CTGGATCCGG CCAATGTGGA AACCTACTGC
GGCGATGGTC TCAAGGGCTT CCTGAAACGC CTGGGTGAAG GCGCCAACAT CGAAATCGGC
CTGCCGCCTG TGTTCCACAT GGGTTCCTGT GTGGATAACT CCCGGGCCGT CGACCTCTTG
ATGGCCATGG CCAACGATCT GGGCGTAGAT ACCCCGAAGG TGCCCTTCGT AGCCTCGGCC
CCGGAAGCCA TGAGCGGTAA GGCTGCCGCC ATCGGCACCT GGTGGGTATC CCTCGGCGTA
CCGACCCATG TCGGCACCAT GCCCCCGGTA GAAGGTAGCG ACCTCATTTA TAGTATTCTA
ACCCAGATAG CCAGCGACGT TTATGGTGGT TACTTCATCT TCGAAATGGA TCCCCAGGTA
GCTGCCCGGA AGATCCTTGA CGCCCTGGAA TACCGCACCT GGAAGCTGGG CGTACACAAA
GAGGTAGCTG AACGTTATGA AACCAAACTC TGCCAGGGTT ACTAG
 
Protein sequence
MPRFRDLSHN CRPSEAPRVM EPKNRDRTVD PAVLEMLVKS KDDKVITAFD RFVAQQPQCK 
IGYEGICCRF CMAGPCRIKA TDGPGSRGIC GASAWTIVAR NVGLMILTGA AAHCEHGNHI
AHALVEMAEG KAPDYSVKDE AKLKEVCRRV GIEVEGKSVL ELAQEVGEKA LEDFRRLKGE
GEATWLMTTI NEGRKEKFRT HNVVPFGIHA SISELVNQAH MGMDNDPVNL VFSAIRVALA
DYTGEHIATD FSDILFGTPQ PVVSEANMGV LDPDQVNFVL HGHNPLLSEI IVQAAREMEG
EAKAAGAKGI NLVGICCTGN EVLMRQGIPL VTSFASQELA ICTGAIDAMC VDVQCIMPSI
SAVAECYHTR IITTADNAKI PGAYHIDYQT ATAIESAKTA IRMAIEAFKE RKESNRPVYI
PQIKNRVVAG WSLEALTKLL ATQNAQNPIR VLNQAILDGE LAGVALICGC NNLKGFQDNS
HLTVMKELLK NNVFVVATGC SAQAAGKLGL LDPANVETYC GDGLKGFLKR LGEGANIEIG
LPPVFHMGSC VDNSRAVDLL MAMANDLGVD TPKVPFVASA PEAMSGKAAA IGTWWVSLGV
PTHVGTMPPV EGSDLIYSIL TQIASDVYGG YFIFEMDPQV AARKILDALE YRTWKLGVHK
EVAERYETKL CQGY