Gene Moth_1202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1202 
Symbol 
ID3832969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1237805 
End bp1239994 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content54% 
IMG OID637829135 
Productbifunctional acetyl-CoA decarbonylase/synthase complex subunit alpha/beta 
Protein accessionYP_430059 
Protein GI83590050 
COG category[C] Energy production and conversion 
COG ID[COG1614] CO dehydrogenase/acetyl-CoA synthase beta subunit 
TIGRFAM ID[TIGR00316] CO dehydrogenase/CO-methylating acetyl-CoA synthase complex, beta subunit 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGATT TTGATAAAAT CTTCGAGGGT GCTATTCCAG AAGGTAAAGA GCCGGTAGCC 
CTGTTCCGGG AGGTTTACCA CGGCGCCATT ACAGCTACCA GTTACGCGGA AATCCTTTTA
AACCAGGCCA TCCGGACCTA TGGTCCCGAC CATCCCGTCG GTTATCCTGA TACAGCCTAT
TACCTGCCGG TTATTCGCTG TTTCAGCGGG GAAGAGGTCA AAAAACTGGG GGATTTACCA
CCTATTTTAA ACCGCAAGCG AGCGCAGGTA AGCCCTGTCC TGAATTTCGA GAATGCCCGC
CTGGCCGGGG AAGCCACCTG GTATGCGGCC GAGATCATTG AAGCCCTGCG TTACCTTAAA
TATAAGCCTG ATGAACCCCT CCTGCCCCCA CCCTGGACGG GTTTCATCGG CGACCCGGTT
GTCCGCCGTT TCGGTATCAA GATGGTCGAC TGGACCATTC CGGGTGAAGC TATTATCCTG
GGTCGAGCCA AAGACTCGAA GGCCCTGGCC AAAATCGTCA AGGAACTCAT GGGTATGGGC
TTTATGCTCT TCATCTGTGA TGAAGCGGTA GAACAGCTGC TGGAAGAAAA CGTCAAACTG
GGGATTGACT ATATCGCCTA TCCCCTGGGG AACTTCACCC AGATTGTTCA TGCCGCCAAC
TATGCCCTGC GGGCTGGTAT GATGTTCGGT GGCGTTACCC CGGGTGCCCG TGAAGAACAG
CGCGATTACC AGCGCCGCCG TATCCGCGCC TTTGTCCTTT ATCTCGGCGA GCATGACATG
GTCAAGACGG CTGCCGCCTT CGGGGCCATC TTTACCGGCT TTCCGGTAAT CACCGACCAG
CCCCTACCGG AGGACAAACA GATCCCGGAT TGGTTCTTCA GCGTTGAGGA CTATGATAAA
ATAGTCCAGA TAGCCATGGA GACCCGTGGG ATCAAGCTCA CCAAGATCAA GTTGGATCTG
CCCATTAACT TTGGCCCTGC CTTTGAGGGC GAGAGTATCC GTAAGGGCGA TATGTACGTA
GAAATGGGCG GCAACCGGAC GCCGGCCTTT GAGCTGGTAC GCACCGTTTC GGAATCCGAG
ATCACTGATG GTAAGATTGA AGTCATAGGT CCTGATATTG ACCAGATACC GGAAGGGAGC
AAACTGCCCC TGGGCATTCT GGTGGACATC TATGGCCGTA AAATGCAGGC CGATTTTGAA
GGAGTCCTCG AACGGCGCAT CCACGACTTC ATCAACTACG GTGAAGGTCT CTGGCACACC
GGCCAGCGTA ACATCAACTG GTTGCGGGTC AGCAAAGATG CCGTAGCCAA GGGTTTCCGT
TTCAAGAACT ACGGTGAAAT CCTGGTAGCC AAAATGAAAG AAGAATTCCC CGCCATTGTG
GACCGGGTCC AGGTAACCAT TTTTACCGAT GAAGCCAAGG TCAAAGAATA TATGGAGGTC
GCCCGGGAGA AATACAAGGA ACGTGACGAC CGCATGCGCG GCCTTACCGA TGAAACAGTG
GATACCTTTT ACTCCTGCGT CCTCTGCCAG TCCTTTGCCC CCAACCATGT GTGTATTGTC
ACCCCGGAAC GGGTGGGCCT GTGTGGAGCC GTAAGCTGGC TGGACGCCAA GGCGTCCTAT
GAAATCAACC ATGCCGGTCC TAACCAGCCC ATCCCTAAAG AAGGGGAAAT TGATCCCATT
AAGGGTATCT GGAAGAGTGT AAATGACTAT CTCTATACAG CTTCCAACCG TAACCTGGAA
CAGGTCTGCC TGTACACCCT TATGGAGAAT CCCATGACCT CCTGCGGTTG CTTTGAGGCC
ATTATGGCCA TCCTGCCGGA GTGCAACGGC ATCATGATTA CCACCAGGGA TCACGCCGGC
ATGACTCCTT CGGGGATGAC CTTCTCTACC CTGGCCGGGA TGATCGGCGG TGGCACCCAG
ACCCCGGGCT TTATGGGCAT CGGCCGCACC TATATCGTCA GCAAAAAGTT TATTTCCGCC
GATGGTGGTA TCGCCCGGAT CGTCTGGATG CCCAAATCTC TGAAGGATTT CCTCCACGAC
GACTTTGTAC GTCGTAGTGT TGAGGAGGGC CTGGGAGAGG ACTTTATCGA TAAAATAGCT
GATGAGACCA TCGGTACCAC CGTGGATGAA ATCTTGCCCT ACTTGGAGGA AAAGGGACAC
CCGGCCTTGA CCATGGATCC CATTATGTGA
 
Protein sequence
MTDFDKIFEG AIPEGKEPVA LFREVYHGAI TATSYAEILL NQAIRTYGPD HPVGYPDTAY 
YLPVIRCFSG EEVKKLGDLP PILNRKRAQV SPVLNFENAR LAGEATWYAA EIIEALRYLK
YKPDEPLLPP PWTGFIGDPV VRRFGIKMVD WTIPGEAIIL GRAKDSKALA KIVKELMGMG
FMLFICDEAV EQLLEENVKL GIDYIAYPLG NFTQIVHAAN YALRAGMMFG GVTPGAREEQ
RDYQRRRIRA FVLYLGEHDM VKTAAAFGAI FTGFPVITDQ PLPEDKQIPD WFFSVEDYDK
IVQIAMETRG IKLTKIKLDL PINFGPAFEG ESIRKGDMYV EMGGNRTPAF ELVRTVSESE
ITDGKIEVIG PDIDQIPEGS KLPLGILVDI YGRKMQADFE GVLERRIHDF INYGEGLWHT
GQRNINWLRV SKDAVAKGFR FKNYGEILVA KMKEEFPAIV DRVQVTIFTD EAKVKEYMEV
AREKYKERDD RMRGLTDETV DTFYSCVLCQ SFAPNHVCIV TPERVGLCGA VSWLDAKASY
EINHAGPNQP IPKEGEIDPI KGIWKSVNDY LYTASNRNLE QVCLYTLMEN PMTSCGCFEA
IMAILPECNG IMITTRDHAG MTPSGMTFST LAGMIGGGTQ TPGFMGIGRT YIVSKKFISA
DGGIARIVWM PKSLKDFLHD DFVRRSVEEG LGEDFIDKIA DETIGTTVDE ILPYLEEKGH
PALTMDPIM