Gene Moth_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1201 
Symbol 
ID3832968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1236379 
End bp1237719 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content56% 
IMG OID637829134 
Productacetyl-CoA decarbonylase/synthase complex subunit gamma 
Protein accessionYP_430058 
Protein GI83590049 
COG category[C] Energy production and conversion 
COG ID[COG1456] CO dehydrogenase/acetyl-CoA synthase gamma subunit (corrinoid Fe-S protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTGA CGGGACTGGA GATTTACAAG CAGCTACCCA AAAAGAATTG TGGCGAGTGC 
GGGACACCCA CCTGTCTGGC CTTCGCCATG AACCTGGCCT CCGGAAAGGC CAGCCTTGAT
TCCTGTCCGT ATGTTTCAGA TGCCGCCCGG GAGGCCCTGG ACGCGGCCGC GGCACCACCC
ATTGCCAAGG TAGTCCTGGG CGCCGGGCCG ACTGCCGTAG AAATGGGGGA TGAGACGGAA
CTCTTCCGCC ATGATAAACG TTTTTACCAT GAAACCGCCA TTGCCATCCA GGTTAGCGAC
AACTTGAGCA GTGAAGAACT GAAGGCTAAA GTCGAAGCTA TAAATGGCCT GAACTTCGAC
CGGGTGGGCC AGCACTACAC CATCCAGGCC ATAGCCATCC GCCATGATGC CGATGACCCT
GCTGCTTTCA AGGCAGCGGT AGCCAGTGTA GCCGCCGCTA CCCAGTTAAA CCTTGTCCTT
ATGGCCGATG ATCCTGACGT ATTAAAGGAA GCCCTAGCAG GAGTAGCCGA CCGCAAGCCC
CTCTTATATG CCGCCACCGG CGCTAATTAC GAAGCCATGA CCGCCCTGGC CAAAGAAAAC
AACTGCCCCC TGGCCGTCTA TGGTAACGGT CTGGAGGAAC TGGCCGAACT GGTAGATAAA
ATCGTTGCCC TGGGCCACAA GCAGTTGGTC CTCGATCCCG GTGCCAGGGA GACCTCCAGG
GCCATCGCGG ATTTCACCCA GATCCGCCGC CTGGCCATTA AGAAACGTTT CCGTTCCTTC
GGTTATCCCA TTATCGCCCT TACTACTGCT GCCAATCCAT TAGACGAGGT ACTCCAGGCA
GTTAACTATG TGACCAAGTA TGCTAGCTTG GTGGTTTTAC GCACCGATGC CAAAGAACAC
CTGCTCCCCC TCTTGTCCTG GCGCCAGAAC CTCTACACCG ACCCCCAGGT TCCCATCAGG
GTAGAGGAGA AACTGAATGA AATCGGTGCC GTCAACGAGA ATTCGCCGGT CTACGTAACC
ACCAACTTCT CCCTGACCTA TTACTCCGTC GAGGGCGAGA TCGAGAGCAC CAAGATCCCC
AGTTACCTGC TCTCGGTGGA TACCGACGGA CTGTCAGTCT TGACGGCCTA TGCCGATGGT
AAATTTGAAG CCGAGAAAAT CGCCGCCGTT ATGAAAAAGG TGGACCTGGA CAATAAGGTT
AAACGCCACC GGATCATTAT TCCCGGGGCT GTCGCCGTCC TGAAGGGCAA ACTGGAAGAC
TTAACTGGAT GGGAAGTTAT CGTTGGCCCC AGGGAAGCCA GCGGCATCGT GGCCTTTGCC
CGGGCCAACC TGGCTTCATA G
 
Protein sequence
MPLTGLEIYK QLPKKNCGEC GTPTCLAFAM NLASGKASLD SCPYVSDAAR EALDAAAAPP 
IAKVVLGAGP TAVEMGDETE LFRHDKRFYH ETAIAIQVSD NLSSEELKAK VEAINGLNFD
RVGQHYTIQA IAIRHDADDP AAFKAAVASV AAATQLNLVL MADDPDVLKE ALAGVADRKP
LLYAATGANY EAMTALAKEN NCPLAVYGNG LEELAELVDK IVALGHKQLV LDPGARETSR
AIADFTQIRR LAIKKRFRSF GYPIIALTTA ANPLDEVLQA VNYVTKYASL VVLRTDAKEH
LLPLLSWRQN LYTDPQVPIR VEEKLNEIGA VNENSPVYVT TNFSLTYYSV EGEIESTKIP
SYLLSVDTDG LSVLTAYADG KFEAEKIAAV MKKVDLDNKV KRHRIIIPGA VAVLKGKLED
LTGWEVIVGP REASGIVAFA RANLAS