Gene Moth_1743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1743 
Symbol 
ID3832888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1795031 
End bp1796050 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content59% 
IMG OID637829667 
ProductCoA enzyme activase 
Protein accessionYP_430587 
Protein GI83590578 
COG category[I] Lipid transport and metabolism 
COG ID[COG1924] Activator of 2-hydroxyglutaryl-CoA dehydratase (HSP70-class ATPase domain) 
TIGRFAM ID[TIGR00241] CoA-substrate-specific enzyme activase, putative 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.386364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.436212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTGCT TTTTGGGTAT TGATGTCGGC AGCGTAAGCG CTAAAATCGT CGCCCTCGAT 
GCCGGCAAAA ATTTGCTCTT TGAAACTTAT TTACGCACCC ACGGCAACCC CATAGAAGCC
CTGCAAGCCG GTTTTCAGCA ACTGCAGGAA CAATTACCGG ACCTGCAGAT CCTGGCCGTC
GGCACCACCG GCTCGGGCCG CCACCTGGCC GCGGCCCTGG TAGGGGCTGA TACCATCAAG
AATGAGATAA CCGCCCATGC CGTAGCCGCC AGGGAGGTAA ACCCCGATGT CCGGACGGTA
ATTGATATTG GCGGGCAGGA CTCAAAGATC ATCTTTTTGA AAGATGGGGT TTCCCGCGGC
TTTAACATGA ACAGTGTTTG CGCTGCTGGT ACCGGTTCTT TCCTGGATCA CCAGGCCAGC
CGCCTCAATG TTCCCATAGA AAAGTTCGGT GAATTGGCCT TGCGTTCTAC CAGCCCGGTC
CGCATCGCCG GGCGCTGCGG CGTTTTTGCC GAATCCGACC TTATCAGCAA GCAACAGATG
GGTTACAGCA AGGAAGATTT AATCGCCGGC CTGTGCCTGG CCCTGGCCCG CAACTACCTG
GCCAACGTCG CCCGGGGGAA AGAGATCCAG CCGGTGGTCC TCTTCCAGGG CGGCGTGGCG
GCCAACGTCG GGCTGCGGGC GGCCTTTGAG ACCCTCCTGG GTATTCCCAT TATAGTGCCC
CCCTATTACC GGGTCATGGG GGCCCTAGGG GCGGCCCTCC TCGCCCGGGA AAAGTGGCAG
AAAACCAAAG CCCCCAGCGC CTTCCGGGGG GTACGGGCCA TAGCCCAGTT TAAGTGCGCG
CCGCGGAGCT TTATTTGCAA CGATTGCGCC AATAGCTGTG AAATCAGCGA GCTGTATATC
TGTGGGGAAA TCGTCGGCCG CTGGGGAAGC CGCTGTGGCA AATGGGCCAA CCTGCGGCTG
TCGTCCGCCG ACCGTGAAGA TCAGCGCGAG AAACTCACAT TGATGCGCCT GGGAGCTTAA
 
Protein sequence
MECFLGIDVG SVSAKIVALD AGKNLLFETY LRTHGNPIEA LQAGFQQLQE QLPDLQILAV 
GTTGSGRHLA AALVGADTIK NEITAHAVAA REVNPDVRTV IDIGGQDSKI IFLKDGVSRG
FNMNSVCAAG TGSFLDHQAS RLNVPIEKFG ELALRSTSPV RIAGRCGVFA ESDLISKQQM
GYSKEDLIAG LCLALARNYL ANVARGKEIQ PVVLFQGGVA ANVGLRAAFE TLLGIPIIVP
PYYRVMGALG AALLAREKWQ KTKAPSAFRG VRAIAQFKCA PRSFICNDCA NSCEISELYI
CGEIVGRWGS RCGKWANLRL SSADREDQRE KLTLMRLGA