Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1743 |
Symbol | |
ID | 3832888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1795031 |
End bp | 1796050 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829667 |
Product | CoA enzyme activase |
Protein accession | YP_430587 |
Protein GI | 83590578 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1924] Activator of 2-hydroxyglutaryl-CoA dehydratase (HSP70-class ATPase domain) |
TIGRFAM ID | [TIGR00241] CoA-substrate-specific enzyme activase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.386364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.436212 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTGCT TTTTGGGTAT TGATGTCGGC AGCGTAAGCG CTAAAATCGT CGCCCTCGAT GCCGGCAAAA ATTTGCTCTT TGAAACTTAT TTACGCACCC ACGGCAACCC CATAGAAGCC CTGCAAGCCG GTTTTCAGCA ACTGCAGGAA CAATTACCGG ACCTGCAGAT CCTGGCCGTC GGCACCACCG GCTCGGGCCG CCACCTGGCC GCGGCCCTGG TAGGGGCTGA TACCATCAAG AATGAGATAA CCGCCCATGC CGTAGCCGCC AGGGAGGTAA ACCCCGATGT CCGGACGGTA ATTGATATTG GCGGGCAGGA CTCAAAGATC ATCTTTTTGA AAGATGGGGT TTCCCGCGGC TTTAACATGA ACAGTGTTTG CGCTGCTGGT ACCGGTTCTT TCCTGGATCA CCAGGCCAGC CGCCTCAATG TTCCCATAGA AAAGTTCGGT GAATTGGCCT TGCGTTCTAC CAGCCCGGTC CGCATCGCCG GGCGCTGCGG CGTTTTTGCC GAATCCGACC TTATCAGCAA GCAACAGATG GGTTACAGCA AGGAAGATTT AATCGCCGGC CTGTGCCTGG CCCTGGCCCG CAACTACCTG GCCAACGTCG CCCGGGGGAA AGAGATCCAG CCGGTGGTCC TCTTCCAGGG CGGCGTGGCG GCCAACGTCG GGCTGCGGGC GGCCTTTGAG ACCCTCCTGG GTATTCCCAT TATAGTGCCC CCCTATTACC GGGTCATGGG GGCCCTAGGG GCGGCCCTCC TCGCCCGGGA AAAGTGGCAG AAAACCAAAG CCCCCAGCGC CTTCCGGGGG GTACGGGCCA TAGCCCAGTT TAAGTGCGCG CCGCGGAGCT TTATTTGCAA CGATTGCGCC AATAGCTGTG AAATCAGCGA GCTGTATATC TGTGGGGAAA TCGTCGGCCG CTGGGGAAGC CGCTGTGGCA AATGGGCCAA CCTGCGGCTG TCGTCCGCCG ACCGTGAAGA TCAGCGCGAG AAACTCACAT TGATGCGCCT GGGAGCTTAA
|
Protein sequence | MECFLGIDVG SVSAKIVALD AGKNLLFETY LRTHGNPIEA LQAGFQQLQE QLPDLQILAV GTTGSGRHLA AALVGADTIK NEITAHAVAA REVNPDVRTV IDIGGQDSKI IFLKDGVSRG FNMNSVCAAG TGSFLDHQAS RLNVPIEKFG ELALRSTSPV RIAGRCGVFA ESDLISKQQM GYSKEDLIAG LCLALARNYL ANVARGKEIQ PVVLFQGGVA ANVGLRAAFE TLLGIPIIVP PYYRVMGALG AALLAREKWQ KTKAPSAFRG VRAIAQFKCA PRSFICNDCA NSCEISELYI CGEIVGRWGS RCGKWANLRL SSADREDQRE KLTLMRLGA
|
| |