Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1594 |
Symbol | |
ID | 3832740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1629442 |
End bp | 1631634 |
Gene Length | 2193 bp |
Protein Length | 730 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637829523 |
Product | AAA family ATPase, CDC48 subfamily protein |
Protein accession | YP_430443 |
Protein GI | 83590434 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0464] ATPases of the AAA+ class |
TIGRFAM ID | [TIGR01242] 26S proteasome subunit P45 family [TIGR01243] AAA family ATPase, CDC48 subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0267229 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGGGG ATTTGGGTGT AAAACTCCGC GTGTGCGAGG GAATGGTTGA GGATGCCCGT AAAGGTATAG TACGGGTACT GACCCCGGTT ATGGATGAAT TGGGTTTAAA ACCCAACGAC GTCGTTGCTA TTACCGGCAA GCGGACTACG GTAGCCAGGA TAATGCCGGC TTTTCAAGAC GGTTGTCCCC CGGGCAACAT CCAGATGGAC GGCCTCCAGC GCCAGAATGC CCAAGTCGGC ATCGGCGAAG GAGTTACTCT ATCTCCCGTG GAATGGGAAA CAGCCAGGAC GGTGGTTTTG GCGCCGGTCC TGCCCGGCTG GACCCTCGGC GGCGAGCATG AGATTGTACA TTTAAAGAAG CACCTCATCG GCCGGGCGGT GGTACCGGGT GATCAGGTAA CCATCCCTCA GTTTAGCGGT GGTGATGAGG CCTTTACCGT TGAAGGGGCT GCACCCCGGG GAGCGGTGGT AATCACTCGT GATACAGCTG TACGCTTTAA AGGCGGGGAA GCCACCGAAG GCCGGGGCCA GCGGGTCACC TATGAGGATA TCGGCGGTCT GGCCAGGGAA GTCCAACGGG TCAGGGAAAT TATCGAATTG CCCTTAAAAT ACCCACAACT CTTTCAAAGG TTGGGAGTAG AGGCTCCCAA GGGCATCTTG ATGCACGGGG CACCCGGAAC GGGTAAAACC CTTATTGCCC GGGCCGTAGC CTCGGAAACG GAAGCCCACT TTATCCACGT CAACGGCCCG GAGATAATGC ATAAATACTA CGGTGAAAGC GAGGCCCGCC TGCGCCAGGT TTTTGATGAG GCCCGCAGGA AGGCACCGAG TATTATCTTC CTGGATGAGA TTGACGCCCT GGCTCCGCGC CGGGCCGACG TTCACGGCGA CGTGGAAAAA CGTGTTGTCG CCCAGTTGCT GGCCTTGATG GACGGGCTGG AATCCCGCGG CAACGTAATT GTGATAGCGG CCACTAATAT ACCCGACCTT GTCGATCCGG CCCTGCGCCG CCCGGGCCGT TTTGACCGGG AGATAGCCAT CAACGTCCCG GATCAAAGGG GCCGGCGGGA GATCCTGCAG ATCCATACCC GGGGCATGTC CCTGGCGGAG GACGTTTCCC TGGATCGCCT GGCAGCCATC ACCCACGGCT TTGTCGGTGC TGATTTGGCC GCCCTCTGCC GGGAAGCCGG CATGTATGCC CTGCGACGGG CCCTTAAAAG CTTCCAGCTG GGCAACGAGC GTACGGAAGA CCTGCAACTC CAGGTTACTA TGCGAGACTT TCTCGATGCC CTGACGGAGG TCGAGCCTTC GGCCACCAGG GAGTTCGCCA TGGAGATTCC TACGGCAACC TGGGAGGATA TCGGTGGCCT GGAGAAGATT AAAGAACGAC TGCAGGCTAT GGTCGAGTGG CCCCTACGCT ATCCAGAACT TTTCCAACAG TTTGGCCTGC AAACTCCCAA GGGCATTCTG CTCTCCGGTC CCCCCGGGAC AGGTAAGACC CTGGTAGCTA AAGCCCTGGC CCGGGAGAGC GGGATTAATT TCATACCGGT TAACAGTTCC CTCCTCTTTT CCCACTGGTG GGGAGAGGCG GAGAAAACCC TACATGAGGT TTTTCGCAAG GCCCGCCAGG CCTCTCCCTG CCTGCTGTTT TTTGACGAAC TGGACGCCCT GGTACCGGCC CGCAAAGCTG GCGAAGGTAG TAGCATTGGC AGCCGCCTGG TATCCCAGTT CCTGATGGAG TTAGATGGCC TGGAAGAATT GCGGGAGGTA ATCGTCCTGG GAGCTACCAA CCGTATTGAT ATGATTGACC CGGCCGTCCT GCGGCCCGGT CGCTTTGACC AGATTCTGGA GTTCCCGTAT CCGGACCAGG CAGCCAGGAA AGAGATTTTC CAGATTTACC TGCGCAACCG GCCGGTTGAC CCGGGCATTA ACCTGGATAG TCTGGCCGGT GCGGCTGAAG GGCTGGTGGG GTCGGAGATT GAAGCCCTGT GCAAGCGAGC GGCCCTGCTG GCCGTATCTG AAGTGATTAA CCATAAAGGT GCCGGAGCTT ACATTAAAAC GTGTCACCTG GAACAGGCCC TGGCCGAGAT CCAGGCCGAA AAACAACAGG CACGGACCGG GGCGGAGAAC CATACCCTGC GCCCCGTCTG GAATAATGTT GTCCCCGGAG CAATATCACA GGTGGGGAGG TGA
|
Protein sequence | MPGDLGVKLR VCEGMVEDAR KGIVRVLTPV MDELGLKPND VVAITGKRTT VARIMPAFQD GCPPGNIQMD GLQRQNAQVG IGEGVTLSPV EWETARTVVL APVLPGWTLG GEHEIVHLKK HLIGRAVVPG DQVTIPQFSG GDEAFTVEGA APRGAVVITR DTAVRFKGGE ATEGRGQRVT YEDIGGLARE VQRVREIIEL PLKYPQLFQR LGVEAPKGIL MHGAPGTGKT LIARAVASET EAHFIHVNGP EIMHKYYGES EARLRQVFDE ARRKAPSIIF LDEIDALAPR RADVHGDVEK RVVAQLLALM DGLESRGNVI VIAATNIPDL VDPALRRPGR FDREIAINVP DQRGRREILQ IHTRGMSLAE DVSLDRLAAI THGFVGADLA ALCREAGMYA LRRALKSFQL GNERTEDLQL QVTMRDFLDA LTEVEPSATR EFAMEIPTAT WEDIGGLEKI KERLQAMVEW PLRYPELFQQ FGLQTPKGIL LSGPPGTGKT LVAKALARES GINFIPVNSS LLFSHWWGEA EKTLHEVFRK ARQASPCLLF FDELDALVPA RKAGEGSSIG SRLVSQFLME LDGLEELREV IVLGATNRID MIDPAVLRPG RFDQILEFPY PDQAARKEIF QIYLRNRPVD PGINLDSLAG AAEGLVGSEI EALCKRAALL AVSEVINHKG AGAYIKTCHL EQALAEIQAE KQQARTGAEN HTLRPVWNNV VPGAISQVGR
|
| |