Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0530 |
Symbol | |
ID | 3830915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 549550 |
End bp | 551268 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637828471 |
Product | AAA ATPase |
Protein accession | YP_429403 |
Protein GI | 83589394 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | [TIGR00764] lon-related putative ATP-dependent protease [TIGR02902] ATP-dependent protease LonB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0252109 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAT GGGGCCAGGG CCTGGGCGGA GTCCTGGGCT TTGTCCAGAT ATTCTTTGCC GTGGTAATTG GCCTTTACTT CTGGAACCTC CTGCGCAACC AGCAGGGCAG CCGGGTGGCT GTGGAAAGGG AATCTCGCAA AGAACTGGAG AAACTCCAGC GGATGCGGGA GATCTCCCTC ACTGAACCCC TGGCGGAAAA GACCCGGCCC AGCACCTTTG CCGAGATTAT CGGCCAGGAA GAGGGCTTGA AGGCCCTCCG GGCCGCCCTC TGCGGTCCCA ACCCCCAGCA TGTTATTATC TATGGCCCGC CGGGGGTGGG TAAAACAGCC GCCGCCCGGC TGGTCCTGGA GGAAGCCAAG GCCAACCCCC TTTCACCCTT TAAAGAAAAC GCTAAATTCG TCGAAGTCGA CGGTGCGACC TCCCGGTTTG ACGAGCGGGG CATTGCCGAC CCCCTGATTG GTTCGGTCCA TGACCCCATC TACCAGGGAG CGGGCCCCAT GGGCATGGCC GGTATTCCCC AGCCCAAACC GGGGGCCGTG ACCCGCGCCC ATGGTGGTAT CCTGTTTATC GATGAAATTG GCGAACTGCA CCCCATCCAG ATCAACAAGC TCCTCAAGGT CCTGGAAGAT CGCAAGGTTA TCCTGGAAAG CGCCTATTAC AGCAGTGAAG ATACTAATAT TCCCAGCCAC ATCCATGATA TTTTTCGCAA CGGTTTGCCG GCCGATTTTC GCCTGGTAGG GGCGACCACC CGCCTACCCC AGGAGATCCC GGCGGCCATT CGTTCCCGCT GCCAGGAGAT CTATTTTCGG CCCCTGCTGC CCCATGAAAT CGGGTTAATT GTCCAGAACG CCGCCCGGAA AATAAACTTA CACATTGCCC CGGAGGCCGT GGAGGTAATC AAACAGTATG CCACCAACGG CCGGGAAGCA GTGAACATGG TCCAGCTGGC CGCCGGCCTG GCCCTGACGG ACAGGCGCCA GGAGATTACC CGGGCCGATA TCGAGTGGGT GGTGACCAGC GGCCAGTATA CCCCGCGAAT GGATAGCAAA GTACCTCCTG AACCCCAGGT GGGGGTGGTC AACGCCCTGG CAGTCTATGG CCCCAATATG GGGCTGGTGG TGGAGCTGGA GGCCAGCGTG AACTTCGTCG GCCACCGGCG CGGGCAGGTT ATCGTCACCG GCGTGGTGGA AGAGGAAGAA ACCGGGAGCG GCGACGGCCG CCGCGTCAGG CGCAGGAGCA TGGCCCGTAA CGCCGTGGAC AATGTCCAGA CGGTTCTCAG GCGCCTGCTG CAGGTGGATC TACGCGACTA CGACATCCAC TTGAACTTTC CCGGCGGCGT ACCGGTGGAC GGTCCCTCGG CCGGGGTCAG CATGCTGACA GCCGTTTATT CCGCCCTCAC CGAGACACCG GTCAACAACC TGGTAGCCAT GACGGGAGAA GTAGCCATTC GCGGTGGTGT GCGACCGGTG GGAGGCGTAG TGGCCAAGGT GGAGGCCGCC CGCCTGGCCG GGGCGAAGAA AGTGCTCATC CCCAGGGACA ATTGGCAGGA GATCTTCCGT AGCCTGCCGG GTATCCAAGT GATCCCGGTC CAGAACGTCA ACGAAGTCCT GGCAGAGGCC CTGTTGCCCC CGGCCCAGGT GGCCAGCCCG GAAAGGTTCA AGGCCCGTCC CCAGATTTTA AGTGCGGCAC CAAACATTGG CCGGCCGGTG GGCTGCTAG
|
Protein sequence | MTEWGQGLGG VLGFVQIFFA VVIGLYFWNL LRNQQGSRVA VERESRKELE KLQRMREISL TEPLAEKTRP STFAEIIGQE EGLKALRAAL CGPNPQHVII YGPPGVGKTA AARLVLEEAK ANPLSPFKEN AKFVEVDGAT SRFDERGIAD PLIGSVHDPI YQGAGPMGMA GIPQPKPGAV TRAHGGILFI DEIGELHPIQ INKLLKVLED RKVILESAYY SSEDTNIPSH IHDIFRNGLP ADFRLVGATT RLPQEIPAAI RSRCQEIYFR PLLPHEIGLI VQNAARKINL HIAPEAVEVI KQYATNGREA VNMVQLAAGL ALTDRRQEIT RADIEWVVTS GQYTPRMDSK VPPEPQVGVV NALAVYGPNM GLVVELEASV NFVGHRRGQV IVTGVVEEEE TGSGDGRRVR RRSMARNAVD NVQTVLRRLL QVDLRDYDIH LNFPGGVPVD GPSAGVSMLT AVYSALTETP VNNLVAMTGE VAIRGGVRPV GGVVAKVEAA RLAGAKKVLI PRDNWQEIFR SLPGIQVIPV QNVNEVLAEA LLPPAQVASP ERFKARPQIL SAAPNIGRPV GC
|
| |