Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0531 |
Symbol | |
ID | 3830916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 551366 |
End bp | 553672 |
Gene Length | 2307 bp |
Protein Length | 768 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637828472 |
Product | Lon-A peptidase |
Protein accession | YP_429404 |
Protein GI | 83589395 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0466] ATP-dependent Lon protease, bacterial type |
TIGRFAM ID | [TIGR00763] ATP-dependent protease La |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000877407 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.534279 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGTCGAG CCCTGCCCCT GTTGCCCCTG CGCGGGGTTA TTGTTTTTCC CTACACGGTT ATTCACCTGG ATATAGGGCG GGAACGCTCG GTAAGTGCCA TTGAAGCTGC CATGCTTGGT GATCGGGTCA TTTTCCTGGC CATGCAAAAA GAGGCTCAGG ACGACGACCC CGGCGAAGAT GATATCTATA CTACCGGAAC CATAGCTGAG ATTAAACAAT TATTGAAATT ACCTGGAGGC ACCATCAGGA TCCTGGTCGA AGGTATCCGC AGGGGGGAAA TTAAGGAGTA TATCAGCCAT GATCCCTTCC TCAAGGTAGA AGTCGAAGAG GCTCCGGAGC CGGCAGAGAC CTCCCCGGAG ATCGAGGCCC TGATGCGTTG CCTGATTGAC GAGTTTGAAA CCTATGTCAA GATGGCCAAA AAGATACCTC CGGAAACGGT GGTAGCCGTC GTCAGCCTGG AGGAACCGGG GCGTCTGGCC GATGTGGTGG CCTCCCACCT CAACCTCAAA TTGACGGATA AACAGGCCGT CCTGGAGGCT GTGGATATCA AAACGCGGCT GAATATCCTC TGCGATATCC TGGCCAAGGA AAAAGAAATC CTGGAACTGG AGCGGAAGAT CAGCCTGCGG GTACGCAAGC AGATGGAAAA AGCCCAAAAG GAGTACTACC TGCGGGAGCA GATCAAGGCT ATCCAGAAGG AACTTGGCGA GAAAGACGAC CGTGTGGCCG AGGCCGAGGA ACTGCGGGAG AGGATAGCTA AAGCCAGGCT GCCTAAAGAG ATCCGGGAAC GCGCCCTGAA AGAGGTTGAA AGGCTGGAGA AAATGCCACC CATGGTGGCG GAGGTAACCG TCGTCCGCAA CTACCTGGAC TGGATCCTGG CCCTGCCCTG GCACAAGCAG ACCAGGGACC GCCTGGATAT CAAGGTAGCC GAGGAGATCC TGGACGAAGA TCACTACGGT TTAAAGGAAG TTAAGGAGCG TATCCTGGAA TACCTGGCCA TTCGCCAGCT GGCCAAAAAG ATGCGGGGCC CCATCCTCTG TTTTGTGGGA CCGCCTGGGG TGGGTAAGAC CTCCTTGGCC AAATCCATCG CCCGGGCCCT GCAGCGCAAG TTTGTCCGTA TCTCCCTTGG AGGTACCAGG GACGAAGCCG AGATTCGCGG CCACCGGCGG ACCTATGTGG GCGCCCTGCC CGGACGGATT ATCCAGGGCA TGAAACAGGC GGGGACGAAG AATCCGGTCT TTTTATTGGA TGAGATTGAT AAGTTGAGCA GCGATTTCCG GGGCGATCCC GCCTCGGCGC TGCTGGAAGT CCTGGACCCG GAACAAAACT ATATGTTTAG CGATCATTAT ATTGAAGCTC CCTTCGACCT CTCCAAGGTA ATGTTCATTA CCACGGCCAA TGTCGAATAC TCGATCCCCC GTCCCCTCCT GGATAGGATG GAAGTTATCC GTATTCCCGG CTACACCGAG GAAGAAAAGG TCAAGATTGC TGAACTGCAC CTGCTGCCCA AGCAGCTTGA GGAACACGGC CTTAAGAAGC AGCAACTGGA AGTATCGGAA AACGCCTTGC GGCGGATTGT CCGGGAGTAT ACCCGGGAGG CCGGCGTCCG AAACCTGGAA CGGGAAATCG CCACCATCTG CCGTAAGACC GCCCGGGACA TCGTCAGCGG TAAAACCAAA GCCGTTAAAG TAACGGCCAA CAATGTGGAG CAATACCTTG GTATTCCTCG TTTTCATCAT ACGCAAGCCA TCCGGAATGA GATGGTGGGT GTGGTCAACG GCCTGGCCTG GACGGAGGTT GGCGGCGAGG TCCTGAATGT CGAGGTGTCT ATCCTGAAGG GGAAAGGCAA CCTGACCCTG ACGGGAAAAC TGGGCGACGT CATGAAGGAA TCCGCCTACG CCGGTTTCAG CTACCTCCGC TCCCGGGCCG CCGAACTGGG CCTGGAGGAA GACTTCCACG AGAAGTTTGA CCTGCATATC CACGTTCCCG AGGGTGCCAT CCCCAAGGAC GGGCCTTCGG CGGGCATCAC CATGGCTACG GCCATGGCCT CAGCCCTAAA GGGCGTACCG GTGCGGAGCG ACCTGGCCAT GACCGGGGAA ATCACCCTGC GCGGCCGGGT ACTGCCGGTG GGAGGTATTA AAGAAAAGAT TTTGGCCGCC CACCGGGAAG GGATTAAAAA CATCATCTTG CCCCGGGAGA ACGAGAAAAA CCTGGAAGAC ATCCCGGCCA ACATCAAGCG CAAGATGAAC TTTATCCTGG TCGAGCACAT GGACGAAGTT CTGAAAGAAG CCCTGGGTAA TAACTAG
|
Protein sequence | MRRALPLLPL RGVIVFPYTV IHLDIGRERS VSAIEAAMLG DRVIFLAMQK EAQDDDPGED DIYTTGTIAE IKQLLKLPGG TIRILVEGIR RGEIKEYISH DPFLKVEVEE APEPAETSPE IEALMRCLID EFETYVKMAK KIPPETVVAV VSLEEPGRLA DVVASHLNLK LTDKQAVLEA VDIKTRLNIL CDILAKEKEI LELERKISLR VRKQMEKAQK EYYLREQIKA IQKELGEKDD RVAEAEELRE RIAKARLPKE IRERALKEVE RLEKMPPMVA EVTVVRNYLD WILALPWHKQ TRDRLDIKVA EEILDEDHYG LKEVKERILE YLAIRQLAKK MRGPILCFVG PPGVGKTSLA KSIARALQRK FVRISLGGTR DEAEIRGHRR TYVGALPGRI IQGMKQAGTK NPVFLLDEID KLSSDFRGDP ASALLEVLDP EQNYMFSDHY IEAPFDLSKV MFITTANVEY SIPRPLLDRM EVIRIPGYTE EEKVKIAELH LLPKQLEEHG LKKQQLEVSE NALRRIVREY TREAGVRNLE REIATICRKT ARDIVSGKTK AVKVTANNVE QYLGIPRFHH TQAIRNEMVG VVNGLAWTEV GGEVLNVEVS ILKGKGNLTL TGKLGDVMKE SAYAGFSYLR SRAAELGLEE DFHEKFDLHI HVPEGAIPKD GPSAGITMAT AMASALKGVP VRSDLAMTGE ITLRGRVLPV GGIKEKILAA HREGIKNIIL PRENEKNLED IPANIKRKMN FILVEHMDEV LKEALGNN
|
| |