Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2236 |
Symbol | |
ID | 3831282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2331989 |
End bp | 2334037 |
Gene Length | 2049 bp |
Protein Length | 682 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637830156 |
Product | putative ATP-dependent Lon protease |
Protein accession | YP_431066 |
Protein GI | 83591057 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4930] Predicted ATP-dependent Lon-type protease |
TIGRFAM ID | [TIGR02653] conserved hypothetical protein [TIGR02688] conserved hypothetical protein TIGR02688 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000058517 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGCTTG ATGCCCTCGA TAAATTGGCA GCTTCTGTAT TTGACGGGTA TATAGTTCGG AAGGACCTGG TGCGCAAATA CAGCCGGCAG TATCCTGTAC CCACCTACGT GGTAGAATTC CTGCTCGGCC GGTATTGTGC AACTGTCGAT GAAAAGGAGA TCGAGGAAGG CCTGGAGATT GTTGAGAGGC AATTGCGGGA TCGCACTGTC AGGACCGGTG AAGAAGAACT TTTTAAGGCC CGGGCAAGGG AGATCGGGTC GATCAAAATA ATCGATCTCA TCAAGGCTCG ACTTGACACC AAAAACGATT GTTTTGTTGC CCAACTGCCG AGCCTGGGGC TGAAAGACGT CCGGATTGAC GACGCCCTGG TGCATGAAAA TGAGCGCATG CTTACTGATG GTTTTTATGC CGAAGTCACC CTCGTCTATG ATGCCACCAT TGCCCAGGAG AAGAACGGTC GTCCTTTCGC CATTGAAAAC TTGCGGGCCA TTCAACTTTC TAAAGTGGAT GCCCTGGCAG CGCTGCAGCG AGGGAGGAGC CAGTTTACCA CTGACGAATG GAAACGCCTC CTGATACGCT CTGTAGGTTT GGAACCGGAT ACCCTTTCAG AACGGGCTCA GGATATCGCC CTGCTGCGGA TGGTGCCTTT TGTAGAGCGA AACTACAACC TGGTGGAAAT AGGCCCCCGG GGCACGGGCA AAAGCCATTT GTTTCAACAA ATCTCCCCAT ACTCCCATCT GATTTCAGGC GGTAAAGCCA CAGTGGCCAA AATGTTTGTG AATAATGCCA CCGGACAGCG GGGGCTTGTC TGCCACTATG ATGTCGTGTG CTTTGATGAA GTGTCCGGCA TATCCTTCGA TCAGAAGGAC GGCGTCAACA TTATGAAGGG GTACATGGCA TCGGGCGAAT TCTCCCGCGG CAAGGAGAGT ATCCGTGCTT CCGGCGGCAT TGTAATGCTC GGGAACTTTG ATGTGGATGT CCAGCAACAG CAACGCATCG GCCACTTATT CAGTCCACTC CCGCCGGAGA TGCGGGATGA TACGGCTTTC ATGGACCGCA TTCACGCCTA TGTTCCAGGG TGGGAGTTTC CCAAACTCAA CCCCAATATC CATCTTACGG ATCATTTTGG CTTGGTCAGC GACTTTCTCT CCGAATGCTG GCATAGACTG CGTGATGGTA GCCGGGTTTC CGTGCTCCAG GGCCGGGTTA ACTGGGGTGG AGCCCTCAGC GGTCGCGATA TTGAAGCCGT TCATAAAACC GTTAGCGGCC TGATCAAGCT ACTTTTCCCC GATCCGGAGA TGCCGATACC TGATGAAGAG CTAGAAAAGA TCGTCCGTTT GGCCCTGGAA TCGCGTCGAA GGGTAAAGGA ACAGCAAAAG CGCTGCCTTA AGACGGAATT TCGCAATACT CACTTCAGCT TCTCTATGGG CGTGGAGGGG GTGGAACAGT TTGTTGCCAC GCCGGAACTC CACAGTGATG AAACCATCGA CAGCGATCCC CTGCCTCCCG GGCAGGTGTG GGCCATCAGT CCCGGCGGCC AGGACGCTTC ACCAGCTCTC TATCGAATAG AGGTTGCGGC GGGTCCTGGC AGTGGAGTAA AAATTCTTAA CGCACCTGTC CCGCCGGCAT TTCGGGAAAG CGTTCGTTAC GGAGAACAAA ACCTGTACGT CAGGGCTAAA GAGCTGGTTG GTGACCGCGA TCCGCGCGCC CGTGAGTTTT CAATCCAATT ACGCGCTATG GACGTGGAAC GTTCAGGCCA GGGACTTGGT TTACCAGTGC TGATTGCACT TTGCGGCGCT CTAATTGAGC GAAGCGTTAA GGGTGGATTG ATCATAGTCG GAGCCTTAAA CCTTGGTGGC TCAATTGAGA TGATACCAAA TCCGGTTGCT GTGGCCGAAC TGGCCCTCGA GAAAGGAGCA ACGACGCTGT TAATGCCTAT ATCTTCTCGA AGGCAATTGT TTGATCTTCC TGACGAAATG GCTACGAAGA TCAACATCGA ATTTTATGCT GATGCAACGG ATGCTTTTGT TAAGGCGATT GTTGACTAA
|
Protein sequence | MELDALDKLA ASVFDGYIVR KDLVRKYSRQ YPVPTYVVEF LLGRYCATVD EKEIEEGLEI VERQLRDRTV RTGEEELFKA RAREIGSIKI IDLIKARLDT KNDCFVAQLP SLGLKDVRID DALVHENERM LTDGFYAEVT LVYDATIAQE KNGRPFAIEN LRAIQLSKVD ALAALQRGRS QFTTDEWKRL LIRSVGLEPD TLSERAQDIA LLRMVPFVER NYNLVEIGPR GTGKSHLFQQ ISPYSHLISG GKATVAKMFV NNATGQRGLV CHYDVVCFDE VSGISFDQKD GVNIMKGYMA SGEFSRGKES IRASGGIVML GNFDVDVQQQ QRIGHLFSPL PPEMRDDTAF MDRIHAYVPG WEFPKLNPNI HLTDHFGLVS DFLSECWHRL RDGSRVSVLQ GRVNWGGALS GRDIEAVHKT VSGLIKLLFP DPEMPIPDEE LEKIVRLALE SRRRVKEQQK RCLKTEFRNT HFSFSMGVEG VEQFVATPEL HSDETIDSDP LPPGQVWAIS PGGQDASPAL YRIEVAAGPG SGVKILNAPV PPAFRESVRY GEQNLYVRAK ELVGDRDPRA REFSIQLRAM DVERSGQGLG LPVLIALCGA LIERSVKGGL IIVGALNLGG SIEMIPNPVA VAELALEKGA TTLLMPISSR RQLFDLPDEM ATKINIEFYA DATDAFVKAI VD
|
| |