Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0082 |
Symbol | |
ID | 4808777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 113271 |
End bp | 115718 |
Gene Length | 2448 bp |
Protein Length | 815 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105491 |
Product | Lon-A peptidase |
Protein accession | YP_001036516 |
Protein GI | 125972606 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0466] ATP-dependent Lon protease, bacterial type |
TIGRFAM ID | [TIGR00763] ATP-dependent protease La |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGAAG CAAAGAAAGT TATAAAAAAA CAGGTCTTAC CGTTGCTGCC TCTCAGAGGG TTGACAGTGT TTCCCTATAT GATTTTGCAT TTTGACGTTG GAAGAATAAA GTCGATTAAA GCTTTGGAAG AAGCGATGAT AAACAACCAG TTGATTTTCC TGGTTGCCCA AAAGGATGCA AAGAATGATT CACCCGGACC GGAAGATATT TATACCATTG GTACAATATC AAAAGTAAAA CAGCTGTTAA AGCTTCCGGG AGACACGATA AGGGTTTTGG TGGAAGGAAT AAGCCGGGCT GAAATATGTG AGTTTACCCA GACAGAGCCC TTTTTCATGG CTGAGGTTGA AGAAAAAATA TATGTTGAAG AAGACAAAAA CAGCAAGACG GAAATAGAAG CCCTAAAGAG GAGGGTCCTG TCCACCTTTG AGGAGTATTC AAAGCTCAAT AACAAAGTTT CTCCCGAAAC CGTTCTGTCC ATCATGAACA TTGATGACCC TGACCAGTTG GCTGACATTA TAACTGCCAA CTTAATGCTG AAGGTGGAGC AAAAGCAGGA AATATTAAAT GAGTTTAAAA CCAAAATCAG GCTTCAGAAG CTTTTGGAAA CTCTTGTCAG AGAAATTGAA ATAATGCAGA TTGAAAGAGA GATTAATATA AAGGTCAGAA AACAAATTGA CAAGACTCAG AAAGAATACT ATTTGAGGGA ACAGCTAAAG GCCATACAGA GCGAATTGGG AGACAAGGAC GGCGTGGTCG GCGAGGTAGA AGAGTACAAG AGAAAGCTTG CAGAAGGCAA TTTTGGCGAG GAAGTTGAGA AAAAGGTGTT AAAGGAGCTG GATCGTCTCC TTAAGATGCC TCCGGGTTCT GCGGAAGGTT CAGTTATAAG GACGTACCTT GACTGGATAT TTGATTTGCC GTGGAACAAG AAAACGGAAG AGATTATAGA TTTGGACCGC GCTCAGCAGA TTCTTGACGA GGACCACTAT GGCCTGGAAA AGGTTAAGGA AAGAATAATT GAGTATCTTG CCATAAGAAA GCTTAAAAAA GATCTCAAAG GTCCGATTTT GTGCCTGGCC GGACCGCCGG GAGTAGGAAA AACCTCAATC GCAAAGTCTA TTGCCCGCGC ACTCAACAGA AACTATGTAC GAATGTCTTT GGGCGGAGTT CGGGATGAAG CTGAAATAAG AGGTCACCGC AGAACTTATG TGGGAGCCAT GCCCGGAAGA ATTATTTCCG CTTTGAAACA GGCGGGTTCC AAAAATCCTC TTATTCTGCT TGATGAGATT GACAAAATGA GCAGTGATTT CAGAGGAGAC CCTGCGGCGG CAATGCTTGA GGTATTGGAC AGCGAGCAGA ATTATGCTTT CAGGGACCAT TATCTGGAAC TTCCCTTTGA TTTGTCCGAT GTGTTGTTTA TAACTACGGC AAACAACCTT GACACGGTTC CGAGGCCTCT TTTGGACAGA ATGGAAGTAA TATCTTTGTC CAGCTATACT GAAGAGGAAA AGGTCCAGAT AGCAATGAAA TATCTTTTCC CGAAACAGAT TGAGGCTCAC GGCTTTAAGA AAAGCAATCT GAAAATAGAC GAACCGGCTG TGAGAGAAAT AATAAACTGC TATACAAGGG AAGCCGGAGT GAGGGAGCTT GAAAGACAGA TAGCCGGCGT TTGCAGAAAA GTTGCCAGAA AGCTGGTATC CTCAAATCAG AAGACGGTCA AAATTACTGC AGCCTCTATA GAAAAGTATT TGGGAACGAA AAAATACAGA TATGACATGG CAAATGAAAA GGATGAAGTG GGTGTTGCCA CAGGTCTTGC ATGGACGCCT GTGGGCGGAG ATACGTTGTC CATTGAGGTA ACACTTATGG AAGGAAAGGG CAGCCTCGAG CTTACAGGAC AGCTGGGAGA CGTCATGAAA GAATCTGCCC GGGCTGCAAT GAGTTATATT CGTTCAAGAG CGGAATATTA CGGAATAGAC AAGGATTTTT ACAACAAGTA TGATATTCAC ATACATGTAC CGGAGGGAGC CATTCCAAAG GACGGTCCTT CAGCCGGTAT AACCCTTGCA ACCGCAATGG TGTCTGCATT AACCGGAAAG CCGGTAAGAA AAAATGTGGC TATGACCGGG GAGATAACCT TAAGAGGCAG GGTTCTTCCG ATAGGCGGAG TCAAGGAAAA AGTGCTTGCC GCCCATAGAG CCGGAATAGA TACAATTATA ATTCCTGTGG AAAACAAGAA AGACCTTGAA GAGATACCTG AAAATGTAAG AAAGACAATA AAATTTGTTC TGGCAGACAA TATGGAAACG GTGCTCAATA CTGCATTGGT GAAAACCAAA CCGAAGGGCA GGCAAAAGAG CGTTTCCGGT GAAGAAAAAA CTGTTGTGCC GGAAGTTCCT CCGCAGTTGG AAGAATTGGA TCACGGAACC GCAACAATTG AACAGTAA
|
Protein sequence | MSEAKKVIKK QVLPLLPLRG LTVFPYMILH FDVGRIKSIK ALEEAMINNQ LIFLVAQKDA KNDSPGPEDI YTIGTISKVK QLLKLPGDTI RVLVEGISRA EICEFTQTEP FFMAEVEEKI YVEEDKNSKT EIEALKRRVL STFEEYSKLN NKVSPETVLS IMNIDDPDQL ADIITANLML KVEQKQEILN EFKTKIRLQK LLETLVREIE IMQIEREINI KVRKQIDKTQ KEYYLREQLK AIQSELGDKD GVVGEVEEYK RKLAEGNFGE EVEKKVLKEL DRLLKMPPGS AEGSVIRTYL DWIFDLPWNK KTEEIIDLDR AQQILDEDHY GLEKVKERII EYLAIRKLKK DLKGPILCLA GPPGVGKTSI AKSIARALNR NYVRMSLGGV RDEAEIRGHR RTYVGAMPGR IISALKQAGS KNPLILLDEI DKMSSDFRGD PAAAMLEVLD SEQNYAFRDH YLELPFDLSD VLFITTANNL DTVPRPLLDR MEVISLSSYT EEEKVQIAMK YLFPKQIEAH GFKKSNLKID EPAVREIINC YTREAGVREL ERQIAGVCRK VARKLVSSNQ KTVKITAASI EKYLGTKKYR YDMANEKDEV GVATGLAWTP VGGDTLSIEV TLMEGKGSLE LTGQLGDVMK ESARAAMSYI RSRAEYYGID KDFYNKYDIH IHVPEGAIPK DGPSAGITLA TAMVSALTGK PVRKNVAMTG EITLRGRVLP IGGVKEKVLA AHRAGIDTII IPVENKKDLE EIPENVRKTI KFVLADNMET VLNTALVKTK PKGRQKSVSG EEKTVVPEVP PQLEELDHGT ATIEQ
|
| |