Gene Cthe_0082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0082 
Symbol 
ID4808777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp113271 
End bp115718 
Gene Length2448 bp 
Protein Length815 aa 
Translation table11 
GC content44% 
IMG OID640105491 
ProductLon-A peptidase 
Protein accessionYP_001036516 
Protein GI125972606 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGAAG CAAAGAAAGT TATAAAAAAA CAGGTCTTAC CGTTGCTGCC TCTCAGAGGG 
TTGACAGTGT TTCCCTATAT GATTTTGCAT TTTGACGTTG GAAGAATAAA GTCGATTAAA
GCTTTGGAAG AAGCGATGAT AAACAACCAG TTGATTTTCC TGGTTGCCCA AAAGGATGCA
AAGAATGATT CACCCGGACC GGAAGATATT TATACCATTG GTACAATATC AAAAGTAAAA
CAGCTGTTAA AGCTTCCGGG AGACACGATA AGGGTTTTGG TGGAAGGAAT AAGCCGGGCT
GAAATATGTG AGTTTACCCA GACAGAGCCC TTTTTCATGG CTGAGGTTGA AGAAAAAATA
TATGTTGAAG AAGACAAAAA CAGCAAGACG GAAATAGAAG CCCTAAAGAG GAGGGTCCTG
TCCACCTTTG AGGAGTATTC AAAGCTCAAT AACAAAGTTT CTCCCGAAAC CGTTCTGTCC
ATCATGAACA TTGATGACCC TGACCAGTTG GCTGACATTA TAACTGCCAA CTTAATGCTG
AAGGTGGAGC AAAAGCAGGA AATATTAAAT GAGTTTAAAA CCAAAATCAG GCTTCAGAAG
CTTTTGGAAA CTCTTGTCAG AGAAATTGAA ATAATGCAGA TTGAAAGAGA GATTAATATA
AAGGTCAGAA AACAAATTGA CAAGACTCAG AAAGAATACT ATTTGAGGGA ACAGCTAAAG
GCCATACAGA GCGAATTGGG AGACAAGGAC GGCGTGGTCG GCGAGGTAGA AGAGTACAAG
AGAAAGCTTG CAGAAGGCAA TTTTGGCGAG GAAGTTGAGA AAAAGGTGTT AAAGGAGCTG
GATCGTCTCC TTAAGATGCC TCCGGGTTCT GCGGAAGGTT CAGTTATAAG GACGTACCTT
GACTGGATAT TTGATTTGCC GTGGAACAAG AAAACGGAAG AGATTATAGA TTTGGACCGC
GCTCAGCAGA TTCTTGACGA GGACCACTAT GGCCTGGAAA AGGTTAAGGA AAGAATAATT
GAGTATCTTG CCATAAGAAA GCTTAAAAAA GATCTCAAAG GTCCGATTTT GTGCCTGGCC
GGACCGCCGG GAGTAGGAAA AACCTCAATC GCAAAGTCTA TTGCCCGCGC ACTCAACAGA
AACTATGTAC GAATGTCTTT GGGCGGAGTT CGGGATGAAG CTGAAATAAG AGGTCACCGC
AGAACTTATG TGGGAGCCAT GCCCGGAAGA ATTATTTCCG CTTTGAAACA GGCGGGTTCC
AAAAATCCTC TTATTCTGCT TGATGAGATT GACAAAATGA GCAGTGATTT CAGAGGAGAC
CCTGCGGCGG CAATGCTTGA GGTATTGGAC AGCGAGCAGA ATTATGCTTT CAGGGACCAT
TATCTGGAAC TTCCCTTTGA TTTGTCCGAT GTGTTGTTTA TAACTACGGC AAACAACCTT
GACACGGTTC CGAGGCCTCT TTTGGACAGA ATGGAAGTAA TATCTTTGTC CAGCTATACT
GAAGAGGAAA AGGTCCAGAT AGCAATGAAA TATCTTTTCC CGAAACAGAT TGAGGCTCAC
GGCTTTAAGA AAAGCAATCT GAAAATAGAC GAACCGGCTG TGAGAGAAAT AATAAACTGC
TATACAAGGG AAGCCGGAGT GAGGGAGCTT GAAAGACAGA TAGCCGGCGT TTGCAGAAAA
GTTGCCAGAA AGCTGGTATC CTCAAATCAG AAGACGGTCA AAATTACTGC AGCCTCTATA
GAAAAGTATT TGGGAACGAA AAAATACAGA TATGACATGG CAAATGAAAA GGATGAAGTG
GGTGTTGCCA CAGGTCTTGC ATGGACGCCT GTGGGCGGAG ATACGTTGTC CATTGAGGTA
ACACTTATGG AAGGAAAGGG CAGCCTCGAG CTTACAGGAC AGCTGGGAGA CGTCATGAAA
GAATCTGCCC GGGCTGCAAT GAGTTATATT CGTTCAAGAG CGGAATATTA CGGAATAGAC
AAGGATTTTT ACAACAAGTA TGATATTCAC ATACATGTAC CGGAGGGAGC CATTCCAAAG
GACGGTCCTT CAGCCGGTAT AACCCTTGCA ACCGCAATGG TGTCTGCATT AACCGGAAAG
CCGGTAAGAA AAAATGTGGC TATGACCGGG GAGATAACCT TAAGAGGCAG GGTTCTTCCG
ATAGGCGGAG TCAAGGAAAA AGTGCTTGCC GCCCATAGAG CCGGAATAGA TACAATTATA
ATTCCTGTGG AAAACAAGAA AGACCTTGAA GAGATACCTG AAAATGTAAG AAAGACAATA
AAATTTGTTC TGGCAGACAA TATGGAAACG GTGCTCAATA CTGCATTGGT GAAAACCAAA
CCGAAGGGCA GGCAAAAGAG CGTTTCCGGT GAAGAAAAAA CTGTTGTGCC GGAAGTTCCT
CCGCAGTTGG AAGAATTGGA TCACGGAACC GCAACAATTG AACAGTAA
 
Protein sequence
MSEAKKVIKK QVLPLLPLRG LTVFPYMILH FDVGRIKSIK ALEEAMINNQ LIFLVAQKDA 
KNDSPGPEDI YTIGTISKVK QLLKLPGDTI RVLVEGISRA EICEFTQTEP FFMAEVEEKI
YVEEDKNSKT EIEALKRRVL STFEEYSKLN NKVSPETVLS IMNIDDPDQL ADIITANLML
KVEQKQEILN EFKTKIRLQK LLETLVREIE IMQIEREINI KVRKQIDKTQ KEYYLREQLK
AIQSELGDKD GVVGEVEEYK RKLAEGNFGE EVEKKVLKEL DRLLKMPPGS AEGSVIRTYL
DWIFDLPWNK KTEEIIDLDR AQQILDEDHY GLEKVKERII EYLAIRKLKK DLKGPILCLA
GPPGVGKTSI AKSIARALNR NYVRMSLGGV RDEAEIRGHR RTYVGAMPGR IISALKQAGS
KNPLILLDEI DKMSSDFRGD PAAAMLEVLD SEQNYAFRDH YLELPFDLSD VLFITTANNL
DTVPRPLLDR MEVISLSSYT EEEKVQIAMK YLFPKQIEAH GFKKSNLKID EPAVREIINC
YTREAGVREL ERQIAGVCRK VARKLVSSNQ KTVKITAASI EKYLGTKKYR YDMANEKDEV
GVATGLAWTP VGGDTLSIEV TLMEGKGSLE LTGQLGDVMK ESARAAMSYI RSRAEYYGID
KDFYNKYDIH IHVPEGAIPK DGPSAGITLA TAMVSALTGK PVRKNVAMTG EITLRGRVLP
IGGVKEKVLA AHRAGIDTII IPVENKKDLE EIPENVRKTI KFVLADNMET VLNTALVKTK
PKGRQKSVSG EEKTVVPEVP PQLEELDHGT ATIEQ