Gene Cthe_2742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2742 
Symbol 
ID4810244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3235055 
End bp3236731 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content46% 
IMG OID640108161 
Productendopeptidase La 
Protein accessionYP_001039134 
Protein GI125975224 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease
[TIGR02902] ATP-dependent protease LonB 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTACCA CAACTCTATT TATCATACAA TTTTTTTTCT CAATTATAAT TGGACTGTAT 
TTTTTAAATC TCTTAAAATC GCAGCAGTGC AACAGAAGCG CCATTGACAA AGAGTCTAAA
AAGGAGCTGG ACAAACTAAG AAAACTGAAG GAAATAAAAC TTACGGAACC CTTGTCGGAA
AAAACAAGAC CCACATCGCT GAAGGAAATT GTGGGGCAAA GTCAGGGAAT CAAGGCACTG
AGGGCAGCCT TGTGCGGTCC AAACCCTCAA CATGTGATTA TTTACGGCCC TCCGGGGGTT
GGTAAGACAG CGGCGGCAAG GGTGATACTT GAAGAGGCAA AAAAAAGTGA ATTGTCTCCT
TTTAAAAAAG AGGCAAAATT CGTGGAAGTG GATGCAACTA CATTACGATT CGATGAAAGA
GGTATTGCCG ATCCGTTGAT TGGTTCGGTT CACGACCCGA TATATCAAGG AGCTGGAGCC
TATGGAGTGG CGGGAATTCC TCAGCCGAAA CCCGGCGCGG TGACCAAGGC TCACGGAGGA
ATACTTTTTT TGGACGAAAT AGGCGAGCTT CATCCGATAC AGATGAACAA GCTGCTGAAG
GTTTTGGAAG ACAGGGTTGT TTATTTGGAA AGTGCGTATT ACAGTTCCGA GGACAAAAAT
ATTCCTCCTC ATATACATGA GATATTCCAA AAAGGACTGC CTGCGGATTT CAGACTTGTG
GGCGCAACAA CCAGAACGGC GGATGAAATC CCTGCGGCAA TAAGGTCCCG TTGTGTTGAA
ATATATTTCA GGCCGCTGAC TCCTTCAGAG ATAGCGGAGA TTGCCAAAAA CGCCGCGCAA
AAGGGCGGAT TTGCCATGGA GGAAGGATGT GCCGAACTGG TGGCAAAGTA CGCCCAGAAT
GGAAGAGAAG CCGTAAATAT AGTCCAGATT GCAGGCGGAG TGGCAATTGT AGAGGGCAGG
AGGCTCATAG AAAGGAAAGA TATTGAATGG GTTATAGAAT TTGGCCATTA CAGTCCGCGT
ATTGACAAAA AAGTGACAAA AGGCGAGCAG GTTGGATGTA TAAACGGGCT TGCGGTGTTT
GGCAACTCAA CCGGTACGGT AATTGAGATA GAAGCCTGTG CCACAAGAAC TGTCAGCGGA
GAGGGTACCA TCAAAATATC CGGAATAGTC GAAGAAGAGG AAATGGACGG CAGAGGGCAC
AGACTCAGAA GAACAAGTTC GGCAAGGGCT TCGGTTGATA ATGTCCTCAC GGTTTTGAAG
AGGTTTTTGG GTGTTAATTA TGAGGATTAT GACATTCATC TGAATTTTCC GGGCGGTGTG
CCTGTAGACG GTCCGTCGGC AGGTATCGCC ATAGCTGCGG CGGTTTACTC CGCCATTAAG
AACCTGCCTA TAAGCAGTGA GATTGCGATG ACGGGCGAGA TTTCCATAAG GGGAAAAGTA
AGACCTGTCG GCGGTGTTGT GGCAAAGATT GAAGCGGCAA AAAATGCCGG AATAAAAAAA
GTGCTTATCG CAAAAGAAAA CTGGCAGGAT TTGTTTGAGG ACATGGATAT TGAGGTGGTG
CCTGTGGAGG ATATATTTGA CGTTATAGAG CAGGTATTTG GAAGAAAACA TGAAAAAGTT
GACAATATCC AGATAGACAG TAAATCCGTA AACGTGTTAA GTGCGTCGGG AGCATAA
 
Protein sequence
MLTTTLFIIQ FFFSIIIGLY FLNLLKSQQC NRSAIDKESK KELDKLRKLK EIKLTEPLSE 
KTRPTSLKEI VGQSQGIKAL RAALCGPNPQ HVIIYGPPGV GKTAAARVIL EEAKKSELSP
FKKEAKFVEV DATTLRFDER GIADPLIGSV HDPIYQGAGA YGVAGIPQPK PGAVTKAHGG
ILFLDEIGEL HPIQMNKLLK VLEDRVVYLE SAYYSSEDKN IPPHIHEIFQ KGLPADFRLV
GATTRTADEI PAAIRSRCVE IYFRPLTPSE IAEIAKNAAQ KGGFAMEEGC AELVAKYAQN
GREAVNIVQI AGGVAIVEGR RLIERKDIEW VIEFGHYSPR IDKKVTKGEQ VGCINGLAVF
GNSTGTVIEI EACATRTVSG EGTIKISGIV EEEEMDGRGH RLRRTSSARA SVDNVLTVLK
RFLGVNYEDY DIHLNFPGGV PVDGPSAGIA IAAAVYSAIK NLPISSEIAM TGEISIRGKV
RPVGGVVAKI EAAKNAGIKK VLIAKENWQD LFEDMDIEVV PVEDIFDVIE QVFGRKHEKV
DNIQIDSKSV NVLSASGA