Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2742 |
Symbol | |
ID | 4810244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3235055 |
End bp | 3236731 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640108161 |
Product | endopeptidase La |
Protein accession | YP_001039134 |
Protein GI | 125975224 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | [TIGR00764] lon-related putative ATP-dependent protease [TIGR02902] ATP-dependent protease LonB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTACCA CAACTCTATT TATCATACAA TTTTTTTTCT CAATTATAAT TGGACTGTAT TTTTTAAATC TCTTAAAATC GCAGCAGTGC AACAGAAGCG CCATTGACAA AGAGTCTAAA AAGGAGCTGG ACAAACTAAG AAAACTGAAG GAAATAAAAC TTACGGAACC CTTGTCGGAA AAAACAAGAC CCACATCGCT GAAGGAAATT GTGGGGCAAA GTCAGGGAAT CAAGGCACTG AGGGCAGCCT TGTGCGGTCC AAACCCTCAA CATGTGATTA TTTACGGCCC TCCGGGGGTT GGTAAGACAG CGGCGGCAAG GGTGATACTT GAAGAGGCAA AAAAAAGTGA ATTGTCTCCT TTTAAAAAAG AGGCAAAATT CGTGGAAGTG GATGCAACTA CATTACGATT CGATGAAAGA GGTATTGCCG ATCCGTTGAT TGGTTCGGTT CACGACCCGA TATATCAAGG AGCTGGAGCC TATGGAGTGG CGGGAATTCC TCAGCCGAAA CCCGGCGCGG TGACCAAGGC TCACGGAGGA ATACTTTTTT TGGACGAAAT AGGCGAGCTT CATCCGATAC AGATGAACAA GCTGCTGAAG GTTTTGGAAG ACAGGGTTGT TTATTTGGAA AGTGCGTATT ACAGTTCCGA GGACAAAAAT ATTCCTCCTC ATATACATGA GATATTCCAA AAAGGACTGC CTGCGGATTT CAGACTTGTG GGCGCAACAA CCAGAACGGC GGATGAAATC CCTGCGGCAA TAAGGTCCCG TTGTGTTGAA ATATATTTCA GGCCGCTGAC TCCTTCAGAG ATAGCGGAGA TTGCCAAAAA CGCCGCGCAA AAGGGCGGAT TTGCCATGGA GGAAGGATGT GCCGAACTGG TGGCAAAGTA CGCCCAGAAT GGAAGAGAAG CCGTAAATAT AGTCCAGATT GCAGGCGGAG TGGCAATTGT AGAGGGCAGG AGGCTCATAG AAAGGAAAGA TATTGAATGG GTTATAGAAT TTGGCCATTA CAGTCCGCGT ATTGACAAAA AAGTGACAAA AGGCGAGCAG GTTGGATGTA TAAACGGGCT TGCGGTGTTT GGCAACTCAA CCGGTACGGT AATTGAGATA GAAGCCTGTG CCACAAGAAC TGTCAGCGGA GAGGGTACCA TCAAAATATC CGGAATAGTC GAAGAAGAGG AAATGGACGG CAGAGGGCAC AGACTCAGAA GAACAAGTTC GGCAAGGGCT TCGGTTGATA ATGTCCTCAC GGTTTTGAAG AGGTTTTTGG GTGTTAATTA TGAGGATTAT GACATTCATC TGAATTTTCC GGGCGGTGTG CCTGTAGACG GTCCGTCGGC AGGTATCGCC ATAGCTGCGG CGGTTTACTC CGCCATTAAG AACCTGCCTA TAAGCAGTGA GATTGCGATG ACGGGCGAGA TTTCCATAAG GGGAAAAGTA AGACCTGTCG GCGGTGTTGT GGCAAAGATT GAAGCGGCAA AAAATGCCGG AATAAAAAAA GTGCTTATCG CAAAAGAAAA CTGGCAGGAT TTGTTTGAGG ACATGGATAT TGAGGTGGTG CCTGTGGAGG ATATATTTGA CGTTATAGAG CAGGTATTTG GAAGAAAACA TGAAAAAGTT GACAATATCC AGATAGACAG TAAATCCGTA AACGTGTTAA GTGCGTCGGG AGCATAA
|
Protein sequence | MLTTTLFIIQ FFFSIIIGLY FLNLLKSQQC NRSAIDKESK KELDKLRKLK EIKLTEPLSE KTRPTSLKEI VGQSQGIKAL RAALCGPNPQ HVIIYGPPGV GKTAAARVIL EEAKKSELSP FKKEAKFVEV DATTLRFDER GIADPLIGSV HDPIYQGAGA YGVAGIPQPK PGAVTKAHGG ILFLDEIGEL HPIQMNKLLK VLEDRVVYLE SAYYSSEDKN IPPHIHEIFQ KGLPADFRLV GATTRTADEI PAAIRSRCVE IYFRPLTPSE IAEIAKNAAQ KGGFAMEEGC AELVAKYAQN GREAVNIVQI AGGVAIVEGR RLIERKDIEW VIEFGHYSPR IDKKVTKGEQ VGCINGLAVF GNSTGTVIEI EACATRTVSG EGTIKISGIV EEEEMDGRGH RLRRTSSARA SVDNVLTVLK RFLGVNYEDY DIHLNFPGGV PVDGPSAGIA IAAAVYSAIK NLPISSEIAM TGEISIRGKV RPVGGVVAKI EAAKNAGIKK VLIAKENWQD LFEDMDIEVV PVEDIFDVIE QVFGRKHEKV DNIQIDSKSV NVLSASGA
|
| |