Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2680 |
Symbol | |
ID | 4808848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3160920 |
End bp | 3163157 |
Gene Length | 2238 bp |
Protein Length | 745 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640108095 |
Product | peptidase S41 |
Protein accession | YP_001039072 |
Protein GI | 125975162 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.20871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAAC GCCTTTTACT GTTTATTTTA GTGCTTGGTG TCTGTCTTTT TACTTCATGC GGCAATTTTG TAAAAACCAA TATTTACTTT TCAGCAGCCG AATCGGCATT TGATTCAGGC AAATACGAAG ATGCAATAAA GTACTATGAC AAAGTTATAG AAGCAGATTC CGGCAATGCC ATGGCTTATC TCGGTAAAGG TCTTGCCCTG GATGCTTTGG GAAAATACGA AGAAGCCCTG GAGTTTTTCG ACAAAGCCAT TGAAATCAAC AAAGATTTGG CAAAAGCCTA TAATGCCAAA GGCACCACTT TAGCCAGTCT TGAGAGGTAT GAGGAATCTC TTGAAAATTT TAAAAAAGCA GCGGAATTGA AACCAAAAAA CAGTGCCTAT CAAAATGATG TGGCATATGG CTTAAACAAT CTCGGCAGAT TTGAGGAAGC AATTCAATAT GCCGAAAAGG CACTCAAACT TAATCCACGC AGCGGTGTTG CCTACTCAAA CAAAGGTTTT GCCCTTGACG CTCTGGGAAA ATTGGATGAA GCCATCGAAT GCTATGATAA AGCAATAGAA CTTAGTCCAA CCTATACCAA TGCCTACTAC AACAAGTCCA TTGCAGTTTT CAAAATGGGC AAAACAGAAG AGGCCATAGA ACTTTTGGAC AAAGTACTGG AAATTGACCC CGACGACTTA GATGCCATAA CTTCAAAAGG TTACTGTCTA AATGAACTTG GAAAATATGA AAAGGCAATA GAGTGCTTTG ACACTGCAAT CGAAAAATAT CCCAAAGATC CATACCCGTA TGTTTGCAAA GCCACTTCCC TTTATTATCT GGGAAAATAT GACAACGCTC TCGAAGAGTG CAACAAAGCC ATCAAGTTAG AGTACACCTT TCCTGATTCC TATATATGGA AAGCCAAGAT TCTTGTTGAA AAGGGAGACA TTGAAGAGGC CAGAAAATCG TGCGATGAAT TTCTGGCTAT TGCCGAGGAT GCTTCTGTTT ACGATATGAA AGGTCAAATA TATTTACACG AGTATAACTA CCCGGAAGCA ATAAAGCTCT TTGACAAAGC AATAGAAGTT GATCCATCCT ATGAGGACTC TTATATCAAT AAAATCTATT GCCTGTATCT GCAGAAAAAT TACAAAGAGT GCATAGAATT TGCCACAAAG GTGCAAACCA TTTTCCCAAA TTCCGCAGAC ATTCCCTGGT ATATCGGGGA TTGTTACAGC ATAATGATGG AACCGGAAAA GGCTATTGAA TACCTCAAGA AGGCCCACGA ACTAAACCCG AAAGATGTCG GCATTTTAAC CTCCATTGCG TGGGAATACT ATAGCCTTGA GGATTACGCA AAAGCATCCG AATATGCCGA AAAGGCTGCC GAAATATCTG CCGATGACGA AAGTGTAAAG TACATCAGGG AAAAACTGGA AAATCAAAAA CTTCCCGAAG CAGAGCAAAT AGTTGAGTTT GTTAAAAACA ATTACTTGTA CTATGACAAA ATAGCAAACT TTGAAGCCCT TGCAAATGAA TTCAAAGCCA AAGGCGAAGT TGGTGTAAAA GACATATGCA ACTTTATAGA AAGCATAAGG CAAAAAGATG ATATGTTTAC TTTCGTAATT CACGGTGACG ACTATGATTT GTTAAAGTAC GAGGAAAGTA TTTCCCAGGT AACTTCACAG CAACTTGAAC CAAACATACA CTATATAAAA ATTAAATCTT TCACTGCAAG CGTCAGCTGG GAATTCAAAG AAATCATTGA CTCAATTGAA AATCCCGAGG AAAAAGTTTT GGTTATTGAC TTAAGAGACA ACCTGGGAGG CCTTGCAACT TCGTCCGCCG ACATACTGGA CTACCTTTTG CCGGCATGTA CCACAAGCTA CATAGTCTAC AGAGACGGAT ACATGTATTC ATACTATTCT GATGCCGCCC AGACAAAGTT CAAGAAAATT CTCGTCCTGG TCAATGAATA TTCTGCGAGC AGCTCGGAAA TTCTTGCCTT AGGGCTTAAA AAACACTTAA ACAATGTTGT TATAATCGGC CGTCCCACCG TGGGTAAAGG CGTCGGACAA CTGGTTTATG AAAACAAATC CAAAAAATAC ATGATTTATC TGGTAAGCTT TTATTGGAAT GTCATGGAAG AAAACATATT GGGAAAAAGA ATCGAGCCTG ATGTGTATGT AAACAGTTCC AGTGACGCCG CATACATGAA CGAAGTAAAG CGCCAGGCTG CCAGGTAA
|
Protein sequence | MKKRLLLFIL VLGVCLFTSC GNFVKTNIYF SAAESAFDSG KYEDAIKYYD KVIEADSGNA MAYLGKGLAL DALGKYEEAL EFFDKAIEIN KDLAKAYNAK GTTLASLERY EESLENFKKA AELKPKNSAY QNDVAYGLNN LGRFEEAIQY AEKALKLNPR SGVAYSNKGF ALDALGKLDE AIECYDKAIE LSPTYTNAYY NKSIAVFKMG KTEEAIELLD KVLEIDPDDL DAITSKGYCL NELGKYEKAI ECFDTAIEKY PKDPYPYVCK ATSLYYLGKY DNALEECNKA IKLEYTFPDS YIWKAKILVE KGDIEEARKS CDEFLAIAED ASVYDMKGQI YLHEYNYPEA IKLFDKAIEV DPSYEDSYIN KIYCLYLQKN YKECIEFATK VQTIFPNSAD IPWYIGDCYS IMMEPEKAIE YLKKAHELNP KDVGILTSIA WEYYSLEDYA KASEYAEKAA EISADDESVK YIREKLENQK LPEAEQIVEF VKNNYLYYDK IANFEALANE FKAKGEVGVK DICNFIESIR QKDDMFTFVI HGDDYDLLKY EESISQVTSQ QLEPNIHYIK IKSFTASVSW EFKEIIDSIE NPEEKVLVID LRDNLGGLAT SSADILDYLL PACTTSYIVY RDGYMYSYYS DAAQTKFKKI LVLVNEYSAS SSEILALGLK KHLNNVVIIG RPTVGKGVGQ LVYENKSKKY MIYLVSFYWN VMEENILGKR IEPDVYVNSS SDAAYMNEVK RQAAR
|
| |