Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0605 |
Symbol | |
ID | 4808207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 740565 |
End bp | 741677 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106019 |
Product | PgdS peptidase. cysteine peptidase. MEROPS family C40 |
Protein accession | YP_001037033 |
Protein GI | 125973123 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000243263 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAAGT TTAACAAGGT TTTCATGTAC ATAACTGCTT CGGCTTTGTC TGTTTCTTTA TGGACATGTA CTTCTTTTGC ACAGCAGAAC AAAACGGGAG TAACTACTGC AAGCATGCTT AATATGCGTG AAAATCCCAG CACTTCGACG AAAATCATAG ACCAAATTCC AAATGGAACC AAGGTTGATA TAATAGAAAC TTCAAACGGT TGGTATAAAA TTTCGTACAA TGGCAAGACA GGATGGGTTT ACGGTTCTTA TGTTAAAGTA ACGGAGACAC CAAAACTGAC TGTGACGGAT GAAACGATAT TGGCGACTTT GAATAAAGGT TCTACCGGTA ATTCTTCAAC CAATTCTTCT ACAAATAATT CTGCTGTCAA CAATTCTTCC AGTCAAGTGG TCGATGAAAC CATTCAAAAA CCTGCTCAAA ATGCTGCATC CGGAGAAAAC ACTGAAAATA CCGTTGTAAA AACAGGAATT GTGAAAGCTT CGGCTCTGAA TGTAAGGCAG GGACCGGGTA CTTCCTATAG TATTATTAAT CAGCTTTCAA ACGGCGCAAA GGTAAATATA ATAAAAGAAG AGTCCGGCTG GTATCAAATC AAGCTGGCAA ACGGTTCTAC AGGTTGGGTT TCAGGTACAT ATGTGAATGT CAATACTACC ATTGCTTCAA GAGGAGGACT TTCTGAAAAT TCAGCTCCGG CAGCTTCAAA TAACAGCGAT GTGTCCGGTG TGAGACAGCA GGTTGTTGAA TATGCCAAGA AATTTTTAGG AGTCAAATAT GTATATGGAG GAAATTCGCC TTCTCAGGGA TTTGACTGTT CCGGTTTTGT AAAATATGTG TTCAGCAATT TTGGTATTAA TCTTGAAAGG GTTGCCGCAA GTCAGGCAAA GCAGGGTACA TGGGTTTCCA AGGATCAATT GCTGCCCGGC GACCTCGTAT TTTTCGATAC CGACGGAGGA CATAATTATA TTAACCATTC CGGCATATAC ATAGGAGATG GAAAGTTTAT TCATGCATCA TCGGGAAGCG GCAAGAAGAG TGTTGTTATA AGTGACCTTA CGAGCGGGTT TTATGCCAAC ACTTATATGA CCGCGAGAAG AGTTTTAAAT TAA
|
Protein sequence | MLKFNKVFMY ITASALSVSL WTCTSFAQQN KTGVTTASML NMRENPSTST KIIDQIPNGT KVDIIETSNG WYKISYNGKT GWVYGSYVKV TETPKLTVTD ETILATLNKG STGNSSTNSS TNNSAVNNSS SQVVDETIQK PAQNAASGEN TENTVVKTGI VKASALNVRQ GPGTSYSIIN QLSNGAKVNI IKEESGWYQI KLANGSTGWV SGTYVNVNTT IASRGGLSEN SAPAASNNSD VSGVRQQVVE YAKKFLGVKY VYGGNSPSQG FDCSGFVKYV FSNFGINLER VAASQAKQGT WVSKDQLLPG DLVFFDTDGG HNYINHSGIY IGDGKFIHAS SGSGKKSVVI SDLTSGFYAN TYMTARRVLN
|
| |