Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0608 |
Symbol | |
ID | 4808210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 745327 |
End bp | 746376 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106022 |
Product | peptidase M42 |
Protein accession | YP_001037036 |
Protein GI | 125973126 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0364864 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTGATAA AAGAATTGAC GGAGTTAAAC GGGGTATCCG GAAATGAGGA TGAGGTAAGA AAATTTATAA AAGAAGAGGC GCAAAAGTAT GCAGACAGCA TAACCGAGGA CTCAATGGGA AATTTGATTT GCTATAAAAA AGGCGGATCC TCAAAATACC GCGTAATGTT GTCCGCCCAC ATGGACGAGG TCGGATTTAT GGTAACGGGG TATGATGACG GTCTGATCAA ATTTGCAAGT ATTGGTGGAA TAGATGAGAG AATACTTCCG GGAAAGAGGG TTTTGGTCGG GGAAAAGCGG ATTCCCGGTG TTATAGGTTC AAAGCCCATT CATCTGCAGG AAAAGGCTGA AAGGGGAAAT AATATCAAGC TGAAAAACAT GTATATAGAC ATAGGTGCCG AAAAAAAGGA AGAAGCCGAA AAACTTGCTC CTTTAGGTGA ATACATAGCC TTTTACAGCA TGTATACTGA GTTTGGTGAC GGCTGTATAA AGGCAAAGGC TTTGGATGAC CGTGTCGGTT GTGCCATACT TCTTGAAATA TTGAAAGAGA GGTATGGATT TGATTTGTAT GTATGCTTTA CCGTTCAGGA GGAGATAGGA TTAAGAGGCG CAGGCGTTGC TGCGTTCAGG GTAAATCCTG ATATTGCAAT TGTTGTTGAA GGAACCACCT GCTCTGATGT GCCGGGAGCC CGCGAACATG AGTATTCAAC GGTAATGGGA AATGGAGCGG CACTAACTAT AATGGACAGA ACTTCCTATT CCAATAAAAA GCTGGTTGAC TTTATGTACA AGACGGCGAA AGATAAAAAC ATACCGGTCC AGTACAAGCA AACCGCAACC GGCGGAAATG ATGCCGGAAA GATACAGTTA ACCCGTGAGG GAGTTGTGGT GGCATCGGTA TCTGTTCCCT GCAGGTATAT ACATTCCCCT GTGTCGGTAA TGAACCGAAG AGACTATGAA AGTTGCTTGA ATCTCGTAAA AGCTGTGCTG GAGGAGTTTG ACAACAATGA GAGCTTAATT GAAAGTTTTA AATTGCACAA TGTGAAGTAA
|
Protein sequence | MLIKELTELN GVSGNEDEVR KFIKEEAQKY ADSITEDSMG NLICYKKGGS SKYRVMLSAH MDEVGFMVTG YDDGLIKFAS IGGIDERILP GKRVLVGEKR IPGVIGSKPI HLQEKAERGN NIKLKNMYID IGAEKKEEAE KLAPLGEYIA FYSMYTEFGD GCIKAKALDD RVGCAILLEI LKERYGFDLY VCFTVQEEIG LRGAGVAAFR VNPDIAIVVE GTTCSDVPGA REHEYSTVMG NGAALTIMDR TSYSNKKLVD FMYKTAKDKN IPVQYKQTAT GGNDAGKIQL TREGVVVASV SVPCRYIHSP VSVMNRRDYE SCLNLVKAVL EEFDNNESLI ESFKLHNVK
|
| |