Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0609 |
Symbol | |
ID | 4808211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 746423 |
End bp | 747400 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106023 |
Product | peptidase M42 |
Protein accession | YP_001037037 |
Protein GI | 125973127 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.489423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGACT TGCTCAAAAA GTTTACAGGA ATCGTTGGAG TATCCGGAAA CGAAGAAGAA ATAAGGGAAG CTATAATTGA GGAAATTAAA GAATGTGTTG ACGAAATAAA AGTTGATACT TTGGGAAACC TTATTGCCGT CAAAAAAGGC AAGGGCAAAA AAATCATGGT GGCGGCTCAT ATGGATGAGA TTGGCGTAAT GGTTACATAT ATAGACGACA AGGGCTTTTT AAGGTTTTCT GCCGTCGGAG GGGTCAGCCG CTATGACTGT ATAGGCCAGA GGGTGAAGTT TAAAAACGGA GTTGTCGGAG CTGTTTATTA CGAGGAAAAA CTTGAGGATA TGAAGAATCT CCAGCTTTCC AAAATGTATA TAGATATTGG AGCAAGAAGC AGAGAGGAAG CCCTGAAGAT GGTAAATATC GGAGATGTCG CCTGCTTTGT CGGAGATGCG GTGCTTCAGG GGGATACCGT GATATCGAAG GCATTGGACA ACAGAAGCGG CTGTGCGGTG GTTGTAAAGG CGATAAAAGA GTTGAAAAAG ACGGATAATG AAATATATTT TGTGTTTACG GTTCAGGAAG AGGTCGGTTT GAGAGGAGCA AAAACCGCGG CTTTCAGTAT AAAGCCTGAT ATAGCCATAG CTGTGGATGT TACAATGACG GGAGACACAC CGGAATCGCA TCCTATGGAG GTTAAGTGCG GCGGCGGGCC TGCAATTAAA GTAAAGGATC GTTCCGTCAT TTGTCATCCG GAGGTAAGAA AACTTTTGGA AGAGTCAGCA AAAAGGAATA ATATTCCTTA TCAGTTGGAA ATACTTGAAG CAGGAGGGAG CGACCCGGGC TCAATACATT TGACGGCGGG AGGAATACCT TCGGGTGCGA TATCCATACC GGTGAGGTAT GTTCACAGTC CGGTAGAGAC CGCCAGCATG TCGGATATTA ATAATGCCGT AAAATTGTTG GTTGAAGCCA TTTGCTGA
|
Protein sequence | MFDLLKKFTG IVGVSGNEEE IREAIIEEIK ECVDEIKVDT LGNLIAVKKG KGKKIMVAAH MDEIGVMVTY IDDKGFLRFS AVGGVSRYDC IGQRVKFKNG VVGAVYYEEK LEDMKNLQLS KMYIDIGARS REEALKMVNI GDVACFVGDA VLQGDTVISK ALDNRSGCAV VVKAIKELKK TDNEIYFVFT VQEEVGLRGA KTAAFSIKPD IAIAVDVTMT GDTPESHPME VKCGGGPAIK VKDRSVICHP EVRKLLEESA KRNNIPYQLE ILEAGGSDPG SIHLTAGGIP SGAISIPVRY VHSPVETASM SDINNAVKLL VEAIC
|
| |