Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1446 |
Symbol | |
ID | 4810596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1764431 |
End bp | 1766968 |
Gene Length | 2538 bp |
Protein Length | 845 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640106868 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_001037869 |
Protein GI | 125973959 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0617491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATTTATT TTTGTCACGC TCCAAACGAC AGAGCTATTG CGGAAAAAAT ATATTTTTCG CTTAAGCAAA CGGGTCTTAC ATGTTGGATT CCGTCACAGG ATATTATGGC CGGTCAATAC TATGTGGAAG CTATAGCCAA CGCAATTGAA AAATCCGACA TCGTAGTTTT TATTTTTTCT TCACACTCAA ATACATCCAT ACAAGTAATT GATGAATTGC AGAAAGCATC ATCACTGAAT AAAACTATTA TTCCTTTCTG CGTTGACAGG GCAATGCCCT CAGAGCCAAT TGAACACTAT TTGAGCAGCC CGTACAAAGT CGATGCAACA ATCGGTCTTC CAGATGATAA TATTGCAAAG CTTCAGAACA TTATTAAGCA AATATATAAC CAATCTTCCG TACATAATGC TAATGACATT AATACTGTTA AAGATACATT TGCGCCATCC AATCCATTAG AAAATAATAC AAACATCATA AAAGATGCTT CGGATAACAA CAGCCAAGAG TCAATCCAAT TTAACGGGGT AATTAATTCA GCTAATTCTC AACCTCCACA AAACCCAAAT TTGCCTCCAC AACACCAAAT GAATCCTAAC TTTAACAACA CAGTCAATAA TATGGACAGA AATATGAACT ACCCAATAAA GCAAAATGTC CGTCCCAATC CTCAAGTTAA ATCCTCCAAT AAAATCAAAA TTCCCCTTAT TATTGGCGGA AGTATTGCAG GAGGAATCAT ATTATTGTCA ATTTTATTCA TGCTTGGGAA AAACCTATTA TTTTCGACAT TAATAAACTC GCACAGACCG TCTGACAATC CAGTAATAGG TTCAAACATT CGTATTGACG GCAGTAAATA TACCGAAGAA GAAATTATTG AATTTACAAA AAATGCTATT TTAGAGCTGG ATAGATTGCA AGCCTCTCGT GAATTTTATG ATACCTATGC ATATAACGCA CCGGAATATG ATCTAATGCG TATTCAGGAA AACGTATTCT TTTCACTTAT GGATTTGGAT GTGGTGAGAT TTGAGGAAGT AGCTATTTTA GGTAAAAACG AAAAAGAACA CGGCATAGAA TATCTCCTCA AAGTAGATTT TGTCTGCTAT GCGGATTATG AATATACGGA GAATAATGAA ACTGTTCACC GCAAAGGAGA AGCATTATTG TACAATAATG TGGTAATACT TGAAACACCA AATGACGGAT TAAAATACTT ATACATGGAA GGAATATTTG AAGATGAGTT AAATGCAAGA AATGAGAACT TGGAGTCTTC ATATCGTGAA ATCGCCGAAT CCGGTGATGT AAACACTGAA GACACCATCC AAACCTACTT GCCTGATTCA ACCGACGAAG ACACCATTAT GTCCGTAAAA GATATCGTTA AGAAAAATGA CCACAAAGTT GTTGCCGTGT ACGTTGATGT ACCGGGAGGA CAATCTCAAG GAAGCGGATT TTTTATAAAA GACGGTGTAA TAGTTACCAA TTATCATGTA ATCGAGGGCG GAAAAAGTGC AAAAATTCTC CTTTCCAACG GAAATTACGT GGATGTGGAA GGAGTTTTAT ACACGGATTC TGATGTTGAT ATTGCTGTAT TGAAATTGGT AAATGAAGTC GGCATCGAGC CGGTAACCAT AGGTCAGGCC CGCGATTCCG ATAAAGGAAG TATCGCCGTT GCTATTGGAT CACCTTTGGG ACTGTTTAAT ACAGTATCCA CAGGAATTAT ATCTAATTTC TGGGAGGCCA ATGGAGTTAA CTTAATTCAA ATTTCCATCC CTATTACCCA CGGTAACTCC GGAGGTGCCT TATTCAACGA GTCCGGTAAA TTAATCGGTA TCACATCCTC AGGAATAGGA GAGGCAAATT TAAACTTCGC AATTTCTTCA ACCCATATAA TACCTATTTG TGAGGATATA AAAAATATAC CCTATAATCA ATTAAATGCA GTTCCCCTTA GCAGTGCAGG CGGCAATATT AATTCCATTA CCAGGCAAGC CGGATCTTCC AATAGCAATC AAAGCAGTTC TTCCGGCAAG AGTCTTTTAT CTTCAAAATA CAGGTTTGTT AATTCCGATA AACCAATATC CAACGATGCA ACATATACCT CTGACTTACA GACAATATAT GACCTGGCTT TAAACAAGTA TTATGCCAAA GAAGATTTAT ACAGAGATCT TGATTACGCC TTTGACTATA TTTTGGATTA CGGCACAAAG TATTATCTGG AATATTTAGA GTATATACTT CATGAAACAT ATATAAAAGA AGATGTAGAA AACTTTAATA AGATAAATGA ATCTGTTCTG GAAGTGCTTA ACACCACAGG TGAATATATA GAATTTACAG CGGATGAAAT TGAAATAATA GGTGCCGGAA TAAAAAACGA CGGCACTATT GACGGTATTA TTGTGAGAGC TTTATTCTAT GAAAATTCGG AGCCTTATGT GAGATCAAAA TTTTATTTCA GAGCTAACTA TGACGCATGG AGTTATGTTG ATGGCTTCTT GTATGTAGGA GAAGTTACAG AAAAGTAA
|
Protein sequence | MIYFCHAPND RAIAEKIYFS LKQTGLTCWI PSQDIMAGQY YVEAIANAIE KSDIVVFIFS SHSNTSIQVI DELQKASSLN KTIIPFCVDR AMPSEPIEHY LSSPYKVDAT IGLPDDNIAK LQNIIKQIYN QSSVHNANDI NTVKDTFAPS NPLENNTNII KDASDNNSQE SIQFNGVINS ANSQPPQNPN LPPQHQMNPN FNNTVNNMDR NMNYPIKQNV RPNPQVKSSN KIKIPLIIGG SIAGGIILLS ILFMLGKNLL FSTLINSHRP SDNPVIGSNI RIDGSKYTEE EIIEFTKNAI LELDRLQASR EFYDTYAYNA PEYDLMRIQE NVFFSLMDLD VVRFEEVAIL GKNEKEHGIE YLLKVDFVCY ADYEYTENNE TVHRKGEALL YNNVVILETP NDGLKYLYME GIFEDELNAR NENLESSYRE IAESGDVNTE DTIQTYLPDS TDEDTIMSVK DIVKKNDHKV VAVYVDVPGG QSQGSGFFIK DGVIVTNYHV IEGGKSAKIL LSNGNYVDVE GVLYTDSDVD IAVLKLVNEV GIEPVTIGQA RDSDKGSIAV AIGSPLGLFN TVSTGIISNF WEANGVNLIQ ISIPITHGNS GGALFNESGK LIGITSSGIG EANLNFAISS THIIPICEDI KNIPYNQLNA VPLSSAGGNI NSITRQAGSS NSNQSSSSGK SLLSSKYRFV NSDKPISNDA TYTSDLQTIY DLALNKYYAK EDLYRDLDYA FDYILDYGTK YYLEYLEYIL HETYIKEDVE NFNKINESVL EVLNTTGEYI EFTADEIEII GAGIKNDGTI DGIIVRALFY ENSEPYVRSK FYFRANYDAW SYVDGFLYVG EVTEK
|
| |