Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1286 |
Symbol | |
ID | 4809538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1562982 |
End bp | 1564511 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106709 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_001037711 |
Protein GI | 125973801 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGAAT TAAACTATAA CGGTTTTAGC AATGAAGAAA ATGAAATAAA GAGTTTCGGA GAATTAAACA ATACAACCGG CGAAAATGTT GATGCAAATA TAAATGGAGA TGTGGTTGAA AATGCAGCAC AGTATGCAGC TGAGAATATA ATTGAAAATG TTGACGCGGA TGTCGGTGAA AATGCTGCCG AAGGTTTTGG CGAAAATGTG ATTGAAAATG CTGCCGTGGG CGTTGCAGAT AAACGAGTGG AAAATGAAAA GACTTATGAA GGAAGTTTCG TGGATTTAAA GTCGGCTTCA TATTATTCTG AAAGCTATAC AAAGCCAAAA AAACGTAAAA ACAGTGTTTT ACTGCAGATG ATACTTGTTG CTGTTATGAG TTCCATATTG GGCGGCTCGA TAGTCGGAGG TTTCTTTGTA TTTGGAGTTC CGGCCCTCAG TCCTTCGGTT CAGTCCATTT TCAGAAACAC CAATGTTCAG AACGGTTCGA ATGACGCAAC ATCGGGAGTG GATACGGATT ATTATAAAAA AGTTGTCATT GAGAACAACG CCGATTCTTC TGTGGTGGTT GCAATAGCTG AAAAGGTTGG ACCTTCAGTT GTAGGTATAA GTGTAAAATC AACGACAAGC ATCAGTGATT TCTGGTTCTT TACACCAAGA GACACAGAAT CCCAGGGTTC GGGCATTATA ATAAGAAGTG ACGGATATAT AATGACAAAC TACCACGTTA TTGAATCGGC TTTGAACGGA AGAACCAACA CTCTACTTCC GAATGCAAGT ATTAATGTTA TTTTGCCAAG TGATCCGGAC ACACCTCATC CAGCTACGGT TGTGGGAACG GATTCAAAGA CGGATTTGGC AGTGCTTAAA ATTGAAGCAA CCAACCTGCC CGTGATTGAA TTCGGGGATT CGGATAAAAT AAGAGTCGGT GAGCTTGCAG TTGCCATAGG CAATCCCGGA GGACTTGAAT ACATGGGTTC GGTTACCGTG GGTGTAATAA GCGGTCTTAA CAGGACAATA CCTATAACCG ACGGCAAGGA ACTGAAGCTG ATACAGACAG ATGCCGCAAT AAATCCCGGA AACAGCGGCG GTGCTCTCGT TAATGCCGAA GGAAAGTTAA TTGGTGTCAA TACTGCAAAA ATCGGCGGAC AGGGCTATGA AGGACTTGGT TTTGCAATAC CTGTAAACAA AGCAAAGGAA ATAACCGACA GCCTTATTCA GTACAAGTAT GTAAGAGGAA GACCGTCCCT CGGCATACAG ATAAACAGCG GTTACACCAA GGAAATAGCA GACCGTTACG GACTTCCTGA AGGAGTGCTT GTTTACAACG TTGAAATATT CAGTGCGGCT TACAAAGCCG GTATTCAAAA GGATGACATA ATTACGGAGT TTAACGGCGT GAGAGTAAAG AATTATGATG AATTGGAAGA ACAAAAGAAC AAATACAAAC CCGGAGACAA AGTGAAACTC AAAATACACA GGGACGGAAA AGATATTACC GTTGAAGTGA CGTTGGATGA GCAAAAATAA
|
Protein sequence | MDELNYNGFS NEENEIKSFG ELNNTTGENV DANINGDVVE NAAQYAAENI IENVDADVGE NAAEGFGENV IENAAVGVAD KRVENEKTYE GSFVDLKSAS YYSESYTKPK KRKNSVLLQM ILVAVMSSIL GGSIVGGFFV FGVPALSPSV QSIFRNTNVQ NGSNDATSGV DTDYYKKVVI ENNADSSVVV AIAEKVGPSV VGISVKSTTS ISDFWFFTPR DTESQGSGII IRSDGYIMTN YHVIESALNG RTNTLLPNAS INVILPSDPD TPHPATVVGT DSKTDLAVLK IEATNLPVIE FGDSDKIRVG ELAVAIGNPG GLEYMGSVTV GVISGLNRTI PITDGKELKL IQTDAAINPG NSGGALVNAE GKLIGVNTAK IGGQGYEGLG FAIPVNKAKE ITDSLIQYKY VRGRPSLGIQ INSGYTKEIA DRYGLPEGVL VYNVEIFSAA YKAGIQKDDI ITEFNGVRVK NYDELEEQKN KYKPGDKVKL KIHRDGKDIT VEVTLDEQK
|
| |