Gene Cthe_1286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1286 
Symbol 
ID4809538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1562982 
End bp1564511 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content42% 
IMG OID640106709 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_001037711 
Protein GI125973801 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAAT TAAACTATAA CGGTTTTAGC AATGAAGAAA ATGAAATAAA GAGTTTCGGA 
GAATTAAACA ATACAACCGG CGAAAATGTT GATGCAAATA TAAATGGAGA TGTGGTTGAA
AATGCAGCAC AGTATGCAGC TGAGAATATA ATTGAAAATG TTGACGCGGA TGTCGGTGAA
AATGCTGCCG AAGGTTTTGG CGAAAATGTG ATTGAAAATG CTGCCGTGGG CGTTGCAGAT
AAACGAGTGG AAAATGAAAA GACTTATGAA GGAAGTTTCG TGGATTTAAA GTCGGCTTCA
TATTATTCTG AAAGCTATAC AAAGCCAAAA AAACGTAAAA ACAGTGTTTT ACTGCAGATG
ATACTTGTTG CTGTTATGAG TTCCATATTG GGCGGCTCGA TAGTCGGAGG TTTCTTTGTA
TTTGGAGTTC CGGCCCTCAG TCCTTCGGTT CAGTCCATTT TCAGAAACAC CAATGTTCAG
AACGGTTCGA ATGACGCAAC ATCGGGAGTG GATACGGATT ATTATAAAAA AGTTGTCATT
GAGAACAACG CCGATTCTTC TGTGGTGGTT GCAATAGCTG AAAAGGTTGG ACCTTCAGTT
GTAGGTATAA GTGTAAAATC AACGACAAGC ATCAGTGATT TCTGGTTCTT TACACCAAGA
GACACAGAAT CCCAGGGTTC GGGCATTATA ATAAGAAGTG ACGGATATAT AATGACAAAC
TACCACGTTA TTGAATCGGC TTTGAACGGA AGAACCAACA CTCTACTTCC GAATGCAAGT
ATTAATGTTA TTTTGCCAAG TGATCCGGAC ACACCTCATC CAGCTACGGT TGTGGGAACG
GATTCAAAGA CGGATTTGGC AGTGCTTAAA ATTGAAGCAA CCAACCTGCC CGTGATTGAA
TTCGGGGATT CGGATAAAAT AAGAGTCGGT GAGCTTGCAG TTGCCATAGG CAATCCCGGA
GGACTTGAAT ACATGGGTTC GGTTACCGTG GGTGTAATAA GCGGTCTTAA CAGGACAATA
CCTATAACCG ACGGCAAGGA ACTGAAGCTG ATACAGACAG ATGCCGCAAT AAATCCCGGA
AACAGCGGCG GTGCTCTCGT TAATGCCGAA GGAAAGTTAA TTGGTGTCAA TACTGCAAAA
ATCGGCGGAC AGGGCTATGA AGGACTTGGT TTTGCAATAC CTGTAAACAA AGCAAAGGAA
ATAACCGACA GCCTTATTCA GTACAAGTAT GTAAGAGGAA GACCGTCCCT CGGCATACAG
ATAAACAGCG GTTACACCAA GGAAATAGCA GACCGTTACG GACTTCCTGA AGGAGTGCTT
GTTTACAACG TTGAAATATT CAGTGCGGCT TACAAAGCCG GTATTCAAAA GGATGACATA
ATTACGGAGT TTAACGGCGT GAGAGTAAAG AATTATGATG AATTGGAAGA ACAAAAGAAC
AAATACAAAC CCGGAGACAA AGTGAAACTC AAAATACACA GGGACGGAAA AGATATTACC
GTTGAAGTGA CGTTGGATGA GCAAAAATAA
 
Protein sequence
MDELNYNGFS NEENEIKSFG ELNNTTGENV DANINGDVVE NAAQYAAENI IENVDADVGE 
NAAEGFGENV IENAAVGVAD KRVENEKTYE GSFVDLKSAS YYSESYTKPK KRKNSVLLQM
ILVAVMSSIL GGSIVGGFFV FGVPALSPSV QSIFRNTNVQ NGSNDATSGV DTDYYKKVVI
ENNADSSVVV AIAEKVGPSV VGISVKSTTS ISDFWFFTPR DTESQGSGII IRSDGYIMTN
YHVIESALNG RTNTLLPNAS INVILPSDPD TPHPATVVGT DSKTDLAVLK IEATNLPVIE
FGDSDKIRVG ELAVAIGNPG GLEYMGSVTV GVISGLNRTI PITDGKELKL IQTDAAINPG
NSGGALVNAE GKLIGVNTAK IGGQGYEGLG FAIPVNKAKE ITDSLIQYKY VRGRPSLGIQ
INSGYTKEIA DRYGLPEGVL VYNVEIFSAA YKAGIQKDDI ITEFNGVRVK NYDELEEQKN
KYKPGDKVKL KIHRDGKDIT VEVTLDEQK