Gene Cthe_1446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1446 
Symbol 
ID4810596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1764431 
End bp1766968 
Gene Length2538 bp 
Protein Length845 aa 
Translation table11 
GC content36% 
IMG OID640106868 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_001037869 
Protein GI125973959 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0617491 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTTATT TTTGTCACGC TCCAAACGAC AGAGCTATTG CGGAAAAAAT ATATTTTTCG 
CTTAAGCAAA CGGGTCTTAC ATGTTGGATT CCGTCACAGG ATATTATGGC CGGTCAATAC
TATGTGGAAG CTATAGCCAA CGCAATTGAA AAATCCGACA TCGTAGTTTT TATTTTTTCT
TCACACTCAA ATACATCCAT ACAAGTAATT GATGAATTGC AGAAAGCATC ATCACTGAAT
AAAACTATTA TTCCTTTCTG CGTTGACAGG GCAATGCCCT CAGAGCCAAT TGAACACTAT
TTGAGCAGCC CGTACAAAGT CGATGCAACA ATCGGTCTTC CAGATGATAA TATTGCAAAG
CTTCAGAACA TTATTAAGCA AATATATAAC CAATCTTCCG TACATAATGC TAATGACATT
AATACTGTTA AAGATACATT TGCGCCATCC AATCCATTAG AAAATAATAC AAACATCATA
AAAGATGCTT CGGATAACAA CAGCCAAGAG TCAATCCAAT TTAACGGGGT AATTAATTCA
GCTAATTCTC AACCTCCACA AAACCCAAAT TTGCCTCCAC AACACCAAAT GAATCCTAAC
TTTAACAACA CAGTCAATAA TATGGACAGA AATATGAACT ACCCAATAAA GCAAAATGTC
CGTCCCAATC CTCAAGTTAA ATCCTCCAAT AAAATCAAAA TTCCCCTTAT TATTGGCGGA
AGTATTGCAG GAGGAATCAT ATTATTGTCA ATTTTATTCA TGCTTGGGAA AAACCTATTA
TTTTCGACAT TAATAAACTC GCACAGACCG TCTGACAATC CAGTAATAGG TTCAAACATT
CGTATTGACG GCAGTAAATA TACCGAAGAA GAAATTATTG AATTTACAAA AAATGCTATT
TTAGAGCTGG ATAGATTGCA AGCCTCTCGT GAATTTTATG ATACCTATGC ATATAACGCA
CCGGAATATG ATCTAATGCG TATTCAGGAA AACGTATTCT TTTCACTTAT GGATTTGGAT
GTGGTGAGAT TTGAGGAAGT AGCTATTTTA GGTAAAAACG AAAAAGAACA CGGCATAGAA
TATCTCCTCA AAGTAGATTT TGTCTGCTAT GCGGATTATG AATATACGGA GAATAATGAA
ACTGTTCACC GCAAAGGAGA AGCATTATTG TACAATAATG TGGTAATACT TGAAACACCA
AATGACGGAT TAAAATACTT ATACATGGAA GGAATATTTG AAGATGAGTT AAATGCAAGA
AATGAGAACT TGGAGTCTTC ATATCGTGAA ATCGCCGAAT CCGGTGATGT AAACACTGAA
GACACCATCC AAACCTACTT GCCTGATTCA ACCGACGAAG ACACCATTAT GTCCGTAAAA
GATATCGTTA AGAAAAATGA CCACAAAGTT GTTGCCGTGT ACGTTGATGT ACCGGGAGGA
CAATCTCAAG GAAGCGGATT TTTTATAAAA GACGGTGTAA TAGTTACCAA TTATCATGTA
ATCGAGGGCG GAAAAAGTGC AAAAATTCTC CTTTCCAACG GAAATTACGT GGATGTGGAA
GGAGTTTTAT ACACGGATTC TGATGTTGAT ATTGCTGTAT TGAAATTGGT AAATGAAGTC
GGCATCGAGC CGGTAACCAT AGGTCAGGCC CGCGATTCCG ATAAAGGAAG TATCGCCGTT
GCTATTGGAT CACCTTTGGG ACTGTTTAAT ACAGTATCCA CAGGAATTAT ATCTAATTTC
TGGGAGGCCA ATGGAGTTAA CTTAATTCAA ATTTCCATCC CTATTACCCA CGGTAACTCC
GGAGGTGCCT TATTCAACGA GTCCGGTAAA TTAATCGGTA TCACATCCTC AGGAATAGGA
GAGGCAAATT TAAACTTCGC AATTTCTTCA ACCCATATAA TACCTATTTG TGAGGATATA
AAAAATATAC CCTATAATCA ATTAAATGCA GTTCCCCTTA GCAGTGCAGG CGGCAATATT
AATTCCATTA CCAGGCAAGC CGGATCTTCC AATAGCAATC AAAGCAGTTC TTCCGGCAAG
AGTCTTTTAT CTTCAAAATA CAGGTTTGTT AATTCCGATA AACCAATATC CAACGATGCA
ACATATACCT CTGACTTACA GACAATATAT GACCTGGCTT TAAACAAGTA TTATGCCAAA
GAAGATTTAT ACAGAGATCT TGATTACGCC TTTGACTATA TTTTGGATTA CGGCACAAAG
TATTATCTGG AATATTTAGA GTATATACTT CATGAAACAT ATATAAAAGA AGATGTAGAA
AACTTTAATA AGATAAATGA ATCTGTTCTG GAAGTGCTTA ACACCACAGG TGAATATATA
GAATTTACAG CGGATGAAAT TGAAATAATA GGTGCCGGAA TAAAAAACGA CGGCACTATT
GACGGTATTA TTGTGAGAGC TTTATTCTAT GAAAATTCGG AGCCTTATGT GAGATCAAAA
TTTTATTTCA GAGCTAACTA TGACGCATGG AGTTATGTTG ATGGCTTCTT GTATGTAGGA
GAAGTTACAG AAAAGTAA
 
Protein sequence
MIYFCHAPND RAIAEKIYFS LKQTGLTCWI PSQDIMAGQY YVEAIANAIE KSDIVVFIFS 
SHSNTSIQVI DELQKASSLN KTIIPFCVDR AMPSEPIEHY LSSPYKVDAT IGLPDDNIAK
LQNIIKQIYN QSSVHNANDI NTVKDTFAPS NPLENNTNII KDASDNNSQE SIQFNGVINS
ANSQPPQNPN LPPQHQMNPN FNNTVNNMDR NMNYPIKQNV RPNPQVKSSN KIKIPLIIGG
SIAGGIILLS ILFMLGKNLL FSTLINSHRP SDNPVIGSNI RIDGSKYTEE EIIEFTKNAI
LELDRLQASR EFYDTYAYNA PEYDLMRIQE NVFFSLMDLD VVRFEEVAIL GKNEKEHGIE
YLLKVDFVCY ADYEYTENNE TVHRKGEALL YNNVVILETP NDGLKYLYME GIFEDELNAR
NENLESSYRE IAESGDVNTE DTIQTYLPDS TDEDTIMSVK DIVKKNDHKV VAVYVDVPGG
QSQGSGFFIK DGVIVTNYHV IEGGKSAKIL LSNGNYVDVE GVLYTDSDVD IAVLKLVNEV
GIEPVTIGQA RDSDKGSIAV AIGSPLGLFN TVSTGIISNF WEANGVNLIQ ISIPITHGNS
GGALFNESGK LIGITSSGIG EANLNFAISS THIIPICEDI KNIPYNQLNA VPLSSAGGNI
NSITRQAGSS NSNQSSSSGK SLLSSKYRFV NSDKPISNDA TYTSDLQTIY DLALNKYYAK
EDLYRDLDYA FDYILDYGTK YYLEYLEYIL HETYIKEDVE NFNKINESVL EVLNTTGEYI
EFTADEIEII GAGIKNDGTI DGIIVRALFY ENSEPYVRSK FYFRANYDAW SYVDGFLYVG
EVTEK