Gene Cthe_2513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2513 
Symbol 
ID4809269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2982057 
End bp2983235 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content47% 
IMG OID640107929 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_001038908 
Protein GI125974998 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGACA ATGGCAGAAG GAAAAACGGT TTTTTCACCA TGTTTTTAAC TTCACTTCTT 
ACATCTCTGG GTGTTGGAAT ATTGTTGATT TTTGCTGTTA CAGGTTTTGT AACGGGAAAC
CGCAGTACAG CTCCCCAAAC GCCTCAGGGG GGTTCGGGCA TTGAAATGCC CAGTCCGCAG
CAGATTTCGG CGGAACAATC CAATAATACG GGCGGACAAA ATACCATACA GAGTGTTGCG
GAAAATGCGT CAAAAGCTGT GGTAGGCATT TCCGTACTCA AAGTTGACAG CAGCTCAATA
TTCAATCCCA ATGCTGTCGA ACGGTGGGGT GTGGGTTCGG GAGTAATAGT AACCCCAAAT
GGTTATATTC TTACCAATCA CCATGTGGCT GGAGGAAAAA GCAAAAGGAT TGTTGTTTCG
CTGGTCGACG GGAAAAACCT GGACGGAGTG ACTGTCTGGT CAGATTCGGT ATTGGATTTG
GCGGTGGTAA AGATAGAAGC GGAAGGACTT CCTACAATAC CGCTGGGTGA TGCCACAAAA
CTTAAAGTAG GAGAGCCTGC CATTGCCATC GGCAATCCTC TTGGACTTCA ATTCCAGAGA
ACGGTCACAT CGGGTATTAT CAGTGCGCTT AACAGAACTA TAGAGGTTGA CACCGAACAG
GGCACAAATT ACATGGAAGG CCTTATTCAA ACCGATGCCA GCATAAATCC CGGAAACAGC
GGCGGACCGC TTTTGAACCT GAAAGGTGAA GTGGTGGGAA TTAATACGGT AAAAGTGGCC
AGTGCGGAGG GAATAGGCTT TGCCGTACCC ATTAATGTGG CAATTCCCAT AATCAACAAG
TTTGCAACTA CAGGAGAGTT TATTGAGCCA TACCTGGGGG TTTTTGCATA TGACAAAGAC
ATTATTCCTT ACCTTGACGG AAATGTCAAA GTGCAAAACG GGGTTTATGT GGCAAATGTG
GACGAAAACG GACCGGCTTA CAAGAGTGGA ATACGGGTGG GCTGCATCAT GACTCAGATT
GACGGCGAGG AGATAAGCAC GATGATGCAG CTAAGATGCG TCATTTATTC AAAAAAACCG
GGGGATGTGG TGACCATCAG ACATATCAGC AACGGCAAAC CCCAGACCGT CCAAGTCAGG
CTTGCCGCAA AAGAGAAAGA CGGACTTGTG ACAAGGTAG
 
Protein sequence
MFDNGRRKNG FFTMFLTSLL TSLGVGILLI FAVTGFVTGN RSTAPQTPQG GSGIEMPSPQ 
QISAEQSNNT GGQNTIQSVA ENASKAVVGI SVLKVDSSSI FNPNAVERWG VGSGVIVTPN
GYILTNHHVA GGKSKRIVVS LVDGKNLDGV TVWSDSVLDL AVVKIEAEGL PTIPLGDATK
LKVGEPAIAI GNPLGLQFQR TVTSGIISAL NRTIEVDTEQ GTNYMEGLIQ TDASINPGNS
GGPLLNLKGE VVGINTVKVA SAEGIGFAVP INVAIPIINK FATTGEFIEP YLGVFAYDKD
IIPYLDGNVK VQNGVYVANV DENGPAYKSG IRVGCIMTQI DGEEISTMMQ LRCVIYSKKP
GDVVTIRHIS NGKPQTVQVR LAAKEKDGLV TR