Gene Cthe_3136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3136 
Symbol 
ID4809699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3705344 
End bp3706471 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content37% 
IMG OID640108569 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001039524 
Protein GI125975614 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATAG GAGACCATGG CACACATTGT GCGGGAATAA TTGCAGGTTT TGGACCTAAT 
GTTAAGATTG CCTCACTTAA GCATCTTAGC GGAAATAAAT TTAAAGATTT GGACAATTGG
GTTTCTACTA TGATTAAGGC AATTAATTAT GCCGATGCAA TGGATATTAA AATTGTAAAC
GTTAGTTTGG GATTACATAA ATCTCAAATT GGAGATAGGC CATTTGATTC TCAGGCTCTA
AATGATGCAA TAAGTAATGC AGACTTGTTA TTCGTAACAG CTGCCGGAAA CTTCAATAAA
AATATTGACC TACCGGACGA TGTTATTTAT CCCGCAAGCT GTACCGCTGA AAATATTATT
ACTGTTGCAA ATACTGATAA AGATGATAAA CTTTATGAAT CTTCAAATTA TGGTGTTATT
TCGGTTGACC TTGCTGCCCC CGGGACAGAT ATTCGTAGTA CTATTCCAAC ACATCTTGCA
GGAGAAGGCG GACCTTACGA TATAAAAACA GGTACATCCA TGTCTGCGCC ACATGTAGCA
GGAGCAGCTG CTTTGTTGTT ATCTTCAAAT CCGTCTTTAA CTACACAACA ACTAAAAGAT
TTGATTTTAT CCAGTGTGGA TTTTCTACCG GACTTGCAGG GTAAAGTTGC CACAAGCGGT
AGGTTAAACG TTGCAAAAGC CTTGAGGAAG ATTAGAACTT CTGTCAAAAT TGGAGATATA
GACGGAAATG GAGAAATATC CTCCATTGAT TACGCCATAC TTAAATCACA TTTAATAAAT
TCAAACCTGA CATTTAAACA GTTAGCTGCC GCTGATGTAG ATGGGAATGG ATATGTAAAT
TCCATCGATC TTGCCATACT TCAAATGTAT TTATTAGGCA AAGGTGGCAC GTCAGATATC
GGGAAAAACC GCATATATAC GTATGGCGAC ATTGACAATA ACGGAATAGT AGACGAGAAT
GATTATATAC TGATATGCAA CCATATTAAC GGTACAGGAC AATTATCGGA TGCTAGTCTG
TTTGCTGCAG ATGCTGACGG AAATAATGTT ATAGACCAAA CCGATAGAAT TCTTATAGAA
AAATATATCA CAGGAAGAAT TACTCATCTA CCTGTCGGAA ATCAATAA
 
Protein sequence
MDIGDHGTHC AGIIAGFGPN VKIASLKHLS GNKFKDLDNW VSTMIKAINY ADAMDIKIVN 
VSLGLHKSQI GDRPFDSQAL NDAISNADLL FVTAAGNFNK NIDLPDDVIY PASCTAENII
TVANTDKDDK LYESSNYGVI SVDLAAPGTD IRSTIPTHLA GEGGPYDIKT GTSMSAPHVA
GAAALLLSSN PSLTTQQLKD LILSSVDFLP DLQGKVATSG RLNVAKALRK IRTSVKIGDI
DGNGEISSID YAILKSHLIN SNLTFKQLAA ADVDGNGYVN SIDLAILQMY LLGKGGTSDI
GKNRIYTYGD IDNNGIVDEN DYILICNHIN GTGQLSDASL FAADADGNNV IDQTDRILIE
KYITGRITHL PVGNQ