Gene Cthe_1421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1421 
Symbol 
ID4809082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1740053 
End bp1741033 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content41% 
IMG OID640106844 
Productsignal peptide peptidase SppA, 36K type 
Protein accessionYP_001037845 
Protein GI125973935 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000360798 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA AACAGCTAAT CGGTTTAATT GTGGCAGGAG TAGTATTTGT TTTTGTTTGT 
TCTTCAAGTG TTTTGGTAAA CACCGTTTCA AAAAGACTCG GAACGTCTCT GAACTTAAGC
GACAGCGAAA GCAGTCTTCC TCTTACTCCC TATATAGGTG TTGTAAGTGT TGAAGGCACC
ATTATGGACA GCAACTCAAC CACAAGTTTC TTAAGCAACG GTTACAACCA CAAAGAAACG
TTGAAGCTCA TTGAGGATAT GAAAAATTCA GCAAGCAACA AAGGCATTCT TTTGTATGTG
AATTCCCCCG GCGGAGGCGT TTATGAAAGT GATGAATTGT ATTTGAAGTT GAAAGAATAC
AAAGAAGAAA CCGGAAGGCC GGTCTGGACC TATATGTCAA ATCAGGCATG TTCCGGTGGC
TATTATATTT CTATGGCATC CGACAAAGTA TTTTCAAACC GAAACGCATG GACCGGTTCC
ATCGGCGTCA TCATCTCCCT GACAAACCTC AAAGGCTTGT ACGATAACCT TGGAATTAAA
GGTATTTATA TTACAAGCGG CAGAAACAAA GCAATGGGTG CTGCCGATCT GGAATTGACA
GATGAGCAGC GTGATATACT TCAAAGCCTT GTGGATGAGG CATATGAGCA ATTTGTTGAA
ATTGTGGCGG AAGGCAGAAA AATGACAGTG GAAGAAGTAA AAAGAATTGC CGATGGAAGA
ATTCTTTCCG CAAAACAGGC ACTCGAGTTG AACCTCATTG ATGAAATTGC CACGTATGAT
GAAGTAAAAG AAGCTTTCAG CGCAGAGCTT GGAAATGTTA AAATATATAC ACCCAAAAAG
AAAGACCCGT TTGGACTTAG CTCTTTGTTC AGCTATATAA ACAGCTTGAA ACCTCGCTCT
GATACTGAAA TAATAGCCGA GTTGATAAAG GCTAAAGGAA ATGGGGTGCC GATGTATTAT
GCAATGCCGG GACAATACTA A
 
Protein sequence
MNKKQLIGLI VAGVVFVFVC SSSVLVNTVS KRLGTSLNLS DSESSLPLTP YIGVVSVEGT 
IMDSNSTTSF LSNGYNHKET LKLIEDMKNS ASNKGILLYV NSPGGGVYES DELYLKLKEY
KEETGRPVWT YMSNQACSGG YYISMASDKV FSNRNAWTGS IGVIISLTNL KGLYDNLGIK
GIYITSGRNK AMGAADLELT DEQRDILQSL VDEAYEQFVE IVAEGRKMTV EEVKRIADGR
ILSAKQALEL NLIDEIATYD EVKEAFSAEL GNVKIYTPKK KDPFGLSSLF SYINSLKPRS
DTEIIAELIK AKGNGVPMYY AMPGQY