Gene Cthe_1857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1857 
Symbol 
ID4809408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2201543 
End bp2202832 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content41% 
IMG OID640107276 
Productcarboxyl-terminal protease 
Protein accessionYP_001038271 
Protein GI125974361 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000520735 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATAAAA ATATTTTTTC ATCGAAAATT CTGCCTTTAA TTTTGGTTGC TTTGCTTTCC 
TCGACTGTAA CTGCATACGG GTTTCTTCAG TGGTATGAAA GAAATCCGAA ATATGTTGTC
CTGTCCGAAG AGGAGGCAAA GGTTTTTGAA AAGAAATCAA ATAATAAATT TGATGCCGAT
TATGCAATAA CCTTTGACAA AAACAGCGTT GATATTGAGA ATATCAAGAA ATACAACAAA
GTAAAAAAGC TCCTCACTTC GCATTATTAT CAAGAAGTTG ACCAGAATCA AATGCTCGAA
GGCGCTATTG CAGGGATGGT AAGTGCTTTG AAAGATCCTT ATACTGTATA TTTTACAAAG
GATCAAATGC AAGTTTTCAC TGAAAGCACA TCCGGCAGTT ATGTGGGTAT AGGTGTGTCG
CTAAATATGG ACTCTGACGG CTTGATGACT GTGGTAGAGG CATTCAACGG ATCCCCGGCA
AAAGAAGCGG GAATAATGCC GGGAGACAAG ATAGTCAAAG TTGACGACCA GGATGTTACA
ACTATAAGTG ACCAGGACTA TATTGTAAGC ATAATTAAAG GTGAGGAAAA TACCAAGGTG
AAGATTACGG TATTTAGGCC TTCAGAAGGC ACATACGTGG ATTTTGACAT AATAAGAAAG
AAGATAAAGA TTGAGAACAT AACCAGTGAA TTAATAGACA AAGATATTGG ATACATAAAA
ATTAATATGT TTGACAGTGA AATTGCAAAA TATTTTGGAG ACCACCTAAA CGGGCTTCTT
GCCAAGAATA TCAAAGGATT GATAATAGAT TTGAGGGATA ATCCCGGCGG AGATTATAAG
CAGGTATGTG CGATAGCGGA CCGGCTGCTT CCGGAAGGAT TGATTGTTTA TACTGAAGAC
CGATTGGGCA ACAGAATTGA AGAAAAATCG GATTCAACGG AGCTTGGCAT GCCTCTGGCA
ATACTGGTGA ACGGCAATAG CGCCAGCGCT TCGGAAATTT TGGCCGGTGC CGTAAAGGAT
CACGATAAAG GAACCCTGAT AGGAACCAGA ACCTTTGGAA AAGGGCTTGT TCAGGCGGTG
GAGCCGCTTG AGGACGGGTC CGGCCTCAAG TTTACCATTG CAAGATACTT TACCCCATCC
GGCGTATGCA TACACCAGGA TGGGATAGAA CCGGACATAG AGGTTAAGCT GGATGAAAAG
TATTCAAACT TGCCTGTTTC ACAAGTGCCA AGAGAAGATG ACACCCAGCT TCAAAAAGCT
GTTGAGGTAA TACACGGACA GATCGACTGA
 
Protein sequence
MNKNIFSSKI LPLILVALLS STVTAYGFLQ WYERNPKYVV LSEEEAKVFE KKSNNKFDAD 
YAITFDKNSV DIENIKKYNK VKKLLTSHYY QEVDQNQMLE GAIAGMVSAL KDPYTVYFTK
DQMQVFTEST SGSYVGIGVS LNMDSDGLMT VVEAFNGSPA KEAGIMPGDK IVKVDDQDVT
TISDQDYIVS IIKGEENTKV KITVFRPSEG TYVDFDIIRK KIKIENITSE LIDKDIGYIK
INMFDSEIAK YFGDHLNGLL AKNIKGLIID LRDNPGGDYK QVCAIADRLL PEGLIVYTED
RLGNRIEEKS DSTELGMPLA ILVNGNSASA SEILAGAVKD HDKGTLIGTR TFGKGLVQAV
EPLEDGSGLK FTIARYFTPS GVCIHQDGIE PDIEVKLDEK YSNLPVSQVP REDDTQLQKA
VEVIHGQID