Gene Cthe_2680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2680 
Symbol 
ID4808848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3160920 
End bp3163157 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content40% 
IMG OID640108095 
Productpeptidase S41 
Protein accessionYP_001039072 
Protein GI125975162 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.20871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAAC GCCTTTTACT GTTTATTTTA GTGCTTGGTG TCTGTCTTTT TACTTCATGC 
GGCAATTTTG TAAAAACCAA TATTTACTTT TCAGCAGCCG AATCGGCATT TGATTCAGGC
AAATACGAAG ATGCAATAAA GTACTATGAC AAAGTTATAG AAGCAGATTC CGGCAATGCC
ATGGCTTATC TCGGTAAAGG TCTTGCCCTG GATGCTTTGG GAAAATACGA AGAAGCCCTG
GAGTTTTTCG ACAAAGCCAT TGAAATCAAC AAAGATTTGG CAAAAGCCTA TAATGCCAAA
GGCACCACTT TAGCCAGTCT TGAGAGGTAT GAGGAATCTC TTGAAAATTT TAAAAAAGCA
GCGGAATTGA AACCAAAAAA CAGTGCCTAT CAAAATGATG TGGCATATGG CTTAAACAAT
CTCGGCAGAT TTGAGGAAGC AATTCAATAT GCCGAAAAGG CACTCAAACT TAATCCACGC
AGCGGTGTTG CCTACTCAAA CAAAGGTTTT GCCCTTGACG CTCTGGGAAA ATTGGATGAA
GCCATCGAAT GCTATGATAA AGCAATAGAA CTTAGTCCAA CCTATACCAA TGCCTACTAC
AACAAGTCCA TTGCAGTTTT CAAAATGGGC AAAACAGAAG AGGCCATAGA ACTTTTGGAC
AAAGTACTGG AAATTGACCC CGACGACTTA GATGCCATAA CTTCAAAAGG TTACTGTCTA
AATGAACTTG GAAAATATGA AAAGGCAATA GAGTGCTTTG ACACTGCAAT CGAAAAATAT
CCCAAAGATC CATACCCGTA TGTTTGCAAA GCCACTTCCC TTTATTATCT GGGAAAATAT
GACAACGCTC TCGAAGAGTG CAACAAAGCC ATCAAGTTAG AGTACACCTT TCCTGATTCC
TATATATGGA AAGCCAAGAT TCTTGTTGAA AAGGGAGACA TTGAAGAGGC CAGAAAATCG
TGCGATGAAT TTCTGGCTAT TGCCGAGGAT GCTTCTGTTT ACGATATGAA AGGTCAAATA
TATTTACACG AGTATAACTA CCCGGAAGCA ATAAAGCTCT TTGACAAAGC AATAGAAGTT
GATCCATCCT ATGAGGACTC TTATATCAAT AAAATCTATT GCCTGTATCT GCAGAAAAAT
TACAAAGAGT GCATAGAATT TGCCACAAAG GTGCAAACCA TTTTCCCAAA TTCCGCAGAC
ATTCCCTGGT ATATCGGGGA TTGTTACAGC ATAATGATGG AACCGGAAAA GGCTATTGAA
TACCTCAAGA AGGCCCACGA ACTAAACCCG AAAGATGTCG GCATTTTAAC CTCCATTGCG
TGGGAATACT ATAGCCTTGA GGATTACGCA AAAGCATCCG AATATGCCGA AAAGGCTGCC
GAAATATCTG CCGATGACGA AAGTGTAAAG TACATCAGGG AAAAACTGGA AAATCAAAAA
CTTCCCGAAG CAGAGCAAAT AGTTGAGTTT GTTAAAAACA ATTACTTGTA CTATGACAAA
ATAGCAAACT TTGAAGCCCT TGCAAATGAA TTCAAAGCCA AAGGCGAAGT TGGTGTAAAA
GACATATGCA ACTTTATAGA AAGCATAAGG CAAAAAGATG ATATGTTTAC TTTCGTAATT
CACGGTGACG ACTATGATTT GTTAAAGTAC GAGGAAAGTA TTTCCCAGGT AACTTCACAG
CAACTTGAAC CAAACATACA CTATATAAAA ATTAAATCTT TCACTGCAAG CGTCAGCTGG
GAATTCAAAG AAATCATTGA CTCAATTGAA AATCCCGAGG AAAAAGTTTT GGTTATTGAC
TTAAGAGACA ACCTGGGAGG CCTTGCAACT TCGTCCGCCG ACATACTGGA CTACCTTTTG
CCGGCATGTA CCACAAGCTA CATAGTCTAC AGAGACGGAT ACATGTATTC ATACTATTCT
GATGCCGCCC AGACAAAGTT CAAGAAAATT CTCGTCCTGG TCAATGAATA TTCTGCGAGC
AGCTCGGAAA TTCTTGCCTT AGGGCTTAAA AAACACTTAA ACAATGTTGT TATAATCGGC
CGTCCCACCG TGGGTAAAGG CGTCGGACAA CTGGTTTATG AAAACAAATC CAAAAAATAC
ATGATTTATC TGGTAAGCTT TTATTGGAAT GTCATGGAAG AAAACATATT GGGAAAAAGA
ATCGAGCCTG ATGTGTATGT AAACAGTTCC AGTGACGCCG CATACATGAA CGAAGTAAAG
CGCCAGGCTG CCAGGTAA
 
Protein sequence
MKKRLLLFIL VLGVCLFTSC GNFVKTNIYF SAAESAFDSG KYEDAIKYYD KVIEADSGNA 
MAYLGKGLAL DALGKYEEAL EFFDKAIEIN KDLAKAYNAK GTTLASLERY EESLENFKKA
AELKPKNSAY QNDVAYGLNN LGRFEEAIQY AEKALKLNPR SGVAYSNKGF ALDALGKLDE
AIECYDKAIE LSPTYTNAYY NKSIAVFKMG KTEEAIELLD KVLEIDPDDL DAITSKGYCL
NELGKYEKAI ECFDTAIEKY PKDPYPYVCK ATSLYYLGKY DNALEECNKA IKLEYTFPDS
YIWKAKILVE KGDIEEARKS CDEFLAIAED ASVYDMKGQI YLHEYNYPEA IKLFDKAIEV
DPSYEDSYIN KIYCLYLQKN YKECIEFATK VQTIFPNSAD IPWYIGDCYS IMMEPEKAIE
YLKKAHELNP KDVGILTSIA WEYYSLEDYA KASEYAEKAA EISADDESVK YIREKLENQK
LPEAEQIVEF VKNNYLYYDK IANFEALANE FKAKGEVGVK DICNFIESIR QKDDMFTFVI
HGDDYDLLKY EESISQVTSQ QLEPNIHYIK IKSFTASVSW EFKEIIDSIE NPEEKVLVID
LRDNLGGLAT SSADILDYLL PACTTSYIVY RDGYMYSYYS DAAQTKFKKI LVLVNEYSAS
SSEILALGLK KHLNNVVIIG RPTVGKGVGQ LVYENKSKKY MIYLVSFYWN VMEENILGKR
IEPDVYVNSS SDAAYMNEVK RQAAR