Gene Cthe_2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2086 
Symbol 
ID4810946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2478446 
End bp2480965 
Gene Length2520 bp 
Protein Length839 aa 
Translation table11 
GC content44% 
IMG OID640107493 
Productpeptidase U32 
Protein accessionYP_001038486 
Protein GI125974576 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGAG ATTTTAAACT GGAATTGCTT GCCCCCTGCG GAGACTGGGA AGCATTTATG 
GCAGCCGTCG AAAACGGTGC GGACGCGGTA TACGTGGGGG GAAAGTTGTT TAATGCAAGG
CAGTATGCTT CAAACTTTGA TGAGGAAAAA ATCAAAGAGG TAATACACTA TGCCCATGTG
CGGGATGTAA ATGTATATCA GACCATGAAC ATCCTGATAA GCGACAGTGA GATGAGAGAG
GCGCTCAAAG CATTGGAGCG GTCGTACCTT GCAGGTATTG ACGGGGTGAT AGTCCAGGAT
ATCGGACTGG CGAGTTTGAT AAGAAAGCTG TATCCGGATC TTGCACTTCA CGCAAGCACA
CAGATGACAG TATACAATTT GCAGGGCGTA AAGCTGCTCG AGGAACTGGG ATTTAAAAGG
GTTGTGCTTG CAAGGGAGTT GTCGCTGGAG GAAATACAAT ATATTACTGA AAATACTTCA
CTGGAGGTGG AAGTGTTTGT TCATGGGGCG TTGTGTGTCT GCTATTCGGG ACAATGCCTT
ATGAGCAGTA TTATTGGAGG AAGAAGCGGA AACCGCGGAA AATGTGCCCA GCCCTGCAGG
CTTCCGTATC AGCTTCTGGA AGTTGGCGAA GGAAGCGGTC TGCCTCAAAG AAAAGCGAAC
AGAGGGTATT TTATGAGCCC AAAAGACCTG TGCTCTGTTG ATATTTTGGA TAAAATTATA
AAAAGTGGTG TAAAATCGCT TAAAATTGAA GGCAGAATGA AAAGCGCCGA GTATGTGGCC
ACCGTGGTGA GGATTTACAG AAAATATCTT GACAGGCTGT TTGAGAGTAC GGACAGTCGT
AATGAAGGTA TTGTGGAAAA GGATATGAAG GACCTTCTCC AGATATTCAA CCGGGGGGGC
TTCTCAAAAG GATATCTGGA AGGAAAAACG GGAAAAGATA TGATGAGCTT TGAGAAGCCT
AAAAACTGGG GAATATACGT GGGAAAAGTA ATGGCCTGTG ACAGGGCGCA AGGCAGCATA
AAAATAAAAC TTGAGGAACC TTTAAGCCTT GGTGACGGGA TAGAGGTGTG GAACGGTGAG
GATGAAAGCC CGGGAACAAT TGTAACGTCA ATCCGGGTAA ACGGCAAGGC AGTGACGGAA
GCACTGCCGC AGCAGGTGGT TGAGGTAAGA AACGTCAAAG GCAGGATAAA CAAGGGAAAC
AAAGTTTACA AGACGTCCGA CAAAAAGCTT AATGCTTCTG CCAGAGAATC TTTCACCGGA
AAATTCAAAA AGAGAATTCC CATTGAAGGA AGGATTACTG TGGCGGGAGG CAAACCTCTG
TCAATTATTG TGAAGGATTA TGAGGGAAAC AAAGTTGAAG TCAAGTCCTC ATACGTGCCT
GAGAAAGCTC TGACAAGTCC CGTTACCGAA GAGAAAGTTT TGAAACAGGC GGCAAAAACC
GGACAGACTC CTTTTGAATT TAAAGAATTG CTCGCCGATG TGGAAGACGG TTTGTCCGTA
CCCGTAAGTG AAATCAACAA TATTCGGCGT CATGCACTAA ATCAGCTGGA GATAAAAAGA
ACCGACAGAT ATCCCTTAAG AAAGCCGGGA AATTTGCAAG AAAAATTGGA GGATGTGATG
CATTTCCCGG GAAATAGTCG AAACGGGGAG GAAAAAAATT TAAAAATTTC GGCATGTTTT
TACAAAGACA TGGCCGGGCT TGAATATGAA AGCCTTGGAG TGGATCGCAT CTACCTTCCT
TTCAGCATGT TTGTAAAGGA AAACAAAGAA AGGATTTTGA GCATTAAAGA AAATGCAGAG
CTGTTTGTAT TTATTCCCCC GGTAACCAGG GGAAATTATG ACAAGCTGAT AAAATCCAGG
CTTGATGATA TTGTAAATAT GGGAATTGAC GGAATTCTTG CGGGGAACCC CGGCACTGTG
AAATATGCCG GAGCATACCC AAAAATCCGT ATTATGGGGG ACTTTTCTCT GAACATATTT
AACAGTGTTT CAATAAAAAC TCTCAAGGAT ATGGGGCTTA ACGGGGCGAC TTTGTCCTGC
GAGCTTAATT TGAATCAGAT AAGGGAGATG GGGAAGTTTC CGGATTTTGT GGAAGAAGTG
CTGGTATACG GAAGAATACC CCTTATGATC AGTGAGTATT GTCCGGTTGG GAGCATAAAA
GGCAATTTCG GCAAAAACTC CAGATGCAGC ATGCCTTGCA AAGACAAAGA TTTTTACCTT
GTGGACAGAA TGAACATGAA ATTTCCCGTC CTGTGCGACA GGATTGACTG CAGAAGCATG
ATTTTCAACG CAAAAGTATT GCTGCTTTCA GATACTGTTG ATAGAATTAA AACATTGGGT
ATTGATATGG TACGGCTTAA TTTTACGGAT GAAAATCCAA AAGAAGTAAA AGACATAGTG
AAAATGCACA GGGATCTTTT AAATAACGGT TCCGGGGCGT TAGACTCTTA TAAGCAGTTG
ATTGATAAAA TAAAAAGCAG AGGCTTTACA AAAGGGCATT TCCCAAGGGG TGTCCAGTAA
 
Protein sequence
MTRDFKLELL APCGDWEAFM AAVENGADAV YVGGKLFNAR QYASNFDEEK IKEVIHYAHV 
RDVNVYQTMN ILISDSEMRE ALKALERSYL AGIDGVIVQD IGLASLIRKL YPDLALHAST
QMTVYNLQGV KLLEELGFKR VVLARELSLE EIQYITENTS LEVEVFVHGA LCVCYSGQCL
MSSIIGGRSG NRGKCAQPCR LPYQLLEVGE GSGLPQRKAN RGYFMSPKDL CSVDILDKII
KSGVKSLKIE GRMKSAEYVA TVVRIYRKYL DRLFESTDSR NEGIVEKDMK DLLQIFNRGG
FSKGYLEGKT GKDMMSFEKP KNWGIYVGKV MACDRAQGSI KIKLEEPLSL GDGIEVWNGE
DESPGTIVTS IRVNGKAVTE ALPQQVVEVR NVKGRINKGN KVYKTSDKKL NASARESFTG
KFKKRIPIEG RITVAGGKPL SIIVKDYEGN KVEVKSSYVP EKALTSPVTE EKVLKQAAKT
GQTPFEFKEL LADVEDGLSV PVSEINNIRR HALNQLEIKR TDRYPLRKPG NLQEKLEDVM
HFPGNSRNGE EKNLKISACF YKDMAGLEYE SLGVDRIYLP FSMFVKENKE RILSIKENAE
LFVFIPPVTR GNYDKLIKSR LDDIVNMGID GILAGNPGTV KYAGAYPKIR IMGDFSLNIF
NSVSIKTLKD MGLNGATLSC ELNLNQIREM GKFPDFVEEV LVYGRIPLMI SEYCPVGSIK
GNFGKNSRCS MPCKDKDFYL VDRMNMKFPV LCDRIDCRSM IFNAKVLLLS DTVDRIKTLG
IDMVRLNFTD ENPKEVKDIV KMHRDLLNNG SGALDSYKQL IDKIKSRGFT KGHFPRGVQ