Gene Cthe_2137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2137 
Symbol 
ID4811184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2535141 
End bp2537513 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content42% 
IMG OID640107541 
Productcellulosome enzyme, dockerin type I 
Protein accessionYP_001038534 
Protein GI125974624 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGTCA TGAGTGTCAT GCCTTTACCT AAGAGTTTTG CTGCAAACCA GGTTCTGACC 
GTAGATTTGG CAGCAGATAC AGGGGAAATT TGTTATGGTG CCATTGGTGG TCTTTATGCA
ATGGGTAGCC CGGGCGTACC TACAGATAAC GTAATTGTTC CTTTGGGAAT GAAGGCTATT
TCGCAAAAGG CTCCTGACGG ACTTCAACAT CCTACCGGTG ACGCGCTTAA GGTTGCTCCA
CAGTTTATTG AAGCCGGTGG CGAATATGTT ATGATAATGA TGCAGGATAT ATACAGGAAC
TGGCCCTATG AAGATCTTGG TATTAATGAC TATCTTGCGA AAATTGAGAC AATATGCAGA
AAAGTTGTTG CAGATCCATA CCGTCACAAG TATGTATATG TTCCGATTAA TGAGCCTGAA
TGGATTTGGT ACAGGGGAAA TATGACTAAG TTGTGTAACG AGTGGAAAAT GATGTACGAT
AAAATCCGCT CAATTGACCC CACGGCTAAG ATTGCAGGAC CTAACTATGC AGTATACAAC
AGTTCGGCTT ATCGTCAATT CATGACCTTC TGTAAAAACA ACAATTGTTT GCCGGATATA
GTGACATGGC ATGAATTGGA TGATGGATTC TTTTCAAACT GGTATAACCA CTATAATGAT
TACAGGAACA TTGAGAAAAG CCTCGGAATT TCGCCAAGAC CGATAAACAT AAACGAATAT
GGCAGAATCA ACGTAGACGG AGGTATTCCC GGAAATCTCG TGCAATGGAT AGCACGCTTT
GAAAACAGCA AAGTGTATGC TTGTCTTGCC TATTGGACGA CAGCAGGAAC CTTGAATGAC
CTTGTAACTC AGAACAATAA GGCAACCGGT GCATGGTGGC TGTATAAATG GTATGGAGAA
CTTACGGGAC ACACCGTACA GGTAACTCCG CCAAGCTTAA ACGGATCGCT TCAGGGCTTG
GCTGCTCTGG ACAGAAACAA AAAACAGGCC CGTGTTATTT TCGGTGGGTC ACTGAGAAGC
ACTGACGTAT TTAATACTGA TGTAGTAGTT AAAGGTTTCA ATTCTCACTC CTACTTTGGA
AACTCAGTTC ATGTAATTGT TTGGGGAGTG GACAATACCG GTACCAATCC TTCCAGTGGG
CCATACCTCG TACATGAAGG TGACTACAAC ATTTCCAACG GACAGATTAC GGTAACTGTC
AATAATATGA AAGCATTATC TGCATATCAT ATGATAATAA CTCCTAATAC AGACCTGTCG
CCCGCCAATA ATAGAAATCG CTATGAAGCA GAGTATGCAA GAATTTTGGG AACAGCAACC
GTTTCTCACG GCGGTCATTC CGGTTATTCC GGGACAGGCT TTGTTGAAGG ATATGCCGGA
AGCAATAATG CGAGCACCAA TTTTGTAGTA ACTGCCGAAA CAGACGGATA CTACAATGTA
ACCTTGAGAT ATTCTGCCGG TCCTTACCCG GGAGCACCTA AAACCAGATA TTTAAGGATG
GTTGTGAACG GCGGGCTCCA TAAGGATGTT GCTTGTATCC AGACTGCTAA TTGGGATACA
TGGGAAAGCA CCACTGTCAA GGTATTCTTG CAGGCAGGTA TCAACCGTCT GGATTTCAAG
GCTTTTGCTT CGGATGAATC AGACTGTGTA AATATAGACT ATATTGATGT TGAACCCACA
TCCGGAACCA TTAACGTTTA TGAAGCCGAG GATCCTGCAA ACACACTGGG TGGAGCGGCT
GTAAGACAAA GAGATAATGC TGCGTCAGGC GGACAATATG TAGGCTGGAT TGGCAATGGT
TCTAATAATT ATCTCCAATT CAACAACGTT TATGTTCCGC AGGCAGGTAC ATACAGGATG
GTAGTTCAAT TTGCAAATGC GGAAGTATTT GGTCAGCACT CTTATAACAA TAATGTAGTT
GACAGATATT GCAGTATTAG TGTAAACGGA GGACCCGAAA AAGGGCATTA TTTCTTCAAC
ACCCGTGGAT GGAATACATA TCGTACAGAT ATAATAGACG TATATTTGAA TGCCGGAAAC
AACACAATCA GATTTTATAA CGGCACATCG GGAAGTTATG CACCGAATAT TGATAAAATA
GCAATTGCCG CTCCCTTTGA AGGAGGAACC GAACCAACTC CACCAGAAGA GGATTTTGTA
TATGGAGATG TAGATGGAAA CGGCACGGTT AATTCAACAG ATGTAAACTA TATGAAACGG
TATTTATTAA GGCAAATTGA AGAGTTCCCC TATGAAAAAG CTTTAATGGC AGGAGATGTG
GATGGAAACG GCAATATTAA TTCGACAGAC TTGTCTTATT TGAAAAAATA TATATTAAAA
CTCATATCAG CATTCCCGGC AGAAACTAAC TAG
 
Protein sequence
MLVMSVMPLP KSFAANQVLT VDLAADTGEI CYGAIGGLYA MGSPGVPTDN VIVPLGMKAI 
SQKAPDGLQH PTGDALKVAP QFIEAGGEYV MIMMQDIYRN WPYEDLGIND YLAKIETICR
KVVADPYRHK YVYVPINEPE WIWYRGNMTK LCNEWKMMYD KIRSIDPTAK IAGPNYAVYN
SSAYRQFMTF CKNNNCLPDI VTWHELDDGF FSNWYNHYND YRNIEKSLGI SPRPININEY
GRINVDGGIP GNLVQWIARF ENSKVYACLA YWTTAGTLND LVTQNNKATG AWWLYKWYGE
LTGHTVQVTP PSLNGSLQGL AALDRNKKQA RVIFGGSLRS TDVFNTDVVV KGFNSHSYFG
NSVHVIVWGV DNTGTNPSSG PYLVHEGDYN ISNGQITVTV NNMKALSAYH MIITPNTDLS
PANNRNRYEA EYARILGTAT VSHGGHSGYS GTGFVEGYAG SNNASTNFVV TAETDGYYNV
TLRYSAGPYP GAPKTRYLRM VVNGGLHKDV ACIQTANWDT WESTTVKVFL QAGINRLDFK
AFASDESDCV NIDYIDVEPT SGTINVYEAE DPANTLGGAA VRQRDNAASG GQYVGWIGNG
SNNYLQFNNV YVPQAGTYRM VVQFANAEVF GQHSYNNNVV DRYCSISVNG GPEKGHYFFN
TRGWNTYRTD IIDVYLNAGN NTIRFYNGTS GSYAPNIDKI AIAAPFEGGT EPTPPEEDFV
YGDVDGNGTV NSTDVNYMKR YLLRQIEEFP YEKALMAGDV DGNGNINSTD LSYLKKYILK
LISAFPAETN