Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2137 |
Symbol | |
ID | 4811184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2535141 |
End bp | 2537513 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107541 |
Product | cellulosome enzyme, dockerin type I |
Protein accession | YP_001038534 |
Protein GI | 125974624 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGTCA TGAGTGTCAT GCCTTTACCT AAGAGTTTTG CTGCAAACCA GGTTCTGACC GTAGATTTGG CAGCAGATAC AGGGGAAATT TGTTATGGTG CCATTGGTGG TCTTTATGCA ATGGGTAGCC CGGGCGTACC TACAGATAAC GTAATTGTTC CTTTGGGAAT GAAGGCTATT TCGCAAAAGG CTCCTGACGG ACTTCAACAT CCTACCGGTG ACGCGCTTAA GGTTGCTCCA CAGTTTATTG AAGCCGGTGG CGAATATGTT ATGATAATGA TGCAGGATAT ATACAGGAAC TGGCCCTATG AAGATCTTGG TATTAATGAC TATCTTGCGA AAATTGAGAC AATATGCAGA AAAGTTGTTG CAGATCCATA CCGTCACAAG TATGTATATG TTCCGATTAA TGAGCCTGAA TGGATTTGGT ACAGGGGAAA TATGACTAAG TTGTGTAACG AGTGGAAAAT GATGTACGAT AAAATCCGCT CAATTGACCC CACGGCTAAG ATTGCAGGAC CTAACTATGC AGTATACAAC AGTTCGGCTT ATCGTCAATT CATGACCTTC TGTAAAAACA ACAATTGTTT GCCGGATATA GTGACATGGC ATGAATTGGA TGATGGATTC TTTTCAAACT GGTATAACCA CTATAATGAT TACAGGAACA TTGAGAAAAG CCTCGGAATT TCGCCAAGAC CGATAAACAT AAACGAATAT GGCAGAATCA ACGTAGACGG AGGTATTCCC GGAAATCTCG TGCAATGGAT AGCACGCTTT GAAAACAGCA AAGTGTATGC TTGTCTTGCC TATTGGACGA CAGCAGGAAC CTTGAATGAC CTTGTAACTC AGAACAATAA GGCAACCGGT GCATGGTGGC TGTATAAATG GTATGGAGAA CTTACGGGAC ACACCGTACA GGTAACTCCG CCAAGCTTAA ACGGATCGCT TCAGGGCTTG GCTGCTCTGG ACAGAAACAA AAAACAGGCC CGTGTTATTT TCGGTGGGTC ACTGAGAAGC ACTGACGTAT TTAATACTGA TGTAGTAGTT AAAGGTTTCA ATTCTCACTC CTACTTTGGA AACTCAGTTC ATGTAATTGT TTGGGGAGTG GACAATACCG GTACCAATCC TTCCAGTGGG CCATACCTCG TACATGAAGG TGACTACAAC ATTTCCAACG GACAGATTAC GGTAACTGTC AATAATATGA AAGCATTATC TGCATATCAT ATGATAATAA CTCCTAATAC AGACCTGTCG CCCGCCAATA ATAGAAATCG CTATGAAGCA GAGTATGCAA GAATTTTGGG AACAGCAACC GTTTCTCACG GCGGTCATTC CGGTTATTCC GGGACAGGCT TTGTTGAAGG ATATGCCGGA AGCAATAATG CGAGCACCAA TTTTGTAGTA ACTGCCGAAA CAGACGGATA CTACAATGTA ACCTTGAGAT ATTCTGCCGG TCCTTACCCG GGAGCACCTA AAACCAGATA TTTAAGGATG GTTGTGAACG GCGGGCTCCA TAAGGATGTT GCTTGTATCC AGACTGCTAA TTGGGATACA TGGGAAAGCA CCACTGTCAA GGTATTCTTG CAGGCAGGTA TCAACCGTCT GGATTTCAAG GCTTTTGCTT CGGATGAATC AGACTGTGTA AATATAGACT ATATTGATGT TGAACCCACA TCCGGAACCA TTAACGTTTA TGAAGCCGAG GATCCTGCAA ACACACTGGG TGGAGCGGCT GTAAGACAAA GAGATAATGC TGCGTCAGGC GGACAATATG TAGGCTGGAT TGGCAATGGT TCTAATAATT ATCTCCAATT CAACAACGTT TATGTTCCGC AGGCAGGTAC ATACAGGATG GTAGTTCAAT TTGCAAATGC GGAAGTATTT GGTCAGCACT CTTATAACAA TAATGTAGTT GACAGATATT GCAGTATTAG TGTAAACGGA GGACCCGAAA AAGGGCATTA TTTCTTCAAC ACCCGTGGAT GGAATACATA TCGTACAGAT ATAATAGACG TATATTTGAA TGCCGGAAAC AACACAATCA GATTTTATAA CGGCACATCG GGAAGTTATG CACCGAATAT TGATAAAATA GCAATTGCCG CTCCCTTTGA AGGAGGAACC GAACCAACTC CACCAGAAGA GGATTTTGTA TATGGAGATG TAGATGGAAA CGGCACGGTT AATTCAACAG ATGTAAACTA TATGAAACGG TATTTATTAA GGCAAATTGA AGAGTTCCCC TATGAAAAAG CTTTAATGGC AGGAGATGTG GATGGAAACG GCAATATTAA TTCGACAGAC TTGTCTTATT TGAAAAAATA TATATTAAAA CTCATATCAG CATTCCCGGC AGAAACTAAC TAG
|
Protein sequence | MLVMSVMPLP KSFAANQVLT VDLAADTGEI CYGAIGGLYA MGSPGVPTDN VIVPLGMKAI SQKAPDGLQH PTGDALKVAP QFIEAGGEYV MIMMQDIYRN WPYEDLGIND YLAKIETICR KVVADPYRHK YVYVPINEPE WIWYRGNMTK LCNEWKMMYD KIRSIDPTAK IAGPNYAVYN SSAYRQFMTF CKNNNCLPDI VTWHELDDGF FSNWYNHYND YRNIEKSLGI SPRPININEY GRINVDGGIP GNLVQWIARF ENSKVYACLA YWTTAGTLND LVTQNNKATG AWWLYKWYGE LTGHTVQVTP PSLNGSLQGL AALDRNKKQA RVIFGGSLRS TDVFNTDVVV KGFNSHSYFG NSVHVIVWGV DNTGTNPSSG PYLVHEGDYN ISNGQITVTV NNMKALSAYH MIITPNTDLS PANNRNRYEA EYARILGTAT VSHGGHSGYS GTGFVEGYAG SNNASTNFVV TAETDGYYNV TLRYSAGPYP GAPKTRYLRM VVNGGLHKDV ACIQTANWDT WESTTVKVFL QAGINRLDFK AFASDESDCV NIDYIDVEPT SGTINVYEAE DPANTLGGAA VRQRDNAASG GQYVGWIGNG SNNYLQFNNV YVPQAGTYRM VVQFANAEVF GQHSYNNNVV DRYCSISVNG GPEKGHYFFN TRGWNTYRTD IIDVYLNAGN NTIRFYNGTS GSYAPNIDKI AIAAPFEGGT EPTPPEEDFV YGDVDGNGTV NSTDVNYMKR YLLRQIEEFP YEKALMAGDV DGNGNINSTD LSYLKKYILK LISAFPAETN
|
| |