Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0729 |
Symbol | |
ID | 4810347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 885954 |
End bp | 887567 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106146 |
Product | cellulosome enzyme, dockerin type I |
Protein accession | YP_001037157 |
Protein GI | 125973247 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGT TGAAAAAGTG TACAGGGTTT TTTCTATACG TACTTGTACT TCTCGTAAAT ATAATATCTG TAAGTGCATT AGAGCCACCA CCGATATATG GCGACTCGAA TTCGGACTGC AAGGTAAACT CAACGGACTT GACATTAATG AAAAGGTATC TTCTGCAGCA ATCCATTAGC TATATCAACC TGATTAACGC TGATCTGAAT GGGGATGGTA AAATAAACTC GAGCGACTAC ACATTGTTAA AAAGATATCT TTTGGGATAT ATTGATTCTT TCCCTGTGGA AAACCAGTAT CCTACAACAC CTGAGCCTTC ACCGACACCT ACTCCCGCTG TTGATGAAGA AGCATGGAAA AACAACACCG GTACAATTGA GTTGGGAGAT ACGATTAAAG TCAGCGGTGA AGGTATTTCG GTAAACGGTT CGGTCGTTAC CATTACAGCC GGAGGAGACC ACTTAGTTAC AGGTACTTTA AACAACGGCA TGATTTTTGT CAATACAACC GAAAGGGTTA AGCTGAGACT TAGCGGCGTA AATATAAAAA ATCCAAACGG CCCTGCCATC TACTTCTACA ACGTTGACAA AGGCTTTATC ACAATAGAAA AAGGTACGGT CAATTATCTC TCCGACGGCT CAACATATAC TGATCAGGAT GCAAAAGCAG CTCTTTTCAG TAATGACGAT TTGGAGCTGA AGGGAAAAGG CACTCTCTAC GTTACAGGTA ATTACAAGCA CGGTATTGCA AGTGACGATG ACCTTATTAT TGAAAACGGA GATATTTACG TAACAGCAGT TACCGACGGA TTACACGCAA ACAGCGGCAT AGAAATCAAG GGCGGAAACA TCACTGTTAC GGCAAAATCT GATGCCATTG AAAGCGAAAA AGATTTTGAA ATGACCGGCG GTACCCTCAA TCTCACTGCA GATGACGATG CGATACACTC AGAAAAAGAC CTTGTAATTG ACGATGGAGA AATAAATATA TTAAAATGTT ATGAGGGTAT TGAAAGCAAG ACTACTATTA CAATTAACGG TGGCAAAATA AATATAAACT CAAATGAAGA CGGTCTAAAT GCTGCAAGCG GCCTTTATAT CAATGGCGGT GAACTTTACA TAACTTCAGG ATATGATGGA ATTGACTCCA ACGGACCTAT ATATATCAAT GGAGGATATA TTTTCTCCTT TGGAGGCAAC ATTCCCGAAG GAGGTATTGA TTGTGACTGG AATCCTCTGA TAATCAATGG AGGAACCCTC ATTGCAGCGG GAGGTTCCAA CAGTACTCCT TCAACTTCAA GTACTCAGTG CTCGGTGCTT TTAGGCAGTG GAACGGCAAA CTCCGTTATC AGTATCCAAA GGAACGGGTC TGAAATAATC AGCTTTACGG CTCCAAAGAA TTATCAAAAC ATGGTATTCA GTTCACCGGA TCTCGTATTG AATGCAACTT ATGTTGTATA TAGAAACGGA GTCCAGTCGG TAACCTTTAC CACAAATTCA ATTGTAACCA ACGCCGGAGG TAGTTCCGGA GGATGGTTCC CCGGAGGAGG ATTCCCAGGA GGAGGATTCC CAGGAGGCGG TGGGGGATGG TTCCCAGGCG GCCCAGGATG GTAA
|
Protein sequence | MRMLKKCTGF FLYVLVLLVN IISVSALEPP PIYGDSNSDC KVNSTDLTLM KRYLLQQSIS YINLINADLN GDGKINSSDY TLLKRYLLGY IDSFPVENQY PTTPEPSPTP TPAVDEEAWK NNTGTIELGD TIKVSGEGIS VNGSVVTITA GGDHLVTGTL NNGMIFVNTT ERVKLRLSGV NIKNPNGPAI YFYNVDKGFI TIEKGTVNYL SDGSTYTDQD AKAALFSNDD LELKGKGTLY VTGNYKHGIA SDDDLIIENG DIYVTAVTDG LHANSGIEIK GGNITVTAKS DAIESEKDFE MTGGTLNLTA DDDAIHSEKD LVIDDGEINI LKCYEGIESK TTITINGGKI NINSNEDGLN AASGLYINGG ELYITSGYDG IDSNGPIYIN GGYIFSFGGN IPEGGIDCDW NPLIINGGTL IAAGGSNSTP STSSTQCSVL LGSGTANSVI SIQRNGSEII SFTAPKNYQN MVFSSPDLVL NATYVVYRNG VQSVTFTTNS IVTNAGGSSG GWFPGGGFPG GGFPGGGGGW FPGGPGW
|
| |