Gene Cthe_0729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0729 
Symbol 
ID4810347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp885954 
End bp887567 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content42% 
IMG OID640106146 
Productcellulosome enzyme, dockerin type I 
Protein accessionYP_001037157 
Protein GI125973247 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGT TGAAAAAGTG TACAGGGTTT TTTCTATACG TACTTGTACT TCTCGTAAAT 
ATAATATCTG TAAGTGCATT AGAGCCACCA CCGATATATG GCGACTCGAA TTCGGACTGC
AAGGTAAACT CAACGGACTT GACATTAATG AAAAGGTATC TTCTGCAGCA ATCCATTAGC
TATATCAACC TGATTAACGC TGATCTGAAT GGGGATGGTA AAATAAACTC GAGCGACTAC
ACATTGTTAA AAAGATATCT TTTGGGATAT ATTGATTCTT TCCCTGTGGA AAACCAGTAT
CCTACAACAC CTGAGCCTTC ACCGACACCT ACTCCCGCTG TTGATGAAGA AGCATGGAAA
AACAACACCG GTACAATTGA GTTGGGAGAT ACGATTAAAG TCAGCGGTGA AGGTATTTCG
GTAAACGGTT CGGTCGTTAC CATTACAGCC GGAGGAGACC ACTTAGTTAC AGGTACTTTA
AACAACGGCA TGATTTTTGT CAATACAACC GAAAGGGTTA AGCTGAGACT TAGCGGCGTA
AATATAAAAA ATCCAAACGG CCCTGCCATC TACTTCTACA ACGTTGACAA AGGCTTTATC
ACAATAGAAA AAGGTACGGT CAATTATCTC TCCGACGGCT CAACATATAC TGATCAGGAT
GCAAAAGCAG CTCTTTTCAG TAATGACGAT TTGGAGCTGA AGGGAAAAGG CACTCTCTAC
GTTACAGGTA ATTACAAGCA CGGTATTGCA AGTGACGATG ACCTTATTAT TGAAAACGGA
GATATTTACG TAACAGCAGT TACCGACGGA TTACACGCAA ACAGCGGCAT AGAAATCAAG
GGCGGAAACA TCACTGTTAC GGCAAAATCT GATGCCATTG AAAGCGAAAA AGATTTTGAA
ATGACCGGCG GTACCCTCAA TCTCACTGCA GATGACGATG CGATACACTC AGAAAAAGAC
CTTGTAATTG ACGATGGAGA AATAAATATA TTAAAATGTT ATGAGGGTAT TGAAAGCAAG
ACTACTATTA CAATTAACGG TGGCAAAATA AATATAAACT CAAATGAAGA CGGTCTAAAT
GCTGCAAGCG GCCTTTATAT CAATGGCGGT GAACTTTACA TAACTTCAGG ATATGATGGA
ATTGACTCCA ACGGACCTAT ATATATCAAT GGAGGATATA TTTTCTCCTT TGGAGGCAAC
ATTCCCGAAG GAGGTATTGA TTGTGACTGG AATCCTCTGA TAATCAATGG AGGAACCCTC
ATTGCAGCGG GAGGTTCCAA CAGTACTCCT TCAACTTCAA GTACTCAGTG CTCGGTGCTT
TTAGGCAGTG GAACGGCAAA CTCCGTTATC AGTATCCAAA GGAACGGGTC TGAAATAATC
AGCTTTACGG CTCCAAAGAA TTATCAAAAC ATGGTATTCA GTTCACCGGA TCTCGTATTG
AATGCAACTT ATGTTGTATA TAGAAACGGA GTCCAGTCGG TAACCTTTAC CACAAATTCA
ATTGTAACCA ACGCCGGAGG TAGTTCCGGA GGATGGTTCC CCGGAGGAGG ATTCCCAGGA
GGAGGATTCC CAGGAGGCGG TGGGGGATGG TTCCCAGGCG GCCCAGGATG GTAA
 
Protein sequence
MRMLKKCTGF FLYVLVLLVN IISVSALEPP PIYGDSNSDC KVNSTDLTLM KRYLLQQSIS 
YINLINADLN GDGKINSSDY TLLKRYLLGY IDSFPVENQY PTTPEPSPTP TPAVDEEAWK
NNTGTIELGD TIKVSGEGIS VNGSVVTITA GGDHLVTGTL NNGMIFVNTT ERVKLRLSGV
NIKNPNGPAI YFYNVDKGFI TIEKGTVNYL SDGSTYTDQD AKAALFSNDD LELKGKGTLY
VTGNYKHGIA SDDDLIIENG DIYVTAVTDG LHANSGIEIK GGNITVTAKS DAIESEKDFE
MTGGTLNLTA DDDAIHSEKD LVIDDGEINI LKCYEGIESK TTITINGGKI NINSNEDGLN
AASGLYINGG ELYITSGYDG IDSNGPIYIN GGYIFSFGGN IPEGGIDCDW NPLIINGGTL
IAAGGSNSTP STSSTQCSVL LGSGTANSVI SIQRNGSEII SFTAPKNYQN MVFSSPDLVL
NATYVVYRNG VQSVTFTTNS IVTNAGGSSG GWFPGGGFPG GGFPGGGGGW FPGGPGW