Gene Cthe_0246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0246 
Symbol 
ID4808594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp297573 
End bp300035 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content47% 
IMG OID640105658 
Productcarbohydrate-binding family 6 protein 
Protein accessionYP_001036678 
Protein GI125972768 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CTCTTGTATT TTTAACGGCC TTGAGTCTGA TATTCACGCT GTTTATCAGT 
TATTCCCTGT CAGCAGGACC GGCTTCAACC AAGTATGGGG ATCTCAATGC CGATGGCAAG
ATCAATTCGA CAGATTACAA CTTGGGCAAG AGATTGATTC TGAGAACAAT TTCGGAGCTT
CCCATTTCCA ATGGATCTGT AGCCTTTGAC CTTAACGGTG ATTCAAAGGT TGATTCAACG
GACCTTACTG CGCTGAAAAG ATACCTGCTG GGTGTTATTG ACAAGTTTCC GGTGGGCACG
GATATACCAT CCCAAACACA AAAGACGAGA TATCAGGCTG AGGATGCGAT GTTGTACAAG
GCATTCGAGG AAACAATCCA TGCAGGTTAT GACGGGAGAA GTTATGTAAA TTACGACAAC
GAACCCGGAG GATATATTGA GTGGAATGTA AATGTATCCA GTTCAGGTAC ATATAAGCTT
ATTTTCAGAT ATGCAAACGG ATCAAACAAT AACAGACCTA TGGAAATAAG AGTAAATTCC
AATCTGGTTG CAGGTAGTCT GGACTTTTAT CCGACTTCAG CCTGGACTGT ATGGAATGAC
CAAAGCATAG TTGTAACTTT AAATGCGGGC AACAACGTTA TCAGGGCAAC GGGAATTGCC
TCGGACGGCG GACCGAATGT GGATTATCTT GAAGTAATTC CGACAAATGA ACCACCAGCA
CCCACCCCTT CACCGACGCC TACAGTTGGA CCTACACCTG CTGGTGCGCG TCAGATGGAG
AGACTGGACA GAGGGCTTGT GGCGGTAAAA GTAAACAACG GAGTATTTTT AAGCTGGAGA
ATGTTTGGTA CGGATCCTTC CAACATTGCA TTCAACTTGT ACCGCAACGG AACAAAGATA
AATTCCACAC CGATTACCGG TGCGACAAAC TATGTGGATA CCGGCGGAAC GACAAGTTCA
ACATACACGG TACGTGCGGT TATTAACGGA CAGGAACAGG AGGCATCAAA ACCTGTAAGT
GTCTGGGCTC AGAATTATCT TCAGATTCCC ATTCAGCCAC CGTCAAGCGC GTACGAGGCT
AATGACTGCA GTGCCGCAGA CCTTGACGGA GACGGAGAAT ATGAAATTGT GTTAAAGTGG
GAGCCAAATA ACGCAAAAGA CAATTCCCAA TCCGGATATA CCGATAATGT GTATTTGGAT
GCTTACAAGC TGAACGGCAC ACGTTTGTGG AGAATAGATC TTGGAAGAAA TATCCGTGCC
GGTGCCCACT ATACCCAGTT TATGGTTTAT GACCTTGACG GCGACGGCAA GGCAGAGGTT
GCATGCAAGA CAGCTGACGG AACAAGAGAC GGAAAAGGAA ATGTGATAGG CAATCCAAAT
GCGGATTATC GTAATTCAAG CGGATACATA CTTTCAGGAC CTGAATACCT GACAGTATTC
GATGGACAGA CAGGTGCCGC CATTACAACG GTGGATTATG ATCCTCCGAG AGGAAATGTC
TCTTCATGGG GTGACAATTA CGGAAACAGA GTGGACCGTT TCCTGGCGTG CATAGCATAC
CTTGACGGTC AAAGACCAAG CCTTGTCATG TGCCGCGGAT ATTATACAAG AAGCGTGCTT
GTGGCCTGGG ATTTCAGAAA CGGAAGGCTT ACAAAGAGAT GGGTATTTGA CGGCAACAAT
TACAGCGGAT ATAACGGACA GGGTAATCAC AACCTGAGTG TGGCCGATGT TGACGGCGAC
GGAAGAGATG AGATTATTTA CGGTGCATGT ACCATTGATG ACAACGGAAA AGGATTGTAT
ACTTCAGGAC TTGGCCATGG GGACGCTCTG CATGTGGGAG ATCTTAATCC CAACAGACCG
GGCCTTGAAA TTTGGAGCTG CTTTGAAAGC TCCGGCGGCG CTGCTTTGCG TGATGCAAGG
ACAGGAGAAG TGTTGTTCAG ATGGCATAGA TCCAGTGATA CAGGAAGGGC TTGTGCGGCT
GATATAACGG CATCATCTCC GGGAGCTGAG CTTTGGGCTG CAGGTTCTCC GCTGTTCAGC
TGTACCGGTC AGAATATAGG AACTGCTCCA AGCCAGATTA ACTTTGCTAT ATGGTGGGAC
GGAGACGAAC TCAGGGAGCT CCTTGACGGC ATTACAATAA GCAAATACGG TGTAGGAACA
TTGTTTACCG CGACCGGATG TGCTTCCAAC AACGGTACAA AATCAACTCC GTGCCTCCAG
GCAGACCTCC TTGGAGACTG GAGAGAAGAA GTAATCTTTA GAACTTCGGA CAACAGGTAT
TTGAGAATAT ACACCACAAC GGCAACAACA AACAGACGTA TTTACACATT AATGCATGAT
CCGGTTTACA GATTGGGTAT AGCCTGGCAG AATGTAGCAT ACAATCAGCC GCCGCACACA
AGCTTCTTTA TCGGAGCCGG CATGGCTGAG CCTCCGAAGC CAAATATTTA CCTTGTGCCG
TAA
 
Protein sequence
MKKTLVFLTA LSLIFTLFIS YSLSAGPAST KYGDLNADGK INSTDYNLGK RLILRTISEL 
PISNGSVAFD LNGDSKVDST DLTALKRYLL GVIDKFPVGT DIPSQTQKTR YQAEDAMLYK
AFEETIHAGY DGRSYVNYDN EPGGYIEWNV NVSSSGTYKL IFRYANGSNN NRPMEIRVNS
NLVAGSLDFY PTSAWTVWND QSIVVTLNAG NNVIRATGIA SDGGPNVDYL EVIPTNEPPA
PTPSPTPTVG PTPAGARQME RLDRGLVAVK VNNGVFLSWR MFGTDPSNIA FNLYRNGTKI
NSTPITGATN YVDTGGTTSS TYTVRAVING QEQEASKPVS VWAQNYLQIP IQPPSSAYEA
NDCSAADLDG DGEYEIVLKW EPNNAKDNSQ SGYTDNVYLD AYKLNGTRLW RIDLGRNIRA
GAHYTQFMVY DLDGDGKAEV ACKTADGTRD GKGNVIGNPN ADYRNSSGYI LSGPEYLTVF
DGQTGAAITT VDYDPPRGNV SSWGDNYGNR VDRFLACIAY LDGQRPSLVM CRGYYTRSVL
VAWDFRNGRL TKRWVFDGNN YSGYNGQGNH NLSVADVDGD GRDEIIYGAC TIDDNGKGLY
TSGLGHGDAL HVGDLNPNRP GLEIWSCFES SGGAALRDAR TGEVLFRWHR SSDTGRACAA
DITASSPGAE LWAAGSPLFS CTGQNIGTAP SQINFAIWWD GDELRELLDG ITISKYGVGT
LFTATGCASN NGTKSTPCLQ ADLLGDWREE VIFRTSDNRY LRIYTTTATT NRRIYTLMHD
PVYRLGIAWQ NVAYNQPPHT SFFIGAGMAE PPKPNIYLVP