Gene Cthe_1257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1257 
Symbol 
ID4809762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1525272 
End bp1528424 
Gene Length3153 bp 
Protein Length1050 aa 
Translation table11 
GC content39% 
IMG OID640106680 
Productcarbohydrate-binding, CenC-like protein 
Protein accessionYP_001037682 
Protein GI125973772 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTACGGA GATTATCATA TAAGCAAAGT GCATTGCTTA TTGCAGTGTG TATATTAATA 
CAAATCATAT CTTTAGCTTC TTTTGGTTTG AATGTCTATG CAAATTCTGC AAACATAAGT
TTGGAATTCT ATAATGGAGA TTTTGGTGCT TCTGTTTCTT CCATTTCTAT GAATTTTAGA
ATTACCAACA ATGGTTCTTC TCAAATTTCA CTTTCGGACA TTAAATTAAG ATACTATTTT
ACTGATGACG GAGTGTCTCC AATTACTGTT TTTATCGATT ATGCCAACAA TAACGGCAGA
GGAATTAACA ACGATGTTAC TTATACTATA AAAGATATAA ATTCATCCGG AGCAAACAAA
TATATTGAAT TTGGATTTAA CGCTCAGGCG GGAAGTCTTG AACCCAATAC ATCCGTGTTA
ATGAGAGCCA GAGCCTATCA ATCCGAATAC AAACAGAGTT TTACACAAAC CAATGATTAT
TCTTTCTGTC AGTCTAATAA CGATTTTGCG GCATGGAACA AGGTTACTGG GTATTTGAAC
GGCGTTTTGT TCAGTGGAAC GGAACCGGTA ATGTATAGTC CCACCCCTTC TTTGTCAACT
CCGACTCCGC AAATCAGTCC GACGCCGGTT GCAACACCTT CGTCTGTTCC GACTTACCAG
GCCACGCCAA CACCTTCAGG TTTTGAGACT CCTGTAAATT GGGATGGAAA AGCAGTTCCC
AACGGAGATT TTGAGAGCGG ATCGGTATTT TGGAGTTTTT ACTGCGACAG CTTATCCGGT
GCGAATGCCA CAAATTTGAT CCATTCCGAG CCTTCCGGAA ATAAAATGTC CAAAACTTCT
ATTACGAATG CAGGCTCGAA TCACTGGGCA ATTCAATTGA AGCATGACGG AATTGTACTT
GAAAATTTAA AAACCTACAG GCTTACTTTT GATGCAAAGT CAACGGTTCC GAGAAATATA
AGGGTGTCAT TGCAGAATGC CACAAGCAGT ATGATTGAGT ATTTCGGCAA GATAGTGGAA
GTTGAGCCAA AGATGAAGAC TTATACCTGT GAATTCACAT TTAACAGCAC AACAGGTACA
AATGTGGCGA TTGTGTTTGA AATGGGGAAA ATCGGAACAG AGACGGATAA AGCCCATGAC
ATTGTTCTTG ACAATGTACA TATTGAAAAA ATAGCTTCGC CTTCCGTATC GCCCGTGCCG
TCGGAGGACC CTCAGGGAGC CGGTATCACC GCTTCGAGAA GTTCCGTATA TGAGGCGGAG
CTTGGAGAAG AAGTTGACAT AACATTATCC CAAAGCGGAG AAATTGCTTT GGAAGGCAGG
ATGGACACAG AAAAAGAAAT TGTGCTTGTA TTGGATAACT CAGGTGTGTT AAATTCTTAT
GTGGAAGATA TTTTATCACC GCTGGACTTT GGAATATATT CAAATCATAA TTTAACTGTA
CAGGGAAAAG ATGCTTCCAT TAACGGAAGT GTGCATGCAA ATGATGTATT TACTTCTACA
GCGGATAGTA TAAGTATTTC TCAAACTTGC TCAGCTGCCA GCTTCCACAT TACAAGTAAA
AACGTAAATA TAAATGAATA TAAAAACATT ACGATTCCAA TAGAAATGCC CAATTTCCAT
TCTAAACTTA TTGATGATGC GATGCGCAAT TCCATGGTTT TCAGACCCGA GGATTATTTT
TTGAGTTGGT TCCCCCAGCC AATGCCGGGT CAAGAAGATA TCTTTATATT TTATAACTTA
ATTGCCGGAC GGTTTGAAAT ATTTGGAGCA GGTACACTTG TTATAAATTC TTCAATGTAT
TTTATGGGCA ATGTTCTTAT ATCACTGACA AATACCAATA ACGTCGGTGA GGGATTTATT
GTTGCTGACG GAAATATAAT TATACAGGGA CAAAACTTGT ATCCCAACGG ACCTAATGAC
AAGTTGTATG TTTACTCCAT AGGGGGAAAT ATAGAGTTTC AGACCAGTAA CTCTACTATA
AACGGAATAG TTTATGCACC TGGAAATCCT GCAAACCCTA ATTCAGGAAA AATTTTCTTC
TCAGGTGACA AAAATACAAT TAACGGTTCT ATTGCAGCGA ACGAACTTGA TTTTTTTGCC
GGCGGCCTTG TGGTAAATCA TACTGAAGGG CAATTTGATA CTGTTGAGGA AAAGTATATT
GACAAATCCA CTTATTTAAA ATTGGTAAAA GATGCTGCAA AGAACTTTGT AGACAAGTTT
GCGGGTTCTA AGACCAAAAT GGCCGTTATT CAGTATTCCG ATTCTGCCAA TGATAATGAT
TTTAAAAAGT ATGATTTGTC TTTGCCTGAT AAAGGAGCTG CTTTGAAGGA GACTATTGAC
AAAATTAAGC CCGGAACATC AGGCCTGAGC AACATGGGTG ACGGAATGAG AAGAGCCTAT
CATATTCTTA ATGGTCCTCC TCCAAAAGGT CAGATTTCAA AATATATAGT CGTTATTACA
GGCTCTGTAC CGAACCGGTG GACGGCAGTG GACAATAAAA AGAATGAACC GAAGACAGAC
AACGGACGTG CTGATTTTAT AAAGGCTGAC AATGAGTCAT ACAATTCACT GGATTATGCA
AAGGATATGG GAAGAATCAT TACTTCAAAG GGTATCAATC TTGTGTTTAT AGACTTTTCA
GAAGAGGATA TTGGCGACGT ACTGGAAGAA ATTGCAGCAG AAAGCGGAGC AAAACCGTTA
GAAGGAACAG ACAGACACTA TTACAAGGCA AATAATTTCC TGGAACTTCT GGATATTTTA
AACAATATGA CTTTGAAGAT ATATTATGAT GTTGTGCTTG ACAAGGTTTT GTACGAAGAG
ATTCTTCCGC AGGGAGTGCT GCTGGTTGAG GCTCCTGAGT GGATAAGCAC GGAAAGTGTG
CCGATGGGCG GAGTTAACAG GATAAAGCTT ACCGGTGAGA TAAACAACAT TCCGTTTACG
TTTACAGGCA CGGGTTACAG TTTTGTAGTT GAAAGCTTCA AAATAAAAGT AAAATTCCTC
AAACCGGGCA CAATAGTATT TGACGGGGCA GATTCCAGAC TAAGATACAA TTTTAATTAC
GTTGACGGAG CAGGAAACAT TCATTCAAAG AGCGTGGACA AACATTTTGA CGACATGACG
GTGAATGTCA CGATGAAGGT TGATATAAAC TGA
 
Protein sequence
MLRRLSYKQS ALLIAVCILI QIISLASFGL NVYANSANIS LEFYNGDFGA SVSSISMNFR 
ITNNGSSQIS LSDIKLRYYF TDDGVSPITV FIDYANNNGR GINNDVTYTI KDINSSGANK
YIEFGFNAQA GSLEPNTSVL MRARAYQSEY KQSFTQTNDY SFCQSNNDFA AWNKVTGYLN
GVLFSGTEPV MYSPTPSLST PTPQISPTPV ATPSSVPTYQ ATPTPSGFET PVNWDGKAVP
NGDFESGSVF WSFYCDSLSG ANATNLIHSE PSGNKMSKTS ITNAGSNHWA IQLKHDGIVL
ENLKTYRLTF DAKSTVPRNI RVSLQNATSS MIEYFGKIVE VEPKMKTYTC EFTFNSTTGT
NVAIVFEMGK IGTETDKAHD IVLDNVHIEK IASPSVSPVP SEDPQGAGIT ASRSSVYEAE
LGEEVDITLS QSGEIALEGR MDTEKEIVLV LDNSGVLNSY VEDILSPLDF GIYSNHNLTV
QGKDASINGS VHANDVFTST ADSISISQTC SAASFHITSK NVNINEYKNI TIPIEMPNFH
SKLIDDAMRN SMVFRPEDYF LSWFPQPMPG QEDIFIFYNL IAGRFEIFGA GTLVINSSMY
FMGNVLISLT NTNNVGEGFI VADGNIIIQG QNLYPNGPND KLYVYSIGGN IEFQTSNSTI
NGIVYAPGNP ANPNSGKIFF SGDKNTINGS IAANELDFFA GGLVVNHTEG QFDTVEEKYI
DKSTYLKLVK DAAKNFVDKF AGSKTKMAVI QYSDSANDND FKKYDLSLPD KGAALKETID
KIKPGTSGLS NMGDGMRRAY HILNGPPPKG QISKYIVVIT GSVPNRWTAV DNKKNEPKTD
NGRADFIKAD NESYNSLDYA KDMGRIITSK GINLVFIDFS EEDIGDVLEE IAAESGAKPL
EGTDRHYYKA NNFLELLDIL NNMTLKIYYD VVLDKVLYEE ILPQGVLLVE APEWISTESV
PMGGVNRIKL TGEINNIPFT FTGTGYSFVV ESFKIKVKFL KPGTIVFDGA DSRLRYNFNY
VDGAGNIHSK SVDKHFDDMT VNVTMKVDIN