Gene Cthe_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1472 
Symbol 
ID4810622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1789679 
End bp1792381 
Gene Length2703 bp 
Protein Length900 aa 
Translation table11 
GC content41% 
IMG OID640106893 
Productcarbohydrate-binding family 11 protein 
Protein accessionYP_001037894 
Protein GI125973984 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.980604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GGCTTTTAGT TTCTTTTTTG GTGTTAAGCA TAATTGTAGG ATTACTTTCT 
TTTCAGTCGC TTGGTAATTA CAACAGTGGT TTAAAAATCG GTGCTTGGGT GGGAACCCAG
CCGTCAGAAT CAGCAATTAA GAGTTTTCAG GAACTTCAGG GTAGAAAGCT TGATATTGTC
CACCAGTTTA TTAACTGGTC AACTGATTTT TCCTGGGTAA GACCTTATGC CGACGCTGTT
TATAATAACG GCTCAATATT AATGATTACC TGGGAACCTT GGGAATACAA CACTGTAGAT
ATCAAAAACG GTAAAGCGGA TGCTTACATA ACCAGAATGG CGCAAGATAT GAAAGCCTAT
GGCAAGGAAA TTTGGTTAAG ACCTCTTCAT GAAGCCAACG GAGACTGGTA TCCATGGGCC
ATAGGATATT CTTCAAGAGT AAACACAAAC GAAACTTACA TAGCCGCTTT CAGACATATT
GTCGATATTT TCCGTGCCAA CGGAGCCACC AACGTCAAAT GGGTGTTTAA TGTAAACTGC
GACAATGTAG GTAACGGCAC AAGTTATCTG GGTCATTATC CCGGAGATAA TTATGTAGAC
TACACCTCAA TTGACGGATA CAACTGGGGT ACCACTCAAA GCTGGGGAAG CCAATGGCAA
AGCTTTGATC AGGTTTTCTC CAGAGCCTAC CAAGCTTTGG CATCAATAAA CAAACCCATC
ATTATAGCAG AGTTTGCATC AGCTGAAATA GGCGGAAACA AGGCAAGATG GATTACAGAA
GCATATAACT CTATAAGAAC ATCCTACAAC AAGGTAATTG CTGCAGTATG GTTTCACGAG
AACAAAGAAA CCGACTGGAG AATCAACTCA AGTCCTGAAG CCCTTGCAGC ATACAGGGAG
GCAATAGGAG CCGGTTCATC AAATCCTACC CCTACTCCAA CTTGGACCTC TACTCCACCA
TCAAGCTCAC CAAAGGCTGT CGACCCCTTT GAAATGGTTA GAAAAATGGG TATGGGAACA
AACCTCGGAA ACACTCTCGA AGCTCCCTAT GAAGGCTCCT GGTCCAAGTC TGCCATGGAA
TATTATTTTG ATGATTTTAA AGCTGCAGGA TATAAAAACG TAAGAATCCC TGTAAGATGG
GACAACCATA CAATGAGGAC ATACCCGTAT ACCATTGACA AAGCCTTTTT GGACAGGGTT
GAGCAAGTGG TTGACTGGTC ACTTTCAAGA GGTTTTGTTA CAATTATAAA TTCTCACCAT
GATGACTGGA TCAAGGAAGA CTATAACGGA AACATAGAAC GGTTTGAAAA GATATGGGAA
CAGATTGCGG AAAGGTTTAA AAACAAATCC GAAAATCTTC TGTTTGAAAT CATGAATGAG
CCTTTCGGTA ACATTACAGA CGAACAAATA GACGACATGA ACAGCAGAAT ATTAAAAATA
ATCAGAAAGA CCAATCCAAC CCGTATTGTT ATAATAGGCG GAGGTTATTG GAACAGTTAT
AATACGCTTG TAAACATTAA AATTCCTGAT GACCCATACT TAATCGGAAC TTTCCATTAC
TATGACCCAT ATGAATTTAC TCACAAGTGG AGAGGTACAT GGGGTACTCA GGAAGACATG
GATACTGTAG TAAGAGTATT TGATTTTGTT AAGAGTTGGT CTGACAGAAA CAATATCCCG
GTATATTTTG GAGAATTTGC CGTAATGGCT TATGCCGACA GAACTTCCCG TGTAAAATGG
TATGATTTTA TAAGTGATGC GGCCCTGGAG CGCGGTTTTG CATGTTCCGT ATGGGATAAC
GGCGTTTTTG GTTCATTGGA TAATGACATG GCTATTTACA ACAGAGATAC CCGTACCTTT
GACACTGAAA TCCTCAATGC ACTATTTAAT CCCGGAACAT ATCCGTCTTA TTCTCCGAAA
CCTTCACCAA CTCCAAGACC GACCAAACCG CCCGTAACAC CGGCTGTCGG TGAAAAAATG
CTGGATGATT TTGAGGGTGT GTTAAATTGG GGTTCATACT CCGGTGAAGG TGCAAAAGTT
TCAACAAAAA TTGTGTCCGG AAAAACAGGA AACGGCATGG AAGTCAGCTA CACCGGGACA
ACGGACGGCT ACTGGGGAAC AGTATACAGT TTACCGGACG GCGATTGGTC AAAATGGCTT
AAAATCTCTT TTGACATTAA GTCCGTTGAC GGTTCTGCCA ATGAAATCAG ATTTATGATT
GCTGAAAAAA GCATAAACGG TGTGGGAGAC GGAGAACACT GGGTTTACTC AATAACTCCC
GACAGTTCGT GGAAAACTAT AGAAATACCG TTCTCCAGCT TTAGAAGAAG ACTTGATTAT
CAGCCGCCTG GACAGGATAT GAGCGGTACT TTGGATCTTG ACAATATAGA TTCAATTCAC
TTCATGTATG CCAACAACAA GTCGGGAAAA TTTGTCGTAG ACAATATCAA GCTGATTGGT
GCTACTTCCG ATCCGACTCC TTCAATAAAA CACGGAGATT TGAACTTCGA TAATGCAGTG
AATTCTACAG ACTTGTTAAT GCTTAAAAGG TATATCCTCA AATCTTTGGA ACTCGGTACA
TCTGAGCAGG AGGAAAAATT CAAAAAAGCG GCAGATTTAA ACAGGGACAA CAAGGTCGAC
TCCACTGACT TGACAATTTT GAAAAGATAC TTGCTGAAAG CCATCAGTGA AATACCCATA
TAA
 
Protein sequence
MKKRLLVSFL VLSIIVGLLS FQSLGNYNSG LKIGAWVGTQ PSESAIKSFQ ELQGRKLDIV 
HQFINWSTDF SWVRPYADAV YNNGSILMIT WEPWEYNTVD IKNGKADAYI TRMAQDMKAY
GKEIWLRPLH EANGDWYPWA IGYSSRVNTN ETYIAAFRHI VDIFRANGAT NVKWVFNVNC
DNVGNGTSYL GHYPGDNYVD YTSIDGYNWG TTQSWGSQWQ SFDQVFSRAY QALASINKPI
IIAEFASAEI GGNKARWITE AYNSIRTSYN KVIAAVWFHE NKETDWRINS SPEALAAYRE
AIGAGSSNPT PTPTWTSTPP SSSPKAVDPF EMVRKMGMGT NLGNTLEAPY EGSWSKSAME
YYFDDFKAAG YKNVRIPVRW DNHTMRTYPY TIDKAFLDRV EQVVDWSLSR GFVTIINSHH
DDWIKEDYNG NIERFEKIWE QIAERFKNKS ENLLFEIMNE PFGNITDEQI DDMNSRILKI
IRKTNPTRIV IIGGGYWNSY NTLVNIKIPD DPYLIGTFHY YDPYEFTHKW RGTWGTQEDM
DTVVRVFDFV KSWSDRNNIP VYFGEFAVMA YADRTSRVKW YDFISDAALE RGFACSVWDN
GVFGSLDNDM AIYNRDTRTF DTEILNALFN PGTYPSYSPK PSPTPRPTKP PVTPAVGEKM
LDDFEGVLNW GSYSGEGAKV STKIVSGKTG NGMEVSYTGT TDGYWGTVYS LPDGDWSKWL
KISFDIKSVD GSANEIRFMI AEKSINGVGD GEHWVYSITP DSSWKTIEIP FSSFRRRLDY
QPPGQDMSGT LDLDNIDSIH FMYANNKSGK FVVDNIKLIG ATSDPTPSIK HGDLNFDNAV
NSTDLLMLKR YILKSLELGT SEQEEKFKKA ADLNRDNKVD STDLTILKRY LLKAISEIPI