Gene Cthe_2197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2197 
Symbol 
ID4811062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2622821 
End bp2625607 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content44% 
IMG OID640107603 
Productcarbohydrate-binding family 6 protein 
Protein accessionYP_001038592 
Protein GI125974682 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.71663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCTATT TTTTTGCCGG TATATGGTAC ATTTTTAGAG TAGCATTAAC AAAAACCTTT 
ACCGTACCTT CAGATTATTC CGGCAAGAAA GTTTTTATAC AATTCGACGG AGCTTATATG
AACAGCCAGG TATGGATAAA CGGGACATAC TTGGGAATTC GTCCATATGG ATACAGCTCT
TTTGAATATG ACTTGACTCC ATACCTTAAC ATAGGCGGGA AAAACGTAAT TGCAGTCAAA
ATCAACAACA ACCAGCCCAA CAGTCGCTGG TATTCGGGAA GCGGCATTTA CCGCAACGTG
TGGCTGACAG TTTTGGACCC GGTTCATGTG GATTATTGCG GAATGTTTAT AACCACTCCG
AATGTAAGCA GAGATTCGGC TACAGCCAAT GTCAGCACGA AAGTGGTAAA CCAGGGCAAT
TCGGAAAAAA CAGTTTCTTT AAAAACCATA ATTATGGATG CAAATGGCAA CCAGGTTGCT
TCTGATACAT CTTCAGCAGT TAACATATCA GCCGGTAGTG ACTATACATT TAACCAAAAC
CTTACAGTAT CAAATCCCAA TTTATGGTCT CCTGATTCCC CGTATCTTTA CATGGTTCAA
ACTCAAGTAA TTGTTGATGG AAAAGTGGCT GATACCTATA AGTCAACCAT GGGATTTCGT
TATCTTAATT TTAGCAGCAC TACCGGTTTT TCTTTAAACG GCGTTAAAAC GAAAATAAAG
GGAGTATGTA TGCATCATGA CTTGGGCGCT TTAGGAGCGG CAGTTAATTA CCGTGCTATT
GAAAGGCAGC TTCAGATTAT GAAAGAGATG GGCTGCAATG CTATCCGCAC CGCGCATAAT
CCTCCTGATC CGCAAGTGTT GGAAATATGC GACAGATTGG GTCTGATGGT TATGGATGAA
GCCTTTGACT GCTGGGAAAC CGGAAAGACT GCCAATGACT ATCATCTGTA TTTCAAAGAC
TGGGCCAAAA GGGACCTTCA GGATATGGTT AAAAGAGACC GCAATCATCC GTCGGTTATT
ATGTGGAGCA TAGGCAATGA GATTCCCAAT GCTACCGTTG AAACTGCCAC AAAGCTGAAA
AACTGGGTGA AGGAAATAGA TCCCACCCGA CCGGTAACAT GGGGTTGTTT TGCTATAAAT
ATGTCGGACG ATACATACAA ACGGATTGCA AGTGTCCTTG ATTTGGTCGG ATACAACTAT
TTCCCCTTTA TGTATGACCA GGGACACAAG GAACATCCCG AATGGATAAT GTTCGGCAGT
GAAACAAGCT CGGCGGTAAG AAGCCGGGGT GTATATAAAA CTCCCACCAA CCAGAATATA
CTGACCGGCA ATGACAACCA GTGCTCATCT TATGACAACA GCGTGGTTGC CTGGGGTAAC
AGCGCAGAAT CGTCATATTA TGAAATCAAC AGACGGGATT ACATGCTTGG GGAATTTGTT
TGGACGGGAT TTGACTATAT TGGTGAACCG ACACCGTACA AATGGCCGTC GAAAAGCTCA
TATTTCGGAA TAGTTGACAC ATGCGGATTC CCCAAAGATA TATATTATTT CTATCAAAGC
AAATGGAGCG ACAAGCCGAT GGTGCATATC CTGCCCCATT GGAACTGGTC GAACGGTACT
ACCGTAGAGG TGTGGGCTTA TAGCAACTGC GATACGGTGG AGCTTTTCTT AAACGGCACT
TCCCTTGGAG TAAAGAGTAT GGGAAATAAC GGGCATGTTT CGTGGAATGT TCCCTGGGTT
CCGGGTACAC TCAGAGCAAA AGCTGTCAAA GGAAATATAG TGGTTTATGA CGAGGTAACC
ACTGCCGGTA ATCCTGCAAA AATTCAGTTA AAACCGGACA GGACAACTAT TACGGCTGAC
GGCAAGGACT TGGTATTTAT AGAAACTGAT ATTGTAGACA GTAACGGTGT TCTTGTCCCG
ACGGCAAGCA ATACTGTGAA CTTTTCCATA TCCGGACCGG GAGTAATTGT CGGAGTTGAC
AATGGAAATG CTGCAAGCCT GGAACCTTAC AAGGCAAACA GCAGGCAGGC TTTTAACGGC
AAGTGCCTCG TGATAGTCCA GGCAACCAAA ACCAACGGGA CTATTATAGT AACGGCCAGT
TCGAACGGAT TGGAATCTGA CAGAGTGATT ATTAAGACAA CCGGAGGGGA ACCTGAACCG
ACTCCTGTGC CAAGGTCTGC TTTTACACGA ATCGAAGCGG AAAGCTATGA TGCTCAGTCA
GGAATCCAGA CTGAAGATTG CAGCGAAGGC GGTAAGGATG TGGGATATAT TGAAAACGGA
GATTTTGTCG TCTACAAGGC TATTGATTTT GGCAGAGGAG CAGCAAGTTT TAAAGCGAGA
GTAGCCAGCG CTACAAGCGG AGGCAATATT GAACTTAGGA TTGACAGTAT TGACGGACCT
GTAGTTGGCA TTTGTCCGGT TGCCGGCACC GGCGGTTGGC AGGAATGGGC TGATGCGACG
TGTGAGGTAA GTGACCTGAA GGGAGTCCAT GATCTTTATC TGAAATTTAC CGGAGGCAGC
GGTTATCTGC TTAATGTGAA TTGGTTCACC TTTGTTGAAG GAAACAGTGA TGAGGATCTG
GGTGATTTAA ACGGTGACGG AAAAGTAAAC TCGACAGACC TTCAGCTAAT GAAAATGCAC
GTACTCAGGC AAAGACAGCT TACAGGAACA AGCCTCTTAA ATGCAGATGT AAACAGGGAC
GGCAAAGTGG ATTCTACCGA TGTCGCATTA TTAAAAAGAT ATATATTGAG ACAAATATCT
TCTTTTGATG ATTATGCTCG GTCTTAA
 
Protein sequence
MAYFFAGIWY IFRVALTKTF TVPSDYSGKK VFIQFDGAYM NSQVWINGTY LGIRPYGYSS 
FEYDLTPYLN IGGKNVIAVK INNNQPNSRW YSGSGIYRNV WLTVLDPVHV DYCGMFITTP
NVSRDSATAN VSTKVVNQGN SEKTVSLKTI IMDANGNQVA SDTSSAVNIS AGSDYTFNQN
LTVSNPNLWS PDSPYLYMVQ TQVIVDGKVA DTYKSTMGFR YLNFSSTTGF SLNGVKTKIK
GVCMHHDLGA LGAAVNYRAI ERQLQIMKEM GCNAIRTAHN PPDPQVLEIC DRLGLMVMDE
AFDCWETGKT ANDYHLYFKD WAKRDLQDMV KRDRNHPSVI MWSIGNEIPN ATVETATKLK
NWVKEIDPTR PVTWGCFAIN MSDDTYKRIA SVLDLVGYNY FPFMYDQGHK EHPEWIMFGS
ETSSAVRSRG VYKTPTNQNI LTGNDNQCSS YDNSVVAWGN SAESSYYEIN RRDYMLGEFV
WTGFDYIGEP TPYKWPSKSS YFGIVDTCGF PKDIYYFYQS KWSDKPMVHI LPHWNWSNGT
TVEVWAYSNC DTVELFLNGT SLGVKSMGNN GHVSWNVPWV PGTLRAKAVK GNIVVYDEVT
TAGNPAKIQL KPDRTTITAD GKDLVFIETD IVDSNGVLVP TASNTVNFSI SGPGVIVGVD
NGNAASLEPY KANSRQAFNG KCLVIVQATK TNGTIIVTAS SNGLESDRVI IKTTGGEPEP
TPVPRSAFTR IEAESYDAQS GIQTEDCSEG GKDVGYIENG DFVVYKAIDF GRGAASFKAR
VASATSGGNI ELRIDSIDGP VVGICPVAGT GGWQEWADAT CEVSDLKGVH DLYLKFTGGS
GYLLNVNWFT FVEGNSDEDL GDLNGDGKVN STDLQLMKMH VLRQRQLTGT SLLNADVNRD
GKVDSTDVAL LKRYILRQIS SFDDYARS