Gene Cthe_2195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2195 
Symbol 
ID4811060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2618279 
End bp2621176 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content44% 
IMG OID640107601 
Productcarbohydrate-binding family 6 protein 
Protein accessionYP_001038590 
Protein GI125974680 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.862232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAATG GAATTATAGG AATTATGACC AAAAGACATA TGATAGTGAT AATGGCTTTA 
CTGTTTACGG TATCAGTTCT TTCGGCCGGA CTATTATTCA TAAATACGGT AAACGCAGCG
GAACCGATAA CCTATTATGT ATCTCCCACC GGTAGTGACA GCAATACGGG TACAATAGAT
GCACCCTTTA AGACGATTGC AAAAGCCCGG GACGTGGTGA GAACCGTCAA CGGCAATATG
AAAAGTGATA TTTATGTATA TCTGAGGGGC GGCACTTATA ATATAACCGA AACAATCACG
TTTGGCCCAC AGGATTCGGG AACAAACGGA TATAGGATTT ACTATATGGC GTATCCCGGA
GAAACGCCTG TATTAAGCGG TGCAACAAAG GTTACAGGCT GGACGAGGCA TAACGGCAAT
ATATACAAGG CAAAGTTAAA TCGTTCGACT AAACTGCGAA ACCTGTATGT AAATGACCAA
AGAGCTTCGA TGACCAGCAA GAGAGTAACC GCCAGAGGGG GACACGGAAC TTACACCGTT
ACTGCCGGGC AGGCTCCCTG GGCGTGGACC AGCGGAAGCA AAAGCGACGG TGTTCGGTAT
GATATGTCGG AAGTACCGGA AATTACCCGC AATAAAGATG ACCTTGAGAT AGTAAACGGT
ACTACATGGA ATGAAAATAT TGTGTGTACC CGCGATGTAA TTACAGCCAA CGGCTACAGG
GTGCTTCTTT TGCAACAGCC TTACGGCGCC ATAGCGCAGA CCCCCGGTTG GGGTGCGGCT
TTTACTACTT CCGGTACCCA TACAATTTAT AATGCCTTTG AATTTTTAAA TTCTCCGGGG
CAATTCTATT TTGACAAAAC CGAACAAATG CTTTATTACT ATCTCCGTCC CGGAGAAAAT
ATAGAGACGA TTGACGTTCA GGCCCCAATG GTTGAAAAAC TCATTGAGAT TGCCGGAACA
TCAACTTCAA ACAGGGTAAA GAATATAACC TTCCAGGGCA TTACCTTTGC GTATACCGAT
TACAACCTTG TCGAGGTCGG AGGTTCGCGG GGTAAATCGA CATGCCAGGC TGCCCAAGGC
TTTATAGCTT TTTTCAACGA TAATTGGCAC TACACCAAAT ATGATCTTGT TGATACATTG
CCGGGAATGA TCAACCTAAG AAACTGCGAT TCCATTGATT TTATTGAAAA TGTAATTAAG
CATAGCGGAG CCGACGGAAT TTCCATGGTA AACGATGTTA TAAACTGCAA AATCATCGGC
AACTATATTA CAGATATAAC ATCAAGCGGC ATAACGGTAG GCCATCCGCA GCATGTTTAC
ATTGGAGACG GCGGGAGCCG TGCAAAATTT CCTTCCGGAG TAGAAGGTGT TTGCAAGAAC
AATACCATTT CAAACAATGT GTTGTACGAC ATAAGTATGG TTCCGGGATT TGGCGGATGT
GCCGGCATTA CAGCATACTT TGTGGAAGGT CTGGAAATAA CTCACAACCA TGTCCAGAAG
ACGGCCTACA ACGGTATACA TTTGGGCTGG GGATGGTGCA ATTTTAAAGA CTCCACAACG
TGCAAAAACA ACACAATAAG CTACAACAGG GTTGTTGATA CCTTGTCCAG GCTACATGAC
AGCGGAGCAA TATATACCAT AGGCCAGATG CCGGGTACAA ATATCAACGA GAATTATGTA
AAGGGTATTC CACCGGCAAC ATATGGCCCT ACTTATGGCT TGCATAATGA CGAAGGCACT
GCATATATAA TTGAAAACGA CAACGTCCTG AATATCGACC CGGGAGTAAA ATATACCATC
AACTGCGAAG ATTTCGGAGA AAAACACGAT CTGACAATCC TGAGGACCTA TGCAACGGTG
AGCAAAATGG GAAAAAATCC TCCAAACAGC AGAATTGACC CTCCCGTTGC CGTCCCGGAT
AATGTATGGC CTTTACGGCA GTATAATGTG TGCCTGAATT CGGGAATTCA GGATGAATAC
AGAAAAATTA TGCCTGAGAG CTTACTTTCA ACGCCGGATT ATGTATTCCC GGCAAGCTGT
GCTGCGGAAG CTGCGTCCAT TATAAATATA AGAAGCAGCG GAGATCCTTC AAACACGGTA
TGGTTTGCAC CTCCCGGGAC AACAACCTTT GTTGAAGGAG CTACCATGAC CAAGGCGGCA
GGAGACGCAA CTTCCATTAT TGCTCCATAC ACAGCCGGAA CATACAAGCT GTACATAGTT
AATTCCCAGG GTGTAAAAAT CGGAGAGTCG GAATCAATAT TGAGAGTGAG CGGCTCTGTC
AATCCTCCGC CTAAGGAACC GCGTTCGGCC TTTACCCGGA TTGAGGCCGA GAGCTACAAC
GGACAATCGG GAATCCAGAC CGAAAACTGC AGCGAAGGCG GAATGGATGT AGGGTATATT
GAGAACGGAG ATTATGTTGT TTATAAGAAT ATAGATTTTG GAAAAGGGGC AGCAAGTTTT
AAAGCGAGAG TAGCCAGCGC TACAAGCGGA GGCAATATTG AACTTAGGAT TGACAGTATT
GACGGACCTG TAGTGGGTAT CTGCCCGGTT GCAGGAAGCG GTGGCTGGCA GCAGTGGGTT
GATGCCACAT GTGAGGTCAG CGGGCTTAAG GGAGTCCATG ATCTCTACTT AAAATTTACC
GGTGGCAGCG GTTACCTGCT TAATATAAAT TGGTTTACCT TTGTTGAAGG AAACAATGAT
GAGAATTTGG GTGATTTAAA CGACGATGGA AAAGTAAACT CGACAGACTT TCAGATATTG
AAAAAGCATC TGCTTCGCAT AACTTTGCTT ACGGGAAAAA ATCTTTCAAA TGCGGATTTA
AACAAAGACG GCAAAGTAGA TTCGAGCGAT TTGAGTTTGA TGAAAAGATA TCTGCTTCAA
ATTATACCTA CTTTTTAA
 
Protein sequence
MVNGIIGIMT KRHMIVIMAL LFTVSVLSAG LLFINTVNAA EPITYYVSPT GSDSNTGTID 
APFKTIAKAR DVVRTVNGNM KSDIYVYLRG GTYNITETIT FGPQDSGTNG YRIYYMAYPG
ETPVLSGATK VTGWTRHNGN IYKAKLNRST KLRNLYVNDQ RASMTSKRVT ARGGHGTYTV
TAGQAPWAWT SGSKSDGVRY DMSEVPEITR NKDDLEIVNG TTWNENIVCT RDVITANGYR
VLLLQQPYGA IAQTPGWGAA FTTSGTHTIY NAFEFLNSPG QFYFDKTEQM LYYYLRPGEN
IETIDVQAPM VEKLIEIAGT STSNRVKNIT FQGITFAYTD YNLVEVGGSR GKSTCQAAQG
FIAFFNDNWH YTKYDLVDTL PGMINLRNCD SIDFIENVIK HSGADGISMV NDVINCKIIG
NYITDITSSG ITVGHPQHVY IGDGGSRAKF PSGVEGVCKN NTISNNVLYD ISMVPGFGGC
AGITAYFVEG LEITHNHVQK TAYNGIHLGW GWCNFKDSTT CKNNTISYNR VVDTLSRLHD
SGAIYTIGQM PGTNINENYV KGIPPATYGP TYGLHNDEGT AYIIENDNVL NIDPGVKYTI
NCEDFGEKHD LTILRTYATV SKMGKNPPNS RIDPPVAVPD NVWPLRQYNV CLNSGIQDEY
RKIMPESLLS TPDYVFPASC AAEAASIINI RSSGDPSNTV WFAPPGTTTF VEGATMTKAA
GDATSIIAPY TAGTYKLYIV NSQGVKIGES ESILRVSGSV NPPPKEPRSA FTRIEAESYN
GQSGIQTENC SEGGMDVGYI ENGDYVVYKN IDFGKGAASF KARVASATSG GNIELRIDSI
DGPVVGICPV AGSGGWQQWV DATCEVSGLK GVHDLYLKFT GGSGYLLNIN WFTFVEGNND
ENLGDLNDDG KVNSTDFQIL KKHLLRITLL TGKNLSNADL NKDGKVDSSD LSLMKRYLLQ
IIPTF