Gene Cthe_1911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1911 
Symbol 
ID4810769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2273311 
End bp2277183 
Gene Length3873 bp 
Protein Length1290 aa 
Translation table11 
GC content42% 
IMG OID640107328 
Productcarbohydrate-binding family 6 protein 
Protein accessionYP_001038323 
Protein GI125974413 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.374482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGGG GAAAACTACG GAAGATAATT GCTTTAGTGC TGGTTTTGGG TTTCATTGCC 
GGCAACTTTG CAGTAACCAC CTTTGCGGCG GAAACTGTGA AACCTTATTG TACCAGGACT
TCAACGCCGG TCATGGCGGT AAATCTCGGA AGCAATGAAG ATGTCAGCCA TGGCGGTACC
AATTTTAAAA AGCAGTCGGC TTACAGCAAC CTTAAAGTGG AGGTCAAGGG GCCTTCTGAT
TCCGCATATC CCACCGGAAC AATCAATGCT CTCTCGGTTA TTCAGGCAGA GAATTGCGAT
GAAAACCATG GACTGGAAAT CGAAGACTGC CCGGATGAGG GAGGCACAAA GAACCTGGCG
TATATAGCCA ATGGAGACTA TACAGCTTAT TATAATGTAT ATTTTCCAAA AGGAACAAAA
GGTTTTATAG CAAGGGTTTC AAGTGACACT GAAGGAGGAT ATATTGAACT TCGCCTTGAC
TCAATTTCGG CGGAAGTGGT TGGACGATGC CGCGTAGAAA ACACCGGAGG ATGGGAAAAG
TATGAAGAAG TTTACTGCGA ACTAAATAAA AGTGTTGAAG GCGTACATAC TTTGTATATG
GGCTTTGCCG GAGAAAGGGA CGGTCTTTTC AACGTCAACT GGTTCAGGTT TACAAAGAGC
CCGTATGAGC CGGTAATGAC AAAAAGCCGT GACGGAGCGG TTGACGGAGC CTATTTGTAC
AAGTTCATTG ATTTTGGGAA GGAAAGTATT CCAACCAGGT TTAAAATGCA CCTTTCAGGA
AGCATAAGCG GAATAATAAA TGTAAGGCTT GACAGTCCAT CAGGAGATGT TATAGCAACT
GTTGACGGCA CAAACGGCGC CGGAGAAGTG GAAGGGCGCG TTGAAAAGCC TGTTACGGGA
ATTCATGAGT TATATTTGAC CGATGATAAG AACGGTACTT TAATATCCAA AGTTGACTGG
TTTGTATTTG AACCGGAAGA ATCAAAAGAA ATAACCGACA GAAATCTTCA GAATTTTGCC
AAAACAAGTG TAAAAGGATA CAATGTAAAT CTTGAGGCCT CTCTTGATAA AAACAATGTC
TACGAAGTTT ATATCCATCC TGTAGAGTTC CGCAAGAAAA ATCGTCAGAT ATTTGACCTT
TATATCAATG GTACGCTGGT GGATACCATT GACACCGAGG AGTCGGGACT TGATTGGGAG
AAAAAAGGAC CGTACATCAC AAAGGTTTTG GATGACGGAA AGCTGAAAAT TGAATGCAAA
TCCAGACAAG GTATTGTATC TCTGGCCGGT TTGGAAATCA ACAAGATAAC GTATTCCAAA
GCTTTTTCCG ATGTAAAAAT CAAAGACTGG TTTTATATTC CTGTAATGGA GCTCGCAAGC
CGGGGAGTTA TATTCGGCAA GGGCAATGAC ATGTTTAAGC CGCAGGACCA TATAATCGGC
GAGCATGTGG CATATATGAT GTTTAATGTC ATGAAGGTGT CCATTGCAGA AAATGACAAG
GAATTCAATC CGGAAAAGTA CAGAAACCTG TCGGATGTGC CTCCGAGTTT TTGGGCTTAT
CCCTATATGA GCGCTTATTA CAACTATTTC TTTAAAGAAA AAATGCTAAG ATATGATGTT
AATACCCGTG TTCCTTACAG TGCAAAGGAG TACGAGGAGA AAAAGAAAGT AAGACGCGAA
GAGTTTGCTA TGGCGATTAT AGGGGCAAGG CGTCTTGACT ATAATGAAGA CGGCAAGGTG
TTTGTACTGG ATCCGTATCT TGAACCTTCA GCAATGTTGA ACAAGTATAA GGACAAAGAT
GCCGACAAGA TTACGGATTC CTTCAGGTAT TTTGTAGAAT TGGCTTTGGA AAAGGGTCTG
ATGAAAGGTG ACCAGTTTGG TTACTTAAAT CCTCAAAATC CTGTAACGAG AGGAGAGGCG
GCGGCATTTA TATACAATGC CTTAAATCTT GACGAAAACA ATTTTGTAAA GCCGAAAAGG
GGAGAGAAAA TTCCTGTACC GAGGATAACA GCCAGGAAGA GAAATATAAA TGTGGGAATT
CTTATTTTGC CGGCACCGGC ATGGGATTCA ATAAACAATA TACCGGTGAA TGACCCAAAT
CCTGACTTTA CTTTAATGGA GCTTTTGAAC AGGAACATAA ACAAGCCGAT GGATTGGGAG
TTGGTGAATC CTCATCCGCC TGCCTTTGAC AAGAGTGAAT ACAAAGATAT AATGCACCTT
AACTCATCCA AGATTCCGGG AATTGACAAC CAAAGCCACA GTGATTTCTG CGCATATTTC
AACGACCTCA GGAGCGTGGC CAGGGCACAA ACCGATCTTG AAGCCGATAT AACTTATCTT
GGCACGGTCG GATACAGTGA AAATATAAAT AAGTCGAAGT TCTTCAAATA TTGGGAGGTT
CATCTGGACG ACCCGAATTT GACACCGGAA AAGATTGCAA AAGACTATGA CCTTCTGTTC
CAGACATCCC ATGGTAAAAT AACATATTCA AAGGATGTCC AGGACAAAGT CAAGGCGTTC
CTGAAAGCCG GCGGCCAGTT ATGGTGGGAA AACTGCAAAG GGCTTGAAAT TGAATCCGGA
GACGGTTTTA CGGAAGAAGT TAAGTTTGTG TCGCTGCATC CGGGTCATAA CCGCAAGTAT
CCTCAGATAC CTGTTTTAGA CGACGAAGGG AAAATGCATC CGTTGTTTGA CAATATTTTC
AGAATCAATC CGGAGAAAAC ATCCCGTGTA TTTGCACCGG GTATATACAA CAAGAACAGC
GAGATATCAA TGCTGGGCGA CGGTGAGGAA TGGCTCAACG ATGACAACAG GTATCTTGAT
GAATTGCAGC CTGATGATAT TGTAATTCTC AACATAGAGA ATACCGACAC AGGCGAAATA
CTTCCCAACA TGGCGGTAAG AAATATAGAG AATGAGGATG CGCCTGACGG AAGAATTGTA
ATTAGCACAA ATGATATTGG ATGCGGTATA ACAAAATTTG TGGACCGCGG TGGCGGAAAA
GCCGTTGAAG ACTACAAATT CTGCTACAAT CTTTTGGGTT GGATGTCCAA GATAGATGTA
AGCTTCGATG AAACAACTGT CAACCAGTGG GACGGAGGCA GTGAGTTTTC CGTGGAAGCC
ACATTCACAA ACAATGGAGC AAAGAAACAG ATTTATGACG TTACATATGA ATATGATCCT
AAACTTTGGA ATCTTGTACC AACGAGCGAC TTTAAGAATT ACAAACAGAC TCATCCATGG
ATTAAGGCTT TGGATGAAAA CGGATATCCG AAGAAAATTG AACTTGAGCC CAATCAGACG
GAAGTAGTGA CATATAAATT CAACATCAAG AGAACAAACC TCCGCTGCTA TGACTTTACG
ATAAAAGCAA GTGAATCGGG TGTGAAGTAT ACCCGCGACA TGGCTGAAAC GTTGTACAGA
CTGAATAACG TAAGGGTTGA GGAGCCGATA TTCTCAGGAC GGAGGAATAA CGGAAGTGAA
GCTTCTTTCG ATGTGACAAT CAACGCACCG GAGGAACCGG ACAGTGACCT TAGAACCGAG
GATTATGAGC TTAATATAAA AATTAAAAAG GATGGAAGCT TTATTGATCC GGAAACCGTT
ATAGACAATA TTGAGCTTCT GACGGACGGA AATACACCGC CGCTGGAAGG TTACAATTAC
AAGTACTTGG TTGACAATAA GGGTGTTCTG TATCTGAAGG TTATTATAGA AGACACGCTG
ATTACGAAGC CAACTGAAAA AATCAGGCTG AATATATCTT TAAAGAACCT TGACAGTGGC
AGTTATGAGG TTGCCGGAAA GATAGAGGTA ATTGATCCGG TTTCCCGTAG AAGGCTTGCA
TTCTCAGATG AGGCAATATA TAAAATAAAA TAG
 
Protein sequence
MRRGKLRKII ALVLVLGFIA GNFAVTTFAA ETVKPYCTRT STPVMAVNLG SNEDVSHGGT 
NFKKQSAYSN LKVEVKGPSD SAYPTGTINA LSVIQAENCD ENHGLEIEDC PDEGGTKNLA
YIANGDYTAY YNVYFPKGTK GFIARVSSDT EGGYIELRLD SISAEVVGRC RVENTGGWEK
YEEVYCELNK SVEGVHTLYM GFAGERDGLF NVNWFRFTKS PYEPVMTKSR DGAVDGAYLY
KFIDFGKESI PTRFKMHLSG SISGIINVRL DSPSGDVIAT VDGTNGAGEV EGRVEKPVTG
IHELYLTDDK NGTLISKVDW FVFEPEESKE ITDRNLQNFA KTSVKGYNVN LEASLDKNNV
YEVYIHPVEF RKKNRQIFDL YINGTLVDTI DTEESGLDWE KKGPYITKVL DDGKLKIECK
SRQGIVSLAG LEINKITYSK AFSDVKIKDW FYIPVMELAS RGVIFGKGND MFKPQDHIIG
EHVAYMMFNV MKVSIAENDK EFNPEKYRNL SDVPPSFWAY PYMSAYYNYF FKEKMLRYDV
NTRVPYSAKE YEEKKKVRRE EFAMAIIGAR RLDYNEDGKV FVLDPYLEPS AMLNKYKDKD
ADKITDSFRY FVELALEKGL MKGDQFGYLN PQNPVTRGEA AAFIYNALNL DENNFVKPKR
GEKIPVPRIT ARKRNINVGI LILPAPAWDS INNIPVNDPN PDFTLMELLN RNINKPMDWE
LVNPHPPAFD KSEYKDIMHL NSSKIPGIDN QSHSDFCAYF NDLRSVARAQ TDLEADITYL
GTVGYSENIN KSKFFKYWEV HLDDPNLTPE KIAKDYDLLF QTSHGKITYS KDVQDKVKAF
LKAGGQLWWE NCKGLEIESG DGFTEEVKFV SLHPGHNRKY PQIPVLDDEG KMHPLFDNIF
RINPEKTSRV FAPGIYNKNS EISMLGDGEE WLNDDNRYLD ELQPDDIVIL NIENTDTGEI
LPNMAVRNIE NEDAPDGRIV ISTNDIGCGI TKFVDRGGGK AVEDYKFCYN LLGWMSKIDV
SFDETTVNQW DGGSEFSVEA TFTNNGAKKQ IYDVTYEYDP KLWNLVPTSD FKNYKQTHPW
IKALDENGYP KKIELEPNQT EVVTYKFNIK RTNLRCYDFT IKASESGVKY TRDMAETLYR
LNNVRVEEPI FSGRRNNGSE ASFDVTINAP EEPDSDLRTE DYELNIKIKK DGSFIDPETV
IDNIELLTDG NTPPLEGYNY KYLVDNKGVL YLKVIIEDTL ITKPTEKIRL NISLKNLDSG
SYEVAGKIEV IDPVSRRRLA FSDEAIYKIK