Gene Cthe_0736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0736 
Symbol 
ID4810354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp893987 
End bp897904 
Gene Length3918 bp 
Protein Length1305 aa 
Translation table11 
GC content40% 
IMG OID640106153 
Productcellulosome anchoring protein, cohesin region 
Protein accessionYP_001037164 
Protein GI125973254 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.4308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAG GTATAAGCTT TATTTTGGTT ATTGCTATAA TAATGGCAAT GACCTCATCC 
TTTGCTGTGT CTGCATATCC AGCGACTACT CTGTCAGCTG CCGGTGTGGA GAGTGGTAGC
ATATCATTGG AATTTGACAA AACTACCGCT CAAGTAGGAG ACATAATCAA AGCTTATGTC
AAAATTAGTA ATATTAAGAA TTTTGCCGGA TATCAGGTAA ACATCAAGTA TGATCCAACC
GTATTACAGG CAGTAAATCC CGATACAAAA GTACCTTTGA ATAAAAATAC AATGCCTAAA
AGCGGAAATT TGCTGTCAAA TCCTGAATAT GGTTCTATAT ATGGTGTACT CAATAAAATC
GAAGAAGGTA TTTTGAATTT TGGAAAAGCG TATACATATC TTAATGATTA CAAACTCAGC
AATTCACCAG AAGAAACAGG TATTTTGGCG GAAATAGGCT TTAAAGTGCT GAAAGTGCAG
CCCACCACCG TAAAATTTGA AAATACATCT TCAATGCCAG GCAGCCTTTC AGGCACAATG
CTCTTTGACT GGAACGGTGA GGTTATTACA GATTATACCG TAGTTCAATC TGCTGTTATC
AATTCCTCCG TGGTAAATCC TTCACCGTCG GCTGCACCAT CCAAGGGAAT TGTAAAAATG
GAACTTAATA AAAATACCGC TTTTGTCGGC GATATAATAA TTGCTGAAAT AAAGGTGGAT
AATTTTGATA ATATTGCAGG ATACCAGTTT AATATAAAAT ATGATCCCCA AGTATTGCAG
CCTATAGATC CTGATACGAA TGTTCCCTAT GGAAAATCTA CAATGCCAAA GGACGGCACC
ATACTTGTAA ATCCTGAATT TGGTGCAATT TCGGCAGTAG CGAATAAAGT GGAAGAAGGT
ATATTGAATT TCGGAAAATC TTATACATAC CTTGCTGCCT ACAAAGCTTC CGGAATGGCT
GAAAAGAGCG GTACAATCGC AAAAATTGCG TTTAAGGCAC TTAAAGCAGC TTCTTCAACA
ACCATTAAAT TTGAAGAAAC ACTTTCAATG CCTGGAAGCA TTGAAGGTAC TATGATTTTT
GACTGGAATG GAGACAATGT TCTGGGATAT CAAGTTATTC AGGCCGGGGC TGTCAGTATT
TCAGGACAAA CGGTAACGCC TTCACCGTCT CCTACACAGA TTCCGGTTTC ACCTACTCCA
ATTCCATCGC AAAAACCAAC ACCTTCAAGT ACACCGGTTT CCAACGCTTC AATAAGCATT
GAAGTGGATA AGAATACGGT AAAAGTAGGG GAAATGGTAA AAGCATTTGT GAAGGTTGAC
GGTTTTGACA GTTTGGCCGG TTTCCAGGTA AATATTAAGT ACAATCCTGA TTTGTTACAG
GCTGTAAATC CTGATACCGG AGAGCCTTTA AAAATAAACA GTATGCCAAA GAGCGGCGAT
TTGATCTCAA ATAATGAGTA CGGAGTAATC TCTATTGCTG TGAACAAACC TTCAGAAGGC
GTATTGAACT TTGCGAAAAC TTACACATAT GTTGGAGATT ACAAAGACAG CGGTAAACCT
GAAAAATCAG GTACTCTTGC AATTATTGGA TTTAAGGCTC TCAATGAGGG AGATGCAACT
GTCAGATTCG AAGATGCAAT ATCAATGCCA AGCAGCCTGA GTGGTACAAT ACTGCTTGAC
TGGGATTTGA ACAGAATATC AGATTATAAA GTTGTGCAGC CGGACGTAAT TAAAATAACA
GGTTCAACCA AGCCTTCGCC GTCTCCTACA TCAACGCCTG TTGGTCCAAG TCCGACGGCT
ACTCCGACCG GAGGACCGGT TTCAGACGGA CAAATAGAGC TCAAGCTGGA TAAAGAACAA
GCAAAAGTAG GAGATATCAT CAAGGCTGCA ATCAATATCA GTGACATCAA TAATTTCGCA
GGCTATCAGG TAAATATAAA GTATGATCCT GCAGTGCTTC AAGCTGTAAA TCCTGTTACG
GGAGAACCAA TGTCAGACAA GTCGATGCCT GCTGACGGAA CTATTCTTGT AAACACAGAA
TACGGTATTA TATCAGCTGT GGCGAATAAA ACTTCCGAAG GCATATTAAA CTTTGGTAAA
GCCTATACAT ATCTGGATGC ATATAAACTT TCAAATAATC CGGAAAAAAC CGGAACACTG
GCTGTAATTG GCTTTAAGGT TCTTAAGGCG CAGGATACGT ATATTGGCTT TGAGAACTCC
ATTACAATGC CTTCAAGTGT GTTAGGAACA TACTTGTTTG ACTGGAACGG AGATACGATA
ACCGGCTACA AAGTTGTGAA TCCGGGTGTC ATAAAAATAT CTTCGTCCAC TGTGACAACA
CCTTCGCCGA CACCTACAAC AACACCTACT TCAACGCCAA AGCCGACTAA TCCGGTATCA
ACGGACAGCT ACATAAAATT GGAGCTTGAC AAGAATACTG CGGCTGTGGG AGAAATAATA
AAGGCTACTG TAAAGGTAAA TAACATAAAA GAGTTGGCCG GTTATCAGAT AAATATAAAG
TATGATCCTA ACGTATTACA ACCGGTTAAT CCGTATACCG GAGCGGAATA CACCTCAAAA
ACTCCGCTTG CAAACGGCGA GCTGATTGTA AACAGTGAAT ATGGTGCAAC TTCAATGGTC
GTGCATGATT TAACAAAAGG TGTATTGAAT TTTGCACAAA TATACGTATT TATGGAAGAC
TACAGAAATT CAGGCAAGGC AGAAGAAACA GGAGTTTTAG GGGTAATTGG ATTTAAAGTG
TTGAAAAATG AAAAAACAAC TATCAAATTT GAAGAGCCTG CTTCAATGCC CGCAAGCATT
TCGGGAACAT ACTTGATAGA CTGGAACGGC AATAAGAAGA CAGACTATAA GGTTATTCAG
CCTGAACCTG TCAATGCCGA TGCCGTATCA TCGGGCAGTT ATATAAAATT GGAGTTTGAC
AAAAACACAG CATCAGAGGG AGAAATAATA AGGGCTACAG TTAAGGTAAA TAATGTAAAA
AATTTAGCCG GCTATCAAAT ATGCATCAAA TATGACCCGA ATGTATTACA GCCGGTAAAT
CCAAATACCG GAGCGGCATA TACAACAACA ACTCACCTTG TAGACGGTGA ACTGATTGTA
AAACAGGAAT ATGGATCGAC GTCAATGGCT GCGCATAGAT TGTCCAATGG TATACTCAAT
TTTGCGCGTA CTTACTTGTA CGTAAGTGAC TACAAAGAAG ACGGAAAGCC GGAAGAAACA
GGAATTTTGG GTGTAATTGG CTTTAAGGTA TTGAAAAAAG AGAAGACAAC AGTCAGCTTT
TATGCGGATG AAGCTTTAAT GCCAAATAGT GTTTCAGGAA CGTATTTGAT AGACTGGAAC
AGCAATAAGA AAACAGATTA CAAAGTTATC CAGCCCGAAC CAATCAACGG CGGTGCTTTA
CCGGAGAATT ATATTGCGTT GGAGTTGAAT AAGAATAAAG CGGCAGTTGG TGAAACAATA
AAGGCTACGG TAAGAGTGAA CAATATAAAG AATCTCGCCG GTTACCAGGT AAACATAGTT
TACGATCCGA ATGTATTGCA GCCTATTGAT CCTGTTACTG GAGCACCGTT TACTACAAGG
TCTACTTTTG CAAACTGTGA ATTGCTCAAC AACGATGAAT ACGGTCCGAC AAATATTACC
GCTCATGATT TGACAAAAGG AGCATTAAAT TTCGCGAGAG GATACTCATA CCTGAATGAG
TACAGGAAAA ACGGTGTGCC TGAAACTACC GGAGTTTTGG GGGAAATAAC CTTTAAGGTA
TTGAAGTCAC AAACTACAAA AATAAGATTT GAGGAACCAG CTGCAATGCC GGGAAGTATT
TCAGGAACAT ATCTGTTTGA CTGGTACGGC AATCAAATCA GCAATTATTC GGTAATACAG
CCGGACAGCA TCAACTAA
 
Protein sequence
MKKGISFILV IAIIMAMTSS FAVSAYPATT LSAAGVESGS ISLEFDKTTA QVGDIIKAYV 
KISNIKNFAG YQVNIKYDPT VLQAVNPDTK VPLNKNTMPK SGNLLSNPEY GSIYGVLNKI
EEGILNFGKA YTYLNDYKLS NSPEETGILA EIGFKVLKVQ PTTVKFENTS SMPGSLSGTM
LFDWNGEVIT DYTVVQSAVI NSSVVNPSPS AAPSKGIVKM ELNKNTAFVG DIIIAEIKVD
NFDNIAGYQF NIKYDPQVLQ PIDPDTNVPY GKSTMPKDGT ILVNPEFGAI SAVANKVEEG
ILNFGKSYTY LAAYKASGMA EKSGTIAKIA FKALKAASST TIKFEETLSM PGSIEGTMIF
DWNGDNVLGY QVIQAGAVSI SGQTVTPSPS PTQIPVSPTP IPSQKPTPSS TPVSNASISI
EVDKNTVKVG EMVKAFVKVD GFDSLAGFQV NIKYNPDLLQ AVNPDTGEPL KINSMPKSGD
LISNNEYGVI SIAVNKPSEG VLNFAKTYTY VGDYKDSGKP EKSGTLAIIG FKALNEGDAT
VRFEDAISMP SSLSGTILLD WDLNRISDYK VVQPDVIKIT GSTKPSPSPT STPVGPSPTA
TPTGGPVSDG QIELKLDKEQ AKVGDIIKAA INISDINNFA GYQVNIKYDP AVLQAVNPVT
GEPMSDKSMP ADGTILVNTE YGIISAVANK TSEGILNFGK AYTYLDAYKL SNNPEKTGTL
AVIGFKVLKA QDTYIGFENS ITMPSSVLGT YLFDWNGDTI TGYKVVNPGV IKISSSTVTT
PSPTPTTTPT STPKPTNPVS TDSYIKLELD KNTAAVGEII KATVKVNNIK ELAGYQINIK
YDPNVLQPVN PYTGAEYTSK TPLANGELIV NSEYGATSMV VHDLTKGVLN FAQIYVFMED
YRNSGKAEET GVLGVIGFKV LKNEKTTIKF EEPASMPASI SGTYLIDWNG NKKTDYKVIQ
PEPVNADAVS SGSYIKLEFD KNTASEGEII RATVKVNNVK NLAGYQICIK YDPNVLQPVN
PNTGAAYTTT THLVDGELIV KQEYGSTSMA AHRLSNGILN FARTYLYVSD YKEDGKPEET
GILGVIGFKV LKKEKTTVSF YADEALMPNS VSGTYLIDWN SNKKTDYKVI QPEPINGGAL
PENYIALELN KNKAAVGETI KATVRVNNIK NLAGYQVNIV YDPNVLQPID PVTGAPFTTR
STFANCELLN NDEYGPTNIT AHDLTKGALN FARGYSYLNE YRKNGVPETT GVLGEITFKV
LKSQTTKIRF EEPAAMPGSI SGTYLFDWYG NQISNYSVIQ PDSIN