Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0736 |
Symbol | |
ID | 4810354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 893987 |
End bp | 897904 |
Gene Length | 3918 bp |
Protein Length | 1305 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106153 |
Product | cellulosome anchoring protein, cohesin region |
Protein accession | YP_001037164 |
Protein GI | 125973254 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.4308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAG GTATAAGCTT TATTTTGGTT ATTGCTATAA TAATGGCAAT GACCTCATCC TTTGCTGTGT CTGCATATCC AGCGACTACT CTGTCAGCTG CCGGTGTGGA GAGTGGTAGC ATATCATTGG AATTTGACAA AACTACCGCT CAAGTAGGAG ACATAATCAA AGCTTATGTC AAAATTAGTA ATATTAAGAA TTTTGCCGGA TATCAGGTAA ACATCAAGTA TGATCCAACC GTATTACAGG CAGTAAATCC CGATACAAAA GTACCTTTGA ATAAAAATAC AATGCCTAAA AGCGGAAATT TGCTGTCAAA TCCTGAATAT GGTTCTATAT ATGGTGTACT CAATAAAATC GAAGAAGGTA TTTTGAATTT TGGAAAAGCG TATACATATC TTAATGATTA CAAACTCAGC AATTCACCAG AAGAAACAGG TATTTTGGCG GAAATAGGCT TTAAAGTGCT GAAAGTGCAG CCCACCACCG TAAAATTTGA AAATACATCT TCAATGCCAG GCAGCCTTTC AGGCACAATG CTCTTTGACT GGAACGGTGA GGTTATTACA GATTATACCG TAGTTCAATC TGCTGTTATC AATTCCTCCG TGGTAAATCC TTCACCGTCG GCTGCACCAT CCAAGGGAAT TGTAAAAATG GAACTTAATA AAAATACCGC TTTTGTCGGC GATATAATAA TTGCTGAAAT AAAGGTGGAT AATTTTGATA ATATTGCAGG ATACCAGTTT AATATAAAAT ATGATCCCCA AGTATTGCAG CCTATAGATC CTGATACGAA TGTTCCCTAT GGAAAATCTA CAATGCCAAA GGACGGCACC ATACTTGTAA ATCCTGAATT TGGTGCAATT TCGGCAGTAG CGAATAAAGT GGAAGAAGGT ATATTGAATT TCGGAAAATC TTATACATAC CTTGCTGCCT ACAAAGCTTC CGGAATGGCT GAAAAGAGCG GTACAATCGC AAAAATTGCG TTTAAGGCAC TTAAAGCAGC TTCTTCAACA ACCATTAAAT TTGAAGAAAC ACTTTCAATG CCTGGAAGCA TTGAAGGTAC TATGATTTTT GACTGGAATG GAGACAATGT TCTGGGATAT CAAGTTATTC AGGCCGGGGC TGTCAGTATT TCAGGACAAA CGGTAACGCC TTCACCGTCT CCTACACAGA TTCCGGTTTC ACCTACTCCA ATTCCATCGC AAAAACCAAC ACCTTCAAGT ACACCGGTTT CCAACGCTTC AATAAGCATT GAAGTGGATA AGAATACGGT AAAAGTAGGG GAAATGGTAA AAGCATTTGT GAAGGTTGAC GGTTTTGACA GTTTGGCCGG TTTCCAGGTA AATATTAAGT ACAATCCTGA TTTGTTACAG GCTGTAAATC CTGATACCGG AGAGCCTTTA AAAATAAACA GTATGCCAAA GAGCGGCGAT TTGATCTCAA ATAATGAGTA CGGAGTAATC TCTATTGCTG TGAACAAACC TTCAGAAGGC GTATTGAACT TTGCGAAAAC TTACACATAT GTTGGAGATT ACAAAGACAG CGGTAAACCT GAAAAATCAG GTACTCTTGC AATTATTGGA TTTAAGGCTC TCAATGAGGG AGATGCAACT GTCAGATTCG AAGATGCAAT ATCAATGCCA AGCAGCCTGA GTGGTACAAT ACTGCTTGAC TGGGATTTGA ACAGAATATC AGATTATAAA GTTGTGCAGC CGGACGTAAT TAAAATAACA GGTTCAACCA AGCCTTCGCC GTCTCCTACA TCAACGCCTG TTGGTCCAAG TCCGACGGCT ACTCCGACCG GAGGACCGGT TTCAGACGGA CAAATAGAGC TCAAGCTGGA TAAAGAACAA GCAAAAGTAG GAGATATCAT CAAGGCTGCA ATCAATATCA GTGACATCAA TAATTTCGCA GGCTATCAGG TAAATATAAA GTATGATCCT GCAGTGCTTC AAGCTGTAAA TCCTGTTACG GGAGAACCAA TGTCAGACAA GTCGATGCCT GCTGACGGAA CTATTCTTGT AAACACAGAA TACGGTATTA TATCAGCTGT GGCGAATAAA ACTTCCGAAG GCATATTAAA CTTTGGTAAA GCCTATACAT ATCTGGATGC ATATAAACTT TCAAATAATC CGGAAAAAAC CGGAACACTG GCTGTAATTG GCTTTAAGGT TCTTAAGGCG CAGGATACGT ATATTGGCTT TGAGAACTCC ATTACAATGC CTTCAAGTGT GTTAGGAACA TACTTGTTTG ACTGGAACGG AGATACGATA ACCGGCTACA AAGTTGTGAA TCCGGGTGTC ATAAAAATAT CTTCGTCCAC TGTGACAACA CCTTCGCCGA CACCTACAAC AACACCTACT TCAACGCCAA AGCCGACTAA TCCGGTATCA ACGGACAGCT ACATAAAATT GGAGCTTGAC AAGAATACTG CGGCTGTGGG AGAAATAATA AAGGCTACTG TAAAGGTAAA TAACATAAAA GAGTTGGCCG GTTATCAGAT AAATATAAAG TATGATCCTA ACGTATTACA ACCGGTTAAT CCGTATACCG GAGCGGAATA CACCTCAAAA ACTCCGCTTG CAAACGGCGA GCTGATTGTA AACAGTGAAT ATGGTGCAAC TTCAATGGTC GTGCATGATT TAACAAAAGG TGTATTGAAT TTTGCACAAA TATACGTATT TATGGAAGAC TACAGAAATT CAGGCAAGGC AGAAGAAACA GGAGTTTTAG GGGTAATTGG ATTTAAAGTG TTGAAAAATG AAAAAACAAC TATCAAATTT GAAGAGCCTG CTTCAATGCC CGCAAGCATT TCGGGAACAT ACTTGATAGA CTGGAACGGC AATAAGAAGA CAGACTATAA GGTTATTCAG CCTGAACCTG TCAATGCCGA TGCCGTATCA TCGGGCAGTT ATATAAAATT GGAGTTTGAC AAAAACACAG CATCAGAGGG AGAAATAATA AGGGCTACAG TTAAGGTAAA TAATGTAAAA AATTTAGCCG GCTATCAAAT ATGCATCAAA TATGACCCGA ATGTATTACA GCCGGTAAAT CCAAATACCG GAGCGGCATA TACAACAACA ACTCACCTTG TAGACGGTGA ACTGATTGTA AAACAGGAAT ATGGATCGAC GTCAATGGCT GCGCATAGAT TGTCCAATGG TATACTCAAT TTTGCGCGTA CTTACTTGTA CGTAAGTGAC TACAAAGAAG ACGGAAAGCC GGAAGAAACA GGAATTTTGG GTGTAATTGG CTTTAAGGTA TTGAAAAAAG AGAAGACAAC AGTCAGCTTT TATGCGGATG AAGCTTTAAT GCCAAATAGT GTTTCAGGAA CGTATTTGAT AGACTGGAAC AGCAATAAGA AAACAGATTA CAAAGTTATC CAGCCCGAAC CAATCAACGG CGGTGCTTTA CCGGAGAATT ATATTGCGTT GGAGTTGAAT AAGAATAAAG CGGCAGTTGG TGAAACAATA AAGGCTACGG TAAGAGTGAA CAATATAAAG AATCTCGCCG GTTACCAGGT AAACATAGTT TACGATCCGA ATGTATTGCA GCCTATTGAT CCTGTTACTG GAGCACCGTT TACTACAAGG TCTACTTTTG CAAACTGTGA ATTGCTCAAC AACGATGAAT ACGGTCCGAC AAATATTACC GCTCATGATT TGACAAAAGG AGCATTAAAT TTCGCGAGAG GATACTCATA CCTGAATGAG TACAGGAAAA ACGGTGTGCC TGAAACTACC GGAGTTTTGG GGGAAATAAC CTTTAAGGTA TTGAAGTCAC AAACTACAAA AATAAGATTT GAGGAACCAG CTGCAATGCC GGGAAGTATT TCAGGAACAT ATCTGTTTGA CTGGTACGGC AATCAAATCA GCAATTATTC GGTAATACAG CCGGACAGCA TCAACTAA
|
Protein sequence | MKKGISFILV IAIIMAMTSS FAVSAYPATT LSAAGVESGS ISLEFDKTTA QVGDIIKAYV KISNIKNFAG YQVNIKYDPT VLQAVNPDTK VPLNKNTMPK SGNLLSNPEY GSIYGVLNKI EEGILNFGKA YTYLNDYKLS NSPEETGILA EIGFKVLKVQ PTTVKFENTS SMPGSLSGTM LFDWNGEVIT DYTVVQSAVI NSSVVNPSPS AAPSKGIVKM ELNKNTAFVG DIIIAEIKVD NFDNIAGYQF NIKYDPQVLQ PIDPDTNVPY GKSTMPKDGT ILVNPEFGAI SAVANKVEEG ILNFGKSYTY LAAYKASGMA EKSGTIAKIA FKALKAASST TIKFEETLSM PGSIEGTMIF DWNGDNVLGY QVIQAGAVSI SGQTVTPSPS PTQIPVSPTP IPSQKPTPSS TPVSNASISI EVDKNTVKVG EMVKAFVKVD GFDSLAGFQV NIKYNPDLLQ AVNPDTGEPL KINSMPKSGD LISNNEYGVI SIAVNKPSEG VLNFAKTYTY VGDYKDSGKP EKSGTLAIIG FKALNEGDAT VRFEDAISMP SSLSGTILLD WDLNRISDYK VVQPDVIKIT GSTKPSPSPT STPVGPSPTA TPTGGPVSDG QIELKLDKEQ AKVGDIIKAA INISDINNFA GYQVNIKYDP AVLQAVNPVT GEPMSDKSMP ADGTILVNTE YGIISAVANK TSEGILNFGK AYTYLDAYKL SNNPEKTGTL AVIGFKVLKA QDTYIGFENS ITMPSSVLGT YLFDWNGDTI TGYKVVNPGV IKISSSTVTT PSPTPTTTPT STPKPTNPVS TDSYIKLELD KNTAAVGEII KATVKVNNIK ELAGYQINIK YDPNVLQPVN PYTGAEYTSK TPLANGELIV NSEYGATSMV VHDLTKGVLN FAQIYVFMED YRNSGKAEET GVLGVIGFKV LKNEKTTIKF EEPASMPASI SGTYLIDWNG NKKTDYKVIQ PEPVNADAVS SGSYIKLEFD KNTASEGEII RATVKVNNVK NLAGYQICIK YDPNVLQPVN PNTGAAYTTT THLVDGELIV KQEYGSTSMA AHRLSNGILN FARTYLYVSD YKEDGKPEET GILGVIGFKV LKKEKTTVSF YADEALMPNS VSGTYLIDWN SNKKTDYKVI QPEPINGGAL PENYIALELN KNKAAVGETI KATVRVNNIK NLAGYQVNIV YDPNVLQPID PVTGAPFTTR STFANCELLN NDEYGPTNIT AHDLTKGALN FARGYSYLNE YRKNGVPETT GVLGEITFKV LKSQTTKIRF EEPAAMPGSI SGTYLFDWYG NQISNYSVIQ PDSIN
|
| |