Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1238 |
Symbol | |
ID | 7310035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1522660 |
End bp | 1526028 |
Gene Length | 3369 bp |
Protein Length | 1122 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643608159 |
Product | Carbohydrate binding family 6 |
Protein accession | YP_002505574 |
Protein GI | 220928665 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000137164 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA AGATTTTATC TGTTCTTTTG CTTGTCACAA TGACTACAGC ATTGTTTTCA GCTACACCGA TGAATACCGC TTCAGCCGCA AGTACGGACT TTGTACTAGA CGGAAACAAT ATCAAAGCGG GCAACATCAA CGGTCTCACA TTTAAGGGGT TCGGCGTCCT CAGTGGAAAC AGCTCAAGTG CACTGTTGAT GGATTACAAG TCGGAGCATC CTGAGAAATA TACAGAATTA CTGCAAATTC TGTTCGGTGG AAAAAATCCG ATTATGACAC ATGTCAAAAT TGAGATGGGT AATGACCGTA ACAACTCCAC CGGACCAGAT CCCTCAACAA TGCGTTGGGA AAATGAGACG GCTAATGTCA AAAGACACCC CGGATTTCAA CTTGCGGCTG ATGCCAAGAA GGTGAACCCC AATCTTAAAG TCAGCATATT ACGCTGGAAT GCCCCCGGTT GGGCAAATAG CAATGATAAG ATTTATACAT GGTATAAGAA CACCATATTA GCAGCATACC GTCAATATGG TTATATGATT GATTACGTAA ACCCGGGGGT CAACGAACAA ACACCGAATT TAACCTGGAC TAAGCAATAC GCCCAGCGTA TCAAAACAGA CAGTACAGGT TTTAACAATG CTGAAGAACG GGCACTTTAC AACAATATTA AGGTGGTGAT TTCCGACGAA GTTTCCGTCG GTTCCTTCGG GGATGATATG GTCAGTGATT CCACCCTTCG TGATGCCGTA TCTGTCGCTG CATATCACTA TAATACTGAT GACAATAGCT TGGGAAGTTT CAAACAGCTT GCCGAATCCT TTGACAAAGA GGTGTGGAAT AGTGAAGCAC AGGCCACTTT TAGTAATTCG TCCTTTCGTC CCAATAACAA TATGAAAGAT CCAACAGTAG CAGGAACCGG CATAGGAGGC ACAAATGGGC CACTGGAAAT GGGAAATACT GTTATAAAGG GGTTTGTAAA TTCAAGGAGG ACACATTTTA TCTATCAGCC AGTCATTGGG TCCTTCTATG AGGGTGGGCA GTACTCTTTT AAGGAATTGG TATCCGCACG TGATCCTTGG TCCGGGTGGA TTCACTATGA TGCCGGTCTT GTCATACTTC GGCATTTCAG TTGGTTTTCA AAGGCTGGCT GGGAAAATGA GAGTAATACC GCAGGGGTTT GGAGGGCTGT ACCTCAGGCG AGTTTCACAG GTGCGACGGG TACGAATCCT GTTAATGGGC GTAATGGCAC TCCCAGTTAT ATGACACTAG CTTCTCCGGA CAAGCATGAT TTTTCAACTA TCTTCATTAA CGATAGTGAA TATTCCAAAA CCTACACGTT TAAGACCATC AATATGGCTT ATTCGGGAAA CCCCTCGCTG GAAGTATGGG AAACTAGAGC AGCTGATAAG GGAGCATCTT TTAATAGTAA TTACATGAAG TACACTGGAA CGGTTTCCAC AAATAGCAGC GGAGTTTATA CAGTAAACGT AAAACCTTAT TCGATTGTTA CAGTTACAAC ATTAAGTAAC AGTGGAAAAG CAGAGTTCAA TACGCCTCTT CCGGTGGAGG GGGACCGCCC GGTTCTGGAT ACAGACAAGA CAGGTTCAAT GCAGGATACC AGGGATAATA TATTATATGC TGACAACTTT GATTATTCCA GCAAAACTGC CCCTGTCATA GCGGAGGGGG GACAAATCAC AGGAACCCAA AGTTATATAG ATTCCCGAGG AGGCTCAAAA AGTGCTATAC CACGTTATGC CAGCGACAGA AACGGTGCAT TTGAAGTGTA TCTTCCTGAC GGGTCCAGCA ACTATATCCT TCGCCAACAG GTGAATCAGT CAAGTATGGG GCTTGGGGGC ACATGGAATA ATGGTAATCC AATCACCGGC ATTGGAGATA ACCGTTGGAT GAACTATAAG GCAAGTGTGG ATGTTTCATT TGAACACAAC AGTACAGAGG GCGGTAACAA TTATGCTGCA ATCGGTGCCA GACAGCAGGG TGGTGAAAAT TCACACTACT TAAATGGTAC TCCTTATATA CTGAAATTCT GGTTTGACGG CGGTTGGTCG CTACTAGTAA ATGGAAGTTC CGTGGCAAAT GGTAATGTAG CAAGCGGCTC GGGTGGAGTG AAAATCAGTG GTTTTAATAC AGCTTATAAT GCATGGCATA ACATCTCTAT CATGGTTGCA GACAATAAGG TGACTGCGTA TCTGGACAAT ACCATCCTTT ATACCTATAC AGATACTACC CAAAGATTGT CCGGGCGTAT TGATCTGGCA AGCGGCTATT ACAATACTTG TTTTGACAAT TTGAAGGTTG AAACAATAGA CGGTTATGCA CCTTACTACT CTGAAATGCT GGACAATCTG GAAATGTACG ATTTGTCTTC TGTTTCTGCT ACAAAGCTTG TTTACGGCGG TTCTTGGGCA CATGAAAACG GCAAATCCAT GTACAATTAC CAACGATCAC TTTCCACGAG CCAGGGAATA GGTGCTACTA TTCAGTATTC ATTCACTGGC ACCGGGCTGG ACATTCTTGG AGCCAACAAC GGTTCTGCTA AGTTAGAGGT AACTGTTGAT GGAAGAGTTG TTAATTCCTC AGTGGGAACC ATGGTTTCAG GGAATTTACA CCAAAACTTT ACGCTTCACG GTCTTGAGTA CGGTAAGCAT ACGGTTTGTT TGAAGGTGTT AAGTGGTACT ATGGTTGTCG ATGCTGTTGG GGTTGTTGCA AACATAGCCG GTGCTTCGGA GATACCCGTT GAACAATCTG CGTATTCAAG GATAGAAGCA GAGAGCTACA GCAACCAGTC AGGAATCCAG ACAGAAACCT GTTCGGAAGG CGGAGAGGAT GTGGGCTTTG TTGAAAACGG CGACTATACT GTTTACAACA ATGTGGATTT CGGCGATGGT GTCGGAGGCT TCCAAGCAAG AGTAGCAAGT GCAACCAGTG GAGGCAATAT TGAGATCAGG CTTGACAGCT CTACCGGGAC TTTGATAGGA ACTTGTCCTG TTGCCGGAAC AGGGGATTGG CAGACTTATA CTGATGCAAA ATGTACTGTC AGCGGGGTAA CAGGAAAACA TGATGTATAC CTTGTATTTA AAGGAGATAG CGGATATTTA TTTAATCTTA ACTGGTTTAC ATTTAGTGAG AAAACTGTCA TAGGGAATTT GGGTGATATA AACTCGGACG GACAAGTAGA TGCAATAGAT TTACAGGTAT TGAAAAAGTA TCTTTTGCAA CTAGGGGAAA TTGGAGATAC GAAGCTGGCA GATTTGGATG CCAACGGAGA AATTAACGCA ATCGATTTTT CATTACTCAA ACAATTTTTA CTGGGTACTA TTATTAGTTT TCCGGGAGAA GCACTATAA
|
Protein sequence | MKRKILSVLL LVTMTTALFS ATPMNTASAA STDFVLDGNN IKAGNINGLT FKGFGVLSGN SSSALLMDYK SEHPEKYTEL LQILFGGKNP IMTHVKIEMG NDRNNSTGPD PSTMRWENET ANVKRHPGFQ LAADAKKVNP NLKVSILRWN APGWANSNDK IYTWYKNTIL AAYRQYGYMI DYVNPGVNEQ TPNLTWTKQY AQRIKTDSTG FNNAEERALY NNIKVVISDE VSVGSFGDDM VSDSTLRDAV SVAAYHYNTD DNSLGSFKQL AESFDKEVWN SEAQATFSNS SFRPNNNMKD PTVAGTGIGG TNGPLEMGNT VIKGFVNSRR THFIYQPVIG SFYEGGQYSF KELVSARDPW SGWIHYDAGL VILRHFSWFS KAGWENESNT AGVWRAVPQA SFTGATGTNP VNGRNGTPSY MTLASPDKHD FSTIFINDSE YSKTYTFKTI NMAYSGNPSL EVWETRAADK GASFNSNYMK YTGTVSTNSS GVYTVNVKPY SIVTVTTLSN SGKAEFNTPL PVEGDRPVLD TDKTGSMQDT RDNILYADNF DYSSKTAPVI AEGGQITGTQ SYIDSRGGSK SAIPRYASDR NGAFEVYLPD GSSNYILRQQ VNQSSMGLGG TWNNGNPITG IGDNRWMNYK ASVDVSFEHN STEGGNNYAA IGARQQGGEN SHYLNGTPYI LKFWFDGGWS LLVNGSSVAN GNVASGSGGV KISGFNTAYN AWHNISIMVA DNKVTAYLDN TILYTYTDTT QRLSGRIDLA SGYYNTCFDN LKVETIDGYA PYYSEMLDNL EMYDLSSVSA TKLVYGGSWA HENGKSMYNY QRSLSTSQGI GATIQYSFTG TGLDILGANN GSAKLEVTVD GRVVNSSVGT MVSGNLHQNF TLHGLEYGKH TVCLKVLSGT MVVDAVGVVA NIAGASEIPV EQSAYSRIEA ESYSNQSGIQ TETCSEGGED VGFVENGDYT VYNNVDFGDG VGGFQARVAS ATSGGNIEIR LDSSTGTLIG TCPVAGTGDW QTYTDAKCTV SGVTGKHDVY LVFKGDSGYL FNLNWFTFSE KTVIGNLGDI NSDGQVDAID LQVLKKYLLQ LGEIGDTKLA DLDANGEINA IDFSLLKQFL LGTIISFPGE AL
|
| |