Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1237 |
Symbol | |
ID | 7310034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1520816 |
End bp | 1522630 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643608158 |
Product | Carbohydrate binding family 6 |
Protein accession | YP_002505573 |
Protein GI | 220928664 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0649618 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG TTCGTACAGT CAGTACAATT ATCTTTTTAG GACTTATGAT TACCGTAATT ATGGTTTTGA ATACTGGTGC ATGGGATAAC GGTCTTGCAA AAACACCACC GATGGGATGG AACAGTTGGA ACATATTCCA TGGAGACATT AATGAAACTA AAATCAAACA GATTGCTGAT ACCATGGTAA GCTCGGGTAT GAAGGAAGCT GGCTATGTAT ACCTGAATCT GGATGATAAC TGGATGGCAA ATCCTGCAAG AGATTCCAAT GGGAACTTGC GGGCCGATCC TACACGATTC CCAAGTGGGA TTAGGGCTTT AGCTGATTAT GTACATGCAA AAGGTCTCAA ACTAGGGATA TACGGATGTC GTGGAACAAT GACCTGTATG AATATTCCTC AAAGCGGAAG CAAGGGTTAT GAGGACAAGG ATGCAAAGAC ATTTGCTTCA TGGGGAATTG ATTACCTTAA ATATGATAAC TGCAATATAC CTAACGGAAG TGACATGAAA ACCGATTACC AGAAAATGCA GACCGCTCTT GCAAATTGCG GAAGACCAAT AGTATTCAGT ATATGTGCAT GGGGATATCA GAGCTGGATG CCTGCAACAG GTAATTTATG GCGTACTACC GGGGATATCG CTGATAAGTG GGATAACGGA AACGAATGGT TCAAAGGTAT TATAAATGCA ATTGATGGTA ATGCACAATA CACAAGTTCA GCCGCACCCG GTGCATGGAA TGATCCTGAT ATGCTTGAAA TCGGAAACGG TGGATGTACA ACAGAGGAAT ACCGTACACA GATGAGTATG TGGAGTATGA TGGCTTCTCC CCTTATTGCA GGAAATGATA TAAGGACCAT GTCACAGACA ACAAAGGATA TTCTATTGAA TAAGGAAGTA ATAGCAATAG ACCAGGATCC TGCAGGAGTT CAGGGAAAAA GAGTTAAAAG TGCAAATGGT CTTGAGATTT GGGTAAAACC ACTGGGTACG AATGGTACAA CTAAGGCAGT TGCTTTATTG AACAGGAATT CGGCAACATC CAATATTACA GTTAATTGGT CAGATATAGG TGTAAGTGGA AGTGTTACGG TCAGGGATTT GTGGGCTAAA TCTGACAAAG GCAGTTTTAC GGGCTCATAC ACAGCGTCTG TTCCTTCACA TGGAACTGTT TTGATTAAGA TTTCCACTGA GCCGCCGGCA CCTGTTGATG CAACAAAGCA AATAGAAGCA GAGAGTTATA GCAATCAGTC AGGAATCCAG ACAGAAACCT GTTCGGAAGG CGGAGAGGAT GTAGGCTTTA TCGAAAACGG GGACTATACT GTTTACAGCA ATGTGGATTT CGGTGATGGA GTCGGAGGCT TCCAGGCAAG AGTAGCAAGT GCGACCAGCG GAGGCAATAT TGAGATTAGA CTTGACAGCC CTGCCGGGAC TTTAATTGGA ACCTGTCCGG TTGCCGGAAC AGGGGATTGG CAGACTTATA CTGATGTAAA ATGTACTGTC AGCGGGGCAA CAGGAAAACA TGATGTATAC CTTGTATTTA AAGGAGATAG CGGATATTTA TTCAACCTTA ATTGGTTCAC ATTTACTCCA GGAAGTGTCA ATACGGGTAC ATTGGGTGAT TTAAATTCCG ACGGACAAGT AGACGCGATA GATTTACAGT TATTGAAAAA GTATATTTTA GGACTGGGAG CAATCGAAAA TACAAAACTG GCAGATTTGG ATGCCAACGG AGATATCAAT GCAATAGATT TTTCACTGCT GAAACAATTC TTACTAGGCA TAAGGACCAG CTTTCCGGGG CAGGGGGCAG CATAA
|
Protein sequence | MKKVRTVSTI IFLGLMITVI MVLNTGAWDN GLAKTPPMGW NSWNIFHGDI NETKIKQIAD TMVSSGMKEA GYVYLNLDDN WMANPARDSN GNLRADPTRF PSGIRALADY VHAKGLKLGI YGCRGTMTCM NIPQSGSKGY EDKDAKTFAS WGIDYLKYDN CNIPNGSDMK TDYQKMQTAL ANCGRPIVFS ICAWGYQSWM PATGNLWRTT GDIADKWDNG NEWFKGIINA IDGNAQYTSS AAPGAWNDPD MLEIGNGGCT TEEYRTQMSM WSMMASPLIA GNDIRTMSQT TKDILLNKEV IAIDQDPAGV QGKRVKSANG LEIWVKPLGT NGTTKAVALL NRNSATSNIT VNWSDIGVSG SVTVRDLWAK SDKGSFTGSY TASVPSHGTV LIKISTEPPA PVDATKQIEA ESYSNQSGIQ TETCSEGGED VGFIENGDYT VYSNVDFGDG VGGFQARVAS ATSGGNIEIR LDSPAGTLIG TCPVAGTGDW QTYTDVKCTV SGATGKHDVY LVFKGDSGYL FNLNWFTFTP GSVNTGTLGD LNSDGQVDAI DLQLLKKYIL GLGAIENTKL ADLDANGDIN AIDFSLLKQF LLGIRTSFPG QGAA
|
| |