Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1232 |
Symbol | |
ID | 7310029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1510665 |
End bp | 1512137 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643608153 |
Product | Carbohydrate binding family 6 |
Protein accession | YP_002505568 |
Protein GI | 220928659 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2382] Enterochelin esterase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.122742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAA ATATATTAAC TAAAAAAACG ATTTGGGGAA TGGTGGCATT TGTTTTTGCT TTAACTCTAG TTTTTACAGT GCCTGATTCC AAGGTGCAGG CTGCAGCACT GCCTACCACA CCACCGACAG GCTATGACCG AGTCCAGAGT AACATTCCGC ACGGTCAGGT TAGCTATATT AATTACCAAT CTAAGGCAAC AAACAGTCAA AGAAGAGCAA GGATTTACCT GCCTCCAAAT TATTCAGCAG ACAAAAAATA CAGTGTAATG TATTTACTGC ATGGTATCGG TGGAAATGAA GACGAGTGGT ACAATAACGG TGCACCTAAC GTTATTCTTG ACAATCTTAT AGCTGCAGGT AAAATTCAGC CATTTATCGT TGTATTACCA AATGGCAATG CAACAGGAAC TGGTGTATCT GATGGTTGGA CTAATTTTAC AAAAGATTTA ATCGAAAGTC TTATCCCATA TATAGAATCA AACTACTCCG TTTATACGGA TCGTAATCAC AGGACTGTTT GCGGTCTCTC AATGGGTGGC GGACAATCTT TCAATATCGG ACTTCCTAAC TTGAACCTGT TCCCATATGT TGGAGCATTT TCACCGGCAC CTAATACACT CCCTAATTCC CAACTTTTCC CTAATGACGG AGCCTCAGCT AAGCAGCTGT TGAAATTCTT GTTTATTTCT TACGGAACTA CCGACAGTCT GATTAGTTTT GGTACAGGAG TACATAATTA CTGTGATTCC CAGAGTATTC CAAATACTTA CTTCCTTATT CAGGGTGCAG GTCATGATTG GAACGTATGG AAGCAGAGCC TTTGGAATTA TTCTCAAATG ATATGTGAAA AGGGTTTTAC AGACTATGGC CCTGTTGCAC CTGTATCAGC ATTTACACAA ATTGAGGCAG AAAGTTTCAC CAGTCAGTCC GGCGTTCAGA CAGAAACCTG TACTGAAGGA GGTCTGAATG TAGGGTATAT TGAGAATGGT GATTATGTAG TTTATAACAA TGTAGACTTT GGCAGTGGTG CAACAAGCTT TCAGGCAAGA GTAGCAAGTG CTGGGAATGG AGGTAATATC GAAATCAGAT TGGATAGTAT AACAGGTCCG TTGGTAGGAA CTTGTGCAGT TAAAGGCACA GGCGATTGGC AGACTTGGAC TGATGCAAAG TGTACTGTAA GCGGAGTAAC CGGAAAACAT GACTTGTATC TGAAATTTAC AGGTGGAAGC GATTATCTGA TGAATTTTAA CTGGTTCAAA TTTGGTAATG CGACACAGAC TCTTACAGGT GATCTTAATG GAGATGCCAG TGAAGATGCA ACAGACTATG CCTTGCTGAA AAAGTATCTT CTTGGTCAGA TTAATGACTT CCCTGTCGAA GACGACATTA ATGCTGGAGA TATGAATAAG GACGGTGTAA TTGACGCTCT CGACTTTGCT GTCTTTAAGA AAATCCTTTT GGGCACAATC TAA
|
Protein sequence | MIKNILTKKT IWGMVAFVFA LTLVFTVPDS KVQAAALPTT PPTGYDRVQS NIPHGQVSYI NYQSKATNSQ RRARIYLPPN YSADKKYSVM YLLHGIGGNE DEWYNNGAPN VILDNLIAAG KIQPFIVVLP NGNATGTGVS DGWTNFTKDL IESLIPYIES NYSVYTDRNH RTVCGLSMGG GQSFNIGLPN LNLFPYVGAF SPAPNTLPNS QLFPNDGASA KQLLKFLFIS YGTTDSLISF GTGVHNYCDS QSIPNTYFLI QGAGHDWNVW KQSLWNYSQM ICEKGFTDYG PVAPVSAFTQ IEAESFTSQS GVQTETCTEG GLNVGYIENG DYVVYNNVDF GSGATSFQAR VASAGNGGNI EIRLDSITGP LVGTCAVKGT GDWQTWTDAK CTVSGVTGKH DLYLKFTGGS DYLMNFNWFK FGNATQTLTG DLNGDASEDA TDYALLKKYL LGQINDFPVE DDINAGDMNK DGVIDALDFA VFKKILLGTI
|
| |