Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2289 |
Symbol | |
ID | 7310967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2668998 |
End bp | 2670302 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643609216 |
Product | protein of unknown function UPF0052 and CofD |
Protein accession | YP_002506606 |
Protein GI | 220929697 |
COG category | [S] Function unknown |
COG ID | [COG0391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01826] conserved hypothetical protein, cofD-related |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCAT TCAGTTGGAT GAAGCCCGGT GCCAGAATCA AAAGGTGGAT AGCATTACTT TCAGCAGGTA TAGCCTTAAT ATGTTTCAGC ATATTATTTA CCATATATAA TTACAACAAG GGAATCCCGT ATATTATTGC AATGTCTATT TTGGCATTCT TGGGCCTTGC AGGTACTTTT GCCGCTTTCA GGCTGCTTGT TAGAAATTTT GCAGGTAAAC TAATGAACGG TCAGGATTTA AGCAGCCTTC TCCATGAAAA AAAAATTTCT GTAAAGGGCC CCAAAATAGT TGCAATAGGA GGCGGTACAG GGCTTTCTAC AATGCTTAGA GGCTTAAAAC AATATTCCTC AAATCTTACA GCTTTAGTTA CGGTAGCCGA TGACGGAGGC GGTTCAGGCA TATTAAGAGA GGACCTTGGA ATGTTACCTC CGGGAGATAT TCGTAACTGT ATTCTTGCAT TGGCAAATAC GGAGCCCATA ATGCAAAAGC TGCTTCAATA CAGATTCCAG GATGGAATGC TTAAAGGACA AAGCTTTGGA AACCTGTTTC TTGCTGCTAT GGATGGAATT TCAGACAGCT TTGAGGAGGC TGTAAAAAAA ATGAGCGATG TTCTGGCGGT AACAGGCACG GTATTGCCTA TTACACTTGA AGATGTAAGG CTTTGTGCAG AAACAGACAA TGGAAATACA ATTCTTGGCG AATTTAACAT AGGTCACAGA TGTAAAAACG ACAAATCACG AATAAACAGA GTTTTTCTGA ATCAGACAAA GGTCAAACCT TTAAATGAGG CAATAGAAGC CATAATGGAA GCAGATATTG TTGTACTGGG ACCCGGGAGT CTTTATACAA GTATAATACC TAATCTTTTA GTTGATGGGG TATGTGATGC TTTGGGCAAA ACAAGAGCTG TTATAGTGTA CGTATGTAAT GTAATGACAC AGCCCGGAGA AACAGAAGGA TATAGCCTCA GCGACCATAT AAAGGCAATA GAAAAGCATT CTCACAGGGG TTTGATTGAC TATTGTATTG TAAACACATC AATAATTCCG GAGGATATGA AGGAAAGATA CCGCAAAGAC GGTGCGGAGC TTGTCAAGGT TGACTTTGAC ATCGTCAAGA AAATGGGAAT TGAGATAATT ACCGGAGATT TTAAAAGCAT CAATAATGGT TATGTAAGGC ATAATTCAAA AAGGTTGGCA AAAAAAATAA TGGAACTGGT AACCGAACTT GTTCTGGCAA ACGACGGGGA CAGAATACTG GACTATTATT ATGCACAAAA TAAAATCAAT AAAATCGGAG GCTGA
|
Protein sequence | MSAFSWMKPG ARIKRWIALL SAGIALICFS ILFTIYNYNK GIPYIIAMSI LAFLGLAGTF AAFRLLVRNF AGKLMNGQDL SSLLHEKKIS VKGPKIVAIG GGTGLSTMLR GLKQYSSNLT ALVTVADDGG GSGILREDLG MLPPGDIRNC ILALANTEPI MQKLLQYRFQ DGMLKGQSFG NLFLAAMDGI SDSFEEAVKK MSDVLAVTGT VLPITLEDVR LCAETDNGNT ILGEFNIGHR CKNDKSRINR VFLNQTKVKP LNEAIEAIME ADIVVLGPGS LYTSIIPNLL VDGVCDALGK TRAVIVYVCN VMTQPGETEG YSLSDHIKAI EKHSHRGLID YCIVNTSIIP EDMKERYRKD GAELVKVDFD IVKKMGIEII TGDFKSINNG YVRHNSKRLA KKIMELVTEL VLANDGDRIL DYYYAQNKIN KIGG
|
| |