Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1236 |
Symbol | |
ID | 7310033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1517840 |
End bp | 1520707 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643608157 |
Product | protein of unknown function DUF1680 |
Protein accession | YP_002505572 |
Protein GI | 220928663 |
COG category | [S] Function unknown |
COG ID | [COG3533] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00269962 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAAAA CTATGAGTAT ACTTCTACCA TGCTTATTGA TATTTTCACT TATATTCAGT GTACAAATAC CTTTATCAGC TTCAGCAGCA AATGTTGAAC TCCTAAAGCA GTTCGACATG GAACAGGTAA AAATAACAGA TACATATTAT GTAAATGCAC TTAATAAAGA GGTTGCCTAC TTGCAGGCAA TTGATCCAAA CCGTTTGTTG GTGGGTTTTA AGAAAACAGC TGGCTTATCA ACAACTTATA GCTATTATGG AGGGTGGGAA AACAATACCC TGATTCAAGG CCATACCATG GGACATTACA TGTCGGCACT TGCTCAGGCT TATAAAAACA CTAAGTCCGA CCCGACAGTA AATGCAGATT TGAAAAGCCG TATCGATTTG ATTATATCCG AATTGCAGGC TTGTCAGAAT AAAAACGGCA ATGGATATTT GTTTGCAACT CCGGCTACCC AATTTGATGT TGTTGAAGGA AAGGCGTCCG GTTCAAGCTG GGTACCGTGG TATACCATGC ACAAAATCAT GTCCGGTCTT CTTGACATTT ATAAATTTGG AGGCAACCAA ACCGCATTGA CAATAGCAAC CAACTTGGGA AATTGGATTT ACAAAAGAGT AAACGCTTGG GATTCTGCAA CACAGTCAAG GGTATTGGGT GTTGAGTATG GAGGAATGAA TGACTGTCTC TATGAATTGT ATAAGCTGAC TGGTAATGGC AACCATTTAA CAGCAGCACA TAAATTTGAC GAAAATTCAC TATTTAACAC CATCGCTGCA GGCACAAACG TTTTACCCGG AAAACATGCC AATACAACTA TCCCGAAATT CATCGGTGCT TTGAATCGCT ACAGCACTCT AGGAACATCA GAATCATCAT ACTTAAAAGC GGCACAGCAG TTCTGGGCCA TAGTTTTGAA AGACCATACA TATGTAACAG GGGGCAACAG CGAAGATGAG CGTTTCAGGG ACGCTGGCAA ACTGGATGCA TACAGGGATA ATGTAAATAA TGAAACTTGT AATGTAAATA ATATGCTGAA GCTGACTAAA GAGCTGTTCA AGGCAACGGG CGACGTTAAA TATGCAGATT ACTATGAGAA TGCATTGATA AACGAAATCA TGGCTTCACA GAATCCGGAA ACCGGGATGG CTACGTACTT CAAGGCTATG GGAACTGGAT ATTTCAAGGT ATTCAGTTCC CAATTCAATC ATTTCTGGTG CTGTACGGGA ACGGGAATGG AGAATTTCAC AAAGCTGAAT GACAGCCTGT ATTATAATAA TGGTTCCGAC CTGTATGTAA ACATGTATCT GAGTTCTACC CTGAACTGGA GCGAAAAGGG TCTTTCACTG ACACAGCAGG CCAATCTGCC ATTATCAGAT AAAGTAACCT TTACTATCAA CAGTGCTTCT TCATCAGAAG TGAAAATTAA ATTCAGGTCA CCAGCATGGA TTGCTGCAGG ACAAAATATT ACGGTCAAAG TTAACGGTAC TCCAATTAAT GTTGACAAGG CGAATGGCTA TCTTGACGTC AGCAGAGTGT GGCAGACAGG AGATACGGTT GAGTTGACCC TGCCCACCGA AGTAAGGGTA TCCAGACTGA CTGACAGCCC CAATACGGTA GCCTTTACAT ATGGTCCCGT AGTATTGAGT GCAGGTCTTG GAACTGAAAG CATGACAACT CAATCACACG GGGTCCAGGT TTTAAAAGCA ACGAAAAATG TGACTATCAA AGAGACTATT AATATTAATA CCGCCGCCAG TCCCAGCATT GACAATTGGC TTGCCAATAT AAAGAACAAT TTGGTTCAAA CGCCTGGGAA GCTGGAATTT ACATTGAAAA ATACCGACGA GGATAACCAT TTGGTATTCA CACCTCATTA TCAAAGATAC AAGGACAGGT ATGGTATCTA CTTCAAGCTG GGAACGTATG AGGGTAAACA ACCCACGGAT AATTTGCTTG ACAATCCGGA TATTGAGTCA GGGAACACCA CAGGATGGAC TGTGAATGGT GCGGGTACAA TTGCCTCTTC AACAGTACAA AAGCACTCGG GAAGCTATAG TCTGCTGCAT ACAGGCAGGA CAGGAGCCTG GAACGGGCCT ATTCAGAACA TTACAACAAA AGTTCAGAAT GGTAACACGT ATACTTGTTC CGGCTGGGTA ATACTGGACA ACACTGCCAG TGCCCCGATA ACAATGACTA TCAGAAAAAC GGATGACAAC GGAACTTCCT ATGTCAATAT TGCCACTGCT ACCGGAAGCA ATAGTTCCTG GGTTCAATTG TCAGGTAACT ATACCTTAAA TGTTACAGGT GCATTGACTG ACCTGAGTAT ATATTTTGAA GGACCGGACA GCGGCACCAA TTTTTATGTG GATGATGCCT TAGTTAAGGT TTATGGCAAA ACTACCTTCT ATCAGAATAC TTCTTTTGGC GGTACTGCGG TGTCGCTGAA TCCAGGCAGC TATACTACTG CTCAGCTCAC TGCTGCAGGT ATTTCTGATA ACTGGGCATC ATCAATCAAA ATACCTGAAG GCTATACGGT TGAGATTTAT GATGATGACA ATTTCACTGG TACAAAGTGG TCTTTTAGTG CAGATAATTC GAACTTTATA GAAGCCGGAT GCAATGACAA AATGTCTTCC GTGAAAATTT TCCCCACTCT GAGTCAAGTG AAGTATGGGG ATATTAACAG GGACGGTACT GTAGATACTA TTGACTTTGC ACTTTTAAAG CAGTTTTTGT TGGGTGCTCA GGTCACAATT GATTCGGTAG CGGCTGATTT GGACGGCGAT GAATCTGTGA CGGCAATGGA TTTTGCGGTA TTTAAGAAGT ATCTGCTGGG ACAAATAACA GAGTTGCCTG CTTTTTGA
|
Protein sequence | MLKTMSILLP CLLIFSLIFS VQIPLSASAA NVELLKQFDM EQVKITDTYY VNALNKEVAY LQAIDPNRLL VGFKKTAGLS TTYSYYGGWE NNTLIQGHTM GHYMSALAQA YKNTKSDPTV NADLKSRIDL IISELQACQN KNGNGYLFAT PATQFDVVEG KASGSSWVPW YTMHKIMSGL LDIYKFGGNQ TALTIATNLG NWIYKRVNAW DSATQSRVLG VEYGGMNDCL YELYKLTGNG NHLTAAHKFD ENSLFNTIAA GTNVLPGKHA NTTIPKFIGA LNRYSTLGTS ESSYLKAAQQ FWAIVLKDHT YVTGGNSEDE RFRDAGKLDA YRDNVNNETC NVNNMLKLTK ELFKATGDVK YADYYENALI NEIMASQNPE TGMATYFKAM GTGYFKVFSS QFNHFWCCTG TGMENFTKLN DSLYYNNGSD LYVNMYLSST LNWSEKGLSL TQQANLPLSD KVTFTINSAS SSEVKIKFRS PAWIAAGQNI TVKVNGTPIN VDKANGYLDV SRVWQTGDTV ELTLPTEVRV SRLTDSPNTV AFTYGPVVLS AGLGTESMTT QSHGVQVLKA TKNVTIKETI NINTAASPSI DNWLANIKNN LVQTPGKLEF TLKNTDEDNH LVFTPHYQRY KDRYGIYFKL GTYEGKQPTD NLLDNPDIES GNTTGWTVNG AGTIASSTVQ KHSGSYSLLH TGRTGAWNGP IQNITTKVQN GNTYTCSGWV ILDNTASAPI TMTIRKTDDN GTSYVNIATA TGSNSSWVQL SGNYTLNVTG ALTDLSIYFE GPDSGTNFYV DDALVKVYGK TTFYQNTSFG GTAVSLNPGS YTTAQLTAAG ISDNWASSIK IPEGYTVEIY DDDNFTGTKW SFSADNSNFI EAGCNDKMSS VKIFPTLSQV KYGDINRDGT VDTIDFALLK QFLLGAQVTI DSVAADLDGD ESVTAMDFAV FKKYLLGQIT ELPAF
|
| |