Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0108 |
Symbol | |
ID | 7309021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 123349 |
End bp | 124686 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643607037 |
Product | Kelch repeat protein |
Protein accession | YP_002504476 |
Protein GI | 220927567 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000075462 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACTAA GCAATAAATT AAAGATTTTT AGTGCAATGG TTTTAATGTT AGTTATTGTA GTATCAAGCA GTATGGTATT TGCGGCAGAC CCAAATACAT GGACAACAAA AGCACCTATG GCGACTGCAA GGTATAACCA TGAAGCAGTA GTTTTGAACG GCCAGATATA CGCTATTGGT GGCCAAACAA CGGGTGCTGC TACATTAAAG TCAGTAGAGC AATATGATCC TGCGACCGAT AAATGGATCA CAAAAGCACC TATGACGTAT GCAAAACATG CTCATCAAGT AGTTGTAATA AATGGTAAGA TTTACACAAT AGGAGGTCTT GGCGACGTCA GTGGCTGTAT GTACTCCTTG GAAGAATATA ATCCTGAAAC TGATACATGG AAAACAAAGG CCTCTATGTC TACAGCAAGG GGTCATTTTG GGGCAACCGT GGTAAACGGA AAAATATATG CTATGGGCGG AAGTTCGGTT AAATCTATGG AGGAATATGA TCCGGCAAAC AATATATGGG TTACAAAGGC CTCAATGTCG GTTGATAGGA TGTTGTTTAA AGTTGCTGTA GTAAACGGAA AAATCTATGC AATAGGTGGT TATAATAGTA CCGGATATCT CAATTCTGTA GAAGAGTATG ACCCTGCGAC TGACAAATGG ACACCAAAAG CACCTATGAA CATAGGTAGG TCAGCCTTTG AAATAGCTGT ATTGAGTGGT AAAATATATG TAATGGCAGG TGCTAATACT AGGAGCACTG AGGTTTCCGA GTCTGTAGAA GTATATGACC CTACCACAGA CACTTGGACA ACAAAGGCAT CTATGCCGAC GCCAATAGCA GGTAAAGCGG TAACATTAAA CGGGAAAATA TATATGGTTG GGGCTGGTAC TGGCCGCAAT ATAGTTGAAG AATACGACCC TGCTACAGAT AAATGGACTT ATGATGCACC TCTTACCACA GGAAGAGCTT ACGACCAGTC TGTTGTGGCA AATGGGAAGA TTTATCACAT AGGAGGAAGT ATTACAAACT CCGTAGAAGA ATATACTCCT ACCAATACAG GCGGTTCCAG TGAAGGCGGT AATGAGAATC CACCTGTTAT TACAGGAAAC AGTGCAATAC TTGAATTAAC TATGGTTAAC GGAGTCATAA AGGAATACGA TTTAAATGCT GCCGAACTTG ATAGTTTTTT AACTTGGTAT GATAACAGCT CAGAAGGCAG TGCACCCTCG TATTACATAT TTAATAAAAA GAACAATATC AAACCTTTCC TAAGCAGAAA GGAATATATC GCATATAGCA AGATTGCAAG CTTTGAAGTG AAGGAATATG CAGAATAA
|
Protein sequence | MRLSNKLKIF SAMVLMLVIV VSSSMVFAAD PNTWTTKAPM ATARYNHEAV VLNGQIYAIG GQTTGAATLK SVEQYDPATD KWITKAPMTY AKHAHQVVVI NGKIYTIGGL GDVSGCMYSL EEYNPETDTW KTKASMSTAR GHFGATVVNG KIYAMGGSSV KSMEEYDPAN NIWVTKASMS VDRMLFKVAV VNGKIYAIGG YNSTGYLNSV EEYDPATDKW TPKAPMNIGR SAFEIAVLSG KIYVMAGANT RSTEVSESVE VYDPTTDTWT TKASMPTPIA GKAVTLNGKI YMVGAGTGRN IVEEYDPATD KWTYDAPLTT GRAYDQSVVA NGKIYHIGGS ITNSVEEYTP TNTGGSSEGG NENPPVITGN SAILELTMVN GVIKEYDLNA AELDSFLTWY DNSSEGSAPS YYIFNKKNNI KPFLSRKEYI AYSKIASFEV KEYAE
|
| |