Gene Ccel_0108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0108 
Symbol 
ID7309021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp123349 
End bp124686 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content40% 
IMG OID643607037 
ProductKelch repeat protein 
Protein accessionYP_002504476 
Protein GI220927567 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000075462 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACTAA GCAATAAATT AAAGATTTTT AGTGCAATGG TTTTAATGTT AGTTATTGTA 
GTATCAAGCA GTATGGTATT TGCGGCAGAC CCAAATACAT GGACAACAAA AGCACCTATG
GCGACTGCAA GGTATAACCA TGAAGCAGTA GTTTTGAACG GCCAGATATA CGCTATTGGT
GGCCAAACAA CGGGTGCTGC TACATTAAAG TCAGTAGAGC AATATGATCC TGCGACCGAT
AAATGGATCA CAAAAGCACC TATGACGTAT GCAAAACATG CTCATCAAGT AGTTGTAATA
AATGGTAAGA TTTACACAAT AGGAGGTCTT GGCGACGTCA GTGGCTGTAT GTACTCCTTG
GAAGAATATA ATCCTGAAAC TGATACATGG AAAACAAAGG CCTCTATGTC TACAGCAAGG
GGTCATTTTG GGGCAACCGT GGTAAACGGA AAAATATATG CTATGGGCGG AAGTTCGGTT
AAATCTATGG AGGAATATGA TCCGGCAAAC AATATATGGG TTACAAAGGC CTCAATGTCG
GTTGATAGGA TGTTGTTTAA AGTTGCTGTA GTAAACGGAA AAATCTATGC AATAGGTGGT
TATAATAGTA CCGGATATCT CAATTCTGTA GAAGAGTATG ACCCTGCGAC TGACAAATGG
ACACCAAAAG CACCTATGAA CATAGGTAGG TCAGCCTTTG AAATAGCTGT ATTGAGTGGT
AAAATATATG TAATGGCAGG TGCTAATACT AGGAGCACTG AGGTTTCCGA GTCTGTAGAA
GTATATGACC CTACCACAGA CACTTGGACA ACAAAGGCAT CTATGCCGAC GCCAATAGCA
GGTAAAGCGG TAACATTAAA CGGGAAAATA TATATGGTTG GGGCTGGTAC TGGCCGCAAT
ATAGTTGAAG AATACGACCC TGCTACAGAT AAATGGACTT ATGATGCACC TCTTACCACA
GGAAGAGCTT ACGACCAGTC TGTTGTGGCA AATGGGAAGA TTTATCACAT AGGAGGAAGT
ATTACAAACT CCGTAGAAGA ATATACTCCT ACCAATACAG GCGGTTCCAG TGAAGGCGGT
AATGAGAATC CACCTGTTAT TACAGGAAAC AGTGCAATAC TTGAATTAAC TATGGTTAAC
GGAGTCATAA AGGAATACGA TTTAAATGCT GCCGAACTTG ATAGTTTTTT AACTTGGTAT
GATAACAGCT CAGAAGGCAG TGCACCCTCG TATTACATAT TTAATAAAAA GAACAATATC
AAACCTTTCC TAAGCAGAAA GGAATATATC GCATATAGCA AGATTGCAAG CTTTGAAGTG
AAGGAATATG CAGAATAA
 
Protein sequence
MRLSNKLKIF SAMVLMLVIV VSSSMVFAAD PNTWTTKAPM ATARYNHEAV VLNGQIYAIG 
GQTTGAATLK SVEQYDPATD KWITKAPMTY AKHAHQVVVI NGKIYTIGGL GDVSGCMYSL
EEYNPETDTW KTKASMSTAR GHFGATVVNG KIYAMGGSSV KSMEEYDPAN NIWVTKASMS
VDRMLFKVAV VNGKIYAIGG YNSTGYLNSV EEYDPATDKW TPKAPMNIGR SAFEIAVLSG
KIYVMAGANT RSTEVSESVE VYDPTTDTWT TKASMPTPIA GKAVTLNGKI YMVGAGTGRN
IVEEYDPATD KWTYDAPLTT GRAYDQSVVA NGKIYHIGGS ITNSVEEYTP TNTGGSSEGG
NENPPVITGN SAILELTMVN GVIKEYDLNA AELDSFLTWY DNSSEGSAPS YYIFNKKNNI
KPFLSRKEYI AYSKIASFEV KEYAE