Gene Ccel_2289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2289 
Symbol 
ID7310967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2668998 
End bp2670302 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content39% 
IMG OID643609216 
Productprotein of unknown function UPF0052 and CofD 
Protein accessionYP_002506606 
Protein GI220929697 
COG category[S] Function unknown 
COG ID[COG0391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01826] conserved hypothetical protein, cofD-related 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCAT TCAGTTGGAT GAAGCCCGGT GCCAGAATCA AAAGGTGGAT AGCATTACTT 
TCAGCAGGTA TAGCCTTAAT ATGTTTCAGC ATATTATTTA CCATATATAA TTACAACAAG
GGAATCCCGT ATATTATTGC AATGTCTATT TTGGCATTCT TGGGCCTTGC AGGTACTTTT
GCCGCTTTCA GGCTGCTTGT TAGAAATTTT GCAGGTAAAC TAATGAACGG TCAGGATTTA
AGCAGCCTTC TCCATGAAAA AAAAATTTCT GTAAAGGGCC CCAAAATAGT TGCAATAGGA
GGCGGTACAG GGCTTTCTAC AATGCTTAGA GGCTTAAAAC AATATTCCTC AAATCTTACA
GCTTTAGTTA CGGTAGCCGA TGACGGAGGC GGTTCAGGCA TATTAAGAGA GGACCTTGGA
ATGTTACCTC CGGGAGATAT TCGTAACTGT ATTCTTGCAT TGGCAAATAC GGAGCCCATA
ATGCAAAAGC TGCTTCAATA CAGATTCCAG GATGGAATGC TTAAAGGACA AAGCTTTGGA
AACCTGTTTC TTGCTGCTAT GGATGGAATT TCAGACAGCT TTGAGGAGGC TGTAAAAAAA
ATGAGCGATG TTCTGGCGGT AACAGGCACG GTATTGCCTA TTACACTTGA AGATGTAAGG
CTTTGTGCAG AAACAGACAA TGGAAATACA ATTCTTGGCG AATTTAACAT AGGTCACAGA
TGTAAAAACG ACAAATCACG AATAAACAGA GTTTTTCTGA ATCAGACAAA GGTCAAACCT
TTAAATGAGG CAATAGAAGC CATAATGGAA GCAGATATTG TTGTACTGGG ACCCGGGAGT
CTTTATACAA GTATAATACC TAATCTTTTA GTTGATGGGG TATGTGATGC TTTGGGCAAA
ACAAGAGCTG TTATAGTGTA CGTATGTAAT GTAATGACAC AGCCCGGAGA AACAGAAGGA
TATAGCCTCA GCGACCATAT AAAGGCAATA GAAAAGCATT CTCACAGGGG TTTGATTGAC
TATTGTATTG TAAACACATC AATAATTCCG GAGGATATGA AGGAAAGATA CCGCAAAGAC
GGTGCGGAGC TTGTCAAGGT TGACTTTGAC ATCGTCAAGA AAATGGGAAT TGAGATAATT
ACCGGAGATT TTAAAAGCAT CAATAATGGT TATGTAAGGC ATAATTCAAA AAGGTTGGCA
AAAAAAATAA TGGAACTGGT AACCGAACTT GTTCTGGCAA ACGACGGGGA CAGAATACTG
GACTATTATT ATGCACAAAA TAAAATCAAT AAAATCGGAG GCTGA
 
Protein sequence
MSAFSWMKPG ARIKRWIALL SAGIALICFS ILFTIYNYNK GIPYIIAMSI LAFLGLAGTF 
AAFRLLVRNF AGKLMNGQDL SSLLHEKKIS VKGPKIVAIG GGTGLSTMLR GLKQYSSNLT
ALVTVADDGG GSGILREDLG MLPPGDIRNC ILALANTEPI MQKLLQYRFQ DGMLKGQSFG
NLFLAAMDGI SDSFEEAVKK MSDVLAVTGT VLPITLEDVR LCAETDNGNT ILGEFNIGHR
CKNDKSRINR VFLNQTKVKP LNEAIEAIME ADIVVLGPGS LYTSIIPNLL VDGVCDALGK
TRAVIVYVCN VMTQPGETEG YSLSDHIKAI EKHSHRGLID YCIVNTSIIP EDMKERYRKD
GAELVKVDFD IVKKMGIEII TGDFKSINNG YVRHNSKRLA KKIMELVTEL VLANDGDRIL
DYYYAQNKIN KIGG