Gene Ccel_0333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0333 
Symbol 
ID7309221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp383282 
End bp384544 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content38% 
IMG OID643607263 
Productglycosyl transferase family 2 
Protein accessionYP_002504700 
Protein GI220927791 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAATA TATTTTTTAA TATAATTTAT TATATTAGTG AATTTATTCA AATACTGATA 
TTCATAGCAG GCTGTTATTT CTTTGGAATC TCTATTTTTG GCTGGGTGAA AAGGAGACAA
AAGTCCCCAC AAAATATTAT TCCAACAAAA CGGTTTGCCC TTGTAGTAGC TGCCCATAAT
GAGGAGCTCG TAATTGGCCA TATAGTAGAC AGTCTTTTTA AACTGAATTA TCCTAAAAAC
TTGTATGACG TATTTGTTAT AGCAGATAAT TGTACAGACA ATACTGCCGG AATTGCCCGA
AGATTCGGTG CAAAGGTACA TATCCGTGAG GATGCCTCTA AAAAGGGTAA AGGACATGCA
CTTGAATGGA TGTTTCACAG AATTTTTCAT ATGGATACGA GCTATGATGC CATTGCAGTT
TTTGATGCGG ATAATTTGGT ATCTCAGAAT TTCCTGTTAG AAATGAATAA ACAAATGTGC
AAGGGTTTCA AGGTAGTTCA GGGTTACATT GATAGTAAAA ACCCATATGA CAGCTGGATA
ACCTGTTCCT ATTCAATTGC TTTCTGGCTT TCAAACAGAA TTTATCAACT CCCCAGATAC
TATCTGAAGC TAAGCTGCGG CTTATGCGGA ACCGGGTTTT GTATAGATAC TTCCATTCTC
AAAACTTTAA AATGGGGAGC TACCTGCCTG ACCGAAGATC TGGAATACAC CATGAAGATG
GCCTTAAACG GAGTTAAAAT AGGATGGGCA CACGAAGCCG TAGTATATGA TGAAAAACCT
ATTACACTCA AACAGTCATG GCACCAGCGA AAAAGATGGA TGCAGGGTCA TGCGGAATGT
GCACAGAAAT ACCTTGGGGC TTTATTTAAG AAAGCTCTTT TTAAAGGAGA TCTTACCTCC
CTTGATTGTG CCTTATATTT GTTTCAACCT ATAAGATTCA TTTTCGTGGG ATTAATGACT
GTTATGATGT GGGTGCAAAC AGTTTATCCC CAATTTCCTC TTTACAGCGT ACAATACGTA
TTTCCGGTTC AAGTATGGTA TTTAATGGGG CTCTTTGAGA TGTTTTACGG GCCGCTGGTT
ATTCTGGCAG AGAAAAAATT CAGCTTGAAG GTGATACTTG GGTTTATTAT TTACCCCTAC
TATTGCCTGA CTTGGATTCC AATTACCATA CAAGGCATCC TGGAGAAAAA TAACAAGGAA
TGGAACCACA CTGTTCATAC AAGACAGATT AGTATAAATG AACTGGAGAA CAGCAATGGG
TAA
 
Protein sequence
MQNIFFNIIY YISEFIQILI FIAGCYFFGI SIFGWVKRRQ KSPQNIIPTK RFALVVAAHN 
EELVIGHIVD SLFKLNYPKN LYDVFVIADN CTDNTAGIAR RFGAKVHIRE DASKKGKGHA
LEWMFHRIFH MDTSYDAIAV FDADNLVSQN FLLEMNKQMC KGFKVVQGYI DSKNPYDSWI
TCSYSIAFWL SNRIYQLPRY YLKLSCGLCG TGFCIDTSIL KTLKWGATCL TEDLEYTMKM
ALNGVKIGWA HEAVVYDEKP ITLKQSWHQR KRWMQGHAEC AQKYLGALFK KALFKGDLTS
LDCALYLFQP IRFIFVGLMT VMMWVQTVYP QFPLYSVQYV FPVQVWYLMG LFEMFYGPLV
ILAEKKFSLK VILGFIIYPY YCLTWIPITI QGILEKNNKE WNHTVHTRQI SINELENSNG