Gene Ccel_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1111 
Symbol 
ID7309924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1368575 
End bp1369855 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content41% 
IMG OID643608034 
Productglycoside hydrolase family 18 
Protein accessionYP_002505449 
Protein GI220928540 
COG category[R] General function prediction only 
COG ID[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGAATTC ATGTTGTAAA GTCAGGTGAA AGCATATATT CAATTGCACA GCAATACAGG 
GTTTCACCGC AGAAAATAAT ATCCGATAAT GAACTGAACA ATCCGGATCA GCTGGTAGTA
GGACAAACTT TAGTAATACT GGAGGGAACC CGACGACATG TTGTAGCCCC CGGTGAATCA
GTATATTCCA TAGCAAGGTC ATATAGAATA AGTGTAAATG AACTTTTGGC AGCCAACCCA
CAGATATCTG ATCCCTCCAG AATACAGCCG GGAATGGTTA TTACCATACC ACCTGTAACC
TATAATTACG GACCTATGGA AGTAAATGGA TATGCATTTC CCAATATTGA CATGGAAGTG
TTACGGAAAA CGTTGCCTAA TCTGACTTAC CTCAGTATTT TCAGCTATCA GGTAAGTCCT
GACGGCAACC TACAGTCAAT ACCTGATGAG CCACTTATTC AAGCTGCAAG AGCCGCAAGG
GTAGCTCCGC TTATGGTTAT AACCAATATA AAAGAAGGTG GGGGCTTTGA CAGCGATATA
GCACATTCAA TACTAACAAA CGAGACAGCA CAGACAAACC TTCTTAATAA TGTTACCAGA
ATTTTAAGAC AAAAAAATTA TTTTGGACTG GACATTGACT TTGAGTATAT ATACCCATAT
GACAGGGAAA GCTATAATAA CTTCCTGAGA AGAGTTGTAA GGACTCTCAG GCCACTGGGT
TACACCATTA CTACAGCACT TGCTCCGAAA ACATCGGCTA CTCAGAAAGG AAAACTTTAT
GAAGCACATG ATTATCCTGT TCATGGAGCA TTGGTTGATC ACGTTATACT TATGACTTAT
GAGTGGGGAT TCACATACAG TGCTCCTATG GCAGTATCTC CTATAACCGG TGTAAGAAGT
GTTCTTGACT ATGCTGTAAC AGCAATCCCA AGACGTAAAA TATTCATGGG AATATCGAAT
TATGGTTATG ACTGGACTCT TCCGTATACT CCCGGAACTG CTGCCAGAAC GGTTACCAAT
ACAGGTGCAG TGGACCTTGC CAGAAGAAGA GGGGCTGAAA TCCAATATGA TGTAATATCT
CAGGCACCTT TCTTCTATTA CTATGCTGAT GACAGAAAAC AGCATGTAGT CTGGTTTGAG
GATGCCAGAA GTATTTTTGC CAGACTGACT CTGGCACATG AATACAGGCT CGGCGGAGTA
AGTTACTGGA CAATTAACAG TTACTTCCCA CAGAATTGGT TGGTTTTAAG CTCTATATTC
AATATAAGAA AGGTGCTTTA G
 
Protein sequence
MRIHVVKSGE SIYSIAQQYR VSPQKIISDN ELNNPDQLVV GQTLVILEGT RRHVVAPGES 
VYSIARSYRI SVNELLAANP QISDPSRIQP GMVITIPPVT YNYGPMEVNG YAFPNIDMEV
LRKTLPNLTY LSIFSYQVSP DGNLQSIPDE PLIQAARAAR VAPLMVITNI KEGGGFDSDI
AHSILTNETA QTNLLNNVTR ILRQKNYFGL DIDFEYIYPY DRESYNNFLR RVVRTLRPLG
YTITTALAPK TSATQKGKLY EAHDYPVHGA LVDHVILMTY EWGFTYSAPM AVSPITGVRS
VLDYAVTAIP RRKIFMGISN YGYDWTLPYT PGTAARTVTN TGAVDLARRR GAEIQYDVIS
QAPFFYYYAD DRKQHVVWFE DARSIFARLT LAHEYRLGGV SYWTINSYFP QNWLVLSSIF
NIRKVL