Gene Ccel_1258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1258 
Symbol 
ID7310051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1559728 
End bp1560858 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content41% 
IMG OID643608179 
Productglycoside hydrolase family 8 
Protein accessionYP_002505594 
Protein GI220928685 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAG GAGCTTATTT TACAAAACAG TACCCTAATT TATTTGCCGA ACTGGGTATT 
TCTGACGAAC AAATTAATAA AAAGGTTAAT GACACTTTTA ACACTATGTT CTTCGATCCC
GAGGAGAAAA TTTATTTTGA AATAGGTAAA GACATGGGAT ATATGATGGA TACAGGTAAT
AATGATGCAC GTACAGAGGG TATGAGCTAT GGGATGATGA TGACTCTGCA AATGGATCGG
AAAGACATTT TTGACCGCTT GTGGTTATTT TCCAAAACAT ATATGTATCA AAACGAGGGA
AAGTATCAGG GCTATTTTGC ATGGTCGGTA TCTACCGACG GAAAGAAAAA TGCCGAAGGG
CCTGCACCTG ACGGAGAAGA GTATTTCGCT ATGGCTCTTT TCTTTGCAGG CAAAAGATGG
GGTGACGGTA AGCCGCCCTT TGACTATAGT ATTCAAGCCA GGGATATTTT AAAACATTGT
ATACACCAGT CGGAGATTGT TGAAGGTGGA GAACCTATGT GGGATAGTAC CAACCATTAT
ATAAAATTTG TTCCTGAAAC GCCTTTCTCT GATCCGTCTT ACCATCTGCC CCATTTCTAT
GAGCTTTTTG CGCTTCTGGC TAATGAAGAG GATAAAGACT TCTGGAAAAA AGCTGCTGAG
GCAAGTCGTA ATTACCTGCA TATTTCATGC GACAGGGACA CTGGGATGGC ATCGGAATAT
GCTGAATTTG ACGGTACTCC CAAAAAGCTG TTCCGTGATT TTCAGTTTTA TTCTGATTCA
TACCGCGTTG CAATGAATAT AGGATTGGAT GCGGCGTGGT TCAGTAAGGA CGAGTCATTA
GGGGATATCG TTGACAAGCT TCAGTCCTTC TTTAGTGAAA ATACGGTGTT AGGCGAATAT
AAGGCCTATA CTGTTAAAGG TGAGCCTTTT GATGCTCCTG CCATGCACCC CGTTGCAATT
ATCGCTACAA ATGCCGCCGG TTCACTTGCT GCTAAAGGGA AATACAGAGA TCAGTGGGTA
AAGGATTTCT GGGAGCTTCC ATTAAGAAAA GGAGTTCATA GGTATTATGA TAACTGTCTG
TACTTTTTCA GTTTACTGAT GCTGGCAGGA AAATATAAAA TTTACATCTA A
 
Protein sequence
MSKGAYFTKQ YPNLFAELGI SDEQINKKVN DTFNTMFFDP EEKIYFEIGK DMGYMMDTGN 
NDARTEGMSY GMMMTLQMDR KDIFDRLWLF SKTYMYQNEG KYQGYFAWSV STDGKKNAEG
PAPDGEEYFA MALFFAGKRW GDGKPPFDYS IQARDILKHC IHQSEIVEGG EPMWDSTNHY
IKFVPETPFS DPSYHLPHFY ELFALLANEE DKDFWKKAAE ASRNYLHISC DRDTGMASEY
AEFDGTPKKL FRDFQFYSDS YRVAMNIGLD AAWFSKDESL GDIVDKLQSF FSENTVLGEY
KAYTVKGEPF DAPAMHPVAI IATNAAGSLA AKGKYRDQWV KDFWELPLRK GVHRYYDNCL
YFFSLLMLAG KYKIYI