Gene Ccel_3001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3001 
Symbol 
ID7311610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3552745 
End bp3553755 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content42% 
IMG OID643609905 
Productglycosidase PH1107-related 
Protein accessionYP_002507275 
Protein GI220930366 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2152] Predicted glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.345013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATTA ACGGTAAGAG TTTAAAGAAT ATTCCGTGGC AAGATAAACC TCTCGGCTGT 
AATAGCGTTA TATGGAGACA CAAAGGGAAT CCCATTATCG TCTGGAATCC TACACCTAAA
ACAGCAAGGA TATATAATAG TTCTGTAGTT CCCTGGAACT CAGGCTTTGC AGGTATTTTC
AGAGCAGACC ACAAAGACGG TAAAGCTCAA ATCCATGTTG GATTCAGCAG TGATGGGGTT
AATTGGAACA TTGAGGATGA GCCTATAGTA TGGCATGATG AGGATGGTAA TCTGTATCAG
CCTAACTATT CGTATGACCC CCGGATTGTG GAATTGGAAG GTATTTATTA TATTGTCTGG
TGCACAGACT TCGGTGGGGC CTCTCTGGGC CTAGGTGTCA CAAAAAACTT TAAACAGTTT
ACACGTCTTG AAAATCCTTT TATACCTTTT AATCGTAATG GTGTTTTGTT TCCACGCAAG
GTAAATGGTA AATACTTACT TTTAAGCAGA CCCAGCGATA CAGGTCATAC ACCTTTCGGA
GATATTTTCA TAAGCGAGAG TCCCGATCTT GTTCACTGGG GACGTCACAG ACGTGTAATG
CAAAAAGGAG GTTCAGGGTG GTGGCAAAGT GTAAAAATAG GAGCAGGTGC GGTTCCTATC
GAAACAACGG AAGGCTGGCT TCTCTTTTAC CATGGTGTTT CAGGAACCTG TAATGGCTTC
GTATACAGTT TTGGTGCGGC AATTCTGGAC ATTGAAATCC CTTCTAAAGT TCTTTACCGC
ACAAGAGATT ATCTTCTCAC CCCCGAAATG TCATATGAAA CATCAGGTTT TGTACCTAAT
GTGGTGTTCC CTTGTGCTGC ACTGCACGAT TCTGAGACTG GCAGAATCGC TATTTATTAC
GGTGCCGCCG ACACATATTC CGCTCTTGCA TATGCAAAGG AAGATGAATT AATAAACTTT
ATTAAATCAA ATTCCGAGTT GCTGCCAGGC GATGCGGAGG AATATAGATA G
 
Protein sequence
MSINGKSLKN IPWQDKPLGC NSVIWRHKGN PIIVWNPTPK TARIYNSSVV PWNSGFAGIF 
RADHKDGKAQ IHVGFSSDGV NWNIEDEPIV WHDEDGNLYQ PNYSYDPRIV ELEGIYYIVW
CTDFGGASLG LGVTKNFKQF TRLENPFIPF NRNGVLFPRK VNGKYLLLSR PSDTGHTPFG
DIFISESPDL VHWGRHRRVM QKGGSGWWQS VKIGAGAVPI ETTEGWLLFY HGVSGTCNGF
VYSFGAAILD IEIPSKVLYR TRDYLLTPEM SYETSGFVPN VVFPCAALHD SETGRIAIYY
GAADTYSALA YAKEDELINF IKSNSELLPG DAEEYR