Gene Ccel_0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0335 
Symbol 
ID7309223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp386508 
End bp388442 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content40% 
IMG OID643607264 
Productglycoside hydrolase 15-related 
Protein accessionYP_002504701 
Protein GI220927792 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID[TIGR01577] oligosaccharide amylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAT CATATTATAA CAACGCAATT ACCGGAAATT CTTCAATGCT GGCATGTTTT 
AGTGAAAGAG CTGAACTTTT AAGACTTTTT TGGCCCGATA TTGATTATAT CCAGAATTTG
GATAAAATGT TTCTTGGACT ATTTGAGAAA AATAAAACAG GAAGCACTGT CTGGCTTAAT
GACATCCGGT GTGAACATCA TCAGGAATAC CTTCCTGATT CTAATATAAT TAAAAACATG
GTTACAAATT TTTTTGACGG ATACAAGGTA GTACTATATG ACTTTGTACA TCCTGAAATG
GATGTTTTGG TACGAAGATT TGAAATAGAG AATTTACGCG GCGAGAGCAG GGAATTGGGA
CTAATGAGTT TTTCAGCCGC CACCAGCAGT GATTCAGAGG TGGCATGCAG CTTGTTTGAT
TTCATGAATG AAGCACTGGT TCATTATAAG CCGGACAGCT ATATTGCCGT TACATCAGAT
ATTCCTGTAT ACCAGTTCCA AATCGGTAAT AATGCCAATG ATGCCGCTGT TAATACATAT
CTGTATGGCA AGGACGATAT AGGAATGATG AAGGATGCGG CCATATCATG GGATCTGGGA
GTTTTTCAGC CTCATGCTGT AAAGACTACA AATGTATATC TATGTGCGGC AGATACCCTG
AAATCCTGTA AAGCTCTTGT AAGAAGAGTA AAAACAGTAG GAGGGCTTAC AGCCTTCAGA
GAGACAGGGC GGTACTGGAA GGATTATCTG GAGAAAACAA CTAAATTAAA ATCAGGTAAC
ACTCTTTTGG ATGACTTATA TAAAAGATCC CTGCTTGTAT TCAGACTGAT GTATAGTAAA
AAAAGCGGCG GATTGATGGC TGCACCTGAA GTTGATGAAT ATTTTACAAA ATGCGGGAAA
TATGCCTATT GCTGGGGAAG GGATGCGGCC TTTATAACAG GTGCACTGGA CATTGGAGGA
TTGTGCGAAA GTGTTGACCA TTTTTATAAA TGGGCTGTAA ACGTTCAGGA TGAGGACGGG
AGCTGGCAGC AGAGATATCA TATGAACGGT AATTTAGGTC CCTGCTGGGG GCTTCAGGTG
GATGAGACAG GGACAATAAT CTGGGGAATG TTGAACCACT ATAACTATAC AAAAAATACA
GACTTTCTGA AATCCGTGTG GGATAGTGTA AAAGCGGCCG CAGATTTCCT TGTGAGGTTT
ATAGACAGTG AAACAGGTCT CCCAAGGCCC AGCTTTGACT TATGGGAAGA GAGATATGGA
GAACATGCAT ATTCCTCGGC TTCCGTATGT GCAGGACTCA AGTCTGCATC AGAAATGGCA
CGTATACTGG GAAAACCTTC CCAAGAATAT ATTCAATGGG AGACAACAGC AGACAGTATT
AAAAAGGCAA TAGTTAAATA CTTTTGGAAA GAAGATTACA GACGTTTTAT CAGAAGCATA
CGGGTAAAAT TAAACGGCTT CGGGCAGGAG CCTTCTTCTG ATACTATGCT GATTAAGGTA
AATCCAAAGG GCTATGTAAG GGATGTAACA AAAGAGGATT GGATTGTAGA TGTAAGCCTT
GTTGGATTGG GTATTCCCTT TGAAATTTTT GAGTTGAATG ATCCAATGTT GAGGGATACA
GTTTCATTAA TTGAACAAGT CCTTACGGCA CAAGGAGTTG GCGGAATAAA AAGATATGAA
AACGACACAT ATATAGGCGG AAATCCGTGG ATTCTTACCA CCCTTTGGAT AGCATTGTAC
CATGCTAAAT CAGGAAACTA TAAAAAAGCA AAGGAATATC TGATATGGGC TGCAAGTGGA
AAAACAGAAC TGGGTCTGCT GCCGGAACAG ATTAACAGGG ATACGGGAAA ACCAGAATGG
ATAATTCCGC TTACATGGTC TCACGCAATG TACGTGCACG TTTATTCAGA GCTTATAAAT
GCGGGTGTAC TGTAA
 
Protein sequence
MQKSYYNNAI TGNSSMLACF SERAELLRLF WPDIDYIQNL DKMFLGLFEK NKTGSTVWLN 
DIRCEHHQEY LPDSNIIKNM VTNFFDGYKV VLYDFVHPEM DVLVRRFEIE NLRGESRELG
LMSFSAATSS DSEVACSLFD FMNEALVHYK PDSYIAVTSD IPVYQFQIGN NANDAAVNTY
LYGKDDIGMM KDAAISWDLG VFQPHAVKTT NVYLCAADTL KSCKALVRRV KTVGGLTAFR
ETGRYWKDYL EKTTKLKSGN TLLDDLYKRS LLVFRLMYSK KSGGLMAAPE VDEYFTKCGK
YAYCWGRDAA FITGALDIGG LCESVDHFYK WAVNVQDEDG SWQQRYHMNG NLGPCWGLQV
DETGTIIWGM LNHYNYTKNT DFLKSVWDSV KAAADFLVRF IDSETGLPRP SFDLWEERYG
EHAYSSASVC AGLKSASEMA RILGKPSQEY IQWETTADSI KKAIVKYFWK EDYRRFIRSI
RVKLNGFGQE PSSDTMLIKV NPKGYVRDVT KEDWIVDVSL VGLGIPFEIF ELNDPMLRDT
VSLIEQVLTA QGVGGIKRYE NDTYIGGNPW ILTTLWIALY HAKSGNYKKA KEYLIWAASG
KTELGLLPEQ INRDTGKPEW IIPLTWSHAM YVHVYSELIN AGVL