Gene Ccel_0374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0374 
Symbol 
ID7309258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp426951 
End bp428303 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content41% 
IMG OID643607302 
Productbeta-galactosidase 
Protein accessionYP_002504739 
Protein GI220927830 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTCA AAGAAGGTTT CGTTTGGGGT ACGGCAACAG CATCATATCA AATTGAAGGA 
GCTGTTAACG AAGGCGGAAG AGGTGAGTCC GTTTGGGATG AATTTTGCAG GATGAAAGGC
AAAATTGATG ACGATGATAA CGGAGATTCT GCATGTGACA GTTATCACAG ATATTCTGAG
GACATACAGC TTATGAAGGA AATCGGTATT AAGGCATATA GGTTTTCCAT AAGCTGGACC
AGGATTTTAC CCGATGGTAT AGGTGAAATC AACATGGAAG GCGTAAACTA CTACAATAAT
CTTATTAATG GGCTGCTGGA AAATGGTATA GAGCCATATG TAACTCTATT TCACTGGGAC
TACCCTATGG AGCTTCAATA TAAGGGAGGA TGGCTGAATC CTGAAAGTCC TCTTTGGTTT
GAAAATTATG CAGCCATATG CTCAAGACTA TTTTCCGACA GAGTAAAGTA CTGGATAACC
AGCAACGAAT CCCAGTGTTA CATTGGATTT GGTTACGGCA CAGGCTGGCA TGCACCGGGC
TTTAAGCTTC CGGTAAACCA GGTGGTAAGA GCTTGGCATC ATAATTTAAA GGGATTGGGA
CTGGCTGCGA AAGCAATACG GGAAAATGCA AAGGGAGAAG TCAAAGTAGG GCTGGTAGCC
TGCGGAGAGG TTGGAATTCC TGCATCAGAC AGTGAGGCAG ATATGCAGGC TGCACGTAAT
GTACTTTTTG ACAGAGAGCA TTCCGAGGAT TCAATCGATT TTGGATATGG GGACCTTTTC
GAGCCTGCAT TAAAGGGAGA GTATCCGAAA AGCCTAATCC CATATCTTCC TAAAGGCTGG
CAGGAGGATA TGAAAGACAT TTGTGTTCCT CTTGATTTTT TAGGCGTGAA CGCTTATATA
GGTTCTATTG TAGAAGCATG TGAAAATAAA AAATACAGAC ACCTTAAATT GCCTGTTGGT
ATAGGCAAAA CTTCCATGGA ATGGCCGTTT AAACCGGAAA CTCTGTACTG GGTAACTAGA
TTTATATCCG AGAGATATAA ATTGCCAGTA TACATTACAG AAAATGGCAT GGCGAATAAT
GACTGGATAA GCACTGACGG AAAAATCAAT GATACTCAGA GAGAAGACTA TTTGAACCAA
TATCTTTCTG CACTGTCAAA GTCTATAGAT GACGGAGCCG ACGTAAGAGG ATATTTTTAC
TGGTCACTCC TTGACAATTT TGAGTGGGCA TACGGATATG CAAAGAGGTT TGGACTTGTA
TATGTAGATT ACAGCAATTT CAGCCGAACT CTAAAACAGT CTGCATTAAG GTATAAGAAA
ATTATCGAAT TAAATGGCGA AGTATTAAAA TAA
 
Protein sequence
MAFKEGFVWG TATASYQIEG AVNEGGRGES VWDEFCRMKG KIDDDDNGDS ACDSYHRYSE 
DIQLMKEIGI KAYRFSISWT RILPDGIGEI NMEGVNYYNN LINGLLENGI EPYVTLFHWD
YPMELQYKGG WLNPESPLWF ENYAAICSRL FSDRVKYWIT SNESQCYIGF GYGTGWHAPG
FKLPVNQVVR AWHHNLKGLG LAAKAIRENA KGEVKVGLVA CGEVGIPASD SEADMQAARN
VLFDREHSED SIDFGYGDLF EPALKGEYPK SLIPYLPKGW QEDMKDICVP LDFLGVNAYI
GSIVEACENK KYRHLKLPVG IGKTSMEWPF KPETLYWVTR FISERYKLPV YITENGMANN
DWISTDGKIN DTQREDYLNQ YLSALSKSID DGADVRGYFY WSLLDNFEWA YGYAKRFGLV
YVDYSNFSRT LKQSALRYKK IIELNGEVLK