Gene Ccel_1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1002 
Symbol 
ID7309829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1244410 
End bp1245879 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content44% 
IMG OID643607929 
Productglycoside hydrolase family 4 
Protein accessionYP_002505344 
Protein GI220928435 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTA AAGTTGCTTT TATAGGAGCG GGAAGTCTTG TGTTTACGAG AACACTGTTT 
ACAGACATAA TGTCGGTTCC TGAATTCAGG GATATTAAAA TTGCATTTAC CGACATTAAC
GGAGATAATC TTCAAAAGGT GGCGGAATTG TGCCAGAGAG ATCTTGAGGC CAATGGTATC
ACTACTAAAA TCCAGGCTAC CACTGACAGG CGGGAGGCTT TCAAGGATGC AAAATATATT
GTGAATTGTG TTCGTATAGG AGGCCTGGAA GCTTTTGAAA CAGATATAGA CATACCGTTA
AAATACGGTG TTGACCAATG TGTGGGAGAT ACTCTCTGTA CAGGTGGGAT TATGTATGGA
CAGCGTGTTA TAGCCGCAAT GTTGGATTTT TGTAAAGACA TAAGAGAAGT TTCGGCACCC
GGAGCAATTC TGCTGAACTA CTCAAATCCT AATGCTATGG CAACCTGGGC CTGCAACAAG
TACGGTGGAG TTCGCACCAT AGGGCTTTGC CACGGTGAAA TTCATGGCGA GGATCAGATT
GCCCAGGTGC TGGGAATACC AAGAAACGAA CTTGACATCA TCTGTGCTGG TATAAACCAC
CAAACGTGGT ATATTTCAGT AAAACACAAG GGAAAAGAGC TGTTGGACAA AATACTTCCC
GGATTTGAGG CACACCCCAA GTTCAGCGAG GAAGAAAAGG TCAGAATTGA TGTACTAAAG
CGCTTTGGTT ACTATTCCAC AGAATCAAAC GGTCATCTTT CGGAATACGT GGCATGGTAC
CGTAAGAGAC CCCAGGAAAT AATGAAATGG ATAAACCTTG ATAGCTGGAT TAACGGTGAA
ACAGGCGGCT ATTTGAGAAT TACAAGGGAA GAACGAAACT GGTTTGAGAC TGATTACCCA
AAGATACTTG CTGAACCTCC AAAAAAATAT GACGGTTCGT CCAGAGGGAG GGAACATTGT
TCCTACATAA TCGAGTCTTT GGAAACAGGA AGAAAATACA GAGGACATTT TAATGTAATG
AATGAAGGCT GTATTACGAA CCTTCCGTAT GAGTCGGTGG TTGAAGTTCC CTGTTACGTG
GACGGCAACG GTATATCTGT CCCGAAGGTA GGAGATTTGC CACTGGGCTG TGCCGCAGTT
TGTTCACAAT CCATATGGGT ACAGCGGCTT GCTGTTGAGG CGGCGGTATC AGGGAATGTA
ACTCTGCTGA AACAGGCAGC TTTGATGGAC CCACTCACCG GAGCCGTTTG CAATCCGCCG
GAAATATGGC AAATGATTGA CGAAATGCTA ATAGCTCAGG AAAAGTGGCT TCCTCAGTAT
GTTGAAGGTA TCAAAGCTGC AAAAGAGAGA TTTGCCAAGG GAAACCTTAT TCCTATAAAT
GAAGGATATC GTGGTGCAGT AAGACAAAGG GTCAAAACTC CTTCCGAGGT AGCTGCCGAA
CGTGAATCCA GAAGTATTAC AGCTGATTAA
 
Protein sequence
MSFKVAFIGA GSLVFTRTLF TDIMSVPEFR DIKIAFTDIN GDNLQKVAEL CQRDLEANGI 
TTKIQATTDR REAFKDAKYI VNCVRIGGLE AFETDIDIPL KYGVDQCVGD TLCTGGIMYG
QRVIAAMLDF CKDIREVSAP GAILLNYSNP NAMATWACNK YGGVRTIGLC HGEIHGEDQI
AQVLGIPRNE LDIICAGINH QTWYISVKHK GKELLDKILP GFEAHPKFSE EEKVRIDVLK
RFGYYSTESN GHLSEYVAWY RKRPQEIMKW INLDSWINGE TGGYLRITRE ERNWFETDYP
KILAEPPKKY DGSSRGREHC SYIIESLETG RKYRGHFNVM NEGCITNLPY ESVVEVPCYV
DGNGISVPKV GDLPLGCAAV CSQSIWVQRL AVEAAVSGNV TLLKQAALMD PLTGAVCNPP
EIWQMIDEML IAQEKWLPQY VEGIKAAKER FAKGNLIPIN EGYRGAVRQR VKTPSEVAAE
RESRSITAD