Gene Ccel_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1004 
Symbol 
ID7309831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1246895 
End bp1249270 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content44% 
IMG OID643607931 
Productglycoside hydrolase family 31 
Protein accessionYP_002505346 
Protein GI220928437 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.667554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGGAAG CAATCAAACC CAAACAGGCA AAAGTCACCG AAGTTCAACG AAGCGAGGGT 
GCTCTGGTTC TCAACACCGA AAATGGTTTA CTAAAGATAG AACCAGTTAA TGCCAATATA
ATCAGGGTAG TGTACACACT GGAAGACAAA TTCTCTGCCG TTCGGGGGCT TGGCATTGAA
CCGCAAAAAG TATTTGAAGG CTGGGAGTTC AAGGAGAATG AACAAACAGT AATGCTTGAT
ACTGGAGAGC TGCAGCTTTA CATTAGGAAA TCCGACAGCC GTATGACTTA TTTTGATGGT
AAAGGCAGAC GCCTGACAGG GGAACCTGAA AAGAACGGCA AGGAGCTGGT TCCCTTTGAT
TCCTTCAAAA CAGTTCTGGA TGATACGGCA GTTGTTGAAA AAATTGAAAC TCCGGATGGT
ATAAAGGAAG TAGTGCTGGA TTCCAAGAAG AAATATGACC GTCAGCTATA CCATACAAGA
CTTAATTTTG AATGGGACAA AGACGAAGCC CTTTATGGCA TGGGACAGAA TGAAGAAGGC
TATCTGAACC TGAGAGGAAC CCGCCAGTAT ATTCATCAGG CCAATATGAA GATAGCCATG
CCGTTTCTCA TGTCAACTAA CGGCTACGGA ATACTGTTGG ATACGTACGC CCCGGTTATA
TTCAATGATA ATGCCTTTGG ATCATACCTG TACGGTGAAG CTTCGGCAGA GCTGGATTAT
TATTTTATAC GGGGAGAAAG CTTTGACCAA ATTATAGGAG GCTACCGCCA ATTGACTGGC
CGGGCATCCA TGCTTCCACT CTGGGCCTTT GGATTTATGC AGTCTCAGGA GAGGTATGAA
ACCCAACAGG AGATTATTGA CACGGTCAAG CGTTATCGTG AACTTGGTGT TCCACTGGAC
GGTATTGTCC TTGACTGGCA GTCCTGGGAA GAAGGTATGT GGGGGCAGAA AACCTTTGAT
ACAGAGCGTT TTCCAGATGC TAAAAACATG ATGGAGCAAG TACATGAGCT GGGAGCTCAC
CTTATGATAT CTATATGGCC CAATATGGTC GAGCATTGTG AAAATTACAA GGAAATGAAA
GCTAATAACG GTTTGTTCCA GCGTTCTGAA ATATATAATG CATTTAGTTC CGAAGCAAGG
AAATTGTATT GGAAACAGGC CAAGGAAGGA TTATTTTCAA AGGGAGTGGA TGCTTGGTGG
TGCGATTCCA GTGAACCATT TACTCCTGAA TGGAACAATC CTGTAAAACC GGAGCCAGAT
CAGAACTTGA CGGCCTTTCA CAACACTTGC AGAACATATA TGGATGAGGT ATATACAAAT
GCGTACCCAT ATATGCATGC CAAGACAATA TATGAAGGAC AGCGGGAGAC CGATGGGCAA
AAAAGAGTTG TCAATCTGAC AAGAAGCGGT TATACAGGTA TACAAAAGTA TGGAACCATT
CTGTGGTCGG GTGATACCAG TGCCAAATGG TCAACTCTTA AAAATCAGAT TGCTGCAGGC
TTGAATTATT GTGCATCAGG CTTACCGTAC TGGACTATGG ATATAGGAGC ATTCTTTGTA
AAGCAGGGAC ACATGTGGTT CTGGGACGGA GACTATGAAG GCGGTTGCAG CGATCTGGGC
TATAGAGAGC TTTATACAAG GTGGTATCAG CTGGGTGCAT TCCTGCCGGT GTTCAGATCT
CATGGTACCG ATTGCCGGAG GGAAATATGG AATTATGGGA AGAAGGGTGA ATTTTTTTAC
GACGCCATAG AGAAGATAAC ACACTTGAGA TATCAGCTCA TGCCGTATAT ATATTCACTT
GCAGGTATGG TTTCACAGAA GCATGGCACA ATTTTAAGAT TGCTTGCATT TGATTTTATT
AATGATGCAA AAGTGTATGA TATCGACGAC CAGTTTATGT TTGGTCCCAG TCTGATGGTA
TGTCCGGTGA CTGCTCCTAT GTACTATGAA GCGGATAGCA AGCCTATTGA GGGAGTAGCA
AAGACAAGAA AAGTATATCT GCCTGCCGGC AGTGACTGGT ATGATTTCTG GACTGAAAAG
CGGTTTAAGG GAGGTCAGTC CATTGAAGCC GAGGCACCTA TTGACAGGAT TCCAATATAC
GTTAAGGCCG GTTCAATTCT GCCTATGTCT GAGCAGATAC AGCATACCGG GCAAATGAAG
GATGATTGTT TCATGCTGGT TGTATACCCG GGAGAGGACG GAAGCTTCAC ACTTTATCAG
GATGAACGGA ATGGATATGG TTATGAAGAA GGCAAATTTA CAACAACGGA ATTAACCTGG
TCCGACAGCG AAAAGAAGCT TACAATCCAT CCGCACAAGG GAGAATATCC GAGTATGCCT
GAAAAGGTAA CCTTTTTAAA AAGAATTGTA GGATAA
 
Protein sequence
MLEAIKPKQA KVTEVQRSEG ALVLNTENGL LKIEPVNANI IRVVYTLEDK FSAVRGLGIE 
PQKVFEGWEF KENEQTVMLD TGELQLYIRK SDSRMTYFDG KGRRLTGEPE KNGKELVPFD
SFKTVLDDTA VVEKIETPDG IKEVVLDSKK KYDRQLYHTR LNFEWDKDEA LYGMGQNEEG
YLNLRGTRQY IHQANMKIAM PFLMSTNGYG ILLDTYAPVI FNDNAFGSYL YGEASAELDY
YFIRGESFDQ IIGGYRQLTG RASMLPLWAF GFMQSQERYE TQQEIIDTVK RYRELGVPLD
GIVLDWQSWE EGMWGQKTFD TERFPDAKNM MEQVHELGAH LMISIWPNMV EHCENYKEMK
ANNGLFQRSE IYNAFSSEAR KLYWKQAKEG LFSKGVDAWW CDSSEPFTPE WNNPVKPEPD
QNLTAFHNTC RTYMDEVYTN AYPYMHAKTI YEGQRETDGQ KRVVNLTRSG YTGIQKYGTI
LWSGDTSAKW STLKNQIAAG LNYCASGLPY WTMDIGAFFV KQGHMWFWDG DYEGGCSDLG
YRELYTRWYQ LGAFLPVFRS HGTDCRREIW NYGKKGEFFY DAIEKITHLR YQLMPYIYSL
AGMVSQKHGT ILRLLAFDFI NDAKVYDIDD QFMFGPSLMV CPVTAPMYYE ADSKPIEGVA
KTRKVYLPAG SDWYDFWTEK RFKGGQSIEA EAPIDRIPIY VKAGSILPMS EQIQHTGQMK
DDCFMLVVYP GEDGSFTLYQ DERNGYGYEE GKFTTTELTW SDSEKKLTIH PHKGEYPSMP
EKVTFLKRIV G