Gene Ccel_2451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2451 
Symbol 
ID7311121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2959323 
End bp2961383 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content44% 
IMG OID643609381 
ProductGlycoside hydrolase family 42 domain protein 
Protein accessionYP_002506760 
Protein GI220929851 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.987943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAT ATATGCCTAT TGGTATGAAT TTTCCACATA TTCTGCATGG TGGAGACTAT 
AATCCGGATC AGTGGTTGAA GGCGCCAAAG ATATTGGATG AAGATTTTCG TCTTATGAAG
CTTGCACATT GTAATGTAAT GACTATAGGA ATTTTTTCGT GGATGGCATT GGAGCCGGAA
GAGGGGAAGT TTGATTTTAG TTGGCTGGAC GAAATTATGG ATAAACTTTC TAAAAACGAA
ATGACCGCCA TTCTGGCTAC CCCAAGCGGT GCAAAACCCT CGTGGATGTC GCAAAAGTAC
CCGGAAATAT TGCGTACTCA GGCGGACAGG CTAAAGGATT TCCATGGCGG ACGTCATAAC
CATTGCCTTA CATCCCCCAT TTACCGTAAA AAGGTAAGCC AAATCAACCG TTTGCTGGCA
GAGCGGTATG CAAGGCATCC TGCACTAATT TTATGGCATA TCTCAAATGA GTACAGCGGG
GAATGTCATT GTCCATTATG TCAGGAAGCG TTCCGCAGTT TTCTTATGAA AAAGTACGAG
GGTGACCTCG AGCGGCTTAA CCATGAGTGG TGGACGGGTT TCTGGGCACA CCGATATACA
GATTGGTCAC AAATAGAGTC ACCATCTCCC CGGGGTGAGC ATGAGACTCA TGGACTTATT
CTGGACTGGA AGAGGTTTGT CTCACACCAG ACAATTGATT TTTTAAAAAA CGAGATTAAA
CCTCTGCGGG AGCTGACACC GGAAATTTCC GTCACTACCA ACCTGATGGG AACCTTTCCG
GGTTTAGAAT ACCGGGAACT TTCAAAAGAG ATTGACGTAG TTTCATGGGA TAATTACCCG
ACTTGGCATA CCTCGGAGCA GCCGGATTGG AAAATAGGCA TACGAACCTC CTTTGTCCAT
GATATCAACA GGTCTCTTAA GCAGGGCAAG CCGTTTATGA TGATGGAAAG TACACCCAGT
ACGGTTAACT GGCAAAAAGT AAACAAACTC AAACGTCCCG GTATGCATAT GTTATCTTCA
CTTCAGGCAG TTGCTCATGG CAGCGATACA GTTCAGTATT TTCAGTGGAG AAAATCAAGA
GGAAGTTTTG AAAAATTTCA TGGTGCGGTG GTAGATCATG TTGGTCACGA GAATACCCGT
GTTTTCCGGG AGGTTGCACA AGTAGGGCAA GTTCTTGAAA AGCTGGATGA GGTGGTGGGA
ACAACGGTTC ATCCAAAGGT TGCTATCATA TTCGACTGGG AAAATCGGTG GGCTATGGAG
GAGGCACAGG CTCTTGCACG TGACAGGATA AAGTACGAGG AAACCTGTGT CGCACATTAT
ACAGCCTTCT GGAAACGAGG TGTATCTGTA GACATTATAG GTGCAAGAGA TAATTTATCA
CAGTATAAAC TAGTTATTGC ACCAATGCTC TATCTGACCC ATCCCGGTGT CGGAGAGCGT
ATAGAGACAT TTGTAAATGA GGGAGGTACC TTTGTTGCCA CCTATTGGAG TGGAATTGTA
AATGAAAATG ACCTATGTCA TTTGGGCGGA TTTCCGGGTC CGCTGCGTGC AGTTACAGGT
ATTTGGAGTG AGGAAATAGA TACTATGCAC CCGGATGAAA ACAATAGTCT CATACTGAGT
GATAACTCTC TGGGTCTCTC AGGGACTTAT AGGGTACAAG ATTTCTGTGA TCTGGTTCAT
GCCGAATCCG CAGAGGTTCT GGCTCGGTAT GGAGAGGAGT TTTATGCGGG TAGGCCTGCA
CTGACAGTTA ACCGTTTTGG AAAGGGGAAT GCCTACTATA TGGCAGCCCG TACAGGAGAA
GATTTCCTTG AGGATTTTTA TAATGCCCTT ATGAAAAAAC TAACACTATC ACGTACACTG
GACGTTGAAC TACCATGCGG AGTGACAAGC CAGCTTCGTA CAGATGGGCA AAACCGCTAC
ATATTTCTTA TGAACTTCAA TCACGAAAAA CAAGATATTC AGCTTACTAA AGGTTTAACG
GATATGTTCA CTGGAGAAAT CAGGCAGGGG ATGCTGGAAC TTGCTCCTTT TGACGTAAAA
GTGCTAAAAG CTAAAATATA G
 
Protein sequence
MDKYMPIGMN FPHILHGGDY NPDQWLKAPK ILDEDFRLMK LAHCNVMTIG IFSWMALEPE 
EGKFDFSWLD EIMDKLSKNE MTAILATPSG AKPSWMSQKY PEILRTQADR LKDFHGGRHN
HCLTSPIYRK KVSQINRLLA ERYARHPALI LWHISNEYSG ECHCPLCQEA FRSFLMKKYE
GDLERLNHEW WTGFWAHRYT DWSQIESPSP RGEHETHGLI LDWKRFVSHQ TIDFLKNEIK
PLRELTPEIS VTTNLMGTFP GLEYRELSKE IDVVSWDNYP TWHTSEQPDW KIGIRTSFVH
DINRSLKQGK PFMMMESTPS TVNWQKVNKL KRPGMHMLSS LQAVAHGSDT VQYFQWRKSR
GSFEKFHGAV VDHVGHENTR VFREVAQVGQ VLEKLDEVVG TTVHPKVAII FDWENRWAME
EAQALARDRI KYEETCVAHY TAFWKRGVSV DIIGARDNLS QYKLVIAPML YLTHPGVGER
IETFVNEGGT FVATYWSGIV NENDLCHLGG FPGPLRAVTG IWSEEIDTMH PDENNSLILS
DNSLGLSGTY RVQDFCDLVH AESAEVLARY GEEFYAGRPA LTVNRFGKGN AYYMAARTGE
DFLEDFYNAL MKKLTLSRTL DVELPCGVTS QLRTDGQNRY IFLMNFNHEK QDIQLTKGLT
DMFTGEIRQG MLELAPFDVK VLKAKI