Gene Ccel_0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0231 
Symbol 
ID7309133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp260554 
End bp262701 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content44% 
IMG OID643607161 
Productglycoside hydrolase family 9 
Protein accessionYP_002504598 
Protein GI220927689 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0413621 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAAA AATTCCTTAA AACAACCAGT TTCATGCTAA TCATGTCACT GCTGGTGTCA 
TTGTTTGTCT ATGCTCCGAC AACTGTAAGA GCAGAAACTT CGCCGCGTGT GGGAGGCTCA
TTTTACAATT ATGGCGAAGC CATGCAAAAG TCAATTCTTT TTTATAAGGC AAACCGTCTT
GGGGATTTAC CAGACAACTA CGTTTTGCCC TATAGAGGTG ATGCTGCAAT GACTGACGGA
AAGGATGTAG GTCTTGATCT AACCGGAGGA TGGGCAGATG CCGGAGACGG AATAAAATTT
ACGCACCCTA TGTCATACGC CGCAGGACAA CTGGGATGGG CTGTTTACGA ATATCGTGAT
GCCTTTGAAA AGTCTGGGCA ACTGGACGAT ATACTTGACG AAATAAAGTG GGCTACAGAC
TTCTTTATAA AGGCACACCC AAGTCCGGAT GTATTGTACT ATATGTGCGG CTACGAAGAA
TCAGACCATT CTGTATGGAT ACCACATGAG TTATTGGATT ATAAAACTGA CAGAAAGTCT
TTCAAAGTTG ATTCTACAAC ACCGGGTTCA GATGTAGCAG GCCAAACTGC GGCAGCATTG
GCTATTGCAT CAATTATATT TGAGCCAACA GACCCCGAGT ATGCAGAAAC CTGTCTTACA
CATGCAAAGC AGCTGTTTAA GTTTGGTGAT ACATACAGGG GAAAAAATCC TCTAAAAACC
CTGTATCCGT CAGGAGGCTA TCTGGATGAT CTGGCATGGG GTGCAATATG GCTGTATATT
AAAACACAGG ATGCAACATA TCTTGAAAAG GCAAAGTCAA TTCTCCCTGT GACTGTTTTA
GGCGGACAGC ACACGCACTG CTGGGATGAT GTAAGCTATG GTGCAGCCCT AAAAATAGCT
CAGGCAGGTC GTGATGAAAG TTACGCTGCA ATGGTTGAAA AGAACCTTGA TTATTGGATG
CCGGGGACAG GAATAAAATA CACTCCCGGA GGACTTGCAT GGCTTTCACA ATGGGGTTCC
CTCCGTTATG CAACAACTGC GGCATTTTTA GCATTTGTAT GGTCGGATGA CAAAACAATA
GGAACAGCTT CCAAAAAGCA GACTTACCAT GACTTTGCAG AAAGACAGAT AAACTATGCA
CTTGGAGATA ACCCGCGTGG AGGTAGCTAT GAAGTAGGTT TCGGAGTAGA TGCACCTGAA
CATCCTCATC ATCGAACTGC TCACGGTTCA TGGACAAGTA TGCTTAACGT CCCGACCTTC
CACAGACATA TTCTCTATGG AGCATTAGTG GGAGGACCTT CTTCAGACGA CAGTTGGAAA
GATGATATCA AAGATTATAC GCTGAATGAA GTAGCTACCG ACTATAATGC GGGTTTTGTA
GGCTGTCTGG CTAAGATGTA CAGTATGTAT GGAGGGAATC CACTGGAAAA CTGGCCAAAG
GCTGAGGATT TCAGATCACC TCAGGATAAT CTGACGGAGT ATTTCACAAG AGGCTGGATA
ATTTATGAGG GCTACGGCAA GCTGAAAGTT ATGTTCCAGA TTAATAACCG CTCAGCTTGG
CCTGCAACAA TGAAGGATAA AATGTCTACC CGATACTATA TGGATTTATC AGAAATATTT
GAAGCAGGGG GAACGGTAGA TGACGTGCAA TTAACCCTTG AGGATAGTCA GGGGGCAAAG
CTTATAGGAC TCAAGCAGTA CAAGGATAAT ATATATTACT TTACAGTTGA TTTTACGGGT
ACACAGATAA TGCCGGCAGA GTGGGAAATG TGTGAAAAGG ATGCAACTGT ACAGATTGAA
TACAAAAATG GCGTAGGTTC CAATGAAAAT GACTGGTCAT ACCAGAACAT AAGCGGCCCG
CCGGACTTTG ATGCAGTATC CTTTGCAGGA ATGTCCAAAT ACATACCTGT ATACGACAAC
GGTAAGCTTC TTTGGGGAGA GGAACCAGCT GGGAAGGAAC CGGAAGTCAT GTATGGCGAT
ATAAATAATG ACGGAAATAT TGATGCGATA GATTTTGCAC TGCTCAAAAA AATACTTATG
GGCGACACAT CAGGCAATGT CAATTTGACT GCCGCCGATT TTAACAAGGA CGGAGATATA
AATGCTATTG ACTATGCGGC GTTAAAGAGC TATTTGCAAC GTGGATAA
 
Protein sequence
MRKKFLKTTS FMLIMSLLVS LFVYAPTTVR AETSPRVGGS FYNYGEAMQK SILFYKANRL 
GDLPDNYVLP YRGDAAMTDG KDVGLDLTGG WADAGDGIKF THPMSYAAGQ LGWAVYEYRD
AFEKSGQLDD ILDEIKWATD FFIKAHPSPD VLYYMCGYEE SDHSVWIPHE LLDYKTDRKS
FKVDSTTPGS DVAGQTAAAL AIASIIFEPT DPEYAETCLT HAKQLFKFGD TYRGKNPLKT
LYPSGGYLDD LAWGAIWLYI KTQDATYLEK AKSILPVTVL GGQHTHCWDD VSYGAALKIA
QAGRDESYAA MVEKNLDYWM PGTGIKYTPG GLAWLSQWGS LRYATTAAFL AFVWSDDKTI
GTASKKQTYH DFAERQINYA LGDNPRGGSY EVGFGVDAPE HPHHRTAHGS WTSMLNVPTF
HRHILYGALV GGPSSDDSWK DDIKDYTLNE VATDYNAGFV GCLAKMYSMY GGNPLENWPK
AEDFRSPQDN LTEYFTRGWI IYEGYGKLKV MFQINNRSAW PATMKDKMST RYYMDLSEIF
EAGGTVDDVQ LTLEDSQGAK LIGLKQYKDN IYYFTVDFTG TQIMPAEWEM CEKDATVQIE
YKNGVGSNEN DWSYQNISGP PDFDAVSFAG MSKYIPVYDN GKLLWGEEPA GKEPEVMYGD
INNDGNIDAI DFALLKKILM GDTSGNVNLT AADFNKDGDI NAIDYAALKS YLQRG