Gene Ccel_1229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1229 
Symbol 
ID7310026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1505662 
End bp1507269 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content41% 
IMG OID643608150 
ProductCarbohydrate binding family 6 
Protein accessionYP_002505565 
Protein GI220928656 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.166566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTAAAA AAATTAAAAA GTTTTCAGTT TTTATGGTAG CTCTTTTATT TTTGTCTGTT 
TATAGTTTTA ACTTAGTATT TGCAGACTAT CCCATATTTT ACCAGAGGTA TACGGCCGAT
CCTTCAGGTT TAGAAGCCAA TGGGAGACTT TATCTGTATT CTTCTCATGA TGTATATGAC
CCTAACAAAC CGGGTTATAT AATGAATGAC ATTACATGTA TATCTACCGA TGACTTGAAG
AATTGGACAG ACCATGGAGA GGTTTTTAAA GCTTCTGGCT GGGCATCATT ATCTTGGGCA
CCGGTAGTTG TTGCAAAAAA CAATAAATAT TATATGTATT TTGGAAACGG TGCTGGAGGT
ATAGGTGTTT CGGTAAGCGA CAGTCCTACA GGTCCTTTCA AGGATGCACT GGGAAAAGCT
TTGATAAATG GGAGCACACC CGGGGTAAAT CCCCCCAGCG GATTTTGGTG CTTTGATCCG
GGAGCCTTTG TGGACGATAA CGGTCAGGCA TATTTGTATT TCGGAGGAAA CGGGGAAGGT
AATACACGCG TTATAAAGCT CAACAGCGAC ATGATAAGTC TTAATGGTTC TGCATCTGGT
ATTACAGCTC CAATCTTTTT TGAGGATTCG TGGATACACA AGTATAACGG CAAGTACTAT
TATTCCTATT CAACAAATTT CTCAAAGGGT GCTGCTACCA TAGATTATAT GATGAGCGAC
AATCCAATAA CCGGATTTCA GTACAAGGGC ACCGTTCTGG CAAATCCCCC TTTAAACGAA
GGCAACAATA ATCACCATAC TATATTTCCA TTCAAAGGGG ATTGGTATAT TGCTTATCAC
AACAGGGCTC TTGCAATAGC CAATGGTGCT GCCAGCGGTG ATGCACGGAC GTATCAAAGA
AGTGTATGTA TAGATAAACT CAATTACAAT GCAGATGGAA CCATGCAGAA GGTTACAATT
ACGACAGATG GTCTTAAACA GCTTAAGTAT GTAAATCCTT ATGTAACAAA CGAGGCCGAA
ACCATGGCAC AGGGAAGCGG GATCAACACG GAAGAATGTA CCGAAGGAGG CCGTGATGTT
GCCTTTATTG AAAACGGAGA CTGGATCAAG GTGAGAGGTG TTGATTTTGG TACTGCTGGA
GCAGCCTCCT TTGACGCAAG GGTAGCATCA TCAACCAGCG GAGGAAATAT TGAAATCCGT
CTCGACAGCC TTACAGGAAA GCTTGTAGGG ACTTGTGTTG TTGAGAATAC AAGTGGTTGG
CAGAATTGGA CTACAAAGAC CTGTTCTGTA AGCGGTACTA CAGGTGTCCA CGATCTGTAC
CTGAAGTTTA CCGGCGACAG CGGGTATCTG TTTAATCTAA ACTCATGGAG ATTCAATACT
TCAGGAGCAA AAACAGTATA TGGCGATCTT GATGGCAGCG GTGATATTAA TGCTATTGAC
TTCTCACTTA TGAAGCAATA TCTGCTTGGT TCAATAACCA AATTTCCTAT AGAAGATGGG
ATAATTGCTG CGGATTTGGA TGCCAGCGGT ACAATTGATG CGATCGACTA TGTACTTCTC
AAAGAATATT TACTTGGTAA GAGGACCCAA TTTCCTGCTG AATTATAA
 
Protein sequence
MFKKIKKFSV FMVALLFLSV YSFNLVFADY PIFYQRYTAD PSGLEANGRL YLYSSHDVYD 
PNKPGYIMND ITCISTDDLK NWTDHGEVFK ASGWASLSWA PVVVAKNNKY YMYFGNGAGG
IGVSVSDSPT GPFKDALGKA LINGSTPGVN PPSGFWCFDP GAFVDDNGQA YLYFGGNGEG
NTRVIKLNSD MISLNGSASG ITAPIFFEDS WIHKYNGKYY YSYSTNFSKG AATIDYMMSD
NPITGFQYKG TVLANPPLNE GNNNHHTIFP FKGDWYIAYH NRALAIANGA ASGDARTYQR
SVCIDKLNYN ADGTMQKVTI TTDGLKQLKY VNPYVTNEAE TMAQGSGINT EECTEGGRDV
AFIENGDWIK VRGVDFGTAG AASFDARVAS STSGGNIEIR LDSLTGKLVG TCVVENTSGW
QNWTTKTCSV SGTTGVHDLY LKFTGDSGYL FNLNSWRFNT SGAKTVYGDL DGSGDINAID
FSLMKQYLLG SITKFPIEDG IIAADLDASG TIDAIDYVLL KEYLLGKRTQ FPAEL