Gene Ccel_1655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1655 
Symbol 
ID7310403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1995887 
End bp1997194 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content41% 
IMG OID643608583 
Productcellulosome protein dockerin type I 
Protein accessionYP_002505986 
Protein GI220929077 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000388378 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAAA AAGCATTATT GTTAACTATT GTGATTAGCA ATATAGTTGT TGGGTCAATG 
ACCACGGGGA CAATGGCTGT TACGCAAATG GTAAAATCTA TGGGTGAAGG AACAGCTTCA
GCGGCAGCGA CGACTACAAA TGATATTAAA TATGGTGATG TTAATATGGA TAATGCAGTT
GATTCTGTAG ATCTGGCATT ACTAAAGGCT TATATCTTAG CTATAACAAG TACTCTGCCA
AATATCGCAG CTGCGGACGT TACTGGTGAC GGTACCCTTG ATGCACTTGA TTACGCTGTA
CTTAAAAAAT ACCTTTTGGG ACTAATCACC ATTTTACCTG CTGATGACAA TGGGAATGGG
AAAATACTGA TTCCACATAA ATCATGGACG TGTGGAATGG CTGATGGCAT ACCCAAGCCC
GAAACCGGAG TACTTGTTTT TGAAACTACT ATGAAGCTAC AAAACAGTTA TGATCTGGGG
AAAACCCAAT ATGGACTGAG AAAAGTTTTT GTAGTTCAAA ATGGCAGTAT AACCGCTACA
AAAATACAAG GTTCAGTTAT GTCGGGGGGC CTTGATTTTC AGTTGACTCT TTCAAATGGT
GCAATGGAAA TTGAACAATT ATTAATGATT AAGACGAATG ACGGGAATTA TATCTATCTA
AGAAGTGCCG GAACAGCCGT AAACCAGAAT GATGTGAGGA TGGTGTGGGA TTTTGAAGCT
CCAAACTCAA GCTCATACAA TTGGCTTAAC TCTGGCAAAT ATGTGGGCAG GCGTATTATA
GACTCAGTTG CCGGAACAAT GAAGATAAGT GTTTATGACG TATCCGGCAT AAATTTTACA
CCGGATTCCA CAAATTCATT AATAGTAACT GAACCGGACG ATGTGCCGGA CCAGCCATGG
GACTATAGAA AGGCATCCTA TGAGAGAAAC GGCAGTAAGT TTATAACTGA GGCGGTCAGT
CTTGGGGCGA GTCAATCTGT AGGAGCAAGC AAGAGAGGTA GCAGGAACAT TATTCCCATA
ACTGGCGGAT CTGTGACCGG GAATTTAACC GCTAAGATTT TAGCGGCAGG TGCGGATTAC
CAGAACCTAT CAAACCCTAT AACAATTGAT GCCAGATATC TTTGGCAAAC TGATGACGGA
GAAATTATTA TTGTTCGAAA TGGGGGACAA TTCGGATCTC TTGTACCTAC ATTCGAAGTT
AGGGCAGACA GTAAATACTC ATACCTGAAC CAAAAGTTAT ATCTAAGCTC AGATCCGGGT
GGTGGAGCAG GCGGTGTTAC AATTACGTTC TATGAAAGTA TAAAGTAG
 
Protein sequence
MFKKALLLTI VISNIVVGSM TTGTMAVTQM VKSMGEGTAS AAATTTNDIK YGDVNMDNAV 
DSVDLALLKA YILAITSTLP NIAAADVTGD GTLDALDYAV LKKYLLGLIT ILPADDNGNG
KILIPHKSWT CGMADGIPKP ETGVLVFETT MKLQNSYDLG KTQYGLRKVF VVQNGSITAT
KIQGSVMSGG LDFQLTLSNG AMEIEQLLMI KTNDGNYIYL RSAGTAVNQN DVRMVWDFEA
PNSSSYNWLN SGKYVGRRII DSVAGTMKIS VYDVSGINFT PDSTNSLIVT EPDDVPDQPW
DYRKASYERN GSKFITEAVS LGASQSVGAS KRGSRNIIPI TGGSVTGNLT AKILAAGADY
QNLSNPITID ARYLWQTDDG EIIIVRNGGQ FGSLVPTFEV RADSKYSYLN QKLYLSSDPG
GGAGGVTITF YESIK