Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1655 |
Symbol | |
ID | 7310403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 1995887 |
End bp | 1997194 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643608583 |
Product | cellulosome protein dockerin type I |
Protein accession | YP_002505986 |
Protein GI | 220929077 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000388378 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAAAA AAGCATTATT GTTAACTATT GTGATTAGCA ATATAGTTGT TGGGTCAATG ACCACGGGGA CAATGGCTGT TACGCAAATG GTAAAATCTA TGGGTGAAGG AACAGCTTCA GCGGCAGCGA CGACTACAAA TGATATTAAA TATGGTGATG TTAATATGGA TAATGCAGTT GATTCTGTAG ATCTGGCATT ACTAAAGGCT TATATCTTAG CTATAACAAG TACTCTGCCA AATATCGCAG CTGCGGACGT TACTGGTGAC GGTACCCTTG ATGCACTTGA TTACGCTGTA CTTAAAAAAT ACCTTTTGGG ACTAATCACC ATTTTACCTG CTGATGACAA TGGGAATGGG AAAATACTGA TTCCACATAA ATCATGGACG TGTGGAATGG CTGATGGCAT ACCCAAGCCC GAAACCGGAG TACTTGTTTT TGAAACTACT ATGAAGCTAC AAAACAGTTA TGATCTGGGG AAAACCCAAT ATGGACTGAG AAAAGTTTTT GTAGTTCAAA ATGGCAGTAT AACCGCTACA AAAATACAAG GTTCAGTTAT GTCGGGGGGC CTTGATTTTC AGTTGACTCT TTCAAATGGT GCAATGGAAA TTGAACAATT ATTAATGATT AAGACGAATG ACGGGAATTA TATCTATCTA AGAAGTGCCG GAACAGCCGT AAACCAGAAT GATGTGAGGA TGGTGTGGGA TTTTGAAGCT CCAAACTCAA GCTCATACAA TTGGCTTAAC TCTGGCAAAT ATGTGGGCAG GCGTATTATA GACTCAGTTG CCGGAACAAT GAAGATAAGT GTTTATGACG TATCCGGCAT AAATTTTACA CCGGATTCCA CAAATTCATT AATAGTAACT GAACCGGACG ATGTGCCGGA CCAGCCATGG GACTATAGAA AGGCATCCTA TGAGAGAAAC GGCAGTAAGT TTATAACTGA GGCGGTCAGT CTTGGGGCGA GTCAATCTGT AGGAGCAAGC AAGAGAGGTA GCAGGAACAT TATTCCCATA ACTGGCGGAT CTGTGACCGG GAATTTAACC GCTAAGATTT TAGCGGCAGG TGCGGATTAC CAGAACCTAT CAAACCCTAT AACAATTGAT GCCAGATATC TTTGGCAAAC TGATGACGGA GAAATTATTA TTGTTCGAAA TGGGGGACAA TTCGGATCTC TTGTACCTAC ATTCGAAGTT AGGGCAGACA GTAAATACTC ATACCTGAAC CAAAAGTTAT ATCTAAGCTC AGATCCGGGT GGTGGAGCAG GCGGTGTTAC AATTACGTTC TATGAAAGTA TAAAGTAG
|
Protein sequence | MFKKALLLTI VISNIVVGSM TTGTMAVTQM VKSMGEGTAS AAATTTNDIK YGDVNMDNAV DSVDLALLKA YILAITSTLP NIAAADVTGD GTLDALDYAV LKKYLLGLIT ILPADDNGNG KILIPHKSWT CGMADGIPKP ETGVLVFETT MKLQNSYDLG KTQYGLRKVF VVQNGSITAT KIQGSVMSGG LDFQLTLSNG AMEIEQLLMI KTNDGNYIYL RSAGTAVNQN DVRMVWDFEA PNSSSYNWLN SGKYVGRRII DSVAGTMKIS VYDVSGINFT PDSTNSLIVT EPDDVPDQPW DYRKASYERN GSKFITEAVS LGASQSVGAS KRGSRNIIPI TGGSVTGNLT AKILAAGADY QNLSNPITID ARYLWQTDDG EIIIVRNGGQ FGSLVPTFEV RADSKYSYLN QKLYLSSDPG GGAGGVTITF YESIK
|
| |