Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1543 |
Symbol | |
ID | 7310307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1878421 |
End bp | 1880769 |
Gene Length | 2349 bp |
Protein Length | 782 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643608472 |
Product | cellulosome anchoring protein cohesin region |
Protein accession | YP_002505875 |
Protein GI | 220928966 |
COG category | [R] General function prediction only |
COG ID | [COG3401] Fibronectin type 3 domain-containing protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000430269 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTACA AGAAAGTACT GTTTACGTGT TTTTTTCTCC TAGCATTAGT TATTCCAGTA TCAGCTAATG CCACTACGGA AAAAAATATA ATTGATTTAT CTAAATGGCA AAATGTAGGT ATAAATTATA GTGGCTCAGA GCCAGCACCA ATTTGGAAAG TGAATTCGAC AAATACAGCT GTAACGCAGA CTTTAAATGC CAGACCGGCA GTTTTTATGG GTGATATGGA GTGTGCGAAT AATTCAGTGC GTGGAAGTTT TAGTGTCGAT ACAACTTCAG ATAACGACTT TGTAGGTTTT GTTTTTGGCT ATAAGGATTC AGGACATTTC TATTTGTTTG ATTGGAAGCA ATCGGATGAA GATTTTCATG GACTAGGAAA ACAAGGTATG AGTATTAAGA AAGTAAATAC GAATACATCT CTTAGTGACA GTGACTTGTG GCCGACTTCT AATACTGATA AGGTTAAAGT TTTATATCAT AATAACATTA CATATAAGGA TCAAATGCTT TATAATTTTA CATTAAATTT TACAGATAAA GGTAGCTTTA ATATTGTAAT AAAGCAGGGG AATACAGTTT TGGACAGTAT CACCATAAAG GATAGCACAT ACACTTCTGG TAAATTCGGT TTTTATAATT ACTCTCAAGA AATGGTTACA TACAGTGTAT CTGAATTTGA AAAACTCCCC CCTGTGGTGA AAGTTACTTC AACTCAAAAT GAAAAGGTTG ATTTATCCTG GACAGCTGTA GAAGGTGCGA CCAGCTACAA TATAAAACGT GATACTACTT CTGGTGGCCC ATATACAACA ATCGGACAAA GCACTTCGAC GACATATACT GACACAACAG TAGCCAATGG AACAACATAC TACTATGTTG TAACTGCTGT AAATACCGGA GGAGAGAGTG AAAATTCCAA TGAAGTATCT GCAAAGCCAA TAGCACCTGC AAAAGCACCA ATAAACCTTG TAGCAAAAGC TAATAATGCA AAAGTTGATT TAGTTTGGTC TGCATCTCAA TCTGCGACCA GCTACAATAT AAAACGTGCT ACTACTGCTG GTGGCCCATA TACAACAATC GGACAAAGCA CTTCGACAAC ATATACTGAC ACAACAGTAG CCAATGGAAC AACATACTAC TATGTTGTAA CTGCTGTAAA TGCTGGAGGA GAGAGTGAAA ATTCCAATGA AGTATCTGCA AGGCCAATAG CACCTGCAAA AGCACCAATA AACCTTGTAG CAAAAGCTAA TAATGCAAAA GTTGATTTAG TTTGGTCTGC ATCTCAATCT GCGACCAGCT ACAATATAAA ACGTGCTACT ACTGCTGGTG GCCCATATAC AACAATCGGA CAAAGCACTT CGACAACATA TACTGACACA ACAGTAGCCA ATGGAACAAC ATACTACTAT GTTGTAACTG CTGTAAATGC CGGAGGAGAG AGTGAAAATT CCAATGAAGT ATCTGCAAAG CCAATAGCAC CTGCAAAAGC ACCAATAAAC CTTGTAGCAA AAGCTAATAA TGCAAAAGTT GATTTAGTTT GGTCTGCATC TCAATCTGCA ACCAGCTACA ATATAAAACG TTCTACCACT GCTGGTGGCC CATATACAAC AATCGGACAA AGCACTTCGA CAACATATAC TGACACAACA GTAGCCAATG GAACAACATA CTACTATGTT GTAACTGCTG TAAATGCCGG AGGAGAGAGT GAAAATTCTA ATGAAGTATC AGCAACTCCC ACCAATCCAA CAGTAACCCT TGAAGTAACT TCAGTTGACA AAGCAAAGCT GGACGATGAA ATAACTGCAA ACATAGTAAT TCATAACGCT GTAAACATAT GTGCCGAAGA CCTAAAAATA TCTTATGATA CTTCCAAACT ACAATTCATA AATGCGGAAA ATGCAGATGG CATGAAAATT TATAAAGAGG ATGATATCGC TGCCGGAGTA AAAAGATACA TAACGGCTTG TCTCGGTAAA GCCAATGCTG CAAATGGCGA TAAAATATTA CTAAAATTGA AGTTCAAAGC TATTGATAAG GGTGAAGCAA AGATTGATAT AACAAACGGA CGTATTGCAG ATAATGTAAC TCTGGAAATG GATGTTTCAC AGGAAAACTG TGGCGAAAAG ACAATTTTGA TAGAAGGGGT AAAAGACGTT AACCGTACAG GTGAATACAC TCTTTTGGAT TTAGGTATTG ATGCATGGTA TTACGGGTAT GCCGCAGTTG ACACAGATAC CAGCAAATAT GACGCCGACC AGATAATCAA TGGATCAATT GACGATGATG ACTTAACTGA AATAGTAGCT CAAATACTTG CTAATACGAA CTATTCTGCG AATAAATAA
|
Protein sequence | MNYKKVLFTC FFLLALVIPV SANATTEKNI IDLSKWQNVG INYSGSEPAP IWKVNSTNTA VTQTLNARPA VFMGDMECAN NSVRGSFSVD TTSDNDFVGF VFGYKDSGHF YLFDWKQSDE DFHGLGKQGM SIKKVNTNTS LSDSDLWPTS NTDKVKVLYH NNITYKDQML YNFTLNFTDK GSFNIVIKQG NTVLDSITIK DSTYTSGKFG FYNYSQEMVT YSVSEFEKLP PVVKVTSTQN EKVDLSWTAV EGATSYNIKR DTTSGGPYTT IGQSTSTTYT DTTVANGTTY YYVVTAVNTG GESENSNEVS AKPIAPAKAP INLVAKANNA KVDLVWSASQ SATSYNIKRA TTAGGPYTTI GQSTSTTYTD TTVANGTTYY YVVTAVNAGG ESENSNEVSA RPIAPAKAPI NLVAKANNAK VDLVWSASQS ATSYNIKRAT TAGGPYTTIG QSTSTTYTDT TVANGTTYYY VVTAVNAGGE SENSNEVSAK PIAPAKAPIN LVAKANNAKV DLVWSASQSA TSYNIKRSTT AGGPYTTIGQ STSTTYTDTT VANGTTYYYV VTAVNAGGES ENSNEVSATP TNPTVTLEVT SVDKAKLDDE ITANIVIHNA VNICAEDLKI SYDTSKLQFI NAENADGMKI YKEDDIAAGV KRYITACLGK ANAANGDKIL LKLKFKAIDK GEAKIDITNG RIADNVTLEM DVSQENCGEK TILIEGVKDV NRTGEYTLLD LGIDAWYYGY AAVDTDTSKY DADQIINGSI DDDDLTEIVA QILANTNYSA NK
|
| |