Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_3367 |
Symbol | |
ID | 5741649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 4103996 |
End bp | 4106953 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641294470 |
Product | cellulose 1,4-beta-cellobiosidase |
Protein accession | YP_001560459 |
Protein GI | 160881491 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA TAATAAGTCT TTTATTAGTG ATAACACTTC TGATATCCAT GGCACCATCG AAAGCTGACG CAGCGGAAAC CAATTATAAT TACGGAGAAG CTCTTCAAAA ATCAATCATG TTTTATGAGT TTCAACGTTC TGGTAAACTG CCAAGTACCA TTCGGAATAA TTGGAGAGGT GACTCTGGTT TAACCGATGG AGCAGATGTT GGTTTGGATC TAACTGGTGG CTGGTATGAT GCTGGTGATC ATGTAAAATT TAATCTTCCT TTGGCTTATA CTGTAACAAT GTTAGCATGG GCAGTATATG AAGAAGAGGC TACTCTTTCA AAGGCAGGCC AATTAAGTTA TTTATTAGAT GAAATTAAGT GGTCTAGTGA TTACCTAATT AAATGTCATC CACAAGCAAA TGTATTTTAT TATCAGGTTG GTAATGGAAA TACAGATCAC TCTTGGTGGG GACCTGCTGA AGTTATGCAG ATGGCTAGAC CGTCCTATAA GGTTGATTTA AATAACCCAG GTTCTACTGT AGTAGGAGAA GCAGCAGCAG CTCTTGCAGC AACAGCACTT ATATATAAGA CAAAAGACCC TACTTATTCA GCAACTTGCC TTCGTCATGC AAAAGAGCTT TTTAATTTTG CAGATACAAC AAAAAGCGAT GCTGGATATA CAGCAGCAAG TGGGTTCTAT ACTTCCTATA GTGGATTTTA TGATGAATTA TCCTGGGCAG CTACATGGAT TTACCTTGCA AGTGGAGAAG CGACCTATTT GGATAAGGCA GAATCTTATG TAGCCAAATG GGGAACAGAA CCTCAATCTT CCACATTAAG TTATAAGTGG GCACAAAACT GGGATGATGT TCACTATGGT GCAGCTTTAT TATTAGCAAG AATTACAAAT AAAGCAATTT ATAAGAACAA TATTGAAATG CATCTTGACT ATTGGACTAC AGGGTATAAT GGTAGTCGTA TTACTTATAC ACCAAAAGGA CTTGCTTGGT TAGATTCCTG GGGTGCATTA AGATATGCGA CGACAACAGC ATTTCTAGCA AGTGTTTATG CTGATTGGAG CGGATGTAGT GCTGGAAAAG TTAGTACTTA CAATGCATTT GCGAAACAGC AGGTAGATTA TGCATTAGGA AGTACCGGAA GAAGTTTTGT GGTTGGATAT GGTGTAAATT CTCCAACAAG ACCTCATCAT AGAACTGCTC ATAGTTCATG GGCAGACAGT CAGACGGAGC CAAATTACCA TAGACACACC ATTTATGGTG CTTTAGTAGG TGGACCTGGT AATAATGATA GTTATGAGGA TAACATTAAT AATTATGTAA ACAATGAAAT CGCTTGTGAC TATAATGCAG GTTTTGTTGG CGCATTGGCT AAAGTTTATA AAACATATGG CGGAACACCA ATTGCAAACT TTAAGGCAAT CGAAACAGTA ACAAACGATG AGTTATTTAT TCAAGCTGGT ATTAATGCCT CTGGTCCATC TTTTATCGAA GTAAAGGCAT TGGTTTTCAA TGAGACAGGT TGGCCAGCTC GTGTTACCGA TAAATTATCC TTTAAGTATT TTATTGATAT CTCGGAATAT GTAGCAAAGG GATATACAAA GAATGATTTT ACGGTATCGA CAAATTATAA CAATGGAGCA ACCACATCGG CATTGCTTCC TTGGGATGCT GCGAATAATA TCTATTATGT GAATGTAGAC TTCTCTGGAA CTAAGATTTA TCCTGGTGGA CAGTCTGCAT ATAAGAAAGA AGTACAATTT AGAATTGCTG GTCCACAAAA CGTTAATATA TGGGACAATT CCAATGACTA CTCCTTTACA CAAATTGCTA ATGTTAGTTC AGGAAATACC GTAAAGACCA CATATATACC ATTGTATGAT AATGGTAAAT TAGTATTTGG TAATGAGCCA AAGACGGGTG TTCCTTCTGC AAGTCTTGAT AAGACTACAG CAAACTTTGA CAAAAACCCA GCTGTATCCG CAGATATACC AGTAACCATT AACTATAATG GTAATACATT AACAGCGGTT AAGAATGGAA CAACGGTTTT AACGAAAGGT ACTGATTATA CTGTATCTGG TAATGTAGTA ACGTTATCTA AGAATTATTT CTTAGCACAG AGCGCTAGTA CGGTTACTTT AACATTTGTA TTTAGTGGCG GTAACGATGC AACATTAAAA GTGACTTTAG TAGATACTTC TCCAAGTGCA TCCATTAATC CAAATTCTGC TGTCTTTGAT AAGGCTAGCG GAAAACAGGA AAATATAGTT ATTACGCTTA CACCAAATGG CAATACCTTA GCTGGACTTA AGAATGGGTC TAAGAGCCTG GTAACTGGAA CTGATTATAC CGTTTCCGGA ACAACAGTGA CGATTCTATC TTCTTATTTA AGTCAATTTG CAGTAGGAAG TCAATCTATT GTATTTGAAA TGAATAAAGG GACAAATCCA GTCTTAGCAG TTACCATTAA GGATTCTTCT GTTGTTACTC CAACAGGAAA TATTAAACTT CAAATGTTTA ATGGAAATTC TTCTGCAACA ACGAATGGCA TTGCACCAAG AATTAAATTA ATTAACACCG GAACTACTGC AATCAACTTA TCCGATGTTA AGATTCGCTA TTATTATACA ATCAATGGCG AAAAGGATCA GGCATTCTGG TGTGATTATT CGACGATTGG TAGTTCCAAT GTAAATGGTA CTTTCGTAAA GATGAGTACA CCAAAAACAA ATGCAGATTA CTATCTAGAA TTTTCATTTA AGTCCGCTGC CGGAACTTTA AACGCAGGGC AAAGTATTGA AGTTCAAGGA AGATTTTCTA AGGTAGACTG GACAAACTAT ACACAAACAG ATGATTATTC GTTTGGTGAT AGTAACTCAA GTTATGCTGA TTGGAATAAG ACAACAGTAT ATATCTCTGA TGTTTTGGTT TGGGGAGTCG AACCATAA
|
Protein sequence | MKKIISLLLV ITLLISMAPS KADAAETNYN YGEALQKSIM FYEFQRSGKL PSTIRNNWRG DSGLTDGADV GLDLTGGWYD AGDHVKFNLP LAYTVTMLAW AVYEEEATLS KAGQLSYLLD EIKWSSDYLI KCHPQANVFY YQVGNGNTDH SWWGPAEVMQ MARPSYKVDL NNPGSTVVGE AAAALAATAL IYKTKDPTYS ATCLRHAKEL FNFADTTKSD AGYTAASGFY TSYSGFYDEL SWAATWIYLA SGEATYLDKA ESYVAKWGTE PQSSTLSYKW AQNWDDVHYG AALLLARITN KAIYKNNIEM HLDYWTTGYN GSRITYTPKG LAWLDSWGAL RYATTTAFLA SVYADWSGCS AGKVSTYNAF AKQQVDYALG STGRSFVVGY GVNSPTRPHH RTAHSSWADS QTEPNYHRHT IYGALVGGPG NNDSYEDNIN NYVNNEIACD YNAGFVGALA KVYKTYGGTP IANFKAIETV TNDELFIQAG INASGPSFIE VKALVFNETG WPARVTDKLS FKYFIDISEY VAKGYTKNDF TVSTNYNNGA TTSALLPWDA ANNIYYVNVD FSGTKIYPGG QSAYKKEVQF RIAGPQNVNI WDNSNDYSFT QIANVSSGNT VKTTYIPLYD NGKLVFGNEP KTGVPSASLD KTTANFDKNP AVSADIPVTI NYNGNTLTAV KNGTTVLTKG TDYTVSGNVV TLSKNYFLAQ SASTVTLTFV FSGGNDATLK VTLVDTSPSA SINPNSAVFD KASGKQENIV ITLTPNGNTL AGLKNGSKSL VTGTDYTVSG TTVTILSSYL SQFAVGSQSI VFEMNKGTNP VLAVTIKDSS VVTPTGNIKL QMFNGNSSAT TNGIAPRIKL INTGTTAINL SDVKIRYYYT INGEKDQAFW CDYSTIGSSN VNGTFVKMST PKTNADYYLE FSFKSAAGTL NAGQSIEVQG RFSKVDWTNY TQTDDYSFGD SNSSYADWNK TTVYISDVLV WGVEP
|
| |