Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_3217 |
Symbol | |
ID | 5741995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 3921799 |
End bp | 3923274 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641294317 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001560310 |
Protein GI | 160881342 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTTA TAATACTGTT AAAGATATAC GGAAAATATT ATACAAGAGA TAAAGCTTAC TCTGATGATG TTTATGCCAA ATTATATTGC GAGTGTCGGA GTAGAAAATC TACTTTAGAA GGAGAGAGTG ATATGGTTAG AAAAGTCTAT AATATATTAG ATTATGGAGC TGTAGCAGAC GGTGTAACTA ACAATGCGGC TACAATTCAA AAGGCTATTG ATGAAGCAAC AATCCATGGA GGACAAGTAG TTGTTCCTGC GGGTAATTAC TTAAGTGGAA CAATTATCTT AAAAAGTAAT ATAGATTTTC ATTTGGAGAT GGGAGCTGTA TTAATTAGTA GTCTGAAGGA AGAAGATATC CTAGATTTTG CAAAGCTATT TGAAGATGAT AATCAAACGA CTGGATGGGA TGGAGGATGT TTTATATTCG CTTGTCATGA AGAAAATATT ACAATATCGG GACAAGGAAC TATCTACGGA CAGGGAGATA AGGTATTCTT TGATGACAAT GCAGATAATG GTGCACATGA ATGCCCATTA AATGTATCGG CCTTTCGTCC AAGAACAACG TTTTTAGAAG ACGTTACAAA TCTAACGGTA AAGGATATTA CAATCAGAGA TGCAGCTTTC TGGACCTTAC ATATGGCTGG TTGCCGCCAC GTTCTTGTAA AAGATATTAA GATATTAAAT GATATTCGTG GTGCTAATAA TGATGGCATT GATCCAGATT GCTGTCAGGA TGTTATGATT AGCGGATGCT TAGTAAAGAC AGGGGATGAT GCAATCGTAG TTAAAGCGAC GAAACCGATG TCTCAAAAGT ATGGAGCTTG TGAAAATATA GTGATTAATA ACTGTATTCT ATATTCACGT GATTCGGGAT TAAAGATTGG AACAGAGACT CATGGAGATA TCCGAAATGT TATGCTTAGT GACTGTGTAA TTAAGGAGTG CTCTAGAGGT GTTGGTATCT GGGTAAGAGA TGGGGCTACA ATAGAAGACA TACACGTGCA TCATGTTACT GGAAGCGTCT TAAAGTATGC AGATGGAGAA CGCTCAGAAG GCCCTACCAT GTGGTGGGGG AATGGAGAGC CTATCTTTAT TAATGCAACT TACCGAAATG AAAATCGTAA CTATCCAGGA AAGATAAGAA ATATAACCTT TGACCATATA TATATGAAGG CAGAATCCAG TGTATTCTTA GCAGGAGAAG AGGATGCAAG GATTGAAAAT ATTACGATCT CTAATCTTGA GGTTACAATG TGCAGTCAAG GAACTCAAAA TTCTGGCTTT TTTGATGAAC AACCATCCCT CCGTCATGTA TATCCTCATA GTATTCCAGC TGTTTACGCT AGGAGTGTTG ATGGGCTTCG TGTTAGTGGC AGAGTTCGTT ATGAAGGGCC ATATCACATC TCTAAGAATA AATTATATGA GTCTGAGGAT TGTACCATGG AAGAAGTTAA TCTTAAAGAG AGATAA
|
Protein sequence | MSFIILLKIY GKYYTRDKAY SDDVYAKLYC ECRSRKSTLE GESDMVRKVY NILDYGAVAD GVTNNAATIQ KAIDEATIHG GQVVVPAGNY LSGTIILKSN IDFHLEMGAV LISSLKEEDI LDFAKLFEDD NQTTGWDGGC FIFACHEENI TISGQGTIYG QGDKVFFDDN ADNGAHECPL NVSAFRPRTT FLEDVTNLTV KDITIRDAAF WTLHMAGCRH VLVKDIKILN DIRGANNDGI DPDCCQDVMI SGCLVKTGDD AIVVKATKPM SQKYGACENI VINNCILYSR DSGLKIGTET HGDIRNVMLS DCVIKECSRG VGIWVRDGAT IEDIHVHHVT GSVLKYADGE RSEGPTMWWG NGEPIFINAT YRNENRNYPG KIRNITFDHI YMKAESSVFL AGEEDARIEN ITISNLEVTM CSQGTQNSGF FDEQPSLRHV YPHSIPAVYA RSVDGLRVSG RVRYEGPYHI SKNKLYESED CTMEEVNLKE R
|
| |