Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_2058 |
Symbol | |
ID | 5743758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 2542002 |
End bp | 2543411 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 641293155 |
Product | cellulase |
Protein accession | YP_001559165 |
Protein GI | 160880197 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.33329 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTGTTG AATTGATGAG AAAGTTTCCA ATTATAGTAT CAGGTATTTT AGTATGTGCA ACATTATTAA GTGGCTGTAG CCAAAATATG TCAAATGTTG ATAAATCCAC AGTGAATTTG AATTCGCAGA ATACAGACAA TCAAAATCAA GATGCAAATA ATAAAGACTC GGATAATCAA GGCGCAAATA ATCAAGAGTC AAATAATCAA GAATCACAAA ACCCTGAATT AAAAAATACA CCAACACCTC AAATCATAGT ACTACCAGGT GACAGCACAA TTGAAGCAAA CGAAATTGTA GCCCCAACCA TCACATTTGA ACAAAAAGAA ATACCATCGA ATCCAGCAAT AACTTTTGTA CATAATATGA AAATAGGATG GAACCTTGGA AATACATTTG ACGCAGTGAG TGATTCCAAT CTAATGGATG AACTTAATTA TGAAAGCTCA TGGTGTGGTG TAAAAACAAC AGAAGAGATG ATGAAAGCAA TTAAAGATGC TGGGTTTCAG TCGATTAGAA TACCAGTATC GTGGCACAAT CATGTTTCTG GTGATGATTT TATTATAAGC GAAGTATGGC TTAACCGAGT ACAAGAAGTG GTCGATTATG CTATCAATAA TGATATGTAT GTGATATTAA ATACTCACCA TGATGTAAGT AAAAATTTTT ATTATCCAAG TAATGAAAAT TTAGAATCTT CTAAAAAATA TATCAACGCA GTATGGACAC AAGTAAGTGA ACGATTTTCT TCCTATGGAG AAAAGTTATT ATTTGAAGGG ATGAACGAAC CAAGGCTTGC AGGTTCTAAT TACGAATGGT GGTTAGATTT ATCAAAGCCT GAGTGTAAAG AAGCAATCGA ATGTATTAAT CAATTAAATC AGGAATTTGT TGATACCGTT CGCAAATCGG GAGGAGAGAA TACTTCTAGG TATCTTCTGA TACCAGGGTA TGATGCATCG TCTCAATATG CACTTATTAA TGATTATAAG TTACCAAAAG ATAATATAAA TGATCGTTTA ATTGTATCAG TACATGCATA TCTACCATAT GACTTTGCCC TAAAAAGTCC AAAGGAAAGT GGCAGTATAT CAGAATGGAA TTCAAAAATA GCTGGATGTA CTAAGGAGAT AGATTCTTTT TTAAATAGTT TATATATGAA GTTTATAAAA AATGGAGTTC CCGTAATTAT CGGTGAATTT GGTGCCAGAG ATAAAGAAAA TAATTTAGAA TCTCGGGTAG AGTATGCTAC TTATTATATA GGTGCTGCAA AAGCGAATGG AATCACATGC TTCTGGTGGG ATAATCATGC ATTTAAAGGG GACGGGGAAA ACTTCGGTCT TTTTGATAGA AAGAGTTGTA CTATAAAATA TCCTGAGATA TTGCAAGGTT TAATGAAATA CGCAGAATAA
|
Protein sequence | MCVELMRKFP IIVSGILVCA TLLSGCSQNM SNVDKSTVNL NSQNTDNQNQ DANNKDSDNQ GANNQESNNQ ESQNPELKNT PTPQIIVLPG DSTIEANEIV APTITFEQKE IPSNPAITFV HNMKIGWNLG NTFDAVSDSN LMDELNYESS WCGVKTTEEM MKAIKDAGFQ SIRIPVSWHN HVSGDDFIIS EVWLNRVQEV VDYAINNDMY VILNTHHDVS KNFYYPSNEN LESSKKYINA VWTQVSERFS SYGEKLLFEG MNEPRLAGSN YEWWLDLSKP ECKEAIECIN QLNQEFVDTV RKSGGENTSR YLLIPGYDAS SQYALINDYK LPKDNINDRL IVSVHAYLPY DFALKSPKES GSISEWNSKI AGCTKEIDSF LNSLYMKFIK NGVPVIIGEF GARDKENNLE SRVEYATYYI GAAKANGITC FWWDNHAFKG DGENFGLFDR KSCTIKYPEI LQGLMKYAE
|
| |