Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_3494 |
Symbol | |
ID | 5743606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 4309695 |
End bp | 4311578 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641294606 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001560584 |
Protein GI | 160881616 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000299529 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGGT ATATAAAGGA TAAACAGTTA TTGCTTCGAA GATTATGTTT GCTTTTGATA GATATAGGAG CTGTTTGCTT AATCAGTATT TTATCTCTAT TAATTCGATA CGATTTTAGA TATAAAAATA TAGATACGAT ATACCTTGAT ACTATATGGA CATATTTCCC TATTAACATC ATTACAGTAC TACTAATATT CTATATTTTT CGCTTGTATC ACAGTTTGTG GTTATTTGCA GGTATTACTG AGTTAGAGAA TATTATTGCT GCATCAATCG GGGTTTCAGC CTTACAAGTA ATTGGTTTCA TACTTTTAAA GCTTCCAATT CCAAGAAGCT ATTTCTTCAT ATTTCCGGTT TTATTAATTA TAGCTACCAT GGCTTCGAGA TTTACTTATC GCGCTGTTCG GGTGAAGTTA CGGAAGAGAG AGAATGGGAA GTCCTGTGAG AATGTGATGG TAATTGGTGC TGGAGAAGCA TCCAATATGA TAATCAAAGA AATAATAAAT AGTGACCATA TACATAAAGT AGTTAAGTGC ATCATAGATG ATGCAGAAGA TAAATTAGGT CGCTATATCC ATGGAATTAA AGTTATTGGT AGTCGAGACA CCATCATAGA TAATGTGATG AAGTATGAGA TAAATGAGAT TATCATAGCG ATGCCATCGG TATCAAGAAA AGAAATTAGT AAAATCTTAG AAATCTGCAA AGAAACAGAT TGTAAGTTAT ATATATTACC AGGAATGTAT CAATTTTTAA ACGGTGAAGT TGGTGTATCA GAACTTCGCG GTGTTGAGGT TGAAGATTTA CTGGGACGAG ATCCTATAAG AGTAGATTTA GACTCGATTA TGGGTTATGT AAGTGACAAG GTTGTTTTAG TTACTGGCGG CGGTGGTTCT ATTGGAAGTG AACTTTGTAG GCAAATAGCA GGGCATAAAC CAAAGCAATT AGTTATCGTA GATATCTATG AAAATAATGC TTATGATATA CAGCAGGAGC TACAGAAAAG ATATCCAAAT TTAGATATTG TAACACTGAT TGGTTCGGTA CGTAATGAGA AACGACTTGA CAAAATATTT GACACTTATC GCCCGAACAT TGTTTATCAC GCTGCAGCCC ATAAGCATGT ACCTTTAATG GAAGATAGCC CAAATGAAGC TGTTAAAAAC AATGTTTTTG GAACATTTAA AACTGCCCAA GCTGCCGATA AATATGGTGT TGAAAAGTTT GTATTAATTT CATCTGACAA AGCAGTTAAT CCTACAAACA TCATGGGTGC AACGAAGCGT ATGTGTGAGA TGATTGTACA AACATTTAAT AGAAAATCTA AAACTGAATT TGTAGCGGTT CGATTTGGTA ATGTACTCGG AAGTAACGGT AGTGTAATTC CACTTTTTAA AAAGCAAATT GAAGCAGGTG GGCCTGTTAC CGTAACACAT CCGGATATTA TCCGATATTT TATGACAATA CCAGAAGCGG TATCTTTAGT ATTACAAGCA GGTGCCTATG CAAAGGGTGG AGAGATATTT GTACTGGATA TGGGAGAGCC AGTTAAAATT CTTGATTTAG CTACAAACCT CATTCGCCTG TCTGGGTATA TACCAGATGT AGATATAAAA ATTAAGTTTA CTGGATTAAG ACCTGGTGAA AAATTATATG AAGAGCTTCT TATGGAAGAG GAGGGATTAG GTGAAACCGA AAACTCACAG ATATTTATTG GGAAGCCTTT AAAGATTGAT GATGAAAAGT TTCATCATCA ATTAGAGGAG TTATACACCG CATGTAATAA CGAGACAGAG TATATTCGAG AAATGGTCGG AGAAATCGTA GATACCTATT CTTATAATAT TGAAGCAAGA GAAGAAGTTG CAGCTTCTGA ATAG
|
Protein sequence | MKRYIKDKQL LLRRLCLLLI DIGAVCLISI LSLLIRYDFR YKNIDTIYLD TIWTYFPINI ITVLLIFYIF RLYHSLWLFA GITELENIIA ASIGVSALQV IGFILLKLPI PRSYFFIFPV LLIIATMASR FTYRAVRVKL RKRENGKSCE NVMVIGAGEA SNMIIKEIIN SDHIHKVVKC IIDDAEDKLG RYIHGIKVIG SRDTIIDNVM KYEINEIIIA MPSVSRKEIS KILEICKETD CKLYILPGMY QFLNGEVGVS ELRGVEVEDL LGRDPIRVDL DSIMGYVSDK VVLVTGGGGS IGSELCRQIA GHKPKQLVIV DIYENNAYDI QQELQKRYPN LDIVTLIGSV RNEKRLDKIF DTYRPNIVYH AAAHKHVPLM EDSPNEAVKN NVFGTFKTAQ AADKYGVEKF VLISSDKAVN PTNIMGATKR MCEMIVQTFN RKSKTEFVAV RFGNVLGSNG SVIPLFKKQI EAGGPVTVTH PDIIRYFMTI PEAVSLVLQA GAYAKGGEIF VLDMGEPVKI LDLATNLIRL SGYIPDVDIK IKFTGLRPGE KLYEELLMEE EGLGETENSQ IFIGKPLKID DEKFHHQLEE LYTACNNETE YIREMVGEIV DTYSYNIEAR EEVAASE
|
| |