Gene Cphy_3494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3494 
Symbol 
ID5743606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4309695 
End bp4311578 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content35% 
IMG OID641294606 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001560584 
Protein GI160881616 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000299529 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGGT ATATAAAGGA TAAACAGTTA TTGCTTCGAA GATTATGTTT GCTTTTGATA 
GATATAGGAG CTGTTTGCTT AATCAGTATT TTATCTCTAT TAATTCGATA CGATTTTAGA
TATAAAAATA TAGATACGAT ATACCTTGAT ACTATATGGA CATATTTCCC TATTAACATC
ATTACAGTAC TACTAATATT CTATATTTTT CGCTTGTATC ACAGTTTGTG GTTATTTGCA
GGTATTACTG AGTTAGAGAA TATTATTGCT GCATCAATCG GGGTTTCAGC CTTACAAGTA
ATTGGTTTCA TACTTTTAAA GCTTCCAATT CCAAGAAGCT ATTTCTTCAT ATTTCCGGTT
TTATTAATTA TAGCTACCAT GGCTTCGAGA TTTACTTATC GCGCTGTTCG GGTGAAGTTA
CGGAAGAGAG AGAATGGGAA GTCCTGTGAG AATGTGATGG TAATTGGTGC TGGAGAAGCA
TCCAATATGA TAATCAAAGA AATAATAAAT AGTGACCATA TACATAAAGT AGTTAAGTGC
ATCATAGATG ATGCAGAAGA TAAATTAGGT CGCTATATCC ATGGAATTAA AGTTATTGGT
AGTCGAGACA CCATCATAGA TAATGTGATG AAGTATGAGA TAAATGAGAT TATCATAGCG
ATGCCATCGG TATCAAGAAA AGAAATTAGT AAAATCTTAG AAATCTGCAA AGAAACAGAT
TGTAAGTTAT ATATATTACC AGGAATGTAT CAATTTTTAA ACGGTGAAGT TGGTGTATCA
GAACTTCGCG GTGTTGAGGT TGAAGATTTA CTGGGACGAG ATCCTATAAG AGTAGATTTA
GACTCGATTA TGGGTTATGT AAGTGACAAG GTTGTTTTAG TTACTGGCGG CGGTGGTTCT
ATTGGAAGTG AACTTTGTAG GCAAATAGCA GGGCATAAAC CAAAGCAATT AGTTATCGTA
GATATCTATG AAAATAATGC TTATGATATA CAGCAGGAGC TACAGAAAAG ATATCCAAAT
TTAGATATTG TAACACTGAT TGGTTCGGTA CGTAATGAGA AACGACTTGA CAAAATATTT
GACACTTATC GCCCGAACAT TGTTTATCAC GCTGCAGCCC ATAAGCATGT ACCTTTAATG
GAAGATAGCC CAAATGAAGC TGTTAAAAAC AATGTTTTTG GAACATTTAA AACTGCCCAA
GCTGCCGATA AATATGGTGT TGAAAAGTTT GTATTAATTT CATCTGACAA AGCAGTTAAT
CCTACAAACA TCATGGGTGC AACGAAGCGT ATGTGTGAGA TGATTGTACA AACATTTAAT
AGAAAATCTA AAACTGAATT TGTAGCGGTT CGATTTGGTA ATGTACTCGG AAGTAACGGT
AGTGTAATTC CACTTTTTAA AAAGCAAATT GAAGCAGGTG GGCCTGTTAC CGTAACACAT
CCGGATATTA TCCGATATTT TATGACAATA CCAGAAGCGG TATCTTTAGT ATTACAAGCA
GGTGCCTATG CAAAGGGTGG AGAGATATTT GTACTGGATA TGGGAGAGCC AGTTAAAATT
CTTGATTTAG CTACAAACCT CATTCGCCTG TCTGGGTATA TACCAGATGT AGATATAAAA
ATTAAGTTTA CTGGATTAAG ACCTGGTGAA AAATTATATG AAGAGCTTCT TATGGAAGAG
GAGGGATTAG GTGAAACCGA AAACTCACAG ATATTTATTG GGAAGCCTTT AAAGATTGAT
GATGAAAAGT TTCATCATCA ATTAGAGGAG TTATACACCG CATGTAATAA CGAGACAGAG
TATATTCGAG AAATGGTCGG AGAAATCGTA GATACCTATT CTTATAATAT TGAAGCAAGA
GAAGAAGTTG CAGCTTCTGA ATAG
 
Protein sequence
MKRYIKDKQL LLRRLCLLLI DIGAVCLISI LSLLIRYDFR YKNIDTIYLD TIWTYFPINI 
ITVLLIFYIF RLYHSLWLFA GITELENIIA ASIGVSALQV IGFILLKLPI PRSYFFIFPV
LLIIATMASR FTYRAVRVKL RKRENGKSCE NVMVIGAGEA SNMIIKEIIN SDHIHKVVKC
IIDDAEDKLG RYIHGIKVIG SRDTIIDNVM KYEINEIIIA MPSVSRKEIS KILEICKETD
CKLYILPGMY QFLNGEVGVS ELRGVEVEDL LGRDPIRVDL DSIMGYVSDK VVLVTGGGGS
IGSELCRQIA GHKPKQLVIV DIYENNAYDI QQELQKRYPN LDIVTLIGSV RNEKRLDKIF
DTYRPNIVYH AAAHKHVPLM EDSPNEAVKN NVFGTFKTAQ AADKYGVEKF VLISSDKAVN
PTNIMGATKR MCEMIVQTFN RKSKTEFVAV RFGNVLGSNG SVIPLFKKQI EAGGPVTVTH
PDIIRYFMTI PEAVSLVLQA GAYAKGGEIF VLDMGEPVKI LDLATNLIRL SGYIPDVDIK
IKFTGLRPGE KLYEELLMEE EGLGETENSQ IFIGKPLKID DEKFHHQLEE LYTACNNETE
YIREMVGEIV DTYSYNIEAR EEVAASE