Gene Cphy_3854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3854 
Symbol 
ID5744806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4724145 
End bp4726538 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content38% 
IMG OID641294966 
Productglycosyltransferase 36 
Protein accessionYP_001560940 
Protein GI160881972 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATACG GACATTTTGA TCAAGAAAAA AAAGAGTATG TCATCGACCG CGTTGACCTT 
CCAACCTCAT GGACCAATTA TCTAGGGGTT AAAGACACTT GTGTGGTTGT AAATCAAACC
GCAGGTGGCT ACATGTTTTA TAAATCACCA GAATACCACC GTGTTACAAG ATTTCGCGGA
AATAGTGTAC CAATGGATCG CCCTGGGCAT TATGTATACC TTAGGGACAA CGAGACGAAG
GATTACTGGA GTGTTTCATG GCAACCAGTA GGTAAGCCAT TAGATGAAGC TAAGTATACC
TGCCGTCATG GAATGTCCTA TTCCGTCTAC GAATGTGATT ATCAGAAATT AAAAGCGACA
CAAACCCTTG TTGTACCAAT CGATGAAGAT GTGGAACTTT GGGATGTAAC AGTTACGAAT
AACGATTCCA AACCAAGAAA TATTAGCCTA TTTACTTATT GTGAATTTTC TTTTCATCAT
ATCATGATAG ATAATCAAAA CTTCCAGATG AGCCTTTACT GTGCTGGTTC TTCCTATGAA
GATGGTATTA CCCTGCATGA TTTATTCTAT GAAGAATTTG GTTATCAATA CTTTACCTCA
ACCCAAACAC CAGATGGTTA TGACTGTCTC CGCGATAAAT TCCTTGGCCT TTATCATACA
GAATCTGATC CAATTGGAGT TATTAATGGG GAATTAAGTG GTAGTACAGA ACTTGGAAAT
AATCACTGTG GTTCTCTTCA ACATAATTTT GTGATTCAGC CAAACGAAAC AATTCGAATT
GTTTATATGT TAGGCGAAGG GAATCATAAC GCTGGCAAAC GTATTCGTGA AAAATACAGT
AATTTATCCG CAGTCGATCA AGTCTACCAT GACTTAAAGA CTTTTTGGAG TGAAAAACAA
AGTAAATTAC AGATTCAAAC TCCAAATGAG GGAATGAATA CTTTAATTAA TACATGGACT
TTATACCAAG CCGAAATCAA TGTCATGTTC TCTCGCTTCG CTTCCTTTAT TGAAGTTGGT
GGAAGAACAG GTCTTGGATA CCGAGATACA GCACAAGATT CCATGACAAT ACCTCATTCT
AATCCAGAAA AATGTAAACA GCGAATTCGT GAGTTAATGC AGGGTCTTGT ATCCGAAGGT
TATGGACTTC ATTTATTCCA ACCAGAATGG TTTGCGAAAG ATGAAGATAA AAAGCCTTTT
AAATCACCAA CCGTAGTTCC TTCTCCAGAT AAAAATAGCA TCGTTCATGG TATAAAAGAT
GCTTGTTCTG ATGATGCATT ATGGTTAGTA TCCTCTGTTG TTGAATACAT AAAAGAGACC
GGAGAATTTG AGCTAGCAGA TGAAGTAATT ACCTATGCAG ATGGTGGCGA GGGTACTGTA
TATGAACACC TAACCAAAAT CTTAGATTTT TCTGCAAAAC AAGTTGGTGC AACCGGTATT
TGTAAAGGTC TTCGCGCCGA TTGGAATGAC TGCCTAAACC TTGGTGGTGG AGAAAGCGCA
ATGGTTTCTT TCTTGCACTA CTGGGCTCTT AATAATTTTA TCTCTCTTGC GAAGTTTTTA
AATCGTGAAG AGGATGTACA GAAATACACA GCTTTAGCCG ATCATGTAAA AGAAGTATGT
AATAAAGAAT TATGGGATCA GGAATGGTTT ATCCGTGGAA TTACGAAAAA CGGTAAAAAA
ATTGGTACAA TAAATGATAA AGAAGGTAAG ATTCACTTAG AGTCTAACTC TTGGGCTGTA
TTATCCGGTG CAGCTACCGA AGAAAAAGGT CTCAAAGCTA TGGATAGTGT ATATGAGCAT
TTATATACTC CATATGGAAT TTTACTGAAT GCTCCTTCCT ATACAGTGCC TGATGATGAC
ATTGGATTTG TTACCCGTGT TTATCCTGGA TTAAAAGAAA ATGGAGCAAT TTTTAGTCAC
CCAAATCCAT GGGCATGGGC TGCTGAATGT GAACTTGGCC GTGGTGACAG AGCAATGGAA
TTTTATAATG CACTTTGCCC ATATTATCAA AATGATAAAA TTGAAATTCG AGAAGCAGAG
CCATATTCCT ATTGCCAGTT TATTATGGGT AGAGATCACA CTGCCTTTGG TCGTGCAAGA
CACCCATTTA TGACTGGTAC TGGCGGTTGG GCATATTTTA GTGCCACTAG ATATATGCTC
GGTATTAAAC CTCAATTTGA TTATTTAGAA ATTAATCCTT GCATTCCTGG CTCATGGGAT
AGCTTCCAAG TAACTAGAGA ATGGAGAGGT GCAGTCTATG AGATTGCTGT TGAGAATCCA
GATGGAGTTA TGAAGGGCGT CAAAGAAATC TATCTTGATG GAAAATTGGT GGAAAGACTT
CCAGTTCAGG AGCCACATAC CACTCACCAT GTTCGCATTG TTATGGGAGT GTAA
 
Protein sequence
MQYGHFDQEK KEYVIDRVDL PTSWTNYLGV KDTCVVVNQT AGGYMFYKSP EYHRVTRFRG 
NSVPMDRPGH YVYLRDNETK DYWSVSWQPV GKPLDEAKYT CRHGMSYSVY ECDYQKLKAT
QTLVVPIDED VELWDVTVTN NDSKPRNISL FTYCEFSFHH IMIDNQNFQM SLYCAGSSYE
DGITLHDLFY EEFGYQYFTS TQTPDGYDCL RDKFLGLYHT ESDPIGVING ELSGSTELGN
NHCGSLQHNF VIQPNETIRI VYMLGEGNHN AGKRIREKYS NLSAVDQVYH DLKTFWSEKQ
SKLQIQTPNE GMNTLINTWT LYQAEINVMF SRFASFIEVG GRTGLGYRDT AQDSMTIPHS
NPEKCKQRIR ELMQGLVSEG YGLHLFQPEW FAKDEDKKPF KSPTVVPSPD KNSIVHGIKD
ACSDDALWLV SSVVEYIKET GEFELADEVI TYADGGEGTV YEHLTKILDF SAKQVGATGI
CKGLRADWND CLNLGGGESA MVSFLHYWAL NNFISLAKFL NREEDVQKYT ALADHVKEVC
NKELWDQEWF IRGITKNGKK IGTINDKEGK IHLESNSWAV LSGAATEEKG LKAMDSVYEH
LYTPYGILLN APSYTVPDDD IGFVTRVYPG LKENGAIFSH PNPWAWAAEC ELGRGDRAME
FYNALCPYYQ NDKIEIREAE PYSYCQFIMG RDHTAFGRAR HPFMTGTGGW AYFSATRYML
GIKPQFDYLE INPCIPGSWD SFQVTREWRG AVYEIAVENP DGVMKGVKEI YLDGKLVERL
PVQEPHTTHH VRIVMGV