Gene Cphy_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1201 
Symbol 
ID5743300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1517785 
End bp1519074 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content32% 
IMG OID641292306 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_001558318 
Protein GI160879350 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGGAA TTGCATTTAT CTTTATTCTC CTTTTATATA TAGCAATCTA CTATGCTTAT 
GATACCTATA GTAGAATTTT TAAGCGTGGA TTCCTAGAGG AATTAATTTC GATTATTAAA
ATAAACTGCT TGCTTGCAGT AACTCTTACT TTCTCAATGT TTGTCTTCCA GGAAGGAACA
GCATATTCGA GATTATTTTT CTTGACTTTT TTTACACTTA ATATATTAAT AAATTATTTA
GCCAGATTGT ATTTTAAGTT TTTTTTACTT GCTGTTTATA AAAAGAGTGG CTCTAGCAGT
AAGCTTATGA TAATAACAAC TTTAGATAAA GCAAAAGAAG TTTTAGATAG AATTAGAAAA
GAAAATGACT GGGAGTATCA AGTTACATAC TTAACTATAG TGGACAGGCA ATTAATTGGA
GACTGGATTG AAGAAATACC TGTTAAGGCA AATTTAGTTA ATATGTATGA AGTAGCAAGA
CAAGAGGTAG TGGATGAAGT ATTTATATAT CTTCCTAATG ACTCCTCTTT ATCTATTGAT
TTAGATGAGA CAATACTTGA ATTTGTTAAT ATGGGAGTGA CGGTTAACTT AAGTATTAAT
GCATTTGGAT TAAAGGTACG TGAGAAAATA GTTCGTGAAA TAAGTGGGTA CTATGCACTG
AGTTTTAGTA CACGTTTATT AAAGGAATCC CAAAAACACT TGAAAAGGGT TATGGATGTT
GCAGGAGGAA TTGTAGGATG TATATTAACA TTAGTATTGA TTGTGTTCCT AGCCCCAGCT
ATTAAATTAG AATCACCAGG ACCAATTTTC TTTTCACAAG TGCGAGTAGG TAAAAACGGA
AGACGTTTTA AAATATATAA ATTTCGTTCC ATGGATTTAG ATGCAGAAAA GCGCAAGGAA
GAATTGATGA ATCAAAATGA AATGGAAGGC TTTATGTTTA AAATGAAAGA CGATCCGAGA
TTAACAAAAG TAGGTAAGTT TATGCGAAAA ACAAGTCTTG ATGAGTTTCC ACAGTTCTTT
AACATCTTAA AAGGAGATAT GAGTTTAGTG GGAACCAGGC CGCCTACAGA AGATGAGTTC
CTTAGGTATG AAGGGAGGCA TAAGAGGAGG CTAGCTCTTA AATGTGGATT GACTGGGTTA
TGGCAGGTTA GTGGAAGAAG TGATATTCAG GATTTTGAAG AAGTGGTTAA GTTGGATCTG
GAATATATCG ATAACTGGTC ATTTAAGTTG GATATAAAGA TATTATTAAA AACAGTGGTA
GTTGTATTAT TTGGAAAGGG ATCTAGATAA
 
Protein sequence
MYGIAFIFIL LLYIAIYYAY DTYSRIFKRG FLEELISIIK INCLLAVTLT FSMFVFQEGT 
AYSRLFFLTF FTLNILINYL ARLYFKFFLL AVYKKSGSSS KLMIITTLDK AKEVLDRIRK
ENDWEYQVTY LTIVDRQLIG DWIEEIPVKA NLVNMYEVAR QEVVDEVFIY LPNDSSLSID
LDETILEFVN MGVTVNLSIN AFGLKVREKI VREISGYYAL SFSTRLLKES QKHLKRVMDV
AGGIVGCILT LVLIVFLAPA IKLESPGPIF FSQVRVGKNG RRFKIYKFRS MDLDAEKRKE
ELMNQNEMEG FMFKMKDDPR LTKVGKFMRK TSLDEFPQFF NILKGDMSLV GTRPPTEDEF
LRYEGRHKRR LALKCGLTGL WQVSGRSDIQ DFEEVVKLDL EYIDNWSFKL DIKILLKTVV
VVLFGKGSR