Gene Cwoe_5106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5106 
Symbol 
ID8735572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5462261 
End bp5463382 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content76% 
IMG OID646505731 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_003396890 
Protein GI284046550 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.324835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.598323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG CCCACCCGAT CCCATCCGTC GTCGCCGTCG AGGTCGCCGA GTTCTCGCCG 
CGAGTCGCGC CAGAGCTGGT CGTGCGGGGC GCCGCCGGCG TGCACGACCG CTCGCCGTTC
GCGCTCGTGC GCGTGCAGTG CAGCGACGGT GCGGTCGGCC ACGGCGAGGT CAGCTCGACG
CCGATCTGGA GCGGGGAGGA CGGCGCGTCC GCCGCCCACT TCGTCCGCAC CGTCCTGCGC
GAGGCGCTCG TCGGCCAGCC GCTGGCGCCG GTCGCCGCGC TGAGCGCGCG GATGGATCGC
GCGCTCGGCG CCAACCCGTT CACGAAGGCC GGCGTCAACG CCGCGCTGTG GGACGCGCTC
GGGCGCAGCA GCGGGCTCCC CGTCGCGGTC CTGCTGGGCG GGCCGTTCCG GACCGAGGTG
CCGGTCAAGC TGTCGCTCAG CGGTGGCGGT GAGGCGCTTG AGGCCAACCA CGCCGCCGCG
GTGGCGCGCG GGTTCCGTGC GTTCAAGCTG AAGGTCGGCC TCGATCCCGA CGAGGATGCC
GCGCGCTTCG CGCTCGCCCG CAAGCTCGCC GGCCGCGACG CCTTCCTCGG GATGGACGCC
AACTGCGGCT GGTCGCGCGC CGACGCGGCG CGTGCGATCG CGCTGACGGC GGCCGACCGC
CCGGCGTTCG TCGAGCAGCC GGTCGCAGCC GACGACCTCG ACGGCCTGCG CGAGCTGCGC
GGTCGCGGCG TGCCGCTGCT GGTCGACGAG TCGGTCTACT CAGTCGGCGA CCTCGCGCGC
GTGGTGCGCG CCGATGCGGC GGACGCGGTC AGCGTCTACG TCGGCAAGAG CGGCGGCCTG
GAGCGGGCGG TCGCACAGGG CCGGCTCGCC GCCGCCTTCG GCCTGCAGAC GATCATCGGC
TCGAACATGG AGGCGGACCT CGGCGCGGCC GCGCAGCTCC ACGTCGCCTG CGCGCTGGAG
GGGCTGAGCG AGACGATCCC GTCCGACATC GCCGGGCCGA TGTACTACGC CGAGCGCGTC
GCGCGGGTGC CGCTCGACAT CGACGGCCGG CGGGCGCGGC TGCCGGACGG CCCGGGCCTC
GGCGTCGAGC CGCCCGCCGA GCTGGACGGG AGCTTCGCGT GA
 
Protein sequence
MSAAHPIPSV VAVEVAEFSP RVAPELVVRG AAGVHDRSPF ALVRVQCSDG AVGHGEVSST 
PIWSGEDGAS AAHFVRTVLR EALVGQPLAP VAALSARMDR ALGANPFTKA GVNAALWDAL
GRSSGLPVAV LLGGPFRTEV PVKLSLSGGG EALEANHAAA VARGFRAFKL KVGLDPDEDA
ARFALARKLA GRDAFLGMDA NCGWSRADAA RAIALTAADR PAFVEQPVAA DDLDGLRELR
GRGVPLLVDE SVYSVGDLAR VVRADAADAV SVYVGKSGGL ERAVAQGRLA AAFGLQTIIG
SNMEADLGAA AQLHVACALE GLSETIPSDI AGPMYYAERV ARVPLDIDGR RARLPDGPGL
GVEPPAELDG SFA