Gene Cwoe_4749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4749 
Symbol 
ID8735215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5066738 
End bp5067706 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content77% 
IMG OID646505378 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_003396537 
Protein GI284046197 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.219423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCT TTCTCGCGGG AGCGACCGGC GCGGTCGGCC GGGCGCTGAT CCCACAACTG 
ACCGCCGGCG GCCACCACGT CGTCGGGACC ACCCGAAGCG CGGCGAAGGC GTCAGCGCTG
TGGGAGCTGG GCGCGGAGCC CGTCGTGCTC GACGGGCTCG ACCGCGCCGC CGTCCTCGCC
GCGGTCGCCG CGGCCAGACC CGACGCGATC GTCCACCAGA TGACCGCACT CACGGGGCTG
ACGGACATGC GGAAGTTCGA GCGCGCGTTC GCGCTCACGA ACCGGCTGCG CACCGAGGGC
ACCGACCACC TGCTGGAGGC GGCGCGCGCG AGCGGCGTCG AGCGGGTCGT CGCGCAGAGC
TTCACCGGCT GGCCGAACGC GCGCGGCGGC GATCCCGGCG TGCTGCGGAC CGAGGACGAC
CCGCTCGACC CCGACCCGCC GGCGCAGCTG CGCACGACGC TCGCCGCGAT CAGACGGCTG
GAGGAGCGCG TCACGGCGGC GGGCGGCGTC GCGCTGCGCT ACGGCGGCCT CTACGGCCCC
GGCACCGGCC TCACGCGGGG CGGCGAGCAG TGGGAGGCGG TGCGCGCCCG CAAGTTCCCC
GTCGTCGGCG ACGGCGGCGG CGTCTGGTCG TTCCTGCACG TCGCCGACGC GGCGGGAGCG
ACGCTGGCGG CGCTGGAGCG GTGGACGCCG GGCGAGGTCT ACAACGTCTG CGACGACGAG
CCCGCCGCCG TGCGCGAGTG GCTCCCCGCG CTCGCGCGGA CGGCCGGCGC ACCGCCGCCG
CGGCATGTGC CGCGCTGGGT CGGACGGCTG ATCGGCGCGC ACGTCGTCGC GCTCATGTGC
GAGATCCGCG GCTCCTCGAA CGCGAAGGCG AAGCGGCAGC TCGACTGGGC GCCGGCGTGG
CCGTCGTGGC GCGAGGGCTT CGCGGCGCTC GACGGCGAAG GCGAGCGCGT CGCGACGGCT
CGGCGCTGA
 
Protein sequence
MKIFLAGATG AVGRALIPQL TAGGHHVVGT TRSAAKASAL WELGAEPVVL DGLDRAAVLA 
AVAAARPDAI VHQMTALTGL TDMRKFERAF ALTNRLRTEG TDHLLEAARA SGVERVVAQS
FTGWPNARGG DPGVLRTEDD PLDPDPPAQL RTTLAAIRRL EERVTAAGGV ALRYGGLYGP
GTGLTRGGEQ WEAVRARKFP VVGDGGGVWS FLHVADAAGA TLAALERWTP GEVYNVCDDE
PAAVREWLPA LARTAGAPPP RHVPRWVGRL IGAHVVALMC EIRGSSNAKA KRQLDWAPAW
PSWREGFAAL DGEGERVATA RR