Gene Cwoe_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3669 
Symbol 
ID8734124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3904882 
End bp3906570 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content71% 
IMG OID646504291 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003395461 
Protein GI284045121 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.281782 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCCGG AGCCGTTCGA CCTCAAGCAT CGCAGCCGGG CCCTGACGGA GGGTCCGCAC 
CGGGCCGCCG CCCGCTCCTA CCTCCACGGC ATCGGCTACT CCGCCGAGGA CCTCGCGAAG
CCGATCGTCG GCGTGGCGCA CTCGTGGATC GAGACGATGC CGTGCAACTT CAACAACCGC
GTGCTCGCCG CGAGAGTCAA GGAAGGCATC CGTGCCGCCG GCGGCACGCC GATGGAGCTG
AACACGATCG CGATCTCCGA CGGGATCACG ATGGGAACCT CCGGCATGCG CGCGTCGCTC
GTGTCGCGCG AGCTGATCGC CGACTCGATC GAGCTGGTCG CCAGCGCCCA CCACTTCGAC
GCGATCGTCG TCATCAGCGG CTGCGACAAG ACGATCCCCG GCACGGTGAT GGCGCTCGCG
CGCCTCGACA TCCCGTCGCT CATGCTCTAT GGCGGCTCGA TCCGGCCCGG CCGCTACAAG
GACCGCGAGG TCACGATCCT CGACGTCTTC GAGGCGGTCG GTGCGCACGC CGTCGGCAAG
ATCACCGACG CGGAGCTGCA CGAGCTGGAA GAGGTCTCCT CGCCCGGCGC CGGCGCCTGC
GCCGGCCAGT TCACCGCCAA CACGATGGCG ATGGCGTTCG AGGTGCTCGG CATCTCGCCC
GCCGGCTCGG CGATGGTGCC CGCCGAGGAC GGCAGCAAGC GCCAGGTCGC CTTCGACGCC
GGCACGCTCG TGATGGACGT GCTCAGACGC GGCCTGCGCC CGCGCGACGT CATCACCAAG
GACGCGCTGG AGAACGCGAT CGCCGCCGTC GCGATGAGCG GCGGCTCGAC CAACGCGGTG
CTCCACCTGC TGGCGGTCGC CAAGGAGATG GGCGTGCCGC TGGAGATCGA CGAGTTCGAC
CGCATCAGCG AGGCGACGCC GCTGCTCTGC GACCTCCAGC CCGGCGGCAG ATACAACGCC
ACCGACCTTT ACGCGGCCGG CGGGGTCCCC GTCGTCTTCA AGCGCCTCAG AGCGCACGGC
AGACTGCACG AGGACGCGAT CTCCGTCACC GGCCAGACGG CCGGCGAGAT CGCCGACGCC
GCCCAGGAGA CGCCCGGCCA GGTCGTCGTC CGCCCACTGG AGGACCCGCT CAAGGCGACC
GGCGGCTTCG CGATCATGAG AGGCAACGTC GCCCCCGACG GCTGTGTGAT CAAGCTCGCC
GGCCACGAGC GCCGCCACCA CGTCGGTCCG GCGCGCGTCT TCGACGGCGA GGAGGCCGCG
ATGAAGGCGG TGCTCGCGAG CGAGATCGTC GCGGACGACA TCGTCGTGAT CCGCAACGAA
GGCCCGGCGG GCGGCCCCGG CATGCGCGAG ATGCTGGCCG TGACGGCGGC GATCATCGGC
GCCGGGCTGG GCGACAGCGT CGCGCTGATC ACCGACGGCC GCTTCTCCGG CGCGAGCCAC
GGCTTCATGG CCGGCCACAT CGCGCCCGAG TCCGTTCGCG GCGGGCCGAT CGCCGCGCTG
CGCGAGGGCG ACAGGCTCAC GATCGACGTC GACGCCCGCC GCATCGACGT CGACCTGACC
GACGAGCAGA TCGCCGAGCG GGTCGCCACC TACAGACCGC TGCCGCGCGC TGACGAGCAC
ATCGACGTCG CGATCCGCAA GTACGCCAAG CTCGTCGGCA GCGCCGCAGA CGGCGCCGTC
ACGCACTGA
 
Protein sequence
MSPEPFDLKH RSRALTEGPH RAAARSYLHG IGYSAEDLAK PIVGVAHSWI ETMPCNFNNR 
VLAARVKEGI RAAGGTPMEL NTIAISDGIT MGTSGMRASL VSRELIADSI ELVASAHHFD
AIVVISGCDK TIPGTVMALA RLDIPSLMLY GGSIRPGRYK DREVTILDVF EAVGAHAVGK
ITDAELHELE EVSSPGAGAC AGQFTANTMA MAFEVLGISP AGSAMVPAED GSKRQVAFDA
GTLVMDVLRR GLRPRDVITK DALENAIAAV AMSGGSTNAV LHLLAVAKEM GVPLEIDEFD
RISEATPLLC DLQPGGRYNA TDLYAAGGVP VVFKRLRAHG RLHEDAISVT GQTAGEIADA
AQETPGQVVV RPLEDPLKAT GGFAIMRGNV APDGCVIKLA GHERRHHVGP ARVFDGEEAA
MKAVLASEIV ADDIVVIRNE GPAGGPGMRE MLAVTAAIIG AGLGDSVALI TDGRFSGASH
GFMAGHIAPE SVRGGPIAAL REGDRLTIDV DARRIDVDLT DEQIAERVAT YRPLPRADEH
IDVAIRKYAK LVGSAADGAV TH