Gene Cwoe_0419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0419 
Symbol 
ID8730847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp429802 
End bp431700 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content72% 
IMG OID646501033 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003392230 
Protein GI284041890 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.176558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCGC TCCGCTCTCG CACCGTCACG CACGGCCGCA ACATGGCCGG CGCCCGCGCG 
CTGCTGCGCG CCTCCGGCGT CGCGCGCGAG GACTTCGGCA AGCCGATCGT CGCGGTCGCG
AACTCCTACA CGCAGTTCGT CCCCGGCCAC ACGCACCTGA AGCCGGTCGG CGAGGTCGTC
TCGGCGGCGG TCCACGCCGC CGGCGGCGTG CCGCTCGAGT TCAACACGAT CGCCGTCGAC
GACGGGATCG CGATGGGCCA CGGCGGGATG CTCTACTCGC TGCCCTCGCG CGACCTGATC
AGCGACAGCG TCGAGTACAT GGTCAACGCG CACTGCGCCG ACGCGCTGAT CTGCATCTCC
AACTGCGACA AGATCACGCC GGGCATGCTC AACGCCGCGC TGCGGCTCGA CATCCCGACC
GTCTTCGTCT CCGGCGGGCC GATGGAGGGC GGCGTCGCGA CGCTCGTCGA CGGCACCGTC
CGCAAGGGCC TGAACCTGAT CTCCGCGATG GCCGAGGCCG TCTCGCCGGA GGTCAGCGAC
GAGGACATGG ACCTGATCGA GGAGGCCGCC TGCCCGACCT GCGGCTCCTG CTCGGGCATG
TTCACCGCCA ACTCGATGAA CTGCCTGACC GAGGCGCTCG GGCTGTCGCT GCCCGGCAAC
GGCTCGACGC TCGCGACCCA TACCGCCCGC AAGCAGCTGT ACGAGGACGC CGGCCGCACC
GTCGTCGAGA TCGCCAAGCG CTACTACGAC GAGGACGACG CGAGCGTCCT GCCGCGTGCG
ATCGCGACCC GCGAGGCGTT CGAGAACGCG ATGACGCTCG ACATCGCGAT GGGCGGCTCG
ACCAACACGA TCCTCCACCT GCTCGCCGCC GCGCGGGAGG CGGAGGTCGA CTTCGCGATG
GGCGACATCG ACGAGCTGTC GCGCCGCGTC CCCTGCGTCT GCAAGGTCGC GCCGAACGGC
ACCTACCTGA TGGAGGACGT CCACCGCGCC GGCGGGATCC CCGCGATCCT CGGCGAGCTG
CACCGCGGCG GCCTGTTGAA CGAGCAGGTC AGAACCGTCC ACGCGCGCAC GATCGACGAG
TGGCTCGGCA GCTGGGACGT GCGCGGCCCG AAGCCGTCCG AGATCGCCGT CGAGCTGTTC
CACGCGGCGC CCGGCTGCGT GCGCTCCGCG AGAGCGTTCT CGCAGTCGGA GCGGTGGGAG
TCGCTTGACG TCGACGCGGC CGACGGCTGC ATCCGCGACC TCGACCACGC CTACTCGACC
GAGGGCGGGC TCGCGATCCT CTACGGCAAC GTCGCCGAGC GCGGCTGTGT CGTGAAGACG
GCCGGCGTCG ACGAGTCGGT CTTCAGATTC AGCGGCCCGG CCGTCGTCGT CGAGTCGCAG
GAGGACGCGG TCGAGCTGAT CCTCGCCGGC GGCGTCAGAG CGGGCGACGT CGTCGTGATC
CGCTACGAGG GCCCTCGCGG CGGCCCGGGC ATGCAGGAGA TGCTCTACCC GACCTCCTAC
CTGAAGGGTC GCGGGCTCGG CAGAGCGTGC GCGCTGATCA CGGACGGCCG CTTCTCCGGC
GGCACCTCCG GCCTCTCGAT CGGCCACGTC TCGCCCGAGG CGGCGTCCGG CGGCGCGATC
GCGCTCGTGC AGAGCGGCGA CACGATCGCG ATCGACATCC CGGCGCGCTC GATCGAGCTG
GAGGTCGGCG ACCACGAGCT GGCCGAGCGC CGCCGAGCGC TTGAGGCTGC CGGCGGCTAC
GCCCCGGTCG CGCGCGAGCG TGTCGTCTCG CCGGCGCTGC GCGCCTACGC GGCGATGGCG
ACCTCCGCCG ACACCGGCGC CGTCCGCGAC GTCGACGCCG TCGAGCGCGC CGTCGCCGCC
GCCCGTCTCC AGGGCAGCGA CGTTGCGCAG GCGACCTGA
 
Protein sequence
MTPLRSRTVT HGRNMAGARA LLRASGVARE DFGKPIVAVA NSYTQFVPGH THLKPVGEVV 
SAAVHAAGGV PLEFNTIAVD DGIAMGHGGM LYSLPSRDLI SDSVEYMVNA HCADALICIS
NCDKITPGML NAALRLDIPT VFVSGGPMEG GVATLVDGTV RKGLNLISAM AEAVSPEVSD
EDMDLIEEAA CPTCGSCSGM FTANSMNCLT EALGLSLPGN GSTLATHTAR KQLYEDAGRT
VVEIAKRYYD EDDASVLPRA IATREAFENA MTLDIAMGGS TNTILHLLAA AREAEVDFAM
GDIDELSRRV PCVCKVAPNG TYLMEDVHRA GGIPAILGEL HRGGLLNEQV RTVHARTIDE
WLGSWDVRGP KPSEIAVELF HAAPGCVRSA RAFSQSERWE SLDVDAADGC IRDLDHAYST
EGGLAILYGN VAERGCVVKT AGVDESVFRF SGPAVVVESQ EDAVELILAG GVRAGDVVVI
RYEGPRGGPG MQEMLYPTSY LKGRGLGRAC ALITDGRFSG GTSGLSIGHV SPEAASGGAI
ALVQSGDTIA IDIPARSIEL EVGDHELAER RRALEAAGGY APVARERVVS PALRAYAAMA
TSADTGAVRD VDAVERAVAA ARLQGSDVAQ AT