Gene Cwoe_3738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3738 
Symbol 
ID8734193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3971049 
End bp3972092 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content71% 
IMG OID646504360 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_003395530 
Protein GI284045190 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.945238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0752567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTGA TGAAGCAGGA CGCGACCGAG GAAGAGATCC AGGCCGTCAT CGAGCGCGTC 
GAGGGCGCGG GCGCCCGCGC GCACCGCATC AACGGCGAGG AGCTGACGGT GATCGGCGCC
CTCGGCGACC GCGAGCACGT CCAGAAGCTG GAGCTGGAGG GTTCACCCGG GGTCGAGAAG
CTGCTCCCGA TCCAGAAGCC TTACAAGCTC GCCTCGTCGC AGATCCGCCA CGGCGAGCCG
AGCGTCCTCG AGATCGGTGG CCGCAAGATC GGCGGCGACA ACTTCGCGCT GATCGCCGGC
CCCTGCACGG TCGAGTCGCG CGAGCAGACG CTCGGCACCG CCGCGACGGT CGCCGCCGCC
GGCGTCACGC TCTTCCGCGG CGGCGCGTAC AAGCCGCGCA CGTCCCCTTA CGCCTTCCAC
GGCCTCGGGC AGGAGGGGCT GCGGCTGCTC GCCGAGGCCA AGCGGGAGAC CGGCCTGCCG
ATCGTCACCG AGCTGATGGA CGTGCGCGAC CTCGAGCCCG TGCTGGAGGT CGCCGACGTG
ATCCAGATCG GCGCGCGCAA CATGCAGAAC TACACGCTCC TGACCGAGCT CGGCCGCGCC
GGCCGCCCGG TCCTGCTCAA GCGCGGTCTG TCGGCGACGC TGGAGGAGCT GCTGAACGCC
TCCGAGTACA TCCTCAAGGA GGGCAACGAG GCGGTGATGC TGTGCGAGCG CGGGATCCGC
ACGTTCGAGA CCGCCTACCG CTTCACGCTC GACCTGACCG CGGTGCCGGT GCTGAAGGAG
CTGACGCACC TGCCGATCAT CGTCGACCCG TCGCACGCCG CCGGCCGGCG CGACCTCGTG
CAGCCGCTGT CGCTGGCCGC CGCCGCGGTC GGCGCCGACG GCATCATCGT CGAGGTCCAC
CCGAACCCCG ACGAGGCGAT CTGCGACGGA CCTCAGCAGC TCGTCGCGGC CGAGTTCGCC
GCCTACGCGG AGAAGGTCGC GCAGGCCGCG GCCGTCGCCG GCAAGACGAT CTCGACCCTG
GCCGCCGAGG CCACGGCCGC CTGA
 
Protein sequence
MIVMKQDATE EEIQAVIERV EGAGARAHRI NGEELTVIGA LGDREHVQKL ELEGSPGVEK 
LLPIQKPYKL ASSQIRHGEP SVLEIGGRKI GGDNFALIAG PCTVESREQT LGTAATVAAA
GVTLFRGGAY KPRTSPYAFH GLGQEGLRLL AEAKRETGLP IVTELMDVRD LEPVLEVADV
IQIGARNMQN YTLLTELGRA GRPVLLKRGL SATLEELLNA SEYILKEGNE AVMLCERGIR
TFETAYRFTL DLTAVPVLKE LTHLPIIVDP SHAAGRRDLV QPLSLAAAAV GADGIIVEVH
PNPDEAICDG PQQLVAAEFA AYAEKVAQAA AVAGKTISTL AAEATAA