Gene Cwoe_4236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4236 
Symbol 
ID8734698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4496528 
End bp4498036 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content72% 
IMG OID646504862 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_003396025 
Protein GI284045685 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.987331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.292037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAA CGAGCACCAC CACCCTCAAC CACTGGATCG GCGGCCGTGA GGACGCCGGC 
ACCGGCGACC GCTTCGGCGA GGTGACGGAG TCGGCGACCG GCGAGCTCGT CGCGCGCGTC
GCGTTCGCGA CCGAGGCCGA CGTCGACCGC GCCGTACGCG TCGCGGCCGA GGCCGCCGAC
GCGTGGGGCA GATCCTCGCT CGGCCAGCGC ACGAAGGTGA TGTTCGCCTT CCGCGAGCAG
GTCAACTCGC GCCGCGACGA GCTTGCTCGC GCGATCACGC GCGAGCACGG CAAGGTCCTC
TCCGACGCCG CCGGCGAGGT CCAGCGCGGC ATGGAGGTGA TCGACTTCGC GTGCGGTCTC
GGCCACCTGC TGAAGGGCGA GATGTCCGGC CAGGTCTCGC GCGGCGTCGA CTCGTACTCG
CTGCGTCAGC CGCTCGGCGT CGTCGCCGGC ATAACGCCGT TCAACTTCCC CGTGATGGTG
CCGCTGTGGA TGGCGCCGGT CGCGCTCGCG GCCGGCAACG CGTTCGTGCT GAAGCCGTCC
GAGCAGGACC CGTCCGCCTC GCTGCTGCTC GCCGACATGC TGAAGAACGC CGGTCTGCCC
GAGGGCGTCT TCACGGTCAT CAACGGCGAC AAGGACGCGG TCAACGCGCT GCTCGTCCAC
CCCGAGGTGA GAGCGGTCTC GTTCGTCGGC TCGACGCCGA TCGCCAAGCA CGTCTACGAG
ACGGCGACGG CGCACGGCAA ACGCGTGCAG GCGCTCGGCG GCGCGAAGAA CCACGCCGTC
GTGCTGCCCG ACGCCGACCT CGACCTCGCC GCCGACGCGC TCGTCTCGGC CGGCTACGGC
TCCGCCGGCC AGCGCTGCAT GGCGGTCTCC GTCGCGGTCG CCGTCGGCGC GATCGCCGAG
CCGCTGATCG CGAAGATCCA GGAGCGGATC GCCGGCCTGA CCGTCGGCGA CGGCTTCGAC
GCGGCGTCCG AGATGGGCCC GCTCGTGAGC GAGCGCCACC TCGGCCGCGT GCGCGGCCTC
GTCGACTCCG GCGAGGGCGA CGGCGCGACG CTGCTGGCCG ACGGCCGCGC GATCGCGGTC
GAGGGCCGCG AGGGCGGCCA CTGGCTCGGC CCGACGCTGT TCGACAACGT CAGACCCGGC
ATGGCGATCT ACGACGAGGA GATCTTCGGC CCGGTGCTGT GCGTCGTGCG CGCGGACTCC
TACGACGAGG CCGTCGGCCT CGCGAACTCC AGCCCGTACG GCAACGGCGC GGCGATCTTC
ACCAACGACG GCGGCGCCGC CCGGCAGTTC GAGCAGGACA TCACGGCCGG CATGGTCGGC
GTCAACGTGC CGATCCCGGT GCCGATGGCC TACCACTCGT TCGGCGGCTG GAAGGACTCG
CTGTTCGGCG ACCTCCACGT CCACGGCCCC GACGGCGTGC GCTTCTACAC GCGCGGCAAG
GTGATCACGC GCCGCTGGCC CGACCCGGCC GACCGCGGCA TCGACCTCGG CTTCCCGGTC
CACTCGTAG
 
Protein sequence
MTATSTTTLN HWIGGREDAG TGDRFGEVTE SATGELVARV AFATEADVDR AVRVAAEAAD 
AWGRSSLGQR TKVMFAFREQ VNSRRDELAR AITREHGKVL SDAAGEVQRG MEVIDFACGL
GHLLKGEMSG QVSRGVDSYS LRQPLGVVAG ITPFNFPVMV PLWMAPVALA AGNAFVLKPS
EQDPSASLLL ADMLKNAGLP EGVFTVINGD KDAVNALLVH PEVRAVSFVG STPIAKHVYE
TATAHGKRVQ ALGGAKNHAV VLPDADLDLA ADALVSAGYG SAGQRCMAVS VAVAVGAIAE
PLIAKIQERI AGLTVGDGFD AASEMGPLVS ERHLGRVRGL VDSGEGDGAT LLADGRAIAV
EGREGGHWLG PTLFDNVRPG MAIYDEEIFG PVLCVVRADS YDEAVGLANS SPYGNGAAIF
TNDGGAARQF EQDITAGMVG VNVPIPVPMA YHSFGGWKDS LFGDLHVHGP DGVRFYTRGK
VITRRWPDPA DRGIDLGFPV HS