Gene Cwoe_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4047 
Symbol 
ID8734508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4299014 
End bp4300420 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content75% 
IMG OID646504675 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003395839 
Protein GI284045499 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCCA CGATCACCCG TCACGATCCC GCCGACGTCC GCGAGATCGC GGCCACCTAC 
GAGCAGCACA CCGCCGCCGA CGTCACCGCC GCGTGCGACC GCGCGGCGGC CGGCGCCGAG
GTCTGGGGCC GCACCCCCGG GCCGCGGCGC GCGGCGGTCC TGCACTCCGC CGCGACGCTG
ATCGAGCAGC GCGCCGACCA GATCGCGACG GACATCACGC GCGAGGAGGG CAAGCTTGTC
TCCGACGCGC GCGGCGAGAC GCTGCGCGCG GCGGCCGTGC TGCGCTTCCA CGCCGGCGAG
GCCGAGCGCA CGCACGGCGA GGCCGGCGAC GCCGGCGATC CCGGCACGCT CGCGTTCACG
CGCCGGCGTC CGCTCGGCGT CGTCGCGCTG ATCACACCGT GGAACTTCCC GATCGCGATC
CCGGCGTGGA AGCTCGCGCC TGCGCTCGCG GCCGGCAACG CGGTCGTGCT GAAGCCGTCC
TCCCGCGCTC CGGGCGGCGC GCTCGCGCTC GTCGCCGCAT TGCGCGACGC GGGCCTGCCG
GACGGCGTCG TCGAGGTCGT CATCGGCGGC AGCGCCGTCG GCACCGCGCT GTCCGACGAC
GCGCGCGTCG CGGCGCTCTC GTTCACCGGC TCCAACGCCG TCGGCGACGC GGTGCGCGAG
CGCGTGCAGG CGCGCGGCGC GCGCTTCCAG GGCGAGCTGG GCGGCAACAA TCCGCTGATC
GTGCTCGGCG ACGCCGACGT CGCGCACGCG GCGAAGACCG CCGTCGGCGG TGCGTTCGGC
GCTGCCGGGC AGAAGTGCAC CGCGACGCGG CGCGTGATCG TCGAGCGGGC GGCGTACGAG
CCGCTGCTCG CGGCGATGCA GCGCGAGGTC GCCGCGCTGC GCACCGGCCC GGGGCTCGAC
GCCGCCTCGC AGGTGCCGCC GCTGGTCGAC CGCGACGCGC AGAAGGAGAT CCTCGACGCG
ATCGCGCAGG CGGTCTCCGC AGGCGCGTCA GCGGTCACGG GCGGGCACGC CGGCGACGGC
GAGCTGGAGC ACGGCTGCTT CGTGCTGCCG ACCGTGCTGG CCGAGGTCGG CCCCGGCATG
ACGGTGATCG ACGACGAGGT CTTCGGGCCG GTCTGCGCGG TCCTGCCGGC GGACGGGATC
GACCACGCGA TCGAGCTGGC GAACGCGACG CCGTACGGGT TGTCCGCCTC GATCTGCACG
AACGACCTCA GAGGCGCGTT CCGCTTCGTC GAGCGGATCG ACGCGGGCAT GGTGCACGTC
AACCGGCCGA CGCCGGGCGC GGATCCGCAC ATGCCGTTCG GCGGCGTGAA GGGCTCGGCC
GGCTCCGGCT ACCGCGAGCA GGGCAGAGCA GCGCTGGAGT TCTTCAGCCA GAGCCAGACC
GTCTACGTCC AGCACGACGT CCCATGA
 
Protein sequence
MASTITRHDP ADVREIAATY EQHTAADVTA ACDRAAAGAE VWGRTPGPRR AAVLHSAATL 
IEQRADQIAT DITREEGKLV SDARGETLRA AAVLRFHAGE AERTHGEAGD AGDPGTLAFT
RRRPLGVVAL ITPWNFPIAI PAWKLAPALA AGNAVVLKPS SRAPGGALAL VAALRDAGLP
DGVVEVVIGG SAVGTALSDD ARVAALSFTG SNAVGDAVRE RVQARGARFQ GELGGNNPLI
VLGDADVAHA AKTAVGGAFG AAGQKCTATR RVIVERAAYE PLLAAMQREV AALRTGPGLD
AASQVPPLVD RDAQKEILDA IAQAVSAGAS AVTGGHAGDG ELEHGCFVLP TVLAEVGPGM
TVIDDEVFGP VCAVLPADGI DHAIELANAT PYGLSASICT NDLRGAFRFV ERIDAGMVHV
NRPTPGADPH MPFGGVKGSA GSGYREQGRA ALEFFSQSQT VYVQHDVP