Gene Cwoe_2249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2249 
Symbol 
ID8732692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2369973 
End bp2371562 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content72% 
IMG OID646502867 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003394049 
Protein GI284043709 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.408136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.876589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCTT CCGCCACCCA GACCCTCGAG GTCGCCAGCA TCGTCGCCGG CCGCGCGGTC 
GACGGCGCCG CCGGCGGGAC GCTCGCGACC CGCAACCCGG CCGACCTGAC GCAGGTCGTC
GCGAACGTGC GGCTCGCCGA CAGCGCCGCG TTCGTGGCAG CGGCGCGCGC CGCGCACGAC
GCGCAGCCCG CGTGGGCTGC GGTGCCGGCA CCCGTGCGCG GCGCGGTCGT GCAGCAGATC
GGCCGTCTGG TCGAGGCGAG CAAGGAGTCG CTGGCGCGGC TGATCACGAC CGAGATCGGC
AAGCCGTACG CCGAGGCGCT CGGCGAGGTG CAGGAGGTCG TCGACACCTG CAACTTCTTC
ATCTCCGAGG GACGCCGGCT CTACGGCCAG ACCGTCCCGT CGGAGATGCC CGACAAGCAG
CTGTTCACGT TCCGCAAGCC GGTCGGCACG TGCGCGATCG TCACCGCCGG CAACTTCCCG
GCGGCGGTGC CGTCGTGGTA CATCGTGCCG GCGCTGCTGT GCGGCAACAC GGTCGTGTGG
AAGCCGGCCG AGTACGCGGC CGGCGTCAGC CGTGCCTTCT ACGAGCTGTT CGCGCGCGGC
GGGCTGCCGG ACGGCACGCT CAACCTCGTG CTCGCCGACG GCCCCGCGAC GTTCGCCGGG
CTGGAGCAGT CGCTGGAGCT GGGGCTCGTC GACAAGGTCG GCTTCACGGG CTCCTCCGAG
GTCGGCGTCC AGATCGGCGA GCTGTGCGGG CGCAACCTGC AGACGCCGTG CCTGGAGCTG
GGCGGCAAGA ACCCGCTCGT CGTGATGGGC GACGCCGACC TGGAGCTGGC GGTCGAGGGC
GCGCTGTTCT CCGGCTTCGG CACGGCCGGA CAGCGCTGCA CGTCGCTCGG CGTCGCGATC
GTCCACGACT CGGTCTACGA CGAGTTCCTG GAGCGCTTCG ACGCAGCCGC GCGCGCCGCG
GTCGCCGGCG ACCCGGCGGG CGACGTGCTG TTCGGGCCGC TGATGAACGA GCGCTTCGCG
GAGCGCTTCG AGCAGTGGCT CGGGCTGATC CAGCCGCACC ACCGCGTGCT CGGCTCCAGC
GGCACCGGCC GCATCACGGC CGCGAACCCG CGCGCGGGCT TCAGCGGCGG CGACCCCGAG
CGGGGCGTCT TCTACCACCC GACGATCGTC GCCGACGTGA CCACCGACGA CGAGCTGTAC
CGGCGCGAGA CGTTCGGCCC GATCGTCGCC GTCGCGCGCT TCTCGACCTT CGACGAGGCG
ATCGCGCTCG CGAACGGCCA CGGCTACGGG CTGTCGTCGG CGATCTACAC GCGCGACGCG
ACGGCGGCGC TGCGCTTCCG CGAGCGCGTC AGCGCGGGCA TGGTGTCGGT CAACAACTCG
ACGAGCGGCG CCGAGGCGCA CCTGCCGTTC GGCGGCAACG GCAAGTCCGG CAACGGCTCG
CGCCAGTCCG GCGTCTGGGT GCTCGACCAG TTCACGCGCT GGCAGTCGGT CAACTGGGAC
TTCTCCGGCA AGCTTCAGAA GGCGCAGATG GACGTCGTCG AGATCACCGC CGACGAGGGC
TTCCGGCTGG ACGGGTGGGA CGGACGCTGA
 
Protein sequence
MSPSATQTLE VASIVAGRAV DGAAGGTLAT RNPADLTQVV ANVRLADSAA FVAAARAAHD 
AQPAWAAVPA PVRGAVVQQI GRLVEASKES LARLITTEIG KPYAEALGEV QEVVDTCNFF
ISEGRRLYGQ TVPSEMPDKQ LFTFRKPVGT CAIVTAGNFP AAVPSWYIVP ALLCGNTVVW
KPAEYAAGVS RAFYELFARG GLPDGTLNLV LADGPATFAG LEQSLELGLV DKVGFTGSSE
VGVQIGELCG RNLQTPCLEL GGKNPLVVMG DADLELAVEG ALFSGFGTAG QRCTSLGVAI
VHDSVYDEFL ERFDAAARAA VAGDPAGDVL FGPLMNERFA ERFEQWLGLI QPHHRVLGSS
GTGRITAANP RAGFSGGDPE RGVFYHPTIV ADVTTDDELY RRETFGPIVA VARFSTFDEA
IALANGHGYG LSSAIYTRDA TAALRFRERV SAGMVSVNNS TSGAEAHLPF GGNGKSGNGS
RQSGVWVLDQ FTRWQSVNWD FSGKLQKAQM DVVEITADEG FRLDGWDGR