Gene Cwoe_3884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3884 
Symbol 
ID8734339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4122337 
End bp4123836 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content75% 
IMG OID646504506 
ProductBetaine-aldehyde dehydrogenase 
Protein accessionYP_003395676 
Protein GI284045336 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.163878 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.985606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCCG TCGCGCCGTA CGCGCCGGAG GCGCCATACG GTCTCTTCAT CGGCGGAGAG 
GAGCTGCCGG GCGACGGCGA CTTCGCCGCG CTGAACCCGT CGACCGGCGC GCAGTGGGCG
ACCGTCGCGG AGGCGTCGGC CGCGCAGGCC GACCTCGCCG TGCGGGCGGC CTCCGGCGCC
TTCTCCGGCT GGCGGCGCTC GTCGCCTGCG ACGCGCCAGC AGGTGCTGCT GGCGCTCGCC
GACGCGATCG AGGCGACGCC CGACTGGCCG GCGCTGCTGG CGACCGAGAA CGGCCGCCCG
ATCCGCGAGG CGCTCGCCGC CGACGTCCCT TTCTCCGCCG CCGTGCTGCG CTACTACGCC
GGCCTCGTGC GCGGCCTGCA CGGCGAGACG ATCCCGACCG GCGACCCGCT CAGCCACGTC
TTCACCGTGC GCGAGCCGCT CGGCGTGATC GCGGCGCTGA TCCCGTGGAA CTCGCCGTTG
ATCTCCGCCG CGCTGAAACT GGGTCCTGCG CTGGCGACCG GCAACACGGT CGTGCTGAAG
CCGTCGGAGT TCGCGGCGCC GAGCGTCGTC GAGCTGGCGC GCCGCACCGC CGGCCTGCTC
CCGCCCGGCG TCCTCAACGT GCTGACGGGC AGCGGCCCCG GGGCCGGCGC GGCGCTCGTC
GCGCATCCTG GGATCGCGAA GATCTCGTTC ACCGGCGGCG TCCCGACCGC GCGCCACATC
GTCCGCGCGA CCGCCGAGAC GCTGACGCCG ACGATCCTCG AGCTGGGCGG CAAGAGCGCG
TTCGTGATCT GCCCGGACGC CGATCTCGAC GCCGCCGTCC ACGACGCGCT GAGCGGGATC
CTCAGCCAGA ACGGCGAGGT CTGCTTCGCG GCGTCGCGGC TGTTCGTGCA CGAGGACGTG
CGCCCCGAAT TCCTCGAACG GATGCGCGCG GCGATCGCCG GCGTGCGGAT CGGCGACGCG
CTCGACGCCG GCACGCAGGT CGGCCCGCTC GTCTCCGCCG CGCACCGCGA CCGCGTGCTC
GGCTACGTCG AGCAGGCGCG CGCGGAGGGC GCCCGCGTGC TGGCCGGCGG CTCGCGGCGC
GAGCTGCCGG GCGCGCTCTC GGACGGCTAC TACGTCGAGC CGGCGCTCGT CGACGACCCG
GACGGGTCGA CGACGGCCGC GCGCGAGGAG ATCTTCGGAC CGGTCGTCGT CGCACAGACG
TGGCGCGAGG AGCGCGACGT GATCGCGCGC GCGAACGACA GCGAGTTCGG GCTCGCCGCC
GGCGTGTGGA CCCGCGACCT CGGTCGCGCC CACCGCTTCG CGGATGAGCT GGAGGCGGGG
ACGGTCTGGG TCAACACCTG GTTCCAGGTC GGCCCGGGCC AGCCGTTGGG CGGCATCAAG
CAGAGCGGGC ACGGCCGGGA GCTGTGCGCC GAGACGCTGC TCGAGTACAG CGCGCCGAAG
GCCGTGAGCA TGCGCCTGGA CGGCGCCCGC CCGGACCTGT GGGGGTCGGG ACGGTCGTAG
 
Protein sequence
MSAVAPYAPE APYGLFIGGE ELPGDGDFAA LNPSTGAQWA TVAEASAAQA DLAVRAASGA 
FSGWRRSSPA TRQQVLLALA DAIEATPDWP ALLATENGRP IREALAADVP FSAAVLRYYA
GLVRGLHGET IPTGDPLSHV FTVREPLGVI AALIPWNSPL ISAALKLGPA LATGNTVVLK
PSEFAAPSVV ELARRTAGLL PPGVLNVLTG SGPGAGAALV AHPGIAKISF TGGVPTARHI
VRATAETLTP TILELGGKSA FVICPDADLD AAVHDALSGI LSQNGEVCFA ASRLFVHEDV
RPEFLERMRA AIAGVRIGDA LDAGTQVGPL VSAAHRDRVL GYVEQARAEG ARVLAGGSRR
ELPGALSDGY YVEPALVDDP DGSTTAAREE IFGPVVVAQT WREERDVIAR ANDSEFGLAA
GVWTRDLGRA HRFADELEAG TVWVNTWFQV GPGQPLGGIK QSGHGRELCA ETLLEYSAPK
AVSMRLDGAR PDLWGSGRS