Gene Daro_3787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3787 
Symbol 
ID3567943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4073242 
End bp4074696 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content62% 
IMG OID637682262 
Productaldehyde dehydrogenase 
Protein accessionYP_286986 
Protein GI71909399 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0000000169768 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0011277 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACAGA TCCTGAATTT TATTAACGGC GAGTTCGTTG CTACAGGGAA GCAGTTCGAG 
AAGCGGACGC CGCTCGACAA CTCGCTGATC GGCATGGTGC ATGAAGCCGG CAAGGCCGAG
GTTGATGCCG CGGTCAAGGC AGCGCACGAT GCGCTGGAAG GGCCGTGGGG CAAGATGACC
GTGGTCGAGC GCACCGATAT CCTGAACAAG GTGGCCGACG AGATTACCCG TCGCTTCGAC
GAATTCCTCG AAGCCGAATG TGCCGATACC GGCAAGCCGA AGAGCTTGGC TTCTCATATC
GACATCCCGC GCGGCGCAGC CAATTTCAAG ATTTTCGCCG ACGTGATCAA GAACGCGCCG
ACCGAGTTCT TCGAAATGGC GACACCGGAT GGCAAGGGCG CGCTGAACTA CGCCATCCGG
CGCCCGGTGG GCGTGGTCGG TGTGGTCTGC CCGTGGAACC TGCCGCTGCT GCTGATGACC
TGGAAGGTTG GTCCGGCGCT GGCCTGCGGC AACACGGTCG TCGTCAAGCC GTCGGAAGAA
ACCCCGTCGA CGGCCACCCT GCTCGGCGAA GTGATGAATG CCTGCGGCGT GCCCAAGGGC
GTCTACAACG TCGTGCATGG TTTCGGGCCG AATTCGGCCG GCGAATTCCT GACCACCAAC
CAGAACGTGA ATGCGATCAC CTTTACTGGC GAGACCCGTA CCGGTGCTGC GATCATGAAG
GCCGCTGCCG ACGGTGCCCG CCCGGTATCG CTGGAAATGG GCGGCAAGAA CCCGGCCATC
GTTTTTGCCG ACGCCAATCT GGATGTCGCG ATCGAAGCCA CGCTGCGCTC CTGCTTCTCG
AACTGTGGCC AGGTCTGCCT GGGCACCGAG CGTGTCTATG TCGAGCGCCC GTTCTTCGAG
ACCTTCGTGG CTGCCCTGAA GGCCGGTGCC GAAAAGCTCA AGCTCGGCGT GCCGAGCGAC
CCGAGCGCCA ACATGGGGCC ACTGGTCAGC CAGGAACACC GCAACAAGGT CTTGTCGTAT
TACAAGAAGG CCGTCGAGGA GGGCGCCACC GTGGTTACTG GTGGTGGCAT CCCCGACATG
CCGGGCGAGC TGGCCAATGG CGCCTGGGTG CAGCCGACCA TCTGGACCGG TCTCGACGAC
AACGCTGCCG TGGTTCGCGA AGAGATCTTC GGACCTTGCA CGACCGTGAT GCCGTTCGAC
AGCGAAGATG AAGTGATCAA GCGCGCCAAC AACACGACCT ACGGTCTGGC CGCCTCGGTC
TTCACCCAGG ACGTCAATCG CGCTCACCGC GTTGCTGGTC GTATCGAAGC CGGCCTGGTC
TGGGTGAACA GCTGGTTCCT GCGCGACTTG CGCACGCCGT TCGGTGGCGC CAAGCAGTCC
GGTATCGGCC GCGAGGGTGG TGTGCATTCA CTTGAGTTCT ACACCGAACT CAAGAACGTC
TGCATCAAGC TGTAA
 
Protein sequence
MKQILNFING EFVATGKQFE KRTPLDNSLI GMVHEAGKAE VDAAVKAAHD ALEGPWGKMT 
VVERTDILNK VADEITRRFD EFLEAECADT GKPKSLASHI DIPRGAANFK IFADVIKNAP
TEFFEMATPD GKGALNYAIR RPVGVVGVVC PWNLPLLLMT WKVGPALACG NTVVVKPSEE
TPSTATLLGE VMNACGVPKG VYNVVHGFGP NSAGEFLTTN QNVNAITFTG ETRTGAAIMK
AAADGARPVS LEMGGKNPAI VFADANLDVA IEATLRSCFS NCGQVCLGTE RVYVERPFFE
TFVAALKAGA EKLKLGVPSD PSANMGPLVS QEHRNKVLSY YKKAVEEGAT VVTGGGIPDM
PGELANGAWV QPTIWTGLDD NAAVVREEIF GPCTTVMPFD SEDEVIKRAN NTTYGLAASV
FTQDVNRAHR VAGRIEAGLV WVNSWFLRDL RTPFGGAKQS GIGREGGVHS LEFYTELKNV
CIKL