Gene Daro_2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2068 
Symbol 
ID3570190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2225238 
End bp2226866 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content63% 
IMG OID637680542 
Productaldehyde dehydrogenase 
Protein accessionYP_285282 
Protein GI71907695 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value0.72096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCG CCGATCAGCT GACTGCGCTA TTCCCTCGCG AAAGCGAGAT CCCCGGCGAC 
TGCCGCATCA TTGCGCCAAT TCACCAGCGC GCGATCCTGA TCAACGGTGA AATGCGGATC
TGGAAGGGCG AGACGCGGAC GGTGCATTCG CCGGTCTGCG TGCAGGGTAG TGACGGCTCG
CTGAGTCATG TCGAACTGGG CAGCGTGCCG GTCACCGGCA CCGTCGAGGC CGATGCAGCA
CTGGCCGCGG CGGTAACCGC CTACGATCAC GGCCGTGGCG CCTGGCCGAA CCTGTCGGTG
GCCGAGCGCA TTGCCTGCGT TGGCGATTTC ACCAACCAGA TCGTCGCCCG CCGCCGCGAG
ATCGTGAACC TGATCATGTG GGAAATCGGC AAGAGCCTGG CCGATTCGCA GAAGGAATTC
GACCGCACCA TCGACTACAT CCGCGCCACC GTCGAGGAGT TGAAACGGCT GGACAACAGC
AATTCGCGCT TCGAGATCGT CGACGGCACC ATCGCCCAGA TCCGCCGCTC GCCGGTCGGC
ATCGCGCTGT GCATGGGGCC GTACAACTAC CCGATGAACG AGACCTTCAG CACGCTGATC
CCGGCGCTGA TCATGGGCAA CGTCGTGCTC TTCAAGCCGC CGCGCTTTGG CGTGCTGCTC
TACTACCCGA TGCTCGAAGC TTTCCGTAGT GCCTTCCCGC CCGGCGTCAT CAACATCGTC
TATGGCCAGG GCCACGTCGT CGTTCCGCAC ATCATGGGCT CCGGCCAGGT CAATGTGCTG
GCCCTGATCG GCTCGTCCGA AGTCGCCGAC CAGCTCAAGA AATCGCACCC GAAGACCAAC
CGCCTGCGCG CCATCCTCGG CCTCGGCGCC AAGAACGCGG CGATCATCAT GCCCGACGCC
GACATCGAGC TGACGGTCAG GGAATGCATC ACCGGCGCCC TCTCCTTCAA TGGCCAACGG
TGCACGGCAA TCAAGATGAT CCTGGTGCAT CAGTCGATCG CCGAAACTTT CCTGCGTCGC
TTCTGCGAGG AAGTCGGCAA GCTGGCCATC GGCATGCCTT GGGAAGCCGG GGTCACGCTG
ACCCCGCTGC CCGAGATGGC GATGGTCACC TACATGAACG AGTGCATTGC CGATGCCCTG
AGCAAGGGCG CCAAGGTCAT CAACCCCGGC GGCGGCACCA CGGTTGAAAC CCTGTTCTAC
CCGGCTGTGA TCTTTCCGGT CAGCGAGGGC ATGAAGCTCT ACCGCGAGGA ACAGTTCGGC
CCGATCATCC CGGTCGCGAC CTTCGAGGAC ATCGAAACGC CGCTCGAGTA CGTGATCACC
TCCGACCATG GCCAGCAGGT CAGCATCTTC GGCAGCGATC CGGGGCAGAT CGCCTCGCTG
GTCGATACGC TGGTCAATCA GGTCTGCCGG GTCAATATCA ACTGCCAGTG CCAGCGCGGC
CCGGACGTCT TCCCCTTCGT CGGGCGCAGG GATTCGGCCG AAGGCACCCT GTCCGTCCAT
GACGCGCTGC GCGCCTTCTC GATCCGCACC ATGGTCGCCG CCAAGCAGAC CGAGGCCTCG
AAGAAGCTGC TCGACGCCAT CGTGCTCGGT AACAAATCGA ACTTCATCAA CACCCACTTC
ATTCTCTGA
 
Protein sequence
MKTADQLTAL FPRESEIPGD CRIIAPIHQR AILINGEMRI WKGETRTVHS PVCVQGSDGS 
LSHVELGSVP VTGTVEADAA LAAAVTAYDH GRGAWPNLSV AERIACVGDF TNQIVARRRE
IVNLIMWEIG KSLADSQKEF DRTIDYIRAT VEELKRLDNS NSRFEIVDGT IAQIRRSPVG
IALCMGPYNY PMNETFSTLI PALIMGNVVL FKPPRFGVLL YYPMLEAFRS AFPPGVINIV
YGQGHVVVPH IMGSGQVNVL ALIGSSEVAD QLKKSHPKTN RLRAILGLGA KNAAIIMPDA
DIELTVRECI TGALSFNGQR CTAIKMILVH QSIAETFLRR FCEEVGKLAI GMPWEAGVTL
TPLPEMAMVT YMNECIADAL SKGAKVINPG GGTTVETLFY PAVIFPVSEG MKLYREEQFG
PIIPVATFED IETPLEYVIT SDHGQQVSIF GSDPGQIASL VDTLVNQVCR VNINCQCQRG
PDVFPFVGRR DSAEGTLSVH DALRAFSIRT MVAAKQTEAS KKLLDAIVLG NKSNFINTHF
IL