Gene Daro_3806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3806 
Symbol 
ID3567962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4088331 
End bp4089800 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content63% 
IMG OID637682280 
Productaldehyde dehydrogenase 
Protein accessionYP_287004 
Protein GI71909417 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00657404 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCAAC CGCAAAGAAT CCACCATTTC ATCAACGGCG AATTCACCGC TTCGCCCGAT 
CCCCGCTATT TCGACAAGCG TTCGCCGGTT GACGGCCGCG TCATCGCCCA CATCGCCGAA
GCCGGCCAGG CCGATGTCGA TGCCGCCGTC ACCGCTGCCC GTGCTGCGCT GAAGGGTGAA
TGGGGCAAGC TGAGCACCGA CCAGCGCGTC GACCTGCTCT ATGGTGTGGC CAACGAAATC
ACCCGCCGCT TCGATGATTT CGTCGCTGCC GAAATGGCCG ATACCGGCCA GCCCTCGCAC
GTCCAGACGC ATGTCTTCAT TCCGCGCGGC GCGGCCAACT TCAAGGTGTT CGCCGACGTG
ATCAAGAACG TTGCCGCCGA ATCCTTCCGC ATGGCGACGC CAGACGGTAA GGGCGCACTG
AACTACGCGA TCCGTAATCC GAAGGGCGTG ATCGGCGTTA TCTCGCCGTG GAATGCGCCC
TTCCTGCTGA TGACCTGGAA GGTTGGCCCG GCACTAGCCT GTGGCAACAC CGTGGTGGTC
AAGCCTTCAG AGGAAACCCC GCTGACCGCC ACGCTGCTCG GCGAGGTGAT GAACAGCGTC
GGCATTCCCA AGGGCGTCTA TAACGTGATC AACGGCTTCG GCCCCGATTC GGCCGGCGCT
TACCTGACCC AGCATCCGGG CGTCGATGCC ATCACCTTCA CCGGCGAAAC TCGCACCGGC
ACGGCGATCA TGAAGGCCGC CGCCGAAGGC ATGCGCGACG TGTCCTTCGA ACTGGGCGGC
AAGAATGCCG GCATCGTTTT CGCCGACTGC AATTTCGAGG CAGCGGTCGA TGGCATCTTC
CGCTCCGCCT TCCTCAACAC CGGGCAGGTC TGCCTGGGCA CCGAGCGCGT CTATGTCGAG
CGGCCGATAT TCGAAAACTT CGTGCAGGCG CTGAAGGCGA AGGTCGAAGG TGTGCGCTAT
GGCCGCCCGG AAGACCACAC CAGCACTTAC GGCCCGCTGA TCAGCCAGGA ACACCGCGAC
AAGGTACTGT CGTATTACAA GAAAGCAGTC GACGAAGGGG CGACAGTCGT CACCGGCGGC
GGCGTGCCCG ACATGCCGGC CGAGCTGGCC GGTGGCAGTT GGGTGCAGCC GACCATCTGG
ACTGGCTTGC CGGAAACCGC CGCCGTGGTG CGCGAGGAAA TCTTCGGCCC GTGCTGCCAC
ATCCGCCCCT TCGACAGCGA AGAAGAAGTG ATCGAACTCG CCAACGCCAA CGACTACGGC
CTGGCGACCA CGATCTGGAC CGAGAACCTG TCGCGCGCCC ATCGCGTCGC CGAGCGCGTC
GAAGTCGGCG TCACCTGGGT GAACAGCTGG TTCCTGCGCG ACCTGCGCAC GCCCTTCGGC
GGCTCAAAGC AGTCCGGCAT CGGCCGCGAA GGCGGTGTCC ATTCGCTCGA GTTCTATACC
GAAACCCGCA ACGTCTGCAT CAAGCTCTAA
 
Protein sequence
MSQPQRIHHF INGEFTASPD PRYFDKRSPV DGRVIAHIAE AGQADVDAAV TAARAALKGE 
WGKLSTDQRV DLLYGVANEI TRRFDDFVAA EMADTGQPSH VQTHVFIPRG AANFKVFADV
IKNVAAESFR MATPDGKGAL NYAIRNPKGV IGVISPWNAP FLLMTWKVGP ALACGNTVVV
KPSEETPLTA TLLGEVMNSV GIPKGVYNVI NGFGPDSAGA YLTQHPGVDA ITFTGETRTG
TAIMKAAAEG MRDVSFELGG KNAGIVFADC NFEAAVDGIF RSAFLNTGQV CLGTERVYVE
RPIFENFVQA LKAKVEGVRY GRPEDHTSTY GPLISQEHRD KVLSYYKKAV DEGATVVTGG
GVPDMPAELA GGSWVQPTIW TGLPETAAVV REEIFGPCCH IRPFDSEEEV IELANANDYG
LATTIWTENL SRAHRVAERV EVGVTWVNSW FLRDLRTPFG GSKQSGIGRE GGVHSLEFYT
ETRNVCIKL