Gene Daro_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4020 
Symbol 
ID3567192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4318295 
End bp4319959 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content65% 
IMG OID637682493 
Productalpha subunit of malonate decarboxylase 
Protein accessionYP_287217 
Protein GI71909630 
COG category[I] Lipid transport and metabolism 
COG ID[COG4670] Acyl CoA:acetate/3-ketoacid CoA transferase 
TIGRFAM ID[TIGR01110] malonate decarboxylase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value0.0677734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCAC CGCAACCCCG TCAATGGGAC AGCCTGCGCC AGAACCGGGC GCGCCGCCTG 
GAACGGGCGG CCAGCCTCGG CCTGGCTGGC CAGAATGGCA AGGAAATTCC GGTCGATCGC
ATCATCGACC TGCTCGAAGC CGTCATCCAG CCGGGCGACC GTGTCTGCCT CGAGGGCAAC
AACCAGAAGC AGGCCGATTT CCTGTCCGAG TCGCTGGCCG ATTGCGATCC GGCCCGTATC
AATCACCTCA GCATGGTCCA GTCTGTCCTG GCGCTTCCGA GCCACGTGGA CCTTTTCGAG
CGCGGCCTGG CAACCCGCCT CGACTTTTCT TTCAGCGGCC CGCAGGGCGC CCGGCTGGCC
AAGCTGGTCC AGGAACAGCG CATCGAGATC GGGGCCATCC ATACCTATCT CGAACTGTTC
GGGCGCTATT TCATGGATCT GACGCCGAAT GTGGCGCTGA TCGCGGCGCA GGCGGCCGAT
GCCGAGGGCA ACCTCTACCT CGGGCCGAAT ACCGAGGACA CGCCGGCCAT CGTCGAGGCG
ACCGCGTTCA AGGGCGGCAT CGTGATCGCC CAGGTCAACG AGCGCCTCGA CAAGCTGCCG
CGCGTCGATG TGCCGGCCGA CTGGGTCGAC TTCACGGTGC TGGCGCCGAA GCCCAACTAC
ATTGAGCCAC TATTCACCCG CGACCCGGCG CAGATCACCG AAGTCCAGGT GCTGATGGCG
ATGATGGCGA TCAAGGGCAT CTACGCCGAA TACGGCGTTA CCCGGCTCAA TCACGGCATC
GGCTTCGATA CCGCGGCGAT CGAGTTGCTG CTGCCGACCT ACGCTGCCGA CCTCGGCCTG
AAGGGCAAGA TCTGCACGCA CTGGGCGCTC AATCCACATC CGACGCTGAT TCCGGCCATC
GAAGCCGGTT TCGTCGAGTC GGTCCATTGT TTCGGTTCCG AAGTCGGCAT GGATGACTAC
ATCTCCGCCC GTTCCGACAT CTTTTTTACC GGTGCCGACG GCAGCATGCG TTCCAACCGG
GCGTTTTCGC AAACGGCCGG CCTTTACGCC TGCGATATGT TCATCGGCTC GACCTTGCAG
ATGGACTTGG CCGGCAACAG TTCGACCGCG ACGCTGGGCC GCATCACCGG CTTCGGCGGG
GCGCCGAACA TGGGGTCCGA TCCGCACGGC CGGCGTCATG CCAGCCCGGC CTGGCTCAAG
GCCGGGCGTG AGGCCTACGG GCCGCAGGCG ATTCGCGGCC GCAAGCTGGT GGTGCAGATG
GTCGAGACTT TCCGCGAACA CATGGCGCCG GTTTTCGTCG ACGATCTCGA TGCCTGGAAG
TTGCAGGCCA GCATGGGTTC CGACCTGCCG CCGATCATGA TCTACGGCGA CGACGTCAGC
CATATCGTTA CCGAGGAAGG CATCGCCAAC CTGCTGCTCT GCCGCACACC GGCTGAGCGC
GAGCAGGCGA TCCGCGGTGT GGCCGGCTTC ACGCCGGTCG GGATGGCGCG GGACAAGGGC
ACCGTCGAAA ACCTGCGCGA TCGCGGCATC ATCCGCCGCC CGGAAGACCT CGGCATCGAC
CCGCGCCAGG CCAGCCGCGA CCTGTTGGCC GCCCGTTCGA TCAAGGATCT GGTGCGCTGC
TCCGGTGGCC TGTACGCGCC GCCTTCACGT TTCCGCAACT GGTGA
 
Protein sequence
MNAPQPRQWD SLRQNRARRL ERAASLGLAG QNGKEIPVDR IIDLLEAVIQ PGDRVCLEGN 
NQKQADFLSE SLADCDPARI NHLSMVQSVL ALPSHVDLFE RGLATRLDFS FSGPQGARLA
KLVQEQRIEI GAIHTYLELF GRYFMDLTPN VALIAAQAAD AEGNLYLGPN TEDTPAIVEA
TAFKGGIVIA QVNERLDKLP RVDVPADWVD FTVLAPKPNY IEPLFTRDPA QITEVQVLMA
MMAIKGIYAE YGVTRLNHGI GFDTAAIELL LPTYAADLGL KGKICTHWAL NPHPTLIPAI
EAGFVESVHC FGSEVGMDDY ISARSDIFFT GADGSMRSNR AFSQTAGLYA CDMFIGSTLQ
MDLAGNSSTA TLGRITGFGG APNMGSDPHG RRHASPAWLK AGREAYGPQA IRGRKLVVQM
VETFREHMAP VFVDDLDAWK LQASMGSDLP PIMIYGDDVS HIVTEEGIAN LLLCRTPAER
EQAIRGVAGF TPVGMARDKG TVENLRDRGI IRRPEDLGID PRQASRDLLA ARSIKDLVRC
SGGLYAPPSR FRNW