Gene Daro_2352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2352 
Symbol 
ID3566034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2535764 
End bp2536951 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content64% 
IMG OID637680819 
Productthiolase 
Protein accessionYP_285558 
Protein GI71907971 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value0.422281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.512756 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAATT CTCAAGATCC GGTCATCCTG GCCGCCGTCC GTACCCCGTT TGGCCGCCGC 
AATGGGGCCT TCCGTCAGAC CCGTCCCGAT GAACTGCTGG CCGGTATTGT CAGTGAAGCC
GTCAAGCGTT CCGGCGTTTC GGTGTCTGGT GTTGCCGACG TCATTGCCGG CTGTGTCAGT
CAGGCTGGCG AGCAGGGGGC CAACATTGCC CGCCAGGCCT TGCTGCTGGC CGGTCTGCCA
GCCGAGATTC CCGGTGTCAG CCTGAACCGG ATGTGCGGAT CGAGTCAGTA TGCGGTACAT
GCGGCTGCCC AGTCGATTCT GGCGGGCGAT GCGGAATTTT CGGTAGGCTG CGGCGTCGAG
AACATGAGCC GGGTGCCAAT GTTTCTCGAC CTGACCCTGG GCAAAGGCGA TTTCAAGGGC
TTCGACAACC TGCACCCGGG TATCACGGCC CGCTTCGCCA TCCCGCATCA GGTCGAGAGC
GCCGAACTGA TCGGCGACCA CTGGCAGATC AGCCGCGCCG AGTGCGACGA ATTTGCCCGC
GAAAGCCATC GTCGCGCCCA TGCCGCCCGG CTGGCCGGCG TGCATAAGGA AATCGTGGCG
ACGGCCGGCG TCGACAAGGA GGGCAACGCG ATTACCCTCG ATTACGACGA GGGCGTTCGC
CCGGTGATTG ATGTCGATAA GATGTCGGCC ATGCTGCCGG TGTTCCGCAC GCCTGAAACC
GGTGTCGTGA CGGCCGCCAA CGCCAGCCAG ATGTCCGACG GCGCGGCAGC GGTCGTGCTC
GGCAGTGCCG AATCTGCCGC TCGCCTGGGC CTGAAGCCGA AGGCTCGTTT CAAGGCGCGG
GTCGTGGTCG GCTCCGATCC GGTCATGCAG TTGACCGGCG TCATCCCGGC GACTCGTCTG
GCACTGAAGA AGGCCGGCCT GAGTATTGCT GATCTGGACT GGATCGAAGT CAATGAAGCC
TTTGCCACTG TGGCGATAGC CTGGGCGCGC GAGTTTTCTC CCAATATGGA TAAACTTAAC
CCGTGGGGCG GGGCGATTGC CCACGGGCAT CCGCTGGGTG GCACCGGCGC CGGCCTGATG
GCCAAGATGC TGTCCGGGCT GGAGTCGTGC AACGGCCGTT TCGGCCTGCA GGTCATGTGC
ATCGGCCACG GCATGGCGAC CGCCACCATC ATCGAACGTC TGGCCTGA
 
Protein sequence
MQNSQDPVIL AAVRTPFGRR NGAFRQTRPD ELLAGIVSEA VKRSGVSVSG VADVIAGCVS 
QAGEQGANIA RQALLLAGLP AEIPGVSLNR MCGSSQYAVH AAAQSILAGD AEFSVGCGVE
NMSRVPMFLD LTLGKGDFKG FDNLHPGITA RFAIPHQVES AELIGDHWQI SRAECDEFAR
ESHRRAHAAR LAGVHKEIVA TAGVDKEGNA ITLDYDEGVR PVIDVDKMSA MLPVFRTPET
GVVTAANASQ MSDGAAAVVL GSAESAARLG LKPKARFKAR VVVGSDPVMQ LTGVIPATRL
ALKKAGLSIA DLDWIEVNEA FATVAIAWAR EFSPNMDKLN PWGGAIAHGH PLGGTGAGLM
AKMLSGLESC NGRFGLQVMC IGHGMATATI IERLA