Gene Daro_3324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3324 
Symbol 
ID3566308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3574262 
End bp3575341 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content62% 
IMG OID637681796 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_286523 
Protein GI71908936 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000917904 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGATGCCGC AAAGTAAACC GAATACCGAC GACCTTCGCA TCAAGGAAAT CAAGGAGCTG 
GTGCCGCCAG CCCACGTTTT CCGCGAGTAT CCGGTGTCCA CTCGGGCGGC GCAAACGACC
TATGTCGCCC GCCAGGCAAT TCACCGCGTG CTGCATGGCG CCGACGACCG CCTGCTGGTC
GTCATCGGCC CCTGCTCGAT CCATGACTAC GAACTGGCCA TGGATTACGC CAAGAAACTG
GCCAAGGAAG CCGAGAAATA CGCCGAGGAC CTGATCGTCG TCATGCGCGT CTATTTTGAA
AAGCCGCGGA CCACAGTTGG CTGGAAAGGC CTGATCAACG ATCCGCGCAT GGACAACACC
TTCCGCATCA ACGAAGGCCT GCGTCTGGCC CGCCGCATCC TGCTTGAGGT CAATGAGCTG
GACCTGCCTT GCGCCACCGA GTTCCTCGAC ACCATCACGC CGCAATACAC CGCCGACCTG
ATCGCCTGGG GCGCCATCGG TGCGCGCACC ACCGAGTCGC AGGTGCACCG CGAGCTGGCT
TCCGGCCTTT CCTGCCCGGT CGGTTTCAAG AACGGCACCG ACGGCAACAT GCGCATTGCC
GTGGATGCGA TCCGCTCGGC CAACTCGCCA CACCATTTCC TGTCGGTGAC CAAGTCCGGC
CACACCGCCA TCGTGTCGAC GATGGGCAAC GAGGACTGCC ACGTCATCCT GCGCGGCGGC
AAGGAACCGA ACTACGACGC GGCCAGCGTC GATGCCGCAT GCACCGAAAT CGCTAAATCC
GGCCTCGCCG CCCGGCTGAT GGTCGATTTC TCGCACGGCA ACAGCCGCAA GCAATACAAG
CTGCAAATGG AAGTCTGCGA CAGCGTGGCC GAGCAGATCG CCGGTGGCGA AGACCGCATT
GTTGGCGTCA TGGTCGAATC GCACCTCGTC GAAGGCCGCC AGGACATCTC GCCGGACAAG
CCGCTGACCT ACGGCCAGAG CGTGACTGAT GCCTGTATCA ACTGGGATGA CAGCCTGAAA
GTGCTCGAGA AACTGGCAGC TGCCGTCAGG GCAAGACGAG TCGCCGAAGC GTCAGAGTAA
 
Protein sequence
MMPQSKPNTD DLRIKEIKEL VPPAHVFREY PVSTRAAQTT YVARQAIHRV LHGADDRLLV 
VIGPCSIHDY ELAMDYAKKL AKEAEKYAED LIVVMRVYFE KPRTTVGWKG LINDPRMDNT
FRINEGLRLA RRILLEVNEL DLPCATEFLD TITPQYTADL IAWGAIGART TESQVHRELA
SGLSCPVGFK NGTDGNMRIA VDAIRSANSP HHFLSVTKSG HTAIVSTMGN EDCHVILRGG
KEPNYDAASV DAACTEIAKS GLAARLMVDF SHGNSRKQYK LQMEVCDSVA EQIAGGEDRI
VGVMVESHLV EGRQDISPDK PLTYGQSVTD ACINWDDSLK VLEKLAAAVR ARRVAEASE