Gene Daro_1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1964 
Symbol 
ID3567410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2110218 
End bp2111291 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content61% 
IMG OID637680435 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_285180 
Protein GI161377474 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value0.341858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.50784 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAC AGAACATCGA CAACGTAAAC GTAACCGCCT TCGACGCCAT GCCGACGCCG 
GAAGAAATCA ACGCCCGCCA GCCGCTCAGC GCCAAGGCCG CCAAAACGGT TACCCACGGC
CGCAACCTGC TGCGCAAGAT TCTCGACCGC AAGGATCACC GCCTGTTCGT CGTTGTCGGC
CCCTGCTCCA TCCACGACCC GGTCGCCGGC CTGGATTACG CCCGCCGCCT GAAGAAACTG
GCCGATGAAG TCGGTGACGT GCTGCAGATC ATCATGCGTG TCTATTTCGA AAAACCGCGC
ACCACAGTGG GCTGGAAAGG TTATATCAAC GACCCGTTCA TGGACGACTC CTTCCAGGTC
AATGTCGGCA TGGAAAAGGC GCGTGAATTC CTGCTCCAGG TCAATGAGCT CGGCCTGCCC
GCCGGTACCG AAGCCCTCGA CCCCTACGGC CCGCAATACT ACGGCGACCT GATCACCTGG
ACCGCCATCG GCGCCCGCAC CACCGAATCG CAGACCCACC GCGAAATGTC TTCCGGCCTG
TCGACCCCGG TCGGCTTCAA GAACGCCACC AATGGCGACC TGTCAGTGGC CGTCAATGCC
ATCCTCTCCG CCTCGCGTCC GCACTCCTTC CTTGGTCTGA ACAGCGAAGG CCGTGTCGCC
ATCGTCCGCA CCAAGGGCAA CGGCTACGGC CACGTCGTGC TGCGTGGTGG TGACGGTCGT
CCGAATTACG ACACGGTCTC CGTCTCCATC GCCGAACAGG CCATGGTCAA GGCCAAGCTG
CCGGCCAACA TCGTCGTCGA TTGCTCACAC GCCAACAGCT CCAAGAAACC TGAATTGCAG
CCGCTGGTCA TGGCCGATGT CGTCAACCAG ATCCGCCTCG GCAACAAATC ACTGCTTGGC
GTGATGATCG AGTCGAACAT CGAAGCCGGC AACCAGTCCA TCCCGGCTGA CCTCAGCCAG
CTCAAGTACG GCTGCTCGGT CACCGATGGC TGTGTCGGTT GGGACACCAC CGAAAAGATG
ATCCGCGACG CCGCCGTGTT GTTGCGCGAC GTGCTGCCGG AACGTCTGTC CTGA
 
Protein sequence
MTTQNIDNVN VTAFDAMPTP EEINARQPLS AKAAKTVTHG RNLLRKILDR KDHRLFVVVG 
PCSIHDPVAG LDYARRLKKL ADEVGDVLQI IMRVYFEKPR TTVGWKGYIN DPFMDDSFQV
NVGMEKAREF LLQVNELGLP AGTEALDPYG PQYYGDLITW TAIGARTTES QTHREMSSGL
STPVGFKNAT NGDLSVAVNA ILSASRPHSF LGLNSEGRVA IVRTKGNGYG HVVLRGGDGR
PNYDTVSVSI AEQAMVKAKL PANIVVDCSH ANSSKKPELQ PLVMADVVNQ IRLGNKSLLG
VMIESNIEAG NQSIPADLSQ LKYGCSVTDG CVGWDTTEKM IRDAAVLLRD VLPERLS