Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daro_3804 |
Symbol | |
ID | 3567960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dechloromonas aromatica RCB |
Kingdom | Bacteria |
Replicon accession | NC_007298 |
Strand | + |
Start bp | 4086420 |
End bp | 4087220 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637682278 |
Product | HpcH/HpaI aldolase |
Protein accession | YP_287002 |
Protein GI | 71909415 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3836] 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase |
TIGRFAM ID | [TIGR02311] 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 65 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00568502 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAATAC CCAACAACCA GTTCAAGAGC GCTCTGCGCG CCGGCCGCGA CCAGATCGGG CTGTGGCTCG GCCTGGGCGA GACTTTCAGC GCCGAAATCT GCGCCGGGGC AGGTTTTGAC TGGTTGCTGA TCGACGCCGA ACACGGCCCC AACGACCTGC GCAGCATCCT GTCGCAATTG CAGGCGCTGG CGCCCTACCC GACCCAGCCG GTGGTCCGCC CGCCACAGGG CGATCATGTA CTGATCAAGC AACTGCTGGA AACCGGCGTC CAGACCCTGC TCATTCCGAT GGTCGAATCG GCCGAACAGG CCCGCGGACT GGTCGAGGCC ATGCGCTATC CGCCGGCCGG CATTCGCGGC GTCGGCAGCG CGCTGGCCCG AGCATCGCGC TGGGGCCGCA TCGACAACTA TGCCCATCTG GCCAATGACC AGATGTGCCT GCTGGTGCAG GTCGAGACGC GGGCCGGCTA CGAACAACTC GACAGCATCC TGGCGGTCGA CGGGGTGGAT GGGGTGTTTT TCGGCTCGGC CGACCTCGCC GCTTCCTACG GCTATCTCGG CCAGTCGACG CATCCGGAAA TCGTCGCCGC CATCGAGGAT GGCCTGCACC GCACACGAAC AGCCGGCAAG GCCGGCGGCG TGCTGTGCAG CGACCGGGAG CTCAACACCC GCTACATGGG TGCCGGTGCC AACTTTGTCG CGGTCGGCAT CGATGCGCTG CTGCTCACCG CAGCGACCAC GGCGCTATGC CGGAGCTACA AGCCGGAGGC ATTTCCAGCC GGCATCCCCG GTTCGTATTA G
|
Protein sequence | MQIPNNQFKS ALRAGRDQIG LWLGLGETFS AEICAGAGFD WLLIDAEHGP NDLRSILSQL QALAPYPTQP VVRPPQGDHV LIKQLLETGV QTLLIPMVES AEQARGLVEA MRYPPAGIRG VGSALARASR WGRIDNYAHL ANDQMCLLVQ VETRAGYEQL DSILAVDGVD GVFFGSADLA ASYGYLGQST HPEIVAAIED GLHRTRTAGK AGGVLCSDRE LNTRYMGAGA NFVAVGIDAL LLTAATTALC RSYKPEAFPA GIPGSY
|
| |