Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pden_4791 |
Symbol | |
ID | 4583353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Paracoccus denitrificans PD1222 |
Kingdom | Bacteria |
Replicon accession | NC_008688 |
Strand | + |
Start bp | 285774 |
End bp | 286877 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639772095 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_918548 |
Protein GI | 119387514 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.22928 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.130512 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTTCC GGTCCGGCCC TTTGCAGAAA GAAAGAGCCA TGCCCATCCA GACCGAGAAC CTGCATATCG CCGCATTGCG CCCCCTGCCC GCCCCGGCCG CCCTGGCCGC ATCCCTGCCG CGGGACGAAG CGGTCTCCCG CACCGTCGCC GACAGCCGCG CCGCCATCCG CGCCATCCTG GCCGGGCGCG ACGACCGGCT GCTGGTCGTC GCCGGCCCCT GCTCGGTCCA TGATCCGGCT GCGGCGCTGG ATTACGCCGC GCGCCTGGCC GAGATGCGCC ACGCGCTGTC CGACCGGCTG GAGATCGTCA TGCGGGTCTA TTTCGAAAAG CCGCGCACGA CCGTCGGCTG GAAGGGGCTG ATCAACGATC CGCATCTGGA CGGCTCGGAC CGGATCGAGG ACGGGCTGCC CCTGGCCCGC CGCCTGCTGC TGGAGATCAA CCGCATGGGC CTGCCGGCGG CGACCGAGTT CCTGGACCCG ATTCTGCCGC AATACTTCGC CGACCTGATC GCCTGGGGCG CAATCGGCGC GCGCACCACG GAAAGCCAGA TCCATCGCCA GCTGGCCTCG GGCCTGTCCT GCCCGGTGGG GTTCAAGAAC GGCACCGACG GCGGGGTGCA GGTGGCGCTG GACGCGATCC GCTCGGCTTC GCGGCCGCAC AGCTTTCCCG CGATCACCGC CGAAGGGCGC GCGGCCATCG CCACGACCAC CGGCAACGAT GCCTGCCACG TCGTGCTGCG CGGCGGCCAT GGCGGGCCGA ATTACGGCGC CGACCATGTC GCGGCAGTGG CGGCGGCTGC GGCCAAGGCG GGGATCGAGC CCGGTATCGT CATCGACGCC AGCCACGCCA ACAGCGACAA GGATCCCGCC CGACAGCCGG AGGTGATCGC CGATGTCGCG GCTCGGATCC GCACGGGCGA CAGCCGCATT CGCGGGGTCA TGCTGGAAAG CCATCTGGTG GCGGGACGGC AGGATCTGCG GGACGGCCAG GTGCCGGTCT ATGGCCAGAG CATCACCGAC GGCTGCCTGG GCTGGGAGGA CAGCCGCGCG CTGCTCCTGG ACCTTGCCGG GGCCGCGGCG ACGCGGCTGC GCTGCGCCGC CTGA
|
Protein sequence | MRFRSGPLQK ERAMPIQTEN LHIAALRPLP APAALAASLP RDEAVSRTVA DSRAAIRAIL AGRDDRLLVV AGPCSVHDPA AALDYAARLA EMRHALSDRL EIVMRVYFEK PRTTVGWKGL INDPHLDGSD RIEDGLPLAR RLLLEINRMG LPAATEFLDP ILPQYFADLI AWGAIGARTT ESQIHRQLAS GLSCPVGFKN GTDGGVQVAL DAIRSASRPH SFPAITAEGR AAIATTTGND ACHVVLRGGH GGPNYGADHV AAVAAAAAKA GIEPGIVIDA SHANSDKDPA RQPEVIADVA ARIRTGDSRI RGVMLESHLV AGRQDLRDGQ VPVYGQSITD GCLGWEDSRA LLLDLAGAAA TRLRCAA
|
| |