Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1769 |
Symbol | edd |
ID | 5713336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1835700 |
End bp | 1837505 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641267687 |
Product | phosphogluconate dehydratase |
Protein accession | YP_001533112 |
Protein GI | 159044318 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR01196] 6-phosphogluconate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.618197 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTCA ATCGCGTCAT TTCCGACGTC ACGGCCAGAA TCGAAGCCCG CAGCGCGGAA GCGCGCAGCA CCTATCTCGA CCGGATGCGG CGCGCTGCCG AGGATGGGCC ACGGCGTGCG CACCTGTCCT GCGGCAACCA GGCCCATGCC TATGCGGCGA TGGGGGGCGA CAAGGAGGCC CTCGTGGCCG AGCGGAGCGC CAATATCGGT ATCGTCACCG CCTATAACGA CATGTTGAGC GCGCATCAGC CGTTCAAGGA TTACCCCGAC AAGATCAAGG AGGCCGCCCG GCGCGCAGGC GCCACGGCCC AGGTCGCGGG CGGGGTTCCG GCGATGTGCG ACGGGGTCAC GCAAGGTCAG GTCGGCATGG AGCTGAGCCT GTTCTCGCGG GACGTGATCG CCCTGGCCAC TGGCGTGGCG CTGAGCCACA ACACGTTCGA TGCCGCGGCC TATCTGGGCG TGTGCGACAA GATCGTGCCG GGGCTGGTGA TTGCCGCCGC CACCTTCGGC TACCTGCCCG GCGTGTTCGT GCCTGCCGGG CCCATGGTCA GCGGCTTGCC CAACGACCAG AAGGCCAAGG TGCGCCAGCA GTTCGCCGCC GGGGAGATCG GGCGTGACAA GCTGATGGAG GCGGAAATGG CCTCCTATCA CGGGCCGGGA ACCTGCACCT TTTACGGGAC GGCGAATTCC AACCAGATGC TGATGGAATT CATGGGGCTG CACCTGCCAG GTGCGTCCTT CGTCAACCCG GGCACGCCCC TGCGCGAGGC GCTGACCGCG GCGGCGGCGG AACGCCTGGC GGCGATCACG CAGCTCGGCA ACGAGTATCG CCCGGTCTGC GATATCCTGG ATGCCAAGGC TTTCGTGAAC GGGATCGTCG GGCTGATGGC CACGGGCGGG TCCACGAACC TGGTGATTCA CCTGCCCGCC ATGGCGCGGG CGGCGGGTGT GATCCTGGAC CTGCAGGATT TCGCGGATAT TTCGGAGGCG ACGCCGTTGA TGGCGCGGGT CTATCCCAAC GGGCTGGCGG ATGTGAACCA TTTCCATGCG GCGGGGGGTC TGGCCTACAT GATCGGGGAG CTCCTGTCCG AGGGGCTGCT GCATCCCGAC ACCAAGACGA TCGCGGGCGA CGGTCTGGCC GATTATGCCC GCGAGCCCAA GCTGATCGAC GGTGTGCTGC GCTGGGAAGA CGGGCCGCGC CGGAGCCTGA ATGCCAAGAT CCTGCGCCCG GCATCGGACG GCTTTGCGCC GTCTGGTGGG CTGAAAGAGC TCAAGGGCAA TCTCGGGCGC GGCGTGATGA AGGTGTCCGC CGTGGCACCG GAGCGCCATG TGATCGAGGC CAGGGCGCGG GTCTTCGAGG ATCAGGGTGC CGTGAAGGAC GCCTTCAAGG CGGGCGAGTT CACCGAGGAC ACGGTGGTCA TCGTGCGGTT CCAGGGGCCC AAAGCGAACG GGATGCCGGA ATTGCATGCG CTGACGCCGG TTCTGGCGGT GCTGCAAGAT CGCGGACTGA AGGTGGCGCT GGTGACGGAT GGGCGCATGT CGGGCGCGTC GGGCAAGGTG CCCGCGGCGA TCCACGTGGC GCCCGAGGCG CTGGATGGCG GGCTGATGGC CAAGGTGCGG ACCGGCGATC TGGTACGCGT GGATGCGGTG GCGGGCGTTC TGGAGGTGCT GGAGCCGGGG GTCGAGGACC GTGCCCCGGC CATGCCGGAT CTGAGCGGCA ACAGTCACGG CATCGGGCGC GAGTTGTTCG ACGTGTTCCG CACGACTGTC GGCCCGGCCA GCACCGGCGC TGGCGTGGTG GTATAA
|
Protein sequence | MALNRVISDV TARIEARSAE ARSTYLDRMR RAAEDGPRRA HLSCGNQAHA YAAMGGDKEA LVAERSANIG IVTAYNDMLS AHQPFKDYPD KIKEAARRAG ATAQVAGGVP AMCDGVTQGQ VGMELSLFSR DVIALATGVA LSHNTFDAAA YLGVCDKIVP GLVIAAATFG YLPGVFVPAG PMVSGLPNDQ KAKVRQQFAA GEIGRDKLME AEMASYHGPG TCTFYGTANS NQMLMEFMGL HLPGASFVNP GTPLREALTA AAAERLAAIT QLGNEYRPVC DILDAKAFVN GIVGLMATGG STNLVIHLPA MARAAGVILD LQDFADISEA TPLMARVYPN GLADVNHFHA AGGLAYMIGE LLSEGLLHPD TKTIAGDGLA DYAREPKLID GVLRWEDGPR RSLNAKILRP ASDGFAPSGG LKELKGNLGR GVMKVSAVAP ERHVIEARAR VFEDQGAVKD AFKAGEFTED TVVIVRFQGP KANGMPELHA LTPVLAVLQD RGLKVALVTD GRMSGASGKV PAAIHVAPEA LDGGLMAKVR TGDLVRVDAV AGVLEVLEPG VEDRAPAMPD LSGNSHGIGR ELFDVFRTTV GPASTGAGVV V
|
| |