Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1895 |
Symbol | |
ID | 3917116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2007263 |
End bp | 2009083 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640444639 |
Product | phosphogluconate dehydratase |
Protein accession | YP_497169 |
Protein GI | 87199912 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR01196] 6-phosphogluconate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.488647 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGATGA ACCCAACTGT CGAGCGCGTC ACCCAGCGCA TCATCGAGCG TTCGCGCAGC ACCCGCGCCG CATATCTCGA TCTTGTGGAA CGTTCGCGCG ACCAGGGGCT CAACCGGCCC AGGCTTTCGT GCGGCAACCT GGCCCACGGC TTTGCCGCAT CGGGCGAGGA CAAGCCGGCG ATCAAGTCGG GCAAGGCGAT GAACATCGGC ATCGTCACTG CCTACAACGA TATGCTTTCG GCGCATCAGC CGTATGGCCG CTATCCCGAG CAGATCAAGA TATTCGCCCG GGAAGTGGGC GCGACGGCGC AGGTTGCAGG CGGCGTTCCA GCGATGTGCG ATGGCGTCAC GCAGGGCCAG GATTCGATGG AGCTGTCGCT CTTCAGCCGC GACGTGATCG CAATGGCGAC GACAGTCGGG CTAAGCCACG CCATGTTCGA AGGCGCGCTC CTGCTCGGAA TCTGCGACAA GATCGTGCCC GGGCTGCTGA TCGGCAGCCT GCGCTTTGGC CACCTGCCGA CCATTCTCGT GCCGGCGGGC CCCATGCCGA CAGGCCTTCC CAACAAGGAG AAGGTCCGTA TCCGCCAGCT ATATGCAGAG GGCAAGGTCG GTCGCGACGA ACTGCTCGAA AGCGAGAGCG CCAGCTATCA CTCCGCCGGC ACGTGCACCT TCTATGGCAC GGCCAACTCC AACCAGATGA TGATGGAGAT GATGGGGCTG CACATGCCCG GCTCCAGCTT CGTCCTGCCC GGCACGAAGA TCCGTCAGGA ACTGACGCGG GCGGCGACGC ATCGCATCGC CCAGATTGGT TGGGATGGCG ACGATTATCG TCCTCTCGGC AGGTGCGTCG ACGAGAAGGC CATCGTCAAT GCGATCGTCG GCCTGCTGGC AACAGGTGGC TCGACCAACC ACGTGATCCA CCTGCCGGCC ATCGCGCGGG CCGCCGGCAT CCAGATAGAC TGGAACGACA TGGACGACCT GTCGCGCGTC GTCCCGCTTA TCGCCAGCGT CTATCCCAAT GGCGCGGGCG ACGTGAACTA CTTCGCAGCG GCGGGCGGCA TGCCCTATGT GATCCGCGAG CTGATCGGGT CCGGCCTTGC CCATCCGGAT ATCCTGACGG TCTACGGCCA GTCGCTGGAG GAAGGCGCCC AGCAGCCTGT CATGGAAGGC GACAACCTGC GCTGGGATCC GGCGCCCGAG GTTTCGGGAG ACGACAGCAT GCTGCGCCCT GTTTCGGCGC CGTTCCAGCC CGAAGGCGGC TTCAGGTTGC TGAAAGGCAA CCTCGGCCGG GGTACGATCA AGGTCAGCGC GGTCGATCCC TCACGCTGGA CGATCGAGGC GCCTTGCCGG GTGTTCGAGG ACCAAAATGC CGTGCTCGAC GCGTTCAAGG CCGGCGAACT GGAGCGTGAC GTCATCGTTG TCGTACGCTT CCAGGGGCCT GCCGCAAACG GCATGCCCGA ACTGCACAAG CTGACCCCGC CGCTTGGCGT CCTGCAGGAT CGCGGGTTCA AGGTCGCGCT CGTCACCGAT GGCCGTATGT CGGGCGCTTC GGGCAAGGTG CCTGCCGCAA TCCATGTCTC GCCCGAAGCC AAGCTCGGTG GCCCGCTGGC AAGGCTGCGC GACGGCGACG TGGTGCGGGT ATGCGCCAAC AGCGGCGAGC TTGTCGCGGT CGTGCCCGCC GAGGAGTGGA GCGCGCGCGA GGAAGCAGTT GCCCCGGCTA GTGCTCCCGG CGTAGGCCGC GAACTCTTCG CGCTCATGCG GCAGCATTCC GATCCCGCCG AGCGCGGCGG ATCGGCGATG CTCGCGGCGG CGGGGCTCTG A
|
Protein sequence | MAMNPTVERV TQRIIERSRS TRAAYLDLVE RSRDQGLNRP RLSCGNLAHG FAASGEDKPA IKSGKAMNIG IVTAYNDMLS AHQPYGRYPE QIKIFAREVG ATAQVAGGVP AMCDGVTQGQ DSMELSLFSR DVIAMATTVG LSHAMFEGAL LLGICDKIVP GLLIGSLRFG HLPTILVPAG PMPTGLPNKE KVRIRQLYAE GKVGRDELLE SESASYHSAG TCTFYGTANS NQMMMEMMGL HMPGSSFVLP GTKIRQELTR AATHRIAQIG WDGDDYRPLG RCVDEKAIVN AIVGLLATGG STNHVIHLPA IARAAGIQID WNDMDDLSRV VPLIASVYPN GAGDVNYFAA AGGMPYVIRE LIGSGLAHPD ILTVYGQSLE EGAQQPVMEG DNLRWDPAPE VSGDDSMLRP VSAPFQPEGG FRLLKGNLGR GTIKVSAVDP SRWTIEAPCR VFEDQNAVLD AFKAGELERD VIVVVRFQGP AANGMPELHK LTPPLGVLQD RGFKVALVTD GRMSGASGKV PAAIHVSPEA KLGGPLARLR DGDVVRVCAN SGELVAVVPA EEWSAREEAV APASAPGVGR ELFALMRQHS DPAERGGSAM LAAAGL
|
| |