Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1101 |
Symbol | |
ID | 5084704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 1126046 |
End bp | 1127416 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640482659 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001167307 |
Protein GI | 146277148 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0457175 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGGG GCTGGACCAA GACCGACTGG CGCGCCAAGC CGCGCATCCA GATGCCCGAC TATCCGGAGG CCGCCGCCGT CGAGGCGGTC GAGGCGCAGC TTGCGAAATA TCCGCCCCTC GTCTTCGCGG GCGAGGCCCG CAAGCTGAAG GCGGCGCTGG CCGAGGCGGC CGAGGGCCGC GCCTTCCTGC TGCAGGGCGG CGACTGCGCC GAGAGCTTTG CGGAATTCTC GGCCGACAAC ATCCGCGACA CGTTCCGCGT GCTCCTGCAG ATGGCGGTCG TGCTCACCTA CGGGGCCAAG GTGCCGGTCG TGAAGATCGG CCGCATGGCG GGGCAGTTCG CCAAGCCGCG CTCGGCCCCG ACCGAGGTCA TCAACGGGAT GGAGCTGCCG TCCTACCGGG GCGACATCAT CAACGGCTTC GACCCGAGCC CCGAGTCGCG CATCCCCGAT CCCCGGCGGA TGCTGCAGGC CTACACACAG GCCGCGGCCT CGCTCAATCT GCTGCGCGCC TTCTCGACGG GCGGCTTCGC CGACATCCAC CGCGTCCATT CCTGGACGCT GGGCTTCTGC GAGCAGGACA AGGCCGAGCG GTATCGCGAC ATCTCGAACC GGATCTCGGA CGCGCTCGAC TTCATGTCGG CCGCGGGCGT GAACGGTTCG ACCTCGCACG ATCTGGCGAC GGTGGACTTC TACACCTCGC ACGAGGCGCT GCTGCTGGAA TATGAAGAGG CGCTCTGCCG GATCGATTCG ATCACCGGCC AGCCGATCGC GGGCTCGGGC CACATGATCT GGATCGGCGA CCGCACGCGC CAGATCGATG GCGCGCATGT CGAATTCTGC CGCGGCGTGC TGAACCCGAT CGGGCTGAAA TGCGGCCCCT CGACCACGGT CGAGGATCTC AAGGTGCTGA TGGCCAAGCT CAACCCGCAG AACGAGGCGG GGCGGCTCAC GCTGATCGCG CGCTTCGGCG CGGGCAAGGT GGGCGAGCAT CTGCCGCGGC TGATCAAGGC CGTGCGCGAG GAGGGCGCCA AGGTTACCTG GTGCTGCGAT CCGATGCACG GCAACACGAT CAAGGCGGCC TCGGGCTACA AGACCCGCCC GTTCGACTCG GTGCTGCGCG AGGTGCGCGA GTTCTTCGCG ATCCACAAGG CCGAGGGCAC GATCCCCGGC GGCGTGCATT TCGAGATGAC CGGGCAGGAC GTGACCGAAT GCACCGGCGG CCTGCGTGCG GTGACGGACG AGGATCTCTC GAACCGCTAC CACACGGCCT GCGATCCCCG CCTCAACGCC TCGCAGTCGC TGGAGCTGGC CTTCCTCGTG GCCGAGGAAC TGACCACGAT GCGCGAAGCG GGCCGGCGCG TGGCGCTGTA G
|
Protein sequence | MSRGWTKTDW RAKPRIQMPD YPEAAAVEAV EAQLAKYPPL VFAGEARKLK AALAEAAEGR AFLLQGGDCA ESFAEFSADN IRDTFRVLLQ MAVVLTYGAK VPVVKIGRMA GQFAKPRSAP TEVINGMELP SYRGDIINGF DPSPESRIPD PRRMLQAYTQ AAASLNLLRA FSTGGFADIH RVHSWTLGFC EQDKAERYRD ISNRISDALD FMSAAGVNGS TSHDLATVDF YTSHEALLLE YEEALCRIDS ITGQPIAGSG HMIWIGDRTR QIDGAHVEFC RGVLNPIGLK CGPSTTVEDL KVLMAKLNPQ NEAGRLTLIA RFGAGKVGEH LPRLIKAVRE EGAKVTWCCD PMHGNTIKAA SGYKTRPFDS VLREVREFFA IHKAEGTIPG GVHFEMTGQD VTECTGGLRA VTDEDLSNRY HTACDPRLNA SQSLELAFLV AEELTTMREA GRRVAL
|
| |