Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2058 |
Symbol | |
ID | 4022540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2306602 |
End bp | 2307990 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637962251 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_569194 |
Protein GI | 91976535 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.210314 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAAC GTTGGACACC CGATAGCTGG CGCAGCAAGC CGGTTCAGCA GATGCCGGAT TACCCGGACG CGAAGGCGTT GGGCGATGTC GAGGCGCAGC TGTCGACCTT TCCGCCGCTG GTTTTTGCAG GTGAGGCGCG CAACCTGAAG AAGGCGCTGG CGAGCGTGGC GGCTGGCGAA TCTTTCCTGC TTCAGGGCGG CGATTGCGCC GAGAGCTTTG CCGAGCACGG CGCCAACAAC ATCCGCGACC TGTTCCGCGT CTTCTTGCAG ATGGCGATCG TGCTGACCTA TGCGGGCGCC TCGCCCGTGG TGAAGGTCGG CCGCATCGCC GGTCAGTTCG CCAAGCCGCG CTCGGCGCCA GTCGAGAAGC GCGACGGCGT CGAACTGCCG AGCTACCGCG GCGACATCGT CAACGACATC GCCTTCACCG AGGACGCGCG CCGGCCTGAT CCGCGCCGCC AGCTCGAGGC TTATCGCCAG TCCGCCGCGA CGCTCAACCT GCTGCGCGCC TTCGCCAAGG GCGGCTACGC CAGCGTGGAG AACGTTCACC GCTGGATGCT GCAGTCGGTC AGCGACAGTC CGCAGTCGAA GGCCTATGCG GATCTTGCCG ACCGGGTCTC CGGCGCGCTC GATTTCATGC GCGCCTGCGG CCTGACCTTC GCGGTCGACA GCGCGCTCGG CACCACCGAT TTCTACACCA GCCACGAAGC GCTGCTGCTC GGCTACGAGC AGGCGATGAC CCGGATCGAT TCGACAACCG GCGACTGGTA CGCGACCTCC GGCCACATGA TCTGGATCGG CGATCGCACC CGTCAGCTCG ATCACGCCCA TATCGAGTAT TTCCGCGGCA TCAAGAATCC GATCGGCCTG AAATGCGGTC CGTCGCTGAA GACCGATGAG CTGCTGAAGC TGATCGACGT GCTCAATCCC GAGAACGAGG CGGGCCGGCT GACGCTGATC GGCCGGTTCG GCGCCGACAA GATTGGCGAC AGCCTTCCGG CGATGATCCG CACCGTACAG CGCGAGGGCC GCAAGGTGGT GTGGTCGTGC GATCCGATGC ACGGCAACAC CATCACCTCG ACCTCGGGCT ACAAGACCCG GCCGTTCGAC CGCATTTTGT CGGAAGTTCG CTCGTTCTTC ACGATCCACG CCGCAGAGGG CACCCATGCC GGCGGCGTGC ATCTGGAGAT GACCGGGCAG AACGTCACCG AATGCATCGG CGGCGCGCGC GCGATCACCG ACGAGGACCT CAACAACCGC TATCACACCG CCTGCGATCC GCGGCTCAAT GCCGAGCAGT CGATCGACAT GGCGTTCCTG ATTGCGGATC TGCTGAAGCA GGGCCGCGAC GGCAAGGTGA GCCCGCTGCC GGTCGCCGCG GGACTGTGA
|
Protein sequence | MSERWTPDSW RSKPVQQMPD YPDAKALGDV EAQLSTFPPL VFAGEARNLK KALASVAAGE SFLLQGGDCA ESFAEHGANN IRDLFRVFLQ MAIVLTYAGA SPVVKVGRIA GQFAKPRSAP VEKRDGVELP SYRGDIVNDI AFTEDARRPD PRRQLEAYRQ SAATLNLLRA FAKGGYASVE NVHRWMLQSV SDSPQSKAYA DLADRVSGAL DFMRACGLTF AVDSALGTTD FYTSHEALLL GYEQAMTRID STTGDWYATS GHMIWIGDRT RQLDHAHIEY FRGIKNPIGL KCGPSLKTDE LLKLIDVLNP ENEAGRLTLI GRFGADKIGD SLPAMIRTVQ REGRKVVWSC DPMHGNTITS TSGYKTRPFD RILSEVRSFF TIHAAEGTHA GGVHLEMTGQ NVTECIGGAR AITDEDLNNR YHTACDPRLN AEQSIDMAFL IADLLKQGRD GKVSPLPVAA GL
|
| |