Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2080 |
Symbol | |
ID | 3971845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 2276967 |
End bp | 2278355 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637925188 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_531953 |
Protein GI | 90423583 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.273143 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAGC GGTGGACACT CGACAGTTGG CGCAGCAAGC CGGTGCAGCA AATGCCGGAT TATCCGGATG CCAAGGCGCT TGGCGAGGTC GAGGCGCAGC TGGCCACGTT TCCGCCGCTG GTTTTTGCAG GTGAGGCGCG CAATCTGAAG CGGGCACTGG CGCGGGTTTG CGCCGGCGAA GCCTTCCTGT TGCAGGGCGG CGACTGCGCC GAGAGCTTCG CCGAGCACGG CGCCAACAAT ATCCGGGATT TCTTCCGCGT GCTGCTGCAG ATGTCGGTGG TCTTGACCTA TGCCGGCGCG CTGCCGGTGG TGAAGGTCGG CCGCATCGCC GGGCAATTCG CCAAGCCGCG GTCCTCGCCG ATGGAAAAGC GCGGCGACGT CGAATTGCCG AGCTATCGCG GCGACATCGT CAACGACATC GGCTTCACTG CGGCGTCGCG GGTGCCCGAT CCGCAGCGCC AGCTGATGGC CTATCGGCAG TCGGCGGCGA CGCTGAACCT GCTGCGGGCG TTTGCCACTG GCGGCTTCGC CAATCTCGGC AGCGTGCACC AGTGGATGCT TGGTTTCCTG AAGGACTCCC ACCAGTCGCG GCGTTACAAA GAGTTGGCCG ACCGGATCTC CGACGCGCTG AACTTCATGC GCGCCTGCGG CCTCAACCTG GAAAGCCATC CGGAGTTGCG CGCCACCGAG ATCTACACCA GCCATGAGGC GCTGCTGCTC GGCTACGAGC AGGCCTTCAC CCGGGTGGAT TCCACCACCG GCGATTGGTA CGCCACCTCC GGCCACATGT TGTGGATCGG CGACCGCACC CGCCAGCTCG ATCACGCCCA TATCGAATAT TTCCGCGGCA TCAAGAATCC GATCGGGTTG AAGTGCGGCC CGTCGCTCAA GACCGATGAA TTGCTGAAGC TGATCGACGT GCTCAATCCG GACAACGAGC CGGGTCGCCT CACGCTGATC GGCCGGTTCG GCGCCGACAA GATCGGCGAC AGCCTGCCGG GGATGATCCG CGCCGTGCAG CGCGAGGGCC GCGCCGTGGT GTGGTCGTGC GATCCGATGC ACGGCAACAC CATCACCTCG ACCTCGGGCT ACAAGACCCG GCCGTTCGAC CGCATCCTGT CGGAGGTGAA ATCGTTCTTC ACCATCCACG CCGCGGAAGG CACCCACGCC GGCGGCGTGC ACCTCGAGAT GACCGGCCAG GACGTCACCG AATGCATCGG CGGGGCGCGG GCGATCACCG ACGAGGACCT CAACAACCGC TATCACACCG CCTGCGATCC GCGGCTCAAT GCCGAGCAGT CGATCGACAT GGCGTTCCTG ATCGCGGAAC TGTTGAAGCA GGATCGGGTC GGCAAGGCCA GCCCGTTGCC GGTCGCCGCT GGACTGTGA
|
Protein sequence | MSERWTLDSW RSKPVQQMPD YPDAKALGEV EAQLATFPPL VFAGEARNLK RALARVCAGE AFLLQGGDCA ESFAEHGANN IRDFFRVLLQ MSVVLTYAGA LPVVKVGRIA GQFAKPRSSP MEKRGDVELP SYRGDIVNDI GFTAASRVPD PQRQLMAYRQ SAATLNLLRA FATGGFANLG SVHQWMLGFL KDSHQSRRYK ELADRISDAL NFMRACGLNL ESHPELRATE IYTSHEALLL GYEQAFTRVD STTGDWYATS GHMLWIGDRT RQLDHAHIEY FRGIKNPIGL KCGPSLKTDE LLKLIDVLNP DNEPGRLTLI GRFGADKIGD SLPGMIRAVQ REGRAVVWSC DPMHGNTITS TSGYKTRPFD RILSEVKSFF TIHAAEGTHA GGVHLEMTGQ DVTECIGGAR AITDEDLNNR YHTACDPRLN AEQSIDMAFL IAELLKQDRV GKASPLPVAA GL
|
| |