Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2197 |
Symbol | |
ID | 6409857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2379950 |
End bp | 2381338 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642712081 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001991193 |
Protein GI | 192290588 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGAAC GCTGGACACC CGATAGCTGG CGCGCCAAGC CGGTGCAGCA GATGCCGCAA TATCCCGACG CGAAGGCGCT TGCGGATGTC GAGGCGCAGC TCGCGAGCTT TCCGCCGCTG GTGTTTGCGG GCGAGGCGCG CAACCTGAAG AAGGCGCTGG CCACGGTGGC GGCCGGCGAC GCTTTCCTGC TCCAGGGCGG CGATTGCGCC GAGAGCTTCG CCGAGCACGG CGCCAACAAC ATCCGCGACC TGTTCCGCGT CTTCCTGCAG ATGGCGATCG TCTTGACCTA CGCCGGCGCC TCGCCGGTGG TGAAGGTCGG CCGCATCGCC GGCCAGTTCG CCAAGCCGCG CTCCGCTCCG GTCGAGAAGC GCGACGGCGT CGAGCTGCCG AGCTATCGCG GCGACATCAT CAATGACGTC GCGTTCACCG AGGAAGCCCG CGTACCCGAT CCGCGCCGGC AGATCGAAGC GTATCGCCAG TCGGCCGCGA CGCTCAACCT GCTGCGCGCC TTCGCCAAGG GCGGCTACGC CAGCGTCGAG AACGTCCATA GCTGGATGCT GCAGTCGGTC AGCGACAGTC CGCAGTCGAA GGCCTATGCG GATCTCGCCG ATCGCGTTTC CGGCGCGCTG GATTTCATGC GCGCCTGCGG CCTGACCTTC GCCGTCGATT CCTCGCTCGG CACCACCGAT TTCTACACCA GCCACGAAGC GCTGCTGCTC GGCTACGAGC AGGCGATGAC CCGCGTCGAC TCGACCACGG GTGATTGGTA CGCGACCTCC GGCCACATGC TGTGGATCGG CGATCGTACC CGTCAGCTCG ACCACGCCCA TGTCGAGTAT TTCCGCGGTA TCAAGAATCC GATCGGGCTG AAGTGCGGTC CGTCGCTGAA GACCGACGAA CTGCTCAAGC TGATCGACAT TCTCAATCCC GACAACGAGC CGGGCCGGCT GACGCTGATC GGCCGTTTCG GCCATGAGAA GATCGGCGAG CACCTCCCGG CGATGGTTCG CGCCGTGAAG CGCGAGGGCC GGACCGTGGT GTGGTCGTGC GATCCGATGC ACGGCAACAC CATCACGTCG AACTCCGGCT ACAAGACCCG GCCGTTCGAC CGCATCCTGT CGGAAGTCCG TTCGTTCTTT GCGGTCCATG CCGCGGAGGG GACTCATGCC GGCGGCGTGC ATCTGGAGAT GACCGGCCAG AACGTCACCG AGTGTCTCGG CGGCGCCCGC GCCATCACCG ACGAAGACCT CAACAACCGC TATCACACCG CCTGCGATCC CCGGCTGAAC GCCGAGCAGT CGATCGACAT GGCGTTCCTG ATCGCGGACC TCCTGAAGCA GGGTCGGGCC GGCAAGGCCA GCCCGCTGCA GGCGGCGGCT GGCCTCTGA
|
Protein sequence | MSERWTPDSW RAKPVQQMPQ YPDAKALADV EAQLASFPPL VFAGEARNLK KALATVAAGD AFLLQGGDCA ESFAEHGANN IRDLFRVFLQ MAIVLTYAGA SPVVKVGRIA GQFAKPRSAP VEKRDGVELP SYRGDIINDV AFTEEARVPD PRRQIEAYRQ SAATLNLLRA FAKGGYASVE NVHSWMLQSV SDSPQSKAYA DLADRVSGAL DFMRACGLTF AVDSSLGTTD FYTSHEALLL GYEQAMTRVD STTGDWYATS GHMLWIGDRT RQLDHAHVEY FRGIKNPIGL KCGPSLKTDE LLKLIDILNP DNEPGRLTLI GRFGHEKIGE HLPAMVRAVK REGRTVVWSC DPMHGNTITS NSGYKTRPFD RILSEVRSFF AVHAAEGTHA GGVHLEMTGQ NVTECLGGAR AITDEDLNNR YHTACDPRLN AEQSIDMAFL IADLLKQGRA GKASPLQAAA GL
|
| |