Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0643 |
Symbol | |
ID | 3908336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 727983 |
End bp | 729320 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637882532 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_484265 |
Protein GI | 86747769 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.128459 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCCATT CCGACCATAC CGCGCCGCTC GAAGCGCGCC TCAGCGCGGC GCTGAGCGGC ACCGCCCGCG TCCCCGGCGA CAAGTCGATC TCCCACCGGG CGCTGATTCT CGGCGCGCTC GCGGTCGGCG AGACCCGGAT TTCCGGCCTG CTCGAGGGCG AGGACGTGCT CAACACCGCC CGGGCGATGC GGGCGCTGGG GGCGCAGGTC GAGCGCACCG GCGATTGCGC CTGGAGCGTC CACGGCGTCG GCGTCGCGGG GTTCGCGCCG CCCGCGGCGC CGCTCGATTT CGGCAATTCC GGCACCGGCT GCCGGCTGGC GATGGGCGCG GTGGCGGGCT CGCCGATCAT CGCCACCTTC GACGGCGACG CCTCGCTGCG CTCGCGGCCG ATGCGCCGGA TCGTCGATCC GCTGGAGCAG ATGGGCGCCC GGGTGACGCA GAGCGCCGAC GGCGGCCGGC TGCCGCTGAC GCTGCAGGGC GCGCGCGACC CGCTGCCGAT CACCTACCGC ACCCCCGTAC CTTCGGCGCA GATCAAATCC GCGGTGCTGC TCGCCGGCCT GTCGGCGCCG GGCGTCACCA CCGTGATCGA GGCCGAGGCC AGCCGCGACC ATACCGAGCT GATGCTGCAG CATTTCGGCG CCACGGTCGT GACCGAGCCG GAAGGCCCCC ATGGCCGGAA GATTTCGCTG ACCGGGCAGC CCGAGCTGCG CGGCGCGCCG GTGGTGGTGC CGGCGGACCC GTCCTCGGCG GCGTTCCCGA TGGTCGCGGC GCTGATCGTG CCGGGCTCCG ACGTGGTGCT GACCGAGGTG ATGACCAACC CGCTGCGCAC CGGCCTGATC ACCACGCTGC GCGAGATGGG CGGCCTGATC GAGGAGAGCG AAACCCGCGG CGACGCCGGC GAGCCGATGG CGCGCTTCCG CATCCGCGGC TCGCAATTGC GCGGCGTCGA AGTGCCGCCG GAGCGCGCTC CGTCGATGAT CGACGAATAT CTGGTGCTGG CGGTCGCGGC CGCCTTCGCC GAGGGCACCA CGATCATGCG CGGCCTGCAC GAGCTGCGGG TCAAGGAAAG CGACCGGCTG GAAGCGACCG CGGCGATGCT GCGGGTCAAT GGCGTGACGG TCGAGATCTC GGGCGACGAT CTGATCGTCG AGGGCAAAGG CCACGTCCCG GGCGGCGGGC TGGTCGCCAC CCACATGGAT CACCGCATCG CGATGTCGGC GCTGGTGATG GGGCTGGCCG CCGACAAGCC GGTCAGGGTC GACGACACCG CCTTCATCGC CACCAGCTTC CCGGATTTCG TCCCGATGAT GCAAAGGCTC GGCGCCGAAT TCGGCTGA
|
Protein sequence | MSHSDHTAPL EARLSAALSG TARVPGDKSI SHRALILGAL AVGETRISGL LEGEDVLNTA RAMRALGAQV ERTGDCAWSV HGVGVAGFAP PAAPLDFGNS GTGCRLAMGA VAGSPIIATF DGDASLRSRP MRRIVDPLEQ MGARVTQSAD GGRLPLTLQG ARDPLPITYR TPVPSAQIKS AVLLAGLSAP GVTTVIEAEA SRDHTELMLQ HFGATVVTEP EGPHGRKISL TGQPELRGAP VVVPADPSSA AFPMVAALIV PGSDVVLTEV MTNPLRTGLI TTLREMGGLI EESETRGDAG EPMARFRIRG SQLRGVEVPP ERAPSMIDEY LVLAVAAAFA EGTTIMRGLH ELRVKESDRL EATAAMLRVN GVTVEISGDD LIVEGKGHVP GGGLVATHMD HRIAMSALVM GLAADKPVRV DDTAFIATSF PDFVPMMQRL GAEFG
|
| |