Gene RPB_0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0643 
Symbol 
ID3908336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp727983 
End bp729320 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content72% 
IMG OID637882532 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_484265 
Protein GI86747769 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.128459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCCATT CCGACCATAC CGCGCCGCTC GAAGCGCGCC TCAGCGCGGC GCTGAGCGGC 
ACCGCCCGCG TCCCCGGCGA CAAGTCGATC TCCCACCGGG CGCTGATTCT CGGCGCGCTC
GCGGTCGGCG AGACCCGGAT TTCCGGCCTG CTCGAGGGCG AGGACGTGCT CAACACCGCC
CGGGCGATGC GGGCGCTGGG GGCGCAGGTC GAGCGCACCG GCGATTGCGC CTGGAGCGTC
CACGGCGTCG GCGTCGCGGG GTTCGCGCCG CCCGCGGCGC CGCTCGATTT CGGCAATTCC
GGCACCGGCT GCCGGCTGGC GATGGGCGCG GTGGCGGGCT CGCCGATCAT CGCCACCTTC
GACGGCGACG CCTCGCTGCG CTCGCGGCCG ATGCGCCGGA TCGTCGATCC GCTGGAGCAG
ATGGGCGCCC GGGTGACGCA GAGCGCCGAC GGCGGCCGGC TGCCGCTGAC GCTGCAGGGC
GCGCGCGACC CGCTGCCGAT CACCTACCGC ACCCCCGTAC CTTCGGCGCA GATCAAATCC
GCGGTGCTGC TCGCCGGCCT GTCGGCGCCG GGCGTCACCA CCGTGATCGA GGCCGAGGCC
AGCCGCGACC ATACCGAGCT GATGCTGCAG CATTTCGGCG CCACGGTCGT GACCGAGCCG
GAAGGCCCCC ATGGCCGGAA GATTTCGCTG ACCGGGCAGC CCGAGCTGCG CGGCGCGCCG
GTGGTGGTGC CGGCGGACCC GTCCTCGGCG GCGTTCCCGA TGGTCGCGGC GCTGATCGTG
CCGGGCTCCG ACGTGGTGCT GACCGAGGTG ATGACCAACC CGCTGCGCAC CGGCCTGATC
ACCACGCTGC GCGAGATGGG CGGCCTGATC GAGGAGAGCG AAACCCGCGG CGACGCCGGC
GAGCCGATGG CGCGCTTCCG CATCCGCGGC TCGCAATTGC GCGGCGTCGA AGTGCCGCCG
GAGCGCGCTC CGTCGATGAT CGACGAATAT CTGGTGCTGG CGGTCGCGGC CGCCTTCGCC
GAGGGCACCA CGATCATGCG CGGCCTGCAC GAGCTGCGGG TCAAGGAAAG CGACCGGCTG
GAAGCGACCG CGGCGATGCT GCGGGTCAAT GGCGTGACGG TCGAGATCTC GGGCGACGAT
CTGATCGTCG AGGGCAAAGG CCACGTCCCG GGCGGCGGGC TGGTCGCCAC CCACATGGAT
CACCGCATCG CGATGTCGGC GCTGGTGATG GGGCTGGCCG CCGACAAGCC GGTCAGGGTC
GACGACACCG CCTTCATCGC CACCAGCTTC CCGGATTTCG TCCCGATGAT GCAAAGGCTC
GGCGCCGAAT TCGGCTGA
 
Protein sequence
MSHSDHTAPL EARLSAALSG TARVPGDKSI SHRALILGAL AVGETRISGL LEGEDVLNTA 
RAMRALGAQV ERTGDCAWSV HGVGVAGFAP PAAPLDFGNS GTGCRLAMGA VAGSPIIATF
DGDASLRSRP MRRIVDPLEQ MGARVTQSAD GGRLPLTLQG ARDPLPITYR TPVPSAQIKS
AVLLAGLSAP GVTTVIEAEA SRDHTELMLQ HFGATVVTEP EGPHGRKISL TGQPELRGAP
VVVPADPSSA AFPMVAALIV PGSDVVLTEV MTNPLRTGLI TTLREMGGLI EESETRGDAG
EPMARFRIRG SQLRGVEVPP ERAPSMIDEY LVLAVAAAFA EGTTIMRGLH ELRVKESDRL
EATAAMLRVN GVTVEISGDD LIVEGKGHVP GGGLVATHMD HRIAMSALVM GLAADKPVRV
DDTAFIATSF PDFVPMMQRL GAEFG