Gene RPC_2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2303 
Symbol 
ID3973744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2502835 
End bp2504103 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content67% 
IMG OID637925411 
Productaminodeoxychorismate lyase 
Protein accessionYP_532176 
Protein GI90423806 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0207028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAA GGCCGCCGAT TTCGCCAAAG AGCCCGCGTG CCGCGCTGGA GCCTGAGCAG 
GTGCCGCCGC CGCCGAAGCG GTCCGACCGC GCCCGCAATC CGCTGGTGGT GATCGGCAAT
GCGATCATCA CCGTGGTGCT GGTGGTGATG ATCGGCACCG GCGGCATCTA CGTCTACGGC
AAGCAGAAGA TCGAAGCTCC CGGTCCCTTG CAGGACGACA AGATCGTCAA CATTCCGCAG
CGCGCCGGGA TGGGCGACAT CGGCGACATT CTGCAGCGCG AAGGCGTGAT CGATAATAAT
CGTTGGGCCT TCATCGGCAG CGTGTTCGCG CTGAAGGCGC GCGCCGATCT GAAGCCCGGC
GAATACTCGT TTCAGAAGAA CGCCAGCCTG CGCGACGTGA TCGCCACCAT CGTCGAAGGC
AAGGTGGTGC AGCACGCGGT GACGATCCCC GAAGGTTTGA CCTCGGAACA GATCCTGGCG
CGGCTGACGG AGAACGACAT CTTCTCCGGC AATGTGCGGG AGATGCCGCG CGAGGGCACC
TTGCTGCCGG AGACCTACAA ATTCCCGCGC GGCACCACCC GCGAATCGGT GATCGTGCGG
ATGCAGCAGG CGCAGAAGCG GGTGCTCGCC GAGATCTGGG AGCGCCGCAA TCCCGACGTG
CCGGTGAAGA CCCCGGAGCA ATTGGTGACG CTGGCCTCCA TCGTCGAGAA GGAGACCGGC
AAGGCCGACG AGCGCAGCCG GGTCGCCGCG GTCTACGTCA ATCGGCTGCG CCAGAAGATG
AAGCTGCAGT CCGATCCGAC CATCATCTAC GGCCTGGTCG GCGGCAAGGG CACGCTGGGC
CGCCCGATCA AGCGCAGCGA GATCATCCAG CCGTCGCCCT ACAACACCTA TGTGGTCGAG
GGCCTGCCGC CGGGGCCGAT CGCCAATCCG GGCCGCGCCT CGCTGGAAGC CGCCGCCAAT
CCGGCGCGCA CCCGCGATCT GTTCTTCGTC GCCGACGGCA GCGGCGGACA CAGCTTCACC
GAGACCTACG ATCAGCACCA GAAGAACGTC GCCAGGCTGC GCACCCTGGA ACGCCAGATC
CAGAACGACA CCGTCGAGCC GCCGGACGAC GCCGCGCCGG CGGCGTCGCC GGCCGCGCCG
GATGCCAACG CCGCGGTGCC GGCCACCGCT GCGCCGTCGA AATCGCAGAA ACGGACGCGT
GCCGCCGCGC CGGCGCGGCA AGGTGCTGCG CAGCCGGCGG CGCCGACCGC GCCGGCCAGC
CAAGAATAG
 
Protein sequence
MSERPPISPK SPRAALEPEQ VPPPPKRSDR ARNPLVVIGN AIITVVLVVM IGTGGIYVYG 
KQKIEAPGPL QDDKIVNIPQ RAGMGDIGDI LQREGVIDNN RWAFIGSVFA LKARADLKPG
EYSFQKNASL RDVIATIVEG KVVQHAVTIP EGLTSEQILA RLTENDIFSG NVREMPREGT
LLPETYKFPR GTTRESVIVR MQQAQKRVLA EIWERRNPDV PVKTPEQLVT LASIVEKETG
KADERSRVAA VYVNRLRQKM KLQSDPTIIY GLVGGKGTLG RPIKRSEIIQ PSPYNTYVVE
GLPPGPIANP GRASLEAAAN PARTRDLFFV ADGSGGHSFT ETYDQHQKNV ARLRTLERQI
QNDTVEPPDD AAPAASPAAP DANAAVPATA APSKSQKRTR AAAPARQGAA QPAAPTAPAS
QE