Gene RPD_2955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2955 
Symbol 
ID4023458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3293822 
End bp3294895 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content65% 
IMG OID637963155 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_570083 
Protein GI91977424 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGC GGAAGCACGG CCTCACCTAT GCTGATTCCG GCGTCGACAT CGACGCGGGC 
AATCGTCTTG TCGACCTGAT CAAGCCGATG GTGCGGGCCA CCGCTCGCCC CGGCGCCGAT
TCCGAAATCG GCGGCTTCGG CGGGCTGTTC GATCTGAAAG CCGCAGGCTT CAAGGACCCG
GTTCTGGTGG CGGCCACCGA CGGCGTCGGC ACCAAGATCA AGGTTGCGAT CGAGGCCGGA
TTGCACGCCG GCATCGGGAT CGATCTGGTC GCAATGTCGG TGAACGACCT CGTGGTGCAG
GGCGCCGAGC CGCTGTTCTT TCTCGACTAC TTCGCCTGCG GAAAACTCGA TCCGGAAGCC
ACGGCCGAAA TCGTCGCCGG GGTGGCCGAA GGCTGCCGCG AGTCCGGCTG CGCGCTGATC
GGCGGCGAGA CCGCCGAGAT GCCGGGGCTC TACAAGGACG GCGACTACGA TCTCGCCGGC
TTCGCGGTGG GCGCGGCCGA GCGTGGAACC CTGTTGCCCT CCCCGGACAT TGCCAAAGGC
GATGCAGTGA TCGGGCTCGC CTCTTCGGGC GTGCATTCGA ACGGCTTTTC GCTGGTGCGC
AAGATCGTCG AGAAATCCGG CCTGCCCTAT GACGCGCAGG CGCCGTTCTC GCCGGTGATG
ACGCTCGGTG GCGCATTGCT GACGCCGACC AAACTTTACG TGAAATCGTG CCTGAACGCG
ATCCGCACGA CCGGCGCGAT CAAAGGACTG GCGCATATCA CGGGCGGCGG ATTCACCGAC
AACATCCCGC GCGTACTCCC GAAACATCTC GGCGTCGGAA TCGATCTGCC GCGACTTCCA
GTGTTGCCGG TGTTCAAATG GCTCGCCGAG CAAGGCGGCA TCGCCGAACT CGAATTGCTG
CGCACCTTCA ACTGCGGCAT CGGAATGATC GCGATCGTCA GAGCCGACGC CGTCGACGCC
GTCACCGAGG CGCTCACCAG CAGCGGGGAA AGCGTCCATC TGCTCGGTGA AGTGATCGAG
GCCACGGGCG AGCATCGCGT CGTTTACGAC GGTCACCTCG ATCTCGGTCG GTGA
 
Protein sequence
MTERKHGLTY ADSGVDIDAG NRLVDLIKPM VRATARPGAD SEIGGFGGLF DLKAAGFKDP 
VLVAATDGVG TKIKVAIEAG LHAGIGIDLV AMSVNDLVVQ GAEPLFFLDY FACGKLDPEA
TAEIVAGVAE GCRESGCALI GGETAEMPGL YKDGDYDLAG FAVGAAERGT LLPSPDIAKG
DAVIGLASSG VHSNGFSLVR KIVEKSGLPY DAQAPFSPVM TLGGALLTPT KLYVKSCLNA
IRTTGAIKGL AHITGGGFTD NIPRVLPKHL GVGIDLPRLP VLPVFKWLAE QGGIAELELL
RTFNCGIGMI AIVRADAVDA VTEALTSSGE SVHLLGEVIE ATGEHRVVYD GHLDLGR