Gene RPB_2489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2489 
Symbol 
ID3910278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2847061 
End bp2848134 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content67% 
IMG OID637884388 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_486105 
Protein GI86749609 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.240095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGC GGAAGCACGG CCTCACCTAT GCCGATTCCG GCGTCGATAT CGACGCCGGC 
AATCGCCTCG TCGACCTGAT CAAGCCGATG GTCCGGGCCA CCGCCCGGGC CGGCGCGGAC
TCCGAAATCG GCGGCTTCGG CGGACTGTTC GACTTGAAGG CCGCGGGCTT CAAGGACCCG
GTGCTGGTGG CGGCGACCGA CGGCGTCGGA ACCAAGATCA AGGTCGCGAT CGAGGCCGGG
GTGCACGCCG GGATCGGTAT CGACCTGGTC GCGATGTCGG TCAATGATCT GGTCGTCCAG
GGCGCCGAGC CGCTGTTCTT TCTCGACTAT TTCGCGTGCG GCAAACTCGA TCCCGAGGCC
GTGGCCGAAA TCGTCGCAGG CGTCGCCGAA GGCTGCCGCG AGTCTGGCTG TGCGCTGATC
GGCGGCGAGA CCGCGGAAAT GCCGGGCCTC TACAAGGACG GCGACTATGA TCTCGCCGGC
TTCGCGGTCG GCGCGGCGGA ACGCGGCACG CTGCTGCCCT CCCCCGACAT CACCGCCGGC
GACGCGGTGA TCGGGCTGGC CTCATCCGGG GTGCATTCGA ACGGGTTTTC GCTGGTCCGC
AAGATCGTCG AAAAATCCGG CCTGCCCTAC GACGCCAAGG CGCCGTTCTC GCCGGTGATG
ACGCTCGGCG GAGCGCTGCT GACGCCGACC CGGCTCTACG TGAAATCCTG TCTGCAGGCG
ATCCGCACCA CCGGCGCGAT CAAGGGGCTG GCGCATATCA CCGGTGGCGG TTTCACCGAC
AACATCCCGC GGGTGCTGCC GAAGCATCTC GGCGTCGGCA TCGACCTGCC GCGGCTGCCG
GTGCTGCCGG TGTTCAAATG GCTGGCCGAA CAAGGCGACA TCGCCGAACT CGAACTGCTG
CGCACCTTCA ATTGCGGCAT CGGCATGATC GCGATCGTCA AGGCGGACGC CGTCGACGCC
GTCACCGAGG CGCTGACCGC GGGCGGCGAG AGCGTGCATC TGCTCGGCGA AGTGATCGCG
GCGAAGGGCG AACATCGCGT CGTCTACGAC GGTCACCTCG ACCTGTCCTG GTGA
 
Protein sequence
MTERKHGLTY ADSGVDIDAG NRLVDLIKPM VRATARAGAD SEIGGFGGLF DLKAAGFKDP 
VLVAATDGVG TKIKVAIEAG VHAGIGIDLV AMSVNDLVVQ GAEPLFFLDY FACGKLDPEA
VAEIVAGVAE GCRESGCALI GGETAEMPGL YKDGDYDLAG FAVGAAERGT LLPSPDITAG
DAVIGLASSG VHSNGFSLVR KIVEKSGLPY DAKAPFSPVM TLGGALLTPT RLYVKSCLQA
IRTTGAIKGL AHITGGGFTD NIPRVLPKHL GVGIDLPRLP VLPVFKWLAE QGDIAELELL
RTFNCGIGMI AIVKADAVDA VTEALTAGGE SVHLLGEVIA AKGEHRVVYD GHLDLSW