Gene Rpal_3461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3461 
Symbol 
ID6411135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3706320 
End bp3707393 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content67% 
IMG OID642713340 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_001992437 
Protein GI192291832 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGC GGAAGCACGG CCTCACCTAC GCCGACTCCG GCGTCGACAT CGATGCGGGC 
AACCGGCTCG TCGATCTGAT CAAGCCGATG GTGCGCGCCA CTGCGCGGGC CGGCGCGGAT
TCCGAGATCG GCGGCTTCGG CGGCCTGTTC GATCTGAAGG CGGCGGGCTT CAAGGATCCG
GTGCTGGTCG CCGCCACCGA CGGCGTCGGC ACCAAGATCA AGGTGGCGAT CGACGCCGGG
CTGCACACCG GCATCGGCAT CGATTTGGTG GCGATGTCGG TCAACGACCT CGTGGTGCAA
GGCGCCGAGC CGCTGTTCTT TCTCGATTAC TTCGCCTGCG GCAAGCTCGA TCCGGAGGCC
GCGGCCGAGA TCGTCGCCGG CGTGGCCGAA GCCTGCCGCG AGTCCGGCTG CGCACTGATC
GGCGGCGAGA CCGCCGAAAT GCCGGGCCTG TATAAGGACG GCGACTACGA CCTGGCCGGT
TTCTCGGTCG GCGCCGCCGA GCGCGGCACG CTGCTGCCGT CGAAGGGCAT CGCCGAAGGT
GATGCGGTGA TCGGACTGGC GTCCTCCGGC GTGCACTCCA ACGGCTTCTC GCTGGTCCGC
AAGATCGTCG AGAAATCCGG CCTGCCCTAT GACGCGCCGG CGCCGTTTTC GCCGGTGATG
ACGCTCGGCG GTGCGCTGCT CGCGCCGACC AAGCTCTATG TGAAGTCCTG CTTGCAGGCG
ATCCGCGACA CTGGCGCCGT CAAAGGCCTC GCCCACATCA CCGGCGGCGG CTTCACCGAG
AACATTCCGC GCGTGCTGCC GAAGCACCTC GGCGTCGGCA TCGACCTGCC GCGGATCCCG
GTGCTGCCGG TGTTCAAATG GCTCGCCGAG CAAGGCGAGA TCGCCGAACT CGAATTGCTG
CGCACCTTCA ACTGCGGCAT CGGCATGGTC ATCATCGTCA AGGCCGAGGC CGTCGATCAG
GTCACCGAGA GCCTCACCGC CAGCGGCGAG AGCGTGCACC TGCTCGGTCA GGTCATTGCC
GCCAAGGGCG AGCAGCGCGT GGTCTATGAT GGCCACCTCG ACCTCGCCTG GTGA
 
Protein sequence
MTERKHGLTY ADSGVDIDAG NRLVDLIKPM VRATARAGAD SEIGGFGGLF DLKAAGFKDP 
VLVAATDGVG TKIKVAIDAG LHTGIGIDLV AMSVNDLVVQ GAEPLFFLDY FACGKLDPEA
AAEIVAGVAE ACRESGCALI GGETAEMPGL YKDGDYDLAG FSVGAAERGT LLPSKGIAEG
DAVIGLASSG VHSNGFSLVR KIVEKSGLPY DAPAPFSPVM TLGGALLAPT KLYVKSCLQA
IRDTGAVKGL AHITGGGFTE NIPRVLPKHL GVGIDLPRIP VLPVFKWLAE QGEIAELELL
RTFNCGIGMV IIVKAEAVDQ VTESLTASGE SVHLLGQVIA AKGEQRVVYD GHLDLAW