Gene RPD_1423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1423 
Symbol 
ID4021900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1587385 
End bp1588479 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content67% 
IMG OID637961615 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_568561 
Protein GI91975902 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCTC CCGTTCAGTT GGTGCTCAAA CCCGGCGATA CGATCGGCAT TCTCGGCGGC 
GGCCAGCTCG GCCGGATGCT GGCGATGGCC GCGGCGCGGC TCGGCCTGCG CTGCCATGTG
TTCTCGCCGG ACCCCGATTC GCCGGCGTTC GACGTGGTGC AGAACGCGAC CTGCGCCGAA
TATGCCGATG TCGAGGCGCT GGAAATGTTC GCCTCCGACG TCGACGTCAT CACCTACGAA
TTCGAGAACG TGCCGGCCTC GGCCGCTTTG GTGCTGGCGG CGCGCAAACC CGTGCTGCCC
GACTACAGGA TCCTGGAGAC CACGCAGGAC CGGCTCGGCG AGAAGGACTT CGTGACCAAG
CTCGGCATCG GCACAGCCGC TTATGCCGAC GTGACGTCGC CGCAGATGCT GCGTGCCGCG
ATCGCCAGGC TCGGTCTGCC CGCGGTGCTG AAAACGCGCC GGTTCGGCTA TGACGGCAAG
GGCCAGATCA TCCTGCGCGA GGGCGACGAC CCCGACGCGG CCTGGGCCAA GCTGGAAACC
CGCGCAGCGA TTCTCGAGGC GTTCGTGCCC TTCGAGCGCG AGGTATCGGT GATTGCCGCG
CGTGGCAGCG ACGGCCAGGT GGTGTGTTAC GACGTCACCG AAAACGAGCA CCGCGATCAC
ATTCTGAAAG TGTCGCGGGT GCCGGCGCCG GTGACCGATG CGGTCGCGGA CGAGGCGCGG
CGGATCGCCA AAACGATCGC CGACGCGCTG AATTACGTCG GCGTGCTCGG CGTCGAGATG
TTTGTGGTGC CGGGCGACGG CGGCGCCAGG GTGCTGGTCA ATGAAATCGC CCCGCGCGTG
CACAATTCCG GTCACTGGAC GCTCGACGGC GCCTCGGTGT CTCAATTCGA ACAGCACATC
CGGGCGATCG CCGGCTGGCC GCTGGCGGAA CCGCTGCGCC ACGGCCGCGT CACCATGACC
AATCTGATCG GCCACGACGT CGACGATTAT GCGCGCTGGC TGACGGTCCC GGGCGCCACG
GTGCATCTTT ACGGCAAGCG GACCGCCCTG CCCGGCCGCA AGATGGGCCA TGTGACGGTG
ATCGAGCCCC AGTGA
 
Protein sequence
MTAPVQLVLK PGDTIGILGG GQLGRMLAMA AARLGLRCHV FSPDPDSPAF DVVQNATCAE 
YADVEALEMF ASDVDVITYE FENVPASAAL VLAARKPVLP DYRILETTQD RLGEKDFVTK
LGIGTAAYAD VTSPQMLRAA IARLGLPAVL KTRRFGYDGK GQIILREGDD PDAAWAKLET
RAAILEAFVP FEREVSVIAA RGSDGQVVCY DVTENEHRDH ILKVSRVPAP VTDAVADEAR
RIAKTIADAL NYVGVLGVEM FVVPGDGGAR VLVNEIAPRV HNSGHWTLDG ASVSQFEQHI
RAIAGWPLAE PLRHGRVTMT NLIGHDVDDY ARWLTVPGAT VHLYGKRTAL PGRKMGHVTV
IEPQ