Gene RPD_1752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1752 
Symbol 
ID4022234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1967842 
End bp1970052 
Gene Length2211 bp 
Protein Length736 aa 
Translation table11 
GC content67% 
IMG OID637961946 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_568889 
Protein GI91976230 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTC CCGAGCCGAA AATCACCCCC GAACTGGTCG CCAGCCATGG CCTCAAGCCG 
GACGAATATC AGCGCATTCT CGACCTGATC GGGCGCGAGC CGAGCTTCAC CGAACTCGGC
ATCTTCTCGG CGATGTGGAA CGAGCATTGC TCGTATAAAT CGTCGCGGAT CCATCTGCGC
GGGCTGCCGA CCAAGGCGCC GTGGGTGCTG CAGGGGCCGG GCGAGAACGC CGGCGTGATC
GACATCGGCG ACAATCAGGC GGTGGTCTTC AAGATGGAGA GCCACAACCA CCCGAGCTAT
ATCGAGCCCT ATCAGGGCGC GACCACCGGC GTCGGCGGCA TTCTGCGCGA CGTCTTCACC
ATGGGCGCGC GGCCGATCGC CTGCCTCAAT GCGTTGTCGT TCGGTGCGCC GGAGCACCCG
AAGACGCGGC ACCTGGTCTC GGGCGTCGTC GCCGGCGTCG GCGGCTATGG CAATTCCTTC
GGCGTGCCCA CCGTCGGCGG CCAGACCCGC TTCCACACCC GCTATGACGG CAACATCCTG
GTCAACGCGA TGGCGGTCGG CCTCGCTGAT GCCGACAAGA TTTTCCTCGC TGCGGCGTCG
GGCGTCGGGA TGCCGATCGT CTATCTCGGC TCCAAGACCG GCCGCGACGG CATGGGCGGC
GCCACCATGG CCTCGGCCGA GTTCGGCGAG GGCTCCGAAG AGAAGCGCCC GACCGTGCAG
GTCGGCGATC CGTTCGCCGA GAAGCTGCTG CTCGAAGCCT GTCTCGAGAT CATGGCCAAT
GATTGCGTGA TCGCGATTCA GGACATGGGC GCGGCAGGAT TGACCTGTTC GGCGGTGGAG
ATGGGCGCCA AGGGCGACCT CGGCGTCGAT CTCGATCTCG ACGCGGTGCC GACCCGCGAG
ACCGGTATGA CCGCCTACGA GATGATGCTC TCCGAAAGCC AGGAGCGGAT GCTCATGGTC
CTGAAGCCGG AGAAGGAAAA GGAAGCCGAG GCGATCTTCA GGAAGTGGGG CCTCGACTTC
GCCGTGGTCG GCTACACCAC GCCGACCAAG CGTTTCGTCG TCAAGCACGG CGGCAAGGCA
ATGGCCGATC TGCCGATCAA GGAACTCGGC GACGAAGCCC CGCTGTATGA TCGGCCCTGG
GTCGAGAGCC CCAAGCTGCC GGTGATCCAC GCCCGCGAGG TCAACGCGCC GAACAGCATC
GCCGACGCGC TGGAGAAGCT GCTGGCGACG CCGGACCTGT GCAGCAAGCG CTGGGTCTGG
GAGCAGTACG ACCACGTCAT CGGCGGCAAC ACGTTGCAGC GACCCGGTGG CGACGCCGCG
GTGGTGCGGG TGCAGGACGG CCCCAAGGGC CTTGCGCTGA CCGTCGACGT CACGCCGCGC
TATTGCGAGG CCGATCCGTT CGAGGGCGGC AAGCAGGCGG TCGCCGAAGC CTGGCGCAAC
ATCACCGCGG TCGGCGGCAA GCCGCTCGCG ATCACCGACA ACCTCAATTT CGGCAATCCG
GAGCGGCCCG AGATCATGGG CCAGTTCGTC GGCTGCCTGA AGGGCATCGC CGAAGCCTGC
ATCGCGTTCG ACTTCCCGGT CGTGTCCGGC AACGTCTCGC TTTACAACGA GACCTCGGGA
CGCGGCATCC TGCCGACCCC CTCGATCGGC GGCGTCGGCC TGCTCGACGA CTTCACCAAA
TCGGCGACGC TCGCCTTCAA GGCCGAGGGC GAGGCGATCC TGCTGATCGG CGAGACCCAC
GGCTGGCTCG GCCAGTCGGT GTATCTGCGC GACATCTGCG GCCGCGAAGA GGGCGCGCCG
CCGCCGGTCG ATCTCGCCTG CGAGAAGCGC CATGGCGACG TGGTGCGCGG CATGATCCAC
GCCGGCACCG CCACCGCGGT GCATGACCTG TCCGATGGCG GCCTGCTGGT CGCACTCGCC
GAAATGGCGC TCGCAGGCTC GATCGGCGCC TCCCTCGAGG CGCCGCCGGA CGGCATCGTG
CCGCATGCCT GGTGGTTCGG CGAGGATCAG GGCCGCTATC TCGTCACCGT CAAGGAAGAC
GATCTGCTCA CGGTGCTGTC GAAGATGAAA TCGGTCGGCG TGTCGTGCGA GCAGATCGGC
CGGACCGCCG GCCACACGCT GAAGATCGAA GGCGAGCGCG CGCTCGACCT CAAGGCGCTG
CGCCACGCCC ACGAGCACTG GCTGCCGGAC TACATGGGCG GGAAGAACTA G
 
Protein sequence
MSAPEPKITP ELVASHGLKP DEYQRILDLI GREPSFTELG IFSAMWNEHC SYKSSRIHLR 
GLPTKAPWVL QGPGENAGVI DIGDNQAVVF KMESHNHPSY IEPYQGATTG VGGILRDVFT
MGARPIACLN ALSFGAPEHP KTRHLVSGVV AGVGGYGNSF GVPTVGGQTR FHTRYDGNIL
VNAMAVGLAD ADKIFLAAAS GVGMPIVYLG SKTGRDGMGG ATMASAEFGE GSEEKRPTVQ
VGDPFAEKLL LEACLEIMAN DCVIAIQDMG AAGLTCSAVE MGAKGDLGVD LDLDAVPTRE
TGMTAYEMML SESQERMLMV LKPEKEKEAE AIFRKWGLDF AVVGYTTPTK RFVVKHGGKA
MADLPIKELG DEAPLYDRPW VESPKLPVIH AREVNAPNSI ADALEKLLAT PDLCSKRWVW
EQYDHVIGGN TLQRPGGDAA VVRVQDGPKG LALTVDVTPR YCEADPFEGG KQAVAEAWRN
ITAVGGKPLA ITDNLNFGNP ERPEIMGQFV GCLKGIAEAC IAFDFPVVSG NVSLYNETSG
RGILPTPSIG GVGLLDDFTK SATLAFKAEG EAILLIGETH GWLGQSVYLR DICGREEGAP
PPVDLACEKR HGDVVRGMIH AGTATAVHDL SDGGLLVALA EMALAGSIGA SLEAPPDGIV
PHAWWFGEDQ GRYLVTVKED DLLTVLSKMK SVGVSCEQIG RTAGHTLKIE GERALDLKAL
RHAHEHWLPD YMGGKN