Gene RPC_0023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0023 
SymbolpurH 
ID3971448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp27674 
End bp29266 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content67% 
IMG OID637923137 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_529921 
Protein GI90421551 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC ACCCGCGCCG CGTGACCCGC GCTCTTCTGT CCGTTTCCGA TAAAGCGGGC 
CTGATCGACT TCGCCCGCGC GCTGGTCGAC CACGGCGTCG AACTGGTCTC CACCGGCGGC
ACCGCCAAAG CGATCGCCGC TGCCGGGCTC GCGGTCAAGG ATGTCTCCGA GCTCACCGGC
TTTCCGGAAA TGATGGACGG CCGGGTCAAG ACCCTGCATC CGAAGGTGCA CGGCGGCCTG
TTGGCGGTCC GCGGCAATGC CGAGCACGTC AAGGCGATGG CCGACCACGA CATCGCGCCG
ATCGACCTGT TGGTGGTCAA CCTCTATCCG TTCGAGGCCA CCGTCGACAA AGGCGCCGGT
TACGAAGACT GCATCGAGAA CATCGACATC GGCGGGCCGG CGATGATCCG CGCCGCTGCG
AAAAACCACG ACGACGTCGC GGTGGTGGTG GAAGCCGCCG ATTATCAGGC GGTGCTCGAC
GAACTCGCGG CCAACAAGGG CGCAACGACA CTGACCTTGC GCAAGAAGCT CGCCGCCAAG
GCCTATGCGC GCACCGCGGC TTACGACGCG GCGATCTCCA ACTGGTTCGC CGATCAGCTG
AAGACCGCGG CGCCGGATTT CCGCGCCATC GGTGGCCGGC TGATCCAGAG CCTGCGCTAC
GGCGAAAACC CGCACCAGAG TGCTGCGTTC TACCGCACCC CGGATCACTG CCCGGGCGTC
GCCACCGCGC GGCAGATCCA GGGCAAGGAA CTATCCTACA ACAACATCAA CGATACCGAC
GCCGCCTATG AGTGCATCGG CGAGTTCGAC GCCACGCGCA CTGCGGCCTG CGTCATCGTC
AAGCACGCCA ACCCCTGTGG TGTGGCGGAG GGCTCGAGCC TGCTGGCCGC CTATCGGTCG
GCGCTGGCCT GCGATTCGAC CTCCGCGTTC GGCGGCATCG TGGCGCTGAA CCGCACCCTG
GATGCCGAGG CCGCGCGCGC CATCACCGAG ATCTTCACCG AAGTGATCAT CGCGCCCGAC
GCCACGGACG AGGCGATCGC GATCGTTGCC GCGAAGAAGA ATCTGCGGCT GCTGCTGGCC
GGCCAATTGC CGGATCCGCG CGCGCCTGGG CTCACCTACA AGACGGTGGC CGGCGGTCTG
TTGGTGCAGT CGCGCGATAA CGCCGTGGTC GAGGATATGG CGCTGAAGGC GGTTACCAAG
CGGCAGCCGA CCGAGGCCGA GCTGCGCGAT CTGAAATTCG CCTTCCGGGT CGCCAAGCAC
GTGAAGTCCA ACACGATTGT GTATGCGAAA GACCTCGCCA CCGTCGGCAT CGGCGCCGGC
CAGATGAGCC GCGTCGACTC CGCGCGGATC GCCGCGCGCA AGGCCGAGGA TGCGGCGGCC
GAGCTGAAGC TCGCCGCGCC GATGACCAAG GGCTCGGTGG TGGCCTCCGA TGCGTTCTTC
CCGTTCGCCG ACGGCATGCT GGCCTGCATC GAGGCCGGCG CCACCGCGGT GATCCAGCCC
GGCGGCTCGG TGCGCGACGA CGAGGTGATC AAGGCCGCCG ACGACGCCGG CATCGCCATG
GTGTTCACCG GCGTGCGGCA TTTTAGGCAT TGA
 
Protein sequence
MTDHPRRVTR ALLSVSDKAG LIDFARALVD HGVELVSTGG TAKAIAAAGL AVKDVSELTG 
FPEMMDGRVK TLHPKVHGGL LAVRGNAEHV KAMADHDIAP IDLLVVNLYP FEATVDKGAG
YEDCIENIDI GGPAMIRAAA KNHDDVAVVV EAADYQAVLD ELAANKGATT LTLRKKLAAK
AYARTAAYDA AISNWFADQL KTAAPDFRAI GGRLIQSLRY GENPHQSAAF YRTPDHCPGV
ATARQIQGKE LSYNNINDTD AAYECIGEFD ATRTAACVIV KHANPCGVAE GSSLLAAYRS
ALACDSTSAF GGIVALNRTL DAEAARAITE IFTEVIIAPD ATDEAIAIVA AKKNLRLLLA
GQLPDPRAPG LTYKTVAGGL LVQSRDNAVV EDMALKAVTK RQPTEAELRD LKFAFRVAKH
VKSNTIVYAK DLATVGIGAG QMSRVDSARI AARKAEDAAA ELKLAAPMTK GSVVASDAFF
PFADGMLACI EAGATAVIQP GGSVRDDEVI KAADDAGIAM VFTGVRHFRH