Gene Rpal_0029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0029 
SymbolpurH 
ID6407670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp31529 
End bp33121 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content66% 
IMG OID642709936 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001989067 
Protein GI192288462 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAGA ATCCCCGTCG CGTCACCCGT GCCCTGTTGT CCGTGTCCGA TAAGACCGGT 
CTGATCGATT TCGCCCGTGC GCTCGCCGGC CATGGCGTCG AACTGGTTTC GACCGGTGGC
ACCGCCAAGG CGATCGCTGC GGCCGGCCTT CCGGTCAAGG ACGTCTCCGA GCTGACCGGT
TTCCCTGAGA TGATGGATGG TCGGGTCAAG ACGCTGCATC CGAAGGTGCA TGGCGGCCTG
CTGGCGATTC GCGACAACGA CGAACACACC CAGGCGATGG CCGCGCACGG CATCCCGCAG
ATCGACCTCC TGGTGGTGAA CCTCTATCCG TTCGAAGCTA CCGTCGACAA AGGCGCCTCT
TACGAGGACT GCATCGAGAA CATCGACATC GGCGGCCCGG CGATGATCCG CGCCGCCGCC
AAGAACCACG ACGACGTCGC GGTTGTGGTC GAGGCCAGCG ATTATCAGGC GGTGCTGGAT
GAGCTGACTG CCAACAACGG CGCCACCACG CTGCCACTGC GCAAGCGCCT CGCTGCCAAG
GCCTATGCGC GCACGGCCGC TTATGATGCG GCGATCTCCA ACTGGTTCGC GCTGCAGCTC
AAGACCGATG CGCCGGACTT CCGCGCGATC GGTGGACGGC TGATCCAGAG CCTCCGCTAC
GGCGAAAATC CGCACCAGAC CGCGGCGTTC TACGCCACGC CGGAGAAGCG TCCGGGCGTT
GCCACCGCGC GCCAGGTGCA GGGCAAGGAG CTGTCCTACA ACAACATCAA CGACACCGAC
GCCGCCTATG AGTGCGTCGG CGAGTTCGAC GCGGCGCGTA CCGCGGCCTG CGTCATCGTC
AAGCACGCCA ATCCTTGCGG CGTCGCCGAG GGATCGAGCC TGCTCGACGC CTACAAGAAG
GCGCTGGCTT GCGACTCCGT CTCGGCGTTC GGAGGCATCG TCGCGCTCAA CCGCACGCTC
GACGCCGAAG CGGCGCGCGC CATCACCGAG ATCTTCACCG AAGTGATCAT CGCGCCGGAC
GCCACCGACG AAGCGATCGC GATCGTCGCG GCGAAGAAGA ACCTGCGGCT GCTGCTGGCG
GGCGCGCTGC CCGATCCGCG CGCCAACGGT CTGACCTACA AGACCGTCGC CGGCGGCCTG
CTGGTGCAGA GCCGCGACAA TGCGGTGGTC GATGACATGG CGCTGAAGGT CGTCACCAAG
CGGCAGCCGA CCGAAGCCGA GCTGCGTGAC CTGAAGTTCG CGTTCCGCGT CGGCAAGCAC
GTCAAGTCCA ACACCATCAT CTATGCCAAG GACCTCGCCA CTGTCGGTAT CGGTGCCGGT
CAGATGAGCC GCGTCGACTC CGCCCGCATC GCCGCCCGCA AGGCGCAGGA TGCGGCCGAG
GCGATGAAGC TCGCCGCGCC GATGACCAAG GGCTCGGTGG TGGCCTCCGA CGCGTTCTTC
CCGTTCGCCG ACGGCATGCT GGCCTGTATC GAAGCCGGCG CCACCGCGGT GATCCAGCCC
GGCGGCTCGG TTCGCGACGA CGAAGTGATC AAGGCTGCGG ACGACGCCGG CATCGCCATG
GTGTTCACCG GCACCCGGCA CTTCCGACAC TAA
 
Protein sequence
MTQNPRRVTR ALLSVSDKTG LIDFARALAG HGVELVSTGG TAKAIAAAGL PVKDVSELTG 
FPEMMDGRVK TLHPKVHGGL LAIRDNDEHT QAMAAHGIPQ IDLLVVNLYP FEATVDKGAS
YEDCIENIDI GGPAMIRAAA KNHDDVAVVV EASDYQAVLD ELTANNGATT LPLRKRLAAK
AYARTAAYDA AISNWFALQL KTDAPDFRAI GGRLIQSLRY GENPHQTAAF YATPEKRPGV
ATARQVQGKE LSYNNINDTD AAYECVGEFD AARTAACVIV KHANPCGVAE GSSLLDAYKK
ALACDSVSAF GGIVALNRTL DAEAARAITE IFTEVIIAPD ATDEAIAIVA AKKNLRLLLA
GALPDPRANG LTYKTVAGGL LVQSRDNAVV DDMALKVVTK RQPTEAELRD LKFAFRVGKH
VKSNTIIYAK DLATVGIGAG QMSRVDSARI AARKAQDAAE AMKLAAPMTK GSVVASDAFF
PFADGMLACI EAGATAVIQP GGSVRDDEVI KAADDAGIAM VFTGTRHFRH