Gene lpp0526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus taglpp0526 
SymbolpurH 
ID3116895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLegionella pneumophila str. Paris 
KingdomBacteria 
Replicon accessionNC_006368 
Strand
Start bp572638 
End bp574227 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content40% 
IMG OID637579222 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_122864 
Protein GI54296495 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAAA AACAAATTTA CTCTTCTTTC AAGCCACGAC GAGCACTACT GAGTGTTTCC 
GATAAAAGAG GAATAAGAGA ACTCGGTCAG GCCTTACACG ACCAAGGAGT GGAATTGATA
GCCACAGGAA ACACAGCTGC GATCTTAAGG GAACATAAAT TACCAGTTAC CGATGTCAGT
GAGTGCACGG GATTTCCCGA AATAATGGAT GGAAGAGTCA AAACCCTTCA TCCAGCTATT
CATGCTGGTT TATTAGCCCG CGGAGAACAA GACAGCCCAG TTTTAAGACA ACATGGAATA
AAACCTATTG ATTTGCTGAT TGTCAATTTA TACCCCTTTG AACAAGTCAT TAACCACCCG
GACTGCGATT TTAATAAGGC CATTGAAAAC ATTGACATTG GTGGACCAAC AATGGTGCGA
GCGGCAGCTA AAAATCATGC CCATACTTAT GTCATCGTTG ATCCAAATGA TTACTCAAAA
TTAATTCATT ATTTACAAGA TCAGAAAGCA CCTTCTCATT GGAATTTTGC GTTAGCTAAA
AAAGCATTTG CCCATACTGC AGCCTACGAT GCCGCAATTG CAAATTATTT GACCACTTTG
GATAATGATT ATGTTCCAAC TGGATTTCCC GACATACTAA CCTGCCAATT TAATAAAATT
ACTGATCTTC GTTATGGTGA AAATCCCCAT CAGCAAGCTA TTTTTTATGC TGATAAAAAT
TCACATCCAG GCTCTCTAAG TACAGCAGCC TTATTACAGG GAAAACAATT ATCCTATAAT
AATATCCTTG ATGCCGATGC AGCTCTCGAT TGTGTGAAAT CATTTTCCAA TGAAAAGTCA
GTTTGCGTCA TTGTAAAACA TACCAACCCC TGCGGCATTG CACTATCCGA CACATCACTT
GACGCTTACT TAAAAGCCTT TCAAAGTGAT CCGATATCCG CTTATGGTGG AATTATTGCT
TTCAATGGAA CTCTGGATAG CGATACGGCC AAAGCAATAT TGGAAAAGCA ATTTGTGGAA
GTGATTATTG CACCTGATGC CAATGAAGAA GCAAAAAAAA TTCTGGCAAC CAAAGAGAAC
ATTCGTGTTC TTCTAACTGG TTTTTGGCAA CAAGGTAATA ATTTTAGATT AAGTATGAAG
AAAGTCGATG GTGGTCTATT AGTACAAGAA CACGATTCTC TTTCTCTGGA ATCATGTGAG
TTACAAACCG TTACCCAAGT AAAACCCACT GATGAACAAC TACAAAATTT AATGTTCGCT
TGGTTAGCAG CCAAGCATGT CAAATCCAAT GCCATTGTAT ACGCTAATGA TTTGGCCACT
ATAGGGATTG GCGGAGGACA AACCAGTCGA GTAATGAGTG CTCGTATTGG CTTGTGGCAA
GCAGAGCAAA TGGGGTTTGA TCCCAAAGGC GCTGTTATGG CCTCAGACGC ATTTATCCCT
TTCCCTGATA CTATTGAAAT AGCTGCGAAG GCTGGTATAT CCGCAATTAT TCAGCCAGGG
GGTTCTATCA GAGATGAAAA AATTATTTCT TGCGCCGACC AACACAACAT AGCGATGATT
TTTACAGGAG TGCGACATTT TAAACATTAA
 
Protein sequence
MAKKQIYSSF KPRRALLSVS DKRGIRELGQ ALHDQGVELI ATGNTAAILR EHKLPVTDVS 
ECTGFPEIMD GRVKTLHPAI HAGLLARGEQ DSPVLRQHGI KPIDLLIVNL YPFEQVINHP
DCDFNKAIEN IDIGGPTMVR AAAKNHAHTY VIVDPNDYSK LIHYLQDQKA PSHWNFALAK
KAFAHTAAYD AAIANYLTTL DNDYVPTGFP DILTCQFNKI TDLRYGENPH QQAIFYADKN
SHPGSLSTAA LLQGKQLSYN NILDADAALD CVKSFSNEKS VCVIVKHTNP CGIALSDTSL
DAYLKAFQSD PISAYGGIIA FNGTLDSDTA KAILEKQFVE VIIAPDANEE AKKILATKEN
IRVLLTGFWQ QGNNFRLSMK KVDGGLLVQE HDSLSLESCE LQTVTQVKPT DEQLQNLMFA
WLAAKHVKSN AIVYANDLAT IGIGGGQTSR VMSARIGLWQ AEQMGFDPKG AVMASDAFIP
FPDTIEIAAK AGISAIIQPG GSIRDEKIIS CADQHNIAMI FTGVRHFKH