Gene lpp2232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus taglpp2232 
Symbollly 
ID3117551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLegionella pneumophila str. Paris 
KingdomBacteria 
Replicon accessionNC_006368 
Strand
Start bp2567720 
End bp2568766 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content41% 
IMG OID637580931 
Product4-hydroxyphenylpyruvate dioxygenase (legiolysin) 
Protein accessionYP_124544 
Protein GI54298175 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAATA ACAACCCCTG CGGATTAGAT GGCTTTGCCT TTTTAGAGTT TTCAGGCCCT 
GATAGGAATA AATTACATCA ACAATTTTCT GAGATGGGGT TTCAGGCCGT TGCCCACCAT
AAAAATCAAG ACATTACTCT TTTCAAACAA GGGGAAATAC AATTTATAGT GAATGCGGCT
TCCCATTGTC AGGCAGAAGC GCATGCTTCA ACTCATGGTC CAGGCGCTTG TGCAATGGGC
TTTAAAGTAA AAGATGCCAA AGCCGCTTTT CAACACGCTA TCGCGCATGG CGGTATAGCA
TTTCAGGATG CACCTCATGC CAATCACGGC TTGCCAGCCA TCCAGGCTAT TGGTGGTAGT
GTTATTTATT TTGTCGATGA AGAACACCAA CCCTTCTCTC ATGAATGGAA TATTACCTCA
CCAGAACCCG TAATTGGAAA TGGTCTGACC GCAATCGACC ATCTCACCCA TAACGTTTAT
CGCGGTAATA TGGATAAATG GGCCAGTTTT TATGCTTCCA TTTTTAACTT CCAGGAAATT
CGTTTTTTCA ATATCAAAGG AAAAATGACT GGTTTGGTCA GTCGAGCATT AGGTAGCCCT
TGTGGCAAAA TCAAAATTCC TTTAAACGAA TCCAAAGATG ATTTATCACA AATTGAAGAG
TTTCTTCATG AATATCATGG CGAGGGCATT CAACACATCG CTCTCAATAC CAAGGATATT
TATAAAACAG TCAACGGCTT AAGAAAACAA GGGGTCAAAT TCCTGGATGT ACCGGATACT
TACTATGAGA TGATTAATGA CCGCCTCCCA TGGCATAAGG AGCCACTGAA TCAACTCCAT
GCTGAGAAAA TTTTAATTGA TGGAGAAGCA GATCCCAAAG ACGGCTTGTT ACTGCAAATA
TTTACTGAAA ACATATTTGG ACCGGTCTTT TTTGAAATTA TTCAACGCAA AGGCAATCAG
GGGTTTGGTG AAGGGAATTT CCAGGCTCTA TTCGAAGCTA TTGAAAGAGA TCAAGTTCGA
CGCGGTACTT TAAAAGAATT AACCTAG
 
Protein sequence
MQNNNPCGLD GFAFLEFSGP DRNKLHQQFS EMGFQAVAHH KNQDITLFKQ GEIQFIVNAA 
SHCQAEAHAS THGPGACAMG FKVKDAKAAF QHAIAHGGIA FQDAPHANHG LPAIQAIGGS
VIYFVDEEHQ PFSHEWNITS PEPVIGNGLT AIDHLTHNVY RGNMDKWASF YASIFNFQEI
RFFNIKGKMT GLVSRALGSP CGKIKIPLNE SKDDLSQIEE FLHEYHGEGI QHIALNTKDI
YKTVNGLRKQ GVKFLDVPDT YYEMINDRLP WHKEPLNQLH AEKILIDGEA DPKDGLLLQI
FTENIFGPVF FEIIQRKGNQ GFGEGNFQAL FEAIERDQVR RGTLKELT