Gene lpl2204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus taglpl2204 
Symbollly 
ID3114280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLegionella pneumophila str. Lens 
KingdomBacteria 
Replicon accessionNC_006369 
Strand
Start bp2518550 
End bp2519596 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content41% 
IMG OID637583978 
Product4-hydroxyphenylpyruvate dioxygenase (legiolysin) 
Protein accessionYP_127539 
Protein GI54295124 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAATA ATAACCCCTG CGGATTAGAT GGCTTTGCCT TTTTAGAGTT TTCAGGCCCT 
GATAGGAATA AATTACATCA GCAATTTTCT GAGATGGGGT TTCAGGCCGT TGCCCACCAT
AAAAATCAAG ACATTACTCT TTTCAAACAA GGGGAAATAC AATTTATAGT GAATGCGGCC
TCCCATTGTC AGGCAGAAGC GCATGCTTCA ACTCATGGTC CAGGCGCTTG TGCAATGGGC
TTTAAAGTAA AAGATGCCAA AGCCGCTTTT CAACACGCTA TCGCGCATGG CGGTATAGCA
TTTCAGGATG CGCCTCATGC CAATCACGGC TTGCCAGCCA TCCAGGCTAT TGGTGGTAGT
GTTATTTATT TTGTCGATGA AGAACACCAA CCCTTCTCTC ATGAATGGAA TATTACCTCG
CCAGAACCCG TAGTTGGAAA TGGTCTGACC GCAATCGACC ATCTCACCCA TAACGTTTAT
CGCGGTAATA TGGATAAATG GGCCAGTTTC TATGCTTCCA TTTTTAACTT CCAGGAAATT
CGTTTTTTCA ATATCAAAGG AAAAATGACT GGTTTGGTCA GTCGAGCATT AGGTAGCCCT
TGTGGCAAAA TCAAAATTCC TTTAAACGAA TCCAAAGATG ATTTATCACA AATTGAAGAG
TTTCTTCATG AATATCATGG CGAGGGCATT CAACACATCG CTCTCAATAC CAATGATATT
TATAAAACAG TCAACGGCTT AAGAAAACAA GGGGTCAAAT TCCTGGATGT GCCGGATACT
TACTATGAGA TGATTAATGA CCGTCTCCCA TGGCACAAGG AGCCACTGAA TCAACTCCAT
GCTGAGAAAA TTTTAATTGA TGGAGAAGCA GATCCCAAAG ACGGCTTGTT ACTGCAAATA
TTTACTGAAA ACATATTTGG ACCAGTCTTT TTTGAAATTA TTCAACGCAA AGGCAATCAG
GGGTTTGGTG AAGGGAATTT CCAGGCTCTA TTCGAAGCTA TTGAAAGAGA TCAAGTTCGA
CGTGGTACTT TAAAAGAATT AAGCTAG
 
Protein sequence
MQNNNPCGLD GFAFLEFSGP DRNKLHQQFS EMGFQAVAHH KNQDITLFKQ GEIQFIVNAA 
SHCQAEAHAS THGPGACAMG FKVKDAKAAF QHAIAHGGIA FQDAPHANHG LPAIQAIGGS
VIYFVDEEHQ PFSHEWNITS PEPVVGNGLT AIDHLTHNVY RGNMDKWASF YASIFNFQEI
RFFNIKGKMT GLVSRALGSP CGKIKIPLNE SKDDLSQIEE FLHEYHGEGI QHIALNTNDI
YKTVNGLRKQ GVKFLDVPDT YYEMINDRLP WHKEPLNQLH AEKILIDGEA DPKDGLLLQI
FTENIFGPVF FEIIQRKGNQ GFGEGNFQAL FEAIERDQVR RGTLKELS