Gene lpp3049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus taglpp3049 
Symbol 
ID3116754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLegionella pneumophila str. Paris 
KingdomBacteria 
Replicon accessionNC_006368 
Strand
Start bp3472423 
End bp3474084 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content38% 
IMG OID637581751 
Producthypothetical protein 
Protein accessionYP_125351 
Protein GI54298982 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC GGATTGTTTC ATTATTTGTC ATTTCAACAG CTATTTCATC TGTAAACGCT 
GCAACCCAAG CAATCGTATG GGGTGAGTCA ACGAAAGTAT TACCTCAATT TATGCAAGCA
CATCCTTTTA AGAAACAAGG ATTATTGCAA ACTGTAAAAA GTCAGCAAAG TGATTATAAA
CTGGAGTTAC AAAATAATTC TAACCATGCG ACCAGGCATG CTCGTTATCA AATAACCTAC
AAGGGGGTTC CGGTTTGGGG TTATCAAGTC ATTTTTCATT CCAAAGATGG CACTAAAGAA
ACTGTAACAG GTATGAATAT TACCGGAATT GAACAGGATA TTAATTCTAC TGAGGGAAAA
TTGAGCCCGA ATGATATAGA ACAAAAGATT CTTGGAAAGG TAGCTGAGCC GGTTAAATTT
AAAAACCTTA AAAAGGTTAT TTATATAGAT AAAGGAAATA AAGCTCATTT GGCTTATCAT
TTGTCTTACT ATTCCAATAG TGAAAAAAAA CATGTGAATG CACCTAATTA TCTTATTGAT
GCCAATAGCG GTGAGATTTT GAAACAATGG GATGAGGTAC GTCATGAAAG AATTGGTCAG
GGATTAGGCG GTAATGCGTT TACTCTCCCA TACCGTCAGG GAATGTTTCA GCATGGTAAT
GCTTTGCCGG GGCTTCCCTC ATTAGGGAAG TTTGATGTTA ATGTTGAGGA TGGATTATGT
CGTGTTGAAA ATGAATCCAT AAAAGTAATG AATTTGGAAA ATCATAATAT AGGTTATGAC
TTCTTTCCAA TTACTATATT TGCAGAGTCT GTACTGAATT TAAGTGCCTT TTCCTACCCA
TGTAATGAAA CCAACTTGTT TTTAAACTAT GCTGATGGCA GAACAGGTCC TGTCAATTAT
GCTTTTTCTC CAGTTAACGA TACGATGTAT TTTGCCCAAC AAACGTTAGA TATGTATCAA
AAAATTTATG GTGTTAATCG TCCAATAGGT GATGATTTAC CTATACGGGC TTACACCCAT
CTTGGTGATA TGGATAACGC TTTTGCAGTA CCAACCATCA GTCTTGATGG GGTAGTTCTT
GCACATCAGC AAATTGTAAT CGGAAATGGT GATGAATTTT TAACAGCTCC CGCCCAGAGT
GTATTGGGAC ATGAATTATC GCACAATTTT ACTGCCTTGC ATTCCGGATT GATGTATGAA
GGGCAATCTG GGGGGATCAA TGAATCTTTC TCTGATATGG CGGCAATTGC ATTGTTAGAT
TATCTTAGTA AAGATTATCC ATGGTATTGG GATGGTGAGG ATTGGACCAT TGGGCGTGAA
GCTGTAAAAA GTGGGCAACC TATTCGTTAT TTGGATGATC CAGCCAAGGA TGGAATGTCT
ATAGGGCATG CTAGTGAATA CACTGATGCG TTGGATGTGC ATATAACGAG CGGAGTATTT
AATAAAGCAT TTTATTTATT AGCACATAAA CCAGGCTGGT CTATACAAAA AGCATTTCAG
GTTATGGTTG ATGCCAATAT GAATTATTGG TCTCCTATTG CATACTATGA TTTTGCTGCA
TGTGGCGTCA TTCAGGCAAC CATAGATAAG CATTGGGATA AAACACCTGT TATCGAGGCA
TTTGCCGAGG TGGGAGTCGT TTGTCCGATG CATAAAAGCT AG
 
Protein sequence
MKKRIVSLFV ISTAISSVNA ATQAIVWGES TKVLPQFMQA HPFKKQGLLQ TVKSQQSDYK 
LELQNNSNHA TRHARYQITY KGVPVWGYQV IFHSKDGTKE TVTGMNITGI EQDINSTEGK
LSPNDIEQKI LGKVAEPVKF KNLKKVIYID KGNKAHLAYH LSYYSNSEKK HVNAPNYLID
ANSGEILKQW DEVRHERIGQ GLGGNAFTLP YRQGMFQHGN ALPGLPSLGK FDVNVEDGLC
RVENESIKVM NLENHNIGYD FFPITIFAES VLNLSAFSYP CNETNLFLNY ADGRTGPVNY
AFSPVNDTMY FAQQTLDMYQ KIYGVNRPIG DDLPIRAYTH LGDMDNAFAV PTISLDGVVL
AHQQIVIGNG DEFLTAPAQS VLGHELSHNF TALHSGLMYE GQSGGINESF SDMAAIALLD
YLSKDYPWYW DGEDWTIGRE AVKSGQPIRY LDDPAKDGMS IGHASEYTDA LDVHITSGVF
NKAFYLLAHK PGWSIQKAFQ VMVDANMNYW SPIAYYDFAA CGVIQATIDK HWDKTPVIEA
FAEVGVVCPM HKS