Gene lpp1285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus taglpp1285 
SymbolhtrA 
ID3117617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLegionella pneumophila str. Paris 
KingdomBacteria 
Replicon accessionNC_006368 
Strand
Start bp1430577 
End bp1432044 
Gene Length1468 bp 
Protein Length466 aa 
Translation table11 
GC content39% 
IMG OID637579980 
Productperiplasmic serine protease Do; heat shock protein HtrA 
Protein accessionYP_123609 
Protein GI54297240 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA TCAGGATTAA TAATATGATT ACACGTATAA AATTATTCAT AGCAAGTCTG 
CTGGTATTTT TCATCCCGGT AGCCGATGCA GCACAAGATC TTACAAACAT GCCCAGCCTG
GCTCCTGTTC TGAAAAATGC TATGCCGGCT ATTGTTAACG TGGCTGTTCA AGGCTACTTA
CCTAATAATA TGGCATCCGG AAATGCGGAC GATGATGACG GGGAAAACAG TAAACAGCCT
TCCAGAATAC CTGAAAAAGG CAGAAAATTT GAAAGTATAG GGTCCGGAGT TATCATTGAC
CCTAAAAATG GCATTATTAT AACGAATGAC CATGTTATTC GTAACGCCAA TTTAATTACT
ATAACTCTTC AAGATGGCAG GAGATTAAAA GCTCGCCTGA TTGGAGGTGA TAGTGAAACA
GACTTGGCTG TTTTAAAAAT TGATGCTAAA AATTTGAAAT CCCTGGTAAT TGGTGATTCC
GATAAGCTGG AAGTCGGAGA TTATGTTGTT GCTATTGGAA ACCCCTTTGG ATTAAACAGC
TTTGGAAATA GTCAATCTGC TACGTTTGGA ATAGTGAGTG CCTTAAAACG CAGCGATTTA
AATATTGAAG GTGTTGAAAA CTTTATTCAA ACTGACGCAG CTATCAATCC TGGTAATTCA
GGAGGTGCTT TGGTCAATGC AAAAGGCGAA TTAATTGGCA TTAATACAGC CATTATTTCA
CCCTATGGAG GAAATGTAGG TATTGGTTTT GCGATCCCAA TTAATATGGT AAAAGATGTA
GCGCAGCAAA TCATTAAATT TGGCTCTATT CATCGTGGTT TAATGGGTAT ATTTGTTCAA
CATTTAACAC CAGAACTTGC CCAATCAATG GGATATGCCG AAGATTTTCA AGGAGCTTTA
GTATCACAGG TCAATGAAAA TTCACCTGCT CAATTGGCTG GTCTAAAATC AGGCGATGTC
ATTGTACAAA TTAATGACAC CAAGATAACT CAGGCAACAC AGGTAAAAAC AACTATTAGC
CTGTTGCGAG CCGGCTCTAC TGCTAAAATT AAAATCTTGC GGGATAATAA GCCGCTTACA
TTAGATGTAG AAGTCACAGA TATCAAAAAA CATGAACAAA AATTACAATC CAATAATCCA
TTTCTCTACG GATTAGCCTT ACGTAATTTT GAACAAGAAT CACCGCCTCA TGGTAATGTT
GTTGGAGTTC AGGTTGTAGG TGCTTCGGAA ACCAGTGCGG GCTGGCGAGC TGGCTTAAGA
CCAGGAGATA TAATCATTTC TGCTAATAAA ACACCGGTTA AAGACATTAA ATCTTTACAA
GCTGTTGCAC ACGACAAAAA GAAACAGCTA TTAGTCCAGG TGCTGAGAGG AGCAGGAGCA
CTCTACCTGT TGATTATTTA AATTTTTTAT CCAAATAATG AAGTAAAGCA TCCTCTTTAT
TTAAGCCCAT CTCTGAGAGA TGGGCTTA
 
Protein sequence
MSKIRINNMI TRIKLFIASL LVFFIPVADA AQDLTNMPSL APVLKNAMPA IVNVAVQGYL 
PNNMASGNAD DDDGENSKQP SRIPEKGRKF ESIGSGVIID PKNGIIITND HVIRNANLIT
ITLQDGRRLK ARLIGGDSET DLAVLKIDAK NLKSLVIGDS DKLEVGDYVV AIGNPFGLNS
FGNSQSATFG IVSALKRSDL NIEGVENFIQ TDAAINPGNS GGALVNAKGE LIGINTAIIS
PYGGNVGIGF AIPINMVKDV AQQIIKFGSI HRGLMGIFVQ HLTPELAQSM GYAEDFQGAL
VSQVNENSPA QLAGLKSGDV IVQINDTKIT QATQVKTTIS LLRAGSTAKI KILRDNKPLT
LDVEVTDIKK HEQKLQSNNP FLYGLALRNF EQESPPHGNV VGVQVVGASE TSAGWRAGLR
PGDIIISANK TPVKDIKSLQ AVAHDKKKQL LVQVLRGAGA LYLLII