Gene lpp0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus taglpp0019 
Symbol 
ID3117166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLegionella pneumophila str. Paris 
KingdomBacteria 
Replicon accessionNC_006368 
Strand
Start bp23666 
End bp25342 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content37% 
IMG OID637578719 
Producthypothetical protein 
Protein accessionYP_122371 
Protein GI54296002 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAA AAATAATGTT TTTTATTTTG TCGATATCTA CTTCAAGTAT TTTTGCTGCT 
GACAATGTAG ATTTGTATCA AGCCCCTCTC AATAGCATCA ATAAATACCC TATACTACAA
ACACCAAAGA ATGCGATTAT TTTAAAGAGT TCTTCTGCCG TTATTGATAA TTCATTGCAA
AAATTAAATC AAACAAAAGA AGATAATCAA ATGATTGTTC GTTATCAGCA ACTGTATAAA
GGAATACCTG TTATTGGCGC CCAAGTGATG ATTACTAAAG GAACAGACTC AGGAGTGCAG
TCCAATGACA ATGCAGAGGT GAACGGCCAT TTATTGGATA ATATAGAACT TAATACGAAA
CCGGCTATTA GTGCGCAACA AGCGAAGGAA TATGCAAAAA AATCCTATTT TCAATTTAGC
CCCCAATCTA ACATACAACA GGAAACAGCT GAATTACAGA TTCGGCCAGA CCATAATAAT
CAATTAAAGC TGGTTTATTT GGTTTCATTT AAAAGCGTGC AACAGGATGG TAAACCAGAC
TGGCCTTTTT TTGTTATTGA TGCTCAAACA GGAGCTTTGA TTAAGCAATG GAACAATATC
AAAAATTATT TGGATACAGG GCCTGGAGGC AATGAGAAAG TTCAGGAATA TTGGTATGGT
AAAGATGGAT TGCCTGCTTT GGATGTGACT CAAAATGGCA GCCAATGCGT CATGGAAAAC
TCAAAAGTCA AGTTGGTTAA TCTCCATTCT CAATGGGATT GGGAAAACAC GATAAATACT
CCTTTTGAAT ACGTTTGTAA CAATAATATA GAAGAGAATA TTAATGGAGG ATTTTCTCCT
GGTAATGATG CGTATTATTT CGGACATGTT ATTGTTGATA TGTACAAAGA CTGGTATGGA
CTTAATGCCT TACAACATTC TAATGGTGCT CCAATGCAAT TGGTTATGCG AGTTCATTTT
GGGCAAAACT ATGATAATGC TTTTTGGGAT GGACAAGCTA TGTCATTTGG AGATGGGTTG
GATTTTTACC CATTGGTTTC TTTAGATGTA GCCGGTCATG AAGTGACTCA TGGTTTTACA
GAGCAGCATT CTGGTCTTGA GTATCATGAT CAATCAGGTG CACTTAATGA GTCCCTATCT
GATATGGCAG GACAAGCGTC AAGAGCTTAT CTTTTGGAAA AAAATCCTCA GTTGTATAAC
AAAGCTTACT TACAGCCCAA TGAAGTCACA TGGGGTATTG GAGAAACAAT AGTTCGTGAT
TCTTATGGCA AAGCTTTGCG ATTCATGGAT TACCCATCCT CTGATGGAAG CTCCGCAGAT
TGTTTAGACA AAGGTATTGC GCAAAACAAT GGCAGCTATT GTGCTATCAA TTATGATGAG
GTAGTAGCCT ATGCCAATGC ACATATCGCA CTTCCTCAAG AACGCCAGAG CTTCATAGTT
CATACAGCCA GTGGTGTGTT CAATAAGGCT TTTTACTTAA TGTCTAAGGA TATGGGTATT
AAAAACGCTT ATCACATCAT GGTTGTTGCT AACACAAAAT ATTGGACTCC TACGACAGAC
TTTAAAAATG GAGCTTGCGG AGTCATTTAT GCTGCCAGGG ATTTAAATAC TGATATCAAT
AAGGTTAAGT CTGCTTTTGG TCAAGTAGGT ATTGATATAG CCGGGTGTGC TATTTAG
 
Protein sequence
MLKKIMFFIL SISTSSIFAA DNVDLYQAPL NSINKYPILQ TPKNAIILKS SSAVIDNSLQ 
KLNQTKEDNQ MIVRYQQLYK GIPVIGAQVM ITKGTDSGVQ SNDNAEVNGH LLDNIELNTK
PAISAQQAKE YAKKSYFQFS PQSNIQQETA ELQIRPDHNN QLKLVYLVSF KSVQQDGKPD
WPFFVIDAQT GALIKQWNNI KNYLDTGPGG NEKVQEYWYG KDGLPALDVT QNGSQCVMEN
SKVKLVNLHS QWDWENTINT PFEYVCNNNI EENINGGFSP GNDAYYFGHV IVDMYKDWYG
LNALQHSNGA PMQLVMRVHF GQNYDNAFWD GQAMSFGDGL DFYPLVSLDV AGHEVTHGFT
EQHSGLEYHD QSGALNESLS DMAGQASRAY LLEKNPQLYN KAYLQPNEVT WGIGETIVRD
SYGKALRFMD YPSSDGSSAD CLDKGIAQNN GSYCAINYDE VVAYANAHIA LPQERQSFIV
HTASGVFNKA FYLMSKDMGI KNAYHIMVVA NTKYWTPTTD FKNGACGVIY AARDLNTDIN
KVKSAFGQVG IDIAGCAI