Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | lpp0517 |
Symbol | icmE/dotG |
ID | 3117707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Legionella pneumophila str. Paris |
Kingdom | Bacteria |
Replicon accession | NC_006368 |
Strand | + |
Start bp | 558398 |
End bp | 561561 |
Gene Length | 3164 bp |
Protein Length | 1048 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637579213 |
Product | hypothetical protein |
Protein accession | YP_122855 |
Protein GI | 54296486 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGAAGGGTTA TAAGCAAATG GCGAGCAAAA AAGAAAATCT AAAGTCACTG TTTTCCAATA CTCGGACTCG AGTGATTATC GTTTTTACAG CAGCTTTATT AATCATCGCG GTAGTAATAG GTTTTTTTAA AATAAGAAGC GCTACTACCA GTTCGATTGC TGCAGCTGAA GTGTCTACAG TGCCGGGAGG AATTCAATCT ATTCCAGGAG TTTTAGATCC AACCGCTCAA TATGCCAAAT TGCAAGAAGA ACAAAATATA ACCCAGGCAC AAGTCGCTGA AAAAACAGGT GGGAGTGCTA TTCCAACAAT TATCCGCACC CAAGCACTAG GAGAAGGAGT TGGTGTTATC GGTTCCCAAA GTGGAGTGGG TTTTGCTGCT TTGGCTCAAG AAGAGTTAGG TGGTCCTCAA CGAAGTTTAT GGATACAAGA GTTACAAGAT GGTGGTTGTA GCAAATCTGT GATTACGAAA GTTATGAATC AGGGAGCTCA ATTAACTGAT TTGAAAGCGG CTTGTAGCTG CGTTCAGTTA AAAGATAGTG GATACGGTTT GCAAGAATTA GAGCAGGTCT GTGAGTGTAA AGAGCTAAAA GCCGCTGGTT ATAATGCAAG GCAGTTGAAA GAAGCAGGTT ATAGTGCAGG TCGATTGCGA AATTGTGGGT TTGATGCGTG TGAATTACGT AATGCGGGTT TTACAGCTCA GGAAATGAAA GATGGAGGTT TTTCAGACGG GGAATTAAAA GGAGCTGGGT TCTCTGATGC TGAAATTGCA AAGGCAAGTG GCTTACCAGA TGGTATAACT GCAGATGATG TGCGCAAAGC AGGATGTGGA GCTGCTGCAT TGGCCAAATT ACGTCAGGCA GGGGTTAGTG CGTCCGCGAT TAGAAAAATA AGTGGTTGCA CCGCTGAGCA GTTGAAAGCG GCAGGTTATA CCGCTAAAGA GTTAAAAGAT GCAGGTTTTA GTGCTGCGGA CTTAAAACGA GCAGGTTTTT CTGCTGCAGA ATTAAAAGAT GCCGGATTTA CTGCAAGAGA TTTGTTAAAC GCAGGATTTA CACCAGCTGA TCTGGCAAAG GCAGGATTTT CTAATGCTCA AATCAAAGCT GCTCAGGCTG AGCTTCCGCC TGGAATTACT CCGCAAGATG TGAAGAATGC TGGTTGTGAT GTGGAGGCTT TGAAAAAGGA AAGGGAAGCA GGTGTTAGTG CTGCTTTAAT CAGACAATAC GCGGGATGTA GCGCACAGGC CCTCAAAGCT GCAGGATTCA CTGATGCGGA TTTGGCGAGC GCAGGATTTA CACCAGCTCA AATCAGTGCA GCAACTCCTT TGAGTGATGC GGAAATAAAG GCTGCTGGTT GTGATCCTGA TAAACTGAAA AAATTATTTT CTGCTGGTGT ATCTGCAAAA CGCATCAAAG AACTCAATGG TTGCAGTGCG GAAGCTTTAA AAGCTGCGGG GTATGATGCG CAATCCCTGC TTGCGGCAGG ATTTACACCT CAAGAATTGC TTGCGGCAGG ATTTACACCA AAGCAGTTGG AAGATGCTGG ATTAAATCCC GCATCGATTA TTGCAGACGG CCGTGTCGCA GACTGCAGCG TAGAGTCATT AAAAAAAGCG AGGGCAGCTG GTGTTAGTGC CTTGACAATA AAGCAAACAT TAGGATGTTC CGCTGCAGCA TTGAAAGCAG CTGGTTATAC TGCAAAAGAG CTGAAAGACG CCGGGTTTAC TGCTGCTGAA TTAAAAGCAG CCGGTTTTAG TGCGAAAGAT CTAAAAGATG CCGGTTTTAC AGCAAAGGAA TTGCGTGATG CGGGTTTCTC CGCCCAGGAA TTAAAAGATG TGGGTTTTAG TGCAAAAGAC TTAAAAGACG CTGGGTTTTC TGCTGCTGAA TTAAAAGCGG CTGGATTTAC TGCAGCCCAG CTAAAGGCTG CTGGGTTTTC TGCAAAAGAT CTAAAAGATG CTGGTTTCTC TGCAGCGGAA TTAAAAGCAG CCGGATTTAG TGCAAAGGAG TTGAAGGATG CCGGGTTTAG CGCCTCGGAT TTGAAAAATG CCGGGTTTAG CGCAAAAGAA TTGAAAGATG CAGGATTTAG CGCCTCAGAT TTGAAAAACG CAGGATTTAG TGCCTCAGAA TTAAAGAATG CTGGATATTC AGCTGATGAG CTAAAAAAAG CTGGATATAC CTCAGCTGAG CTGAGAAATG CTGGATTTTC TCCACAAGAA TCAGCCGTTG CAGGGTTACA AGGGCCTGAT TTGCAACAAC TCGACTCAAG CATTACAGGA ATTCCCTCAA TTCCTGGCGC TACTCCAAGG CCAACAACCA GTGATGCTGC CTCTAGTGCT GAGCAGCTTC AGGCCATTTT GCAAAAGCAA AATGAACAAT TGGCTGAGCA AAAATATCAG CAAGAAATTC AACAAAGAAC TTCTGACATG CTCACAGCGG CGACTCAGCT AGTTCAAGAC TGGAAGAAGG TTGAAACCCA GGTGTATACA GAAGGTACAG AAGAAACAAA AACATCTGGT GGTGAAAGCT CAGTTCCAGG AGCAGGCACT GGTGCAGGTG CTAATAATCA ACCAGTAGAG CAAGGTGCAA GTGGTGCTCA GAATCAGGCT ATTATCAAAA CGGGCGATAT TATGTTTGCG GTTCTGGATA CCGCTGTGAA TAGTGATGAG CCAGGTCCAA TATTGGCAAC TATTGTTACT GGTAAGCTAA AAGGCTCAAA ATTAATAGGA AGCTTCAACT TACCGTCCAA TGCGGACAAG ATGGTGATTA CCTTTAATAC AATGTCAATT CCAGGAGCAG AAAAAACTAT ATCCATATCG GCTTATGCAA TTGATCCAAA CACGGCAAGA ACAGCCCTAT CCAGCAGAAC TAATCATCAC TATTTAATGC GATATGGTTC CCTGTTTGCT TCTTCTTTCT TACAAGGATT TGGAAATGCA TTCCAGTCGG CAAATACAAC AATCACCATT GGGGGTACTG GTGGTGGTAA TAATATTACT GTAGCCAATG GTGTTGGTCG CTCAACTTTG GAAAATGCGG TTATAGGATT GGCTACCGTT GGAAAGGCAT GGAGTCAACA GGCGCAACAA TTGTTTAATA CACCAACAAC TGTGGAAGTT TATTCTGGTA CAGGTTTGGG TATTTTATTT ACCCAGGATG TTACAACAAT TTAA
|
Protein sequence | MASKKENLKS LFSNTRTRVI IVFTAALLII AVVIGFFKIR SATTSSIAAA EVSTVPGGIQ SIPGVLDPTA QYAKLQEEQN ITQAQVAEKT GGSAIPTIIR TQALGEGVGV IGSQSGVGFA ALAQEELGGP QRSLWIQELQ DGGCSKSVIT KVMNQGAQLT DLKAACSCVQ LKDSGYGLQE LEQVCECKEL KAAGYNARQL KEAGYSAGRL RNCGFDACEL RNAGFTAQEM KDGGFSDGEL KGAGFSDAEI AKASGLPDGI TADDVRKAGC GAAALAKLRQ AGVSASAIRK ISGCTAEQLK AAGYTAKELK DAGFSAADLK RAGFSAAELK DAGFTARDLL NAGFTPADLA KAGFSNAQIK AAQAELPPGI TPQDVKNAGC DVEALKKERE AGVSAALIRQ YAGCSAQALK AAGFTDADLA SAGFTPAQIS AATPLSDAEI KAAGCDPDKL KKLFSAGVSA KRIKELNGCS AEALKAAGYD AQSLLAAGFT PQELLAAGFT PKQLEDAGLN PASIIADGRV ADCSVESLKK ARAAGVSALT IKQTLGCSAA ALKAAGYTAK ELKDAGFTAA ELKAAGFSAK DLKDAGFTAK ELRDAGFSAQ ELKDVGFSAK DLKDAGFSAA ELKAAGFTAA QLKAAGFSAK DLKDAGFSAA ELKAAGFSAK ELKDAGFSAS DLKNAGFSAK ELKDAGFSAS DLKNAGFSAS ELKNAGYSAD ELKKAGYTSA ELRNAGFSPQ ESAVAGLQGP DLQQLDSSIT GIPSIPGATP RPTTSDAASS AEQLQAILQK QNEQLAEQKY QQEIQQRTSD MLTAATQLVQ DWKKVETQVY TEGTEETKTS GGESSVPGAG TGAGANNQPV EQGASGAQNQ AIIKTGDIMF AVLDTAVNSD EPGPILATIV TGKLKGSKLI GSFNLPSNAD KMVITFNTMS IPGAEKTISI SAYAIDPNTA RTALSSRTNH HYLMRYGSLF ASSFLQGFGN AFQSANTTIT IGGTGGGNNI TVANGVGRST LENAVIGLAT VGKAWSQQAQ QLFNTPTTVE VYSGTGLGIL FTQDVTTI
|
| |