Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0525 |
Symbol | lon |
ID | 6969992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 528481 |
End bp | 530880 |
Gene Length | 2400 bp |
Protein Length | 799 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643384572 |
Product | DNA-binding ATP-dependent protease La |
Protein accession | YP_002269086 |
Protein GI | 209399962 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0466] ATP-dependent Lon protease, bacterial type |
TIGRFAM ID | [TIGR00763] ATP-dependent protease La |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000828536 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.526776 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCATCTG ATTACCTGGC GGAAATTAAA CTAAGAGAGA GCTCTATGAA TCCTGAGCGT TCTGAACGCA TTGAAATCCC CGTATTGCCG CTGCGCGATG TGGTGGTTTA TCCGCACATG GTCATCCCCT TATTTGTCGG GCGGGAAAAA TCTATCCGTT GTCTGGAAGC GGCGATGGAC CATGATAAAA AAATTATGCT GGTCGCGCAG AAAGAAGCTT CAACGGATGA GCCGGGTGTA AACGATCTTT TCACCGTCGG GACCGTGGCC TCTATATTGC AGATGCTGAA ACTGCCTGAC GGCACCGTCA AAGTGCTGGT CGAGGGGTTA CAGCGCGCGC GTATTTCTGC GCTCTCTGAC AATGGCGAAC ACTTTTCTGC GAAGGCGGAG TATCTGGAGT CGCCGACCAT TGATGAGCGA GAACAGGAAG TGCTGGTGCG TACTGCAATC AGCCAGTTCG AAGGCTACAT CAAGCTGAAC AAAAAAATCC CACCAGAAGT GCTGACGTCG TTGAATAGCA TCGACGATCC GGCGCGTCTG GCGGATACCA TTGCTGCACA TATGCCGCTG AAACTGGCTG ACAAACAGTC CGTTCTGGAG ATGTCCGACG TTAACGAACG TCTGGAATAT CTGATGGCAA TGATGGAATC GGAAATCGAT CTGCTGCAGG TTGAGAAACG CATTCGCAAC CGCGTTAAAA AGCAGATGGA GAAATCCCAG CGTGAGTACT ATCTGAACGA GCAAATGAAA GCTATTCAGA AAGAACTCGG TGAGATGGAC GACGCGCCGG ACGAAAACGA AGCCCTGAAG CGCAAAATCG ACGCGGCGAA GATGCCGAAA GAGGCAAAAG AGAAAGCGGA AGCAGAGTTG CAGAAGCTGA AAATGATGTC TCCGATGTCG GCAGAAGCGA CCGTAGTGCG TGGTTATATC GACTGGATGG TACAGGTACC GTGGAATGCG CGCAGCAAGG TCAAAAAAGA CCTGCGTCAG GCGCAGGAAA TCCTTAATAC CGACCATTAT GGTCTGGAGC GCGTGAAAGA TCGCATCCTT GAGTATCTTG CGGTTCAAAG CCGTGTCAAC AAAATCAAGG GACCGATCCT TTGCCTGGTA GGGCCGCCGG GGGTAGGTAA AACCTCCCTG GGTCAGTCCA TTGCCAAAGC CACCGGGCGT AAATATGTCC GTATGGCGCT GGGCGGCGTG CGTGATGAAG CGGAAATCCG TGGTCACCGC CGTACTTACA TCGGTTCTAT GCCGGGTAAA TTGATCCAGA AAATGGCGAA AGTGGGCGTT AAAAACCCGC TGTTCCTGCT CGATGAGATC GACAAAATGT CTTCTGACAT GCGTGGCGAT CCGGCTTCCG CACTGCTTGA AGTGCTGGAT CCAGAGCAGA ACGTGGCCTT CAGCGATCAC TACCTGGAAG TGGATTACGA TCTCAGCGAC GTGATGTTTG TCGCGACGTC GAACTCCATG AACATTCCGG CACCACTGCT GGATCGTATG GAAGTGATTC GCCTCTCCGG TTATACCGAA GATGAAAAAC TGAACATCGC CAAACGTCAC CTGCTGCCGA AGCAGATTGA ACGTAATGCA CTGAAAAAAG GTGAGCTGAC CGTCGACGAT AGCGCCATTA TCGGCATTAT TCGTTACTAC ACCCGTGAGG CGGGCGTGCG TGGTCTGGAG CGTGAAATCT CCAAGCTGTG CCGTAAAGCG GTTAAGCAGT TACTGCTCGA TAAGTCATTA AAACATATCG AAATTAACGG CGACAACCTG CATGACTACC TTGGTGTTCA GCGTTTCGAC TATGGTCGCG CTGATAACGA AAACCGTGTC GGTCAGGTAA CCGGTCTGGC GTGGACGGAA GTGGGCGGTG ACTTGCTGAC CATTGAAACC GCATGTGTTC CGGGTAAAGG CAAACTGACC TATACCGGTT CGCTCGGCGA AGTGATGCAG GAGTCTATTC AGGCTGCGTT AACGGTGGTT CGCGCGCGTG CGGAAAAACT GGGGATCAAC CCTGATTTTT ACGAAAAACG TGACATCCAC GTCCACGTAC CGGAAGGTGC GACGCCGAAA GATGGTCCGA GTGCCGGTAT TGCTATGTGC ACCGCGCTGG TTTCTTGCCT GACCGGTAAC CCGGTTCGTG CCGATGTGGC AATGACCGGT GAGATCACTC TGCGTGGTCA GGTACTGCCG ATCGGTGGTT TGAAAGAAAA ACTACTGGCA GCGCATCGCG GCGGGATTAA AACAGTGTTA ATTCCGTTCG AAAATAAACG CGATCTGGAA GAGATTCCTG ACAACGTAAT TGCCGATCTG GACATTCATC CTGTGAAGCG CATTGAGGAA GTTCTGACTC TGGCGCTGCA AAATGAACCG TCTGGCATGC AGGTTGTGAC TGCAAAATAG
|
Protein sequence | MSSDYLAEIK LRESSMNPER SERIEIPVLP LRDVVVYPHM VIPLFVGREK SIRCLEAAMD HDKKIMLVAQ KEASTDEPGV NDLFTVGTVA SILQMLKLPD GTVKVLVEGL QRARISALSD NGEHFSAKAE YLESPTIDER EQEVLVRTAI SQFEGYIKLN KKIPPEVLTS LNSIDDPARL ADTIAAHMPL KLADKQSVLE MSDVNERLEY LMAMMESEID LLQVEKRIRN RVKKQMEKSQ REYYLNEQMK AIQKELGEMD DAPDENEALK RKIDAAKMPK EAKEKAEAEL QKLKMMSPMS AEATVVRGYI DWMVQVPWNA RSKVKKDLRQ AQEILNTDHY GLERVKDRIL EYLAVQSRVN KIKGPILCLV GPPGVGKTSL GQSIAKATGR KYVRMALGGV RDEAEIRGHR RTYIGSMPGK LIQKMAKVGV KNPLFLLDEI DKMSSDMRGD PASALLEVLD PEQNVAFSDH YLEVDYDLSD VMFVATSNSM NIPAPLLDRM EVIRLSGYTE DEKLNIAKRH LLPKQIERNA LKKGELTVDD SAIIGIIRYY TREAGVRGLE REISKLCRKA VKQLLLDKSL KHIEINGDNL HDYLGVQRFD YGRADNENRV GQVTGLAWTE VGGDLLTIET ACVPGKGKLT YTGSLGEVMQ ESIQAALTVV RARAEKLGIN PDFYEKRDIH VHVPEGATPK DGPSAGIAMC TALVSCLTGN PVRADVAMTG EITLRGQVLP IGGLKEKLLA AHRGGIKTVL IPFENKRDLE EIPDNVIADL DIHPVKRIEE VLTLALQNEP SGMQVVTAK
|
| |