Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0482 |
Symbol | lon |
ID | 6146991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 487190 |
End bp | 489544 |
Gene Length | 2355 bp |
Protein Length | 784 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615376 |
Product | DNA-binding ATP-dependent protease La |
Protein accession | YP_001742583 |
Protein GI | 170682239 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0466] ATP-dependent Lon protease, bacterial type |
TIGRFAM ID | [TIGR00763] ATP-dependent protease La |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000167472 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCTG AGCGTTCTGA ACGCATTGAA ATCCCCGTAT TGCCGCTGCG CGATGTGGTG GTTTATCCGC ACATGGTCAT CCCCTTATTT GTCGGGCGGG AAAAATCTAT CCGTTGTCTG GAAGCGGCGA TGGACCATGA TAAAAAAATT ATGCTGGTCG CGCAGAAAGA AGCTTCAACG GATGAGCCGG GTGTAAACGA TCTTTTCACC GTCGGGACCG TGGCCTCTAT ATTGCAGATG CTGAAACTGC CTGACGGCAC CGTCAAAGTG CTGGTCGAGG GGTTACAGCG CGCGCGTATT TCTGCGCTCT CTGACAATGG CGAACACTTT TCTGCGAAGG CGGAGTATCT GGAGTCGCCG ACCATTGATG AGCGGGAACA GGAAGTGCTG GTGCGTACTG CAATCAGCCA GTTCGAAGGC TACATCAAGC TGAACAAAAA AATTCCACCA GAAGTGCTGA CGTCGCTGAA TAGCATCGAC GATCCGGCGC GTCTGGCGGA TACCATTGCT GCACATATGC CGCTGAAACT GGCTGACAAA CAGTCCGTTC TGGAGATGTC CGACGTTAAC GAACGTCTGG AATATCTGAT GGCAATGATG GAATCGGAAA TCGATCTGCT GCAGGTTGAG AAACGCATTC GCAACCGTGT TAAAAAGCAG ATGGAGAAAT CCCAGCGTGA GTACTATCTG AACGAGCAAA TGAAAGCTAT TCAGAAAGAA CTCGGTGAAA TGGACGACGC GCCGGACGAA AACGAAGCCC TGAAGCGCAA AATCGACGCG GCGAAAATGC CGAAAGAGGC AAAAGAGAAA GCGGAAGCAG AGTTGCAGAA GCTGAAAATG ATGTCTCCGA TGTCGGCAGA AGCGACCGTA GTGCGTGGTT ATATCGACTG GATGGTACAG GTACCGTGGA ATGCGCGTAG CAAGGTCAAA AAAGACCTGC GTCAGGCGCA GGAAATCCTT GATACCGACC ATTATGGTCT GGAGCGCGTG AAAGATCGCA TCCTTGAGTA TCTTGCGGTT CAAAGCCGTG TCAACAAAAT CAAGGGACCG ATCCTCTGCC TGGTAGGGCC ACCGGGGGTA GGTAAAACGT CCCTGGGGCA GTCCATCGCC AAAGCCACCG GGCGTAAATA TGTCCGTATG GCGCTGGGCG GCGTGCGTGA TGAAGCGGAA ATCCGTGGTC ACCGCCGTAC TTACATCGGT TCTATGCCGG GTAAACTGAT CCAGAAAATG GCGAAAGTGG GCGTGAAAAA CCCGCTGTTC CTGCTCGATG AGATCGACAA AATGTCTTCT GACATGCGAG GCGATCCGGC TTCCGCACTG CTTGAAGTGC TGGATCCAGA GCAGAACGTA GCGTTCAGCG ACCACTACCT GGAAGTGGAT TACGACCTTA GCGACGTGAT GTTTGTCGCG ACGTCGAACT CCATGAACAT TCCGGCACCG CTGCTGGATC GTATGGAAGT GATTCGCCTC TCCGGTTATA CCGAAGATGA AAAACTGAAC ATCGCCAAAC GTCACCTGCT GCCGAAGCAG ATTGAACGTA ATGCACTGAA AAAAGGTGAG CTGACCGTCG ACGATAGCGC CATTATCGGC ATTATTCGTT ACTACACCCG TGAGGCGGGC GTGCGTGGTC TGGAGCGTGA AATCTCCAAA CTGTGTCGCA AAGCGGTTAA GCAGTTACTG CTCGATAAGT CATTAAAACA TATCGAAATT AACGGCGATA ACCTGCATGA CTATCTCGGT GTTCAGCGTT TCGACTATGG TCGCGCGGAT AACGAAAACC GTGTCGGTCA GGTAACTGGT CTGGCGTGGA CGGAAGTGGG CGGTGACTTG CTGACCATTG AAACCGCGTG CGTTCCGGGT AAAGGCAAAC TGACCTATAC CGGATCGCTT GGCGAAGTGA TGCAGGAGTC CATTCAGGCG GCGTTAACGG TGGTTCGTGC GCGTGCGGAA AAACTGGGGA TCAACCCTGA TTTTTATGAA AAACGCGACA TCCACGTCCA CGTACCGGAA GGTGCGACGC CGAAAGATGG TCCGAGTGCC GGTATTGCTA TGTGCACCGC GCTGGTTTCT TGCCTGACCG GTAACCCGGT TCGTGCCGAT GTGGCAATGA CCGGTGAGAT CACTCTGCGT GGTCAGGTAC TGCCTATCGG TGGTTTGAAA GAAAAACTAC TGGCAGCGCA TCGCGGCGGG ATTAAAACAG TGTTAATTCC GTTCGAAAAT AAACGCGATC TGGAAGAGAT TCCTGACAAC GTAATTGCCG ATCTGGATAT TCATCCTGTG AAACGCATTG AGGAAGTTCT GACTCTGGCG CTGCAAAATG AACCGTCTGG CATGCAGGTT GTGACTGCAA AATAG
|
Protein sequence | MNPERSERIE IPVLPLRDVV VYPHMVIPLF VGREKSIRCL EAAMDHDKKI MLVAQKEAST DEPGVNDLFT VGTVASILQM LKLPDGTVKV LVEGLQRARI SALSDNGEHF SAKAEYLESP TIDEREQEVL VRTAISQFEG YIKLNKKIPP EVLTSLNSID DPARLADTIA AHMPLKLADK QSVLEMSDVN ERLEYLMAMM ESEIDLLQVE KRIRNRVKKQ MEKSQREYYL NEQMKAIQKE LGEMDDAPDE NEALKRKIDA AKMPKEAKEK AEAELQKLKM MSPMSAEATV VRGYIDWMVQ VPWNARSKVK KDLRQAQEIL DTDHYGLERV KDRILEYLAV QSRVNKIKGP ILCLVGPPGV GKTSLGQSIA KATGRKYVRM ALGGVRDEAE IRGHRRTYIG SMPGKLIQKM AKVGVKNPLF LLDEIDKMSS DMRGDPASAL LEVLDPEQNV AFSDHYLEVD YDLSDVMFVA TSNSMNIPAP LLDRMEVIRL SGYTEDEKLN IAKRHLLPKQ IERNALKKGE LTVDDSAIIG IIRYYTREAG VRGLEREISK LCRKAVKQLL LDKSLKHIEI NGDNLHDYLG VQRFDYGRAD NENRVGQVTG LAWTEVGGDL LTIETACVPG KGKLTYTGSL GEVMQESIQA ALTVVRARAE KLGINPDFYE KRDIHVHVPE GATPKDGPSA GIAMCTALVS CLTGNPVRAD VAMTGEITLR GQVLPIGGLK EKLLAAHRGG IKTVLIPFEN KRDLEEIPDN VIADLDIHPV KRIEEVLTLA LQNEPSGMQV VTAK
|
| |