Gene EcSMS35_0482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0482 
Symbollon 
ID6146991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp487190 
End bp489544 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content52% 
IMG OID641615376 
ProductDNA-binding ATP-dependent protease La 
Protein accessionYP_001742583 
Protein GI170682239 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000167472 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCTG AGCGTTCTGA ACGCATTGAA ATCCCCGTAT TGCCGCTGCG CGATGTGGTG 
GTTTATCCGC ACATGGTCAT CCCCTTATTT GTCGGGCGGG AAAAATCTAT CCGTTGTCTG
GAAGCGGCGA TGGACCATGA TAAAAAAATT ATGCTGGTCG CGCAGAAAGA AGCTTCAACG
GATGAGCCGG GTGTAAACGA TCTTTTCACC GTCGGGACCG TGGCCTCTAT ATTGCAGATG
CTGAAACTGC CTGACGGCAC CGTCAAAGTG CTGGTCGAGG GGTTACAGCG CGCGCGTATT
TCTGCGCTCT CTGACAATGG CGAACACTTT TCTGCGAAGG CGGAGTATCT GGAGTCGCCG
ACCATTGATG AGCGGGAACA GGAAGTGCTG GTGCGTACTG CAATCAGCCA GTTCGAAGGC
TACATCAAGC TGAACAAAAA AATTCCACCA GAAGTGCTGA CGTCGCTGAA TAGCATCGAC
GATCCGGCGC GTCTGGCGGA TACCATTGCT GCACATATGC CGCTGAAACT GGCTGACAAA
CAGTCCGTTC TGGAGATGTC CGACGTTAAC GAACGTCTGG AATATCTGAT GGCAATGATG
GAATCGGAAA TCGATCTGCT GCAGGTTGAG AAACGCATTC GCAACCGTGT TAAAAAGCAG
ATGGAGAAAT CCCAGCGTGA GTACTATCTG AACGAGCAAA TGAAAGCTAT TCAGAAAGAA
CTCGGTGAAA TGGACGACGC GCCGGACGAA AACGAAGCCC TGAAGCGCAA AATCGACGCG
GCGAAAATGC CGAAAGAGGC AAAAGAGAAA GCGGAAGCAG AGTTGCAGAA GCTGAAAATG
ATGTCTCCGA TGTCGGCAGA AGCGACCGTA GTGCGTGGTT ATATCGACTG GATGGTACAG
GTACCGTGGA ATGCGCGTAG CAAGGTCAAA AAAGACCTGC GTCAGGCGCA GGAAATCCTT
GATACCGACC ATTATGGTCT GGAGCGCGTG AAAGATCGCA TCCTTGAGTA TCTTGCGGTT
CAAAGCCGTG TCAACAAAAT CAAGGGACCG ATCCTCTGCC TGGTAGGGCC ACCGGGGGTA
GGTAAAACGT CCCTGGGGCA GTCCATCGCC AAAGCCACCG GGCGTAAATA TGTCCGTATG
GCGCTGGGCG GCGTGCGTGA TGAAGCGGAA ATCCGTGGTC ACCGCCGTAC TTACATCGGT
TCTATGCCGG GTAAACTGAT CCAGAAAATG GCGAAAGTGG GCGTGAAAAA CCCGCTGTTC
CTGCTCGATG AGATCGACAA AATGTCTTCT GACATGCGAG GCGATCCGGC TTCCGCACTG
CTTGAAGTGC TGGATCCAGA GCAGAACGTA GCGTTCAGCG ACCACTACCT GGAAGTGGAT
TACGACCTTA GCGACGTGAT GTTTGTCGCG ACGTCGAACT CCATGAACAT TCCGGCACCG
CTGCTGGATC GTATGGAAGT GATTCGCCTC TCCGGTTATA CCGAAGATGA AAAACTGAAC
ATCGCCAAAC GTCACCTGCT GCCGAAGCAG ATTGAACGTA ATGCACTGAA AAAAGGTGAG
CTGACCGTCG ACGATAGCGC CATTATCGGC ATTATTCGTT ACTACACCCG TGAGGCGGGC
GTGCGTGGTC TGGAGCGTGA AATCTCCAAA CTGTGTCGCA AAGCGGTTAA GCAGTTACTG
CTCGATAAGT CATTAAAACA TATCGAAATT AACGGCGATA ACCTGCATGA CTATCTCGGT
GTTCAGCGTT TCGACTATGG TCGCGCGGAT AACGAAAACC GTGTCGGTCA GGTAACTGGT
CTGGCGTGGA CGGAAGTGGG CGGTGACTTG CTGACCATTG AAACCGCGTG CGTTCCGGGT
AAAGGCAAAC TGACCTATAC CGGATCGCTT GGCGAAGTGA TGCAGGAGTC CATTCAGGCG
GCGTTAACGG TGGTTCGTGC GCGTGCGGAA AAACTGGGGA TCAACCCTGA TTTTTATGAA
AAACGCGACA TCCACGTCCA CGTACCGGAA GGTGCGACGC CGAAAGATGG TCCGAGTGCC
GGTATTGCTA TGTGCACCGC GCTGGTTTCT TGCCTGACCG GTAACCCGGT TCGTGCCGAT
GTGGCAATGA CCGGTGAGAT CACTCTGCGT GGTCAGGTAC TGCCTATCGG TGGTTTGAAA
GAAAAACTAC TGGCAGCGCA TCGCGGCGGG ATTAAAACAG TGTTAATTCC GTTCGAAAAT
AAACGCGATC TGGAAGAGAT TCCTGACAAC GTAATTGCCG ATCTGGATAT TCATCCTGTG
AAACGCATTG AGGAAGTTCT GACTCTGGCG CTGCAAAATG AACCGTCTGG CATGCAGGTT
GTGACTGCAA AATAG
 
Protein sequence
MNPERSERIE IPVLPLRDVV VYPHMVIPLF VGREKSIRCL EAAMDHDKKI MLVAQKEAST 
DEPGVNDLFT VGTVASILQM LKLPDGTVKV LVEGLQRARI SALSDNGEHF SAKAEYLESP
TIDEREQEVL VRTAISQFEG YIKLNKKIPP EVLTSLNSID DPARLADTIA AHMPLKLADK
QSVLEMSDVN ERLEYLMAMM ESEIDLLQVE KRIRNRVKKQ MEKSQREYYL NEQMKAIQKE
LGEMDDAPDE NEALKRKIDA AKMPKEAKEK AEAELQKLKM MSPMSAEATV VRGYIDWMVQ
VPWNARSKVK KDLRQAQEIL DTDHYGLERV KDRILEYLAV QSRVNKIKGP ILCLVGPPGV
GKTSLGQSIA KATGRKYVRM ALGGVRDEAE IRGHRRTYIG SMPGKLIQKM AKVGVKNPLF
LLDEIDKMSS DMRGDPASAL LEVLDPEQNV AFSDHYLEVD YDLSDVMFVA TSNSMNIPAP
LLDRMEVIRL SGYTEDEKLN IAKRHLLPKQ IERNALKKGE LTVDDSAIIG IIRYYTREAG
VRGLEREISK LCRKAVKQLL LDKSLKHIEI NGDNLHDYLG VQRFDYGRAD NENRVGQVTG
LAWTEVGGDL LTIETACVPG KGKLTYTGSL GEVMQESIQA ALTVVRARAE KLGINPDFYE
KRDIHVHVPE GATPKDGPSA GIAMCTALVS CLTGNPVRAD VAMTGEITLR GQVLPIGGLK
EKLLAAHRGG IKTVLIPFEN KRDLEEIPDN VIADLDIHPV KRIEEVLTLA LQNEPSGMQV
VTAK