Gene ECH74115_0525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0525 
Symbollon 
ID6969992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp528481 
End bp530880 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content52% 
IMG OID643384572 
ProductDNA-binding ATP-dependent protease La 
Protein accessionYP_002269086 
Protein GI209399962 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000828536 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.526776 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCATCTG ATTACCTGGC GGAAATTAAA CTAAGAGAGA GCTCTATGAA TCCTGAGCGT 
TCTGAACGCA TTGAAATCCC CGTATTGCCG CTGCGCGATG TGGTGGTTTA TCCGCACATG
GTCATCCCCT TATTTGTCGG GCGGGAAAAA TCTATCCGTT GTCTGGAAGC GGCGATGGAC
CATGATAAAA AAATTATGCT GGTCGCGCAG AAAGAAGCTT CAACGGATGA GCCGGGTGTA
AACGATCTTT TCACCGTCGG GACCGTGGCC TCTATATTGC AGATGCTGAA ACTGCCTGAC
GGCACCGTCA AAGTGCTGGT CGAGGGGTTA CAGCGCGCGC GTATTTCTGC GCTCTCTGAC
AATGGCGAAC ACTTTTCTGC GAAGGCGGAG TATCTGGAGT CGCCGACCAT TGATGAGCGA
GAACAGGAAG TGCTGGTGCG TACTGCAATC AGCCAGTTCG AAGGCTACAT CAAGCTGAAC
AAAAAAATCC CACCAGAAGT GCTGACGTCG TTGAATAGCA TCGACGATCC GGCGCGTCTG
GCGGATACCA TTGCTGCACA TATGCCGCTG AAACTGGCTG ACAAACAGTC CGTTCTGGAG
ATGTCCGACG TTAACGAACG TCTGGAATAT CTGATGGCAA TGATGGAATC GGAAATCGAT
CTGCTGCAGG TTGAGAAACG CATTCGCAAC CGCGTTAAAA AGCAGATGGA GAAATCCCAG
CGTGAGTACT ATCTGAACGA GCAAATGAAA GCTATTCAGA AAGAACTCGG TGAGATGGAC
GACGCGCCGG ACGAAAACGA AGCCCTGAAG CGCAAAATCG ACGCGGCGAA GATGCCGAAA
GAGGCAAAAG AGAAAGCGGA AGCAGAGTTG CAGAAGCTGA AAATGATGTC TCCGATGTCG
GCAGAAGCGA CCGTAGTGCG TGGTTATATC GACTGGATGG TACAGGTACC GTGGAATGCG
CGCAGCAAGG TCAAAAAAGA CCTGCGTCAG GCGCAGGAAA TCCTTAATAC CGACCATTAT
GGTCTGGAGC GCGTGAAAGA TCGCATCCTT GAGTATCTTG CGGTTCAAAG CCGTGTCAAC
AAAATCAAGG GACCGATCCT TTGCCTGGTA GGGCCGCCGG GGGTAGGTAA AACCTCCCTG
GGTCAGTCCA TTGCCAAAGC CACCGGGCGT AAATATGTCC GTATGGCGCT GGGCGGCGTG
CGTGATGAAG CGGAAATCCG TGGTCACCGC CGTACTTACA TCGGTTCTAT GCCGGGTAAA
TTGATCCAGA AAATGGCGAA AGTGGGCGTT AAAAACCCGC TGTTCCTGCT CGATGAGATC
GACAAAATGT CTTCTGACAT GCGTGGCGAT CCGGCTTCCG CACTGCTTGA AGTGCTGGAT
CCAGAGCAGA ACGTGGCCTT CAGCGATCAC TACCTGGAAG TGGATTACGA TCTCAGCGAC
GTGATGTTTG TCGCGACGTC GAACTCCATG AACATTCCGG CACCACTGCT GGATCGTATG
GAAGTGATTC GCCTCTCCGG TTATACCGAA GATGAAAAAC TGAACATCGC CAAACGTCAC
CTGCTGCCGA AGCAGATTGA ACGTAATGCA CTGAAAAAAG GTGAGCTGAC CGTCGACGAT
AGCGCCATTA TCGGCATTAT TCGTTACTAC ACCCGTGAGG CGGGCGTGCG TGGTCTGGAG
CGTGAAATCT CCAAGCTGTG CCGTAAAGCG GTTAAGCAGT TACTGCTCGA TAAGTCATTA
AAACATATCG AAATTAACGG CGACAACCTG CATGACTACC TTGGTGTTCA GCGTTTCGAC
TATGGTCGCG CTGATAACGA AAACCGTGTC GGTCAGGTAA CCGGTCTGGC GTGGACGGAA
GTGGGCGGTG ACTTGCTGAC CATTGAAACC GCATGTGTTC CGGGTAAAGG CAAACTGACC
TATACCGGTT CGCTCGGCGA AGTGATGCAG GAGTCTATTC AGGCTGCGTT AACGGTGGTT
CGCGCGCGTG CGGAAAAACT GGGGATCAAC CCTGATTTTT ACGAAAAACG TGACATCCAC
GTCCACGTAC CGGAAGGTGC GACGCCGAAA GATGGTCCGA GTGCCGGTAT TGCTATGTGC
ACCGCGCTGG TTTCTTGCCT GACCGGTAAC CCGGTTCGTG CCGATGTGGC AATGACCGGT
GAGATCACTC TGCGTGGTCA GGTACTGCCG ATCGGTGGTT TGAAAGAAAA ACTACTGGCA
GCGCATCGCG GCGGGATTAA AACAGTGTTA ATTCCGTTCG AAAATAAACG CGATCTGGAA
GAGATTCCTG ACAACGTAAT TGCCGATCTG GACATTCATC CTGTGAAGCG CATTGAGGAA
GTTCTGACTC TGGCGCTGCA AAATGAACCG TCTGGCATGC AGGTTGTGAC TGCAAAATAG
 
Protein sequence
MSSDYLAEIK LRESSMNPER SERIEIPVLP LRDVVVYPHM VIPLFVGREK SIRCLEAAMD 
HDKKIMLVAQ KEASTDEPGV NDLFTVGTVA SILQMLKLPD GTVKVLVEGL QRARISALSD
NGEHFSAKAE YLESPTIDER EQEVLVRTAI SQFEGYIKLN KKIPPEVLTS LNSIDDPARL
ADTIAAHMPL KLADKQSVLE MSDVNERLEY LMAMMESEID LLQVEKRIRN RVKKQMEKSQ
REYYLNEQMK AIQKELGEMD DAPDENEALK RKIDAAKMPK EAKEKAEAEL QKLKMMSPMS
AEATVVRGYI DWMVQVPWNA RSKVKKDLRQ AQEILNTDHY GLERVKDRIL EYLAVQSRVN
KIKGPILCLV GPPGVGKTSL GQSIAKATGR KYVRMALGGV RDEAEIRGHR RTYIGSMPGK
LIQKMAKVGV KNPLFLLDEI DKMSSDMRGD PASALLEVLD PEQNVAFSDH YLEVDYDLSD
VMFVATSNSM NIPAPLLDRM EVIRLSGYTE DEKLNIAKRH LLPKQIERNA LKKGELTVDD
SAIIGIIRYY TREAGVRGLE REISKLCRKA VKQLLLDKSL KHIEINGDNL HDYLGVQRFD
YGRADNENRV GQVTGLAWTE VGGDLLTIET ACVPGKGKLT YTGSLGEVMQ ESIQAALTVV
RARAEKLGIN PDFYEKRDIH VHVPEGATPK DGPSAGIAMC TALVSCLTGN PVRADVAMTG
EITLRGQVLP IGGLKEKLLA AHRGGIKTVL IPFENKRDLE EIPDNVIADL DIHPVKRIEE
VLTLALQNEP SGMQVVTAK