Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5201 |
Symbol | |
ID | 6968874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4844989 |
End bp | 4846509 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643388866 |
Product | putative ATP-dependent protease |
Protein accession | YP_002273286 |
Protein GI | 209397578 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0606] Predicted ATPase with chaperone activity |
TIGRFAM ID | [TIGR00368] Mg chelatase-related protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACTGT CAATTGTTCA TACCCGCGCA GCCCTGGGAG TAAATGCGCC CCCAATCACT GTTGAGGTAC ATATCAGTAA AGGTTTGCCC GGCTTAACGA TGGTGGGCTT ACCAGAAACA ACGGTAAAAG AAGCTCGCGA TCGCGTGCGC AGCGCCATTA TCAATAGCGG ATATGAATAT CCGGCGAAAA AAATCACCAT TAACCTTGCT CCAGCCGATC TACCAAAAGA AGGGGGACGA TATGATTTAC CTATCGCCAT TGCGTTGCTG GCGGCCTCAG AACAGCTTAC AGCCAATAAG TTAGATGAAT ATGAATTAGT CGGAGAACTG GCGCTTACAG GCGCTCTGCG TGGCGTTCCC GGTGCAATCT CCAGTGCAAC TGAAGCTATT AAGTCGGGCA GAAAAATTAT CGTCGCGAAA GATAACGAAG ATGAAGTGGG GCTAATTAAC GGTGAAGGAT GCCTGGTAGC CGATCATCTG CAAGCTGTCT GTGCGTTTCT GGAAGGTAAG CACGCTCTCG AACGCCCGAA ACCAACTGAT GCAGTATCCC GGGCGCTACA ACATGATCTC AGTGATGTTG TCGGTCAGGA GCAAGGAAAG CGAGGACTGG AAATTACCGC CGCTGGCGGG CACAACCTTT TACTGATTGG GCCGCCGGGA ACAGGTAAAA CAATGCTCGC CAGCCGTATT AATGGTCTTT TGCCAGATTT AAGCAATGAA GAGGCACTGG AGAGCGCTGC GATATTAAGT CTGGTAAATG CTGAATCAGT ACAAAAACAA TGGCGGCAGC GCCCGTTCCG CTCACCTCAT CACAGTGCGT CGTTAACGGC GATGGTGGGC GGTGGTGCAA TTCCAGGGCC AGGTGAAATT TCGTTGGCGC ATAACGGCGT GCTTTTTCTT GATGAGCTAC CTGAATTTGA ACGGCGTACA CTGGATGCTT TGCGAGAGCC GATTGAATCC GGGCAGATCC ATCTTTCACG CACACGAGCA AAAATAACCT ATCCAGCCCG TTTCCAGCTT GTTGCGGCGA TGAATCCCAG CCCTACCGGA CATTATCAGG GAAACCATAA CCGCTGCACG CCAGAACAGA CATTACGTTA TCTCAACCGG CTCTCGGGGC CCTTTCTCGA CCGCTTCGAT CTCTCACTGG AGATCCCATT ACCACCCCCC GGCATTTTGA GTAAAACGGT AGTGCCGGGA GAAAGCAGCA CCACCGTTAA ACAACGTGTA ATGGCTGCCA GAGAGCGCCA ATTTAAGCGG CAGAATAAGT TGAACGCCTG GCTGGATAGT CCGGAAATAC GCAAATTCTG CAAGCTTGAG AGCGAAGATG CGCAGTGGCT GGAAGAAACG CTGATCCATC TGGGGTTATC GATTCGTGCC TGGCAGCGGT TATTGAAAGT TGCACGAACC ATTGCTGATA TTGATCAGTC TGACATTATC ACACGTCAGC ATTTGCAGGA GGCAGTTAGC TATCGTGCGA TTGACCGTTT GCTCATCCAT CTGCAAAAAC TACTGACATA A
|
Protein sequence | MSLSIVHTRA ALGVNAPPIT VEVHISKGLP GLTMVGLPET TVKEARDRVR SAIINSGYEY PAKKITINLA PADLPKEGGR YDLPIAIALL AASEQLTANK LDEYELVGEL ALTGALRGVP GAISSATEAI KSGRKIIVAK DNEDEVGLIN GEGCLVADHL QAVCAFLEGK HALERPKPTD AVSRALQHDL SDVVGQEQGK RGLEITAAGG HNLLLIGPPG TGKTMLASRI NGLLPDLSNE EALESAAILS LVNAESVQKQ WRQRPFRSPH HSASLTAMVG GGAIPGPGEI SLAHNGVLFL DELPEFERRT LDALREPIES GQIHLSRTRA KITYPARFQL VAAMNPSPTG HYQGNHNRCT PEQTLRYLNR LSGPFLDRFD LSLEIPLPPP GILSKTVVPG ESSTTVKQRV MAARERQFKR QNKLNAWLDS PEIRKFCKLE SEDAQWLEET LIHLGLSIRA WQRLLKVART IADIDQSDII TRQHLQEAVS YRAIDRLLIH LQKLLT
|
| |