Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5675 |
Symbol | |
ID | 6967429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5314972 |
End bp | 5318295 |
Gene Length | 3324 bp |
Protein Length | 1107 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643389308 |
Product | hypothetical protein |
Protein accession | YP_002273704 |
Protein GI | 209400410 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3264] Small-conductance mechanosensitive channel |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00234961 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.548719 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCCTGA TTATCACTTT TCTGATGGCC TGGTGCCTCA GTTGGGGGGC GTACGCCGCG ACGGCCCCCG ATAGCAAACA AATCACCCAG GAACTGGAGC AGGCAAAAGC GGCGAAACCC GCGCAGCCAG AAGTCGTAGA GGCACTCCAG TCTGCCTTAA ATGCGCTTGA GGAACGAAAA GGTTCCCTTG AGCGCATCAA GCAATATCAA GAAGTCATTG ATAATTATCC GAAACTCTCC GCTACTCTGC GCGCACAATT AAACAACATG CGTGACGAGC CGCGCAGCGT GTCGCCGGGG ATGTCTACCG ACGCGCTGAA TCAGGAAATT CTCCAGGTCA GCAGTCAGTT GCTGGATAAA AGCCGTCAGG CCCAGCAAGA GCAGGAGCGC GCCCGCGAGA TTGCCGATTC GCTGAATCAA CTGCCGCAAC AGCAAACCGA CGCCCGCCGT CAGTTAAATG AGATCGAGCG CCGCCTGGGG ACGCTTACCG GCAATACTCC GCTCAATCAG GCACAAAATT TCGCGTTGCA GTCTGACTCT GCACGTCTTA AGGCGCTCGT TGATGAACTG GAGCTGGCGC AGCTGTCTGC CAATAACCGC CAGGAATTAG CGCGCTTGCG CTCTGAGCTG GCTGAAAAAG AGAGCCAGCA ACTGGATGCG TATTTGCAGG CCTTGCGTAA TCAATTGAAC AGCCAACGCC AGCTTGAGGC AGAGCGGGCG CTGGAAAGTA CCGAATTACT AGCAGAAAAC AGTGCCGATT TGCCGAAAGA TATCGTCGCG CAATTCAAAA TTAACCGCGA ACTATCGGCG GCGCTGAATC AACAGGCGCA GCGGATGGAT CTCGTTGCCT CGCAACAGCG TCAGGCTGCC AGCCAGACGT TACAGGTCCG GCAGGCGTTG AATACGCTGC GTGAACAGTC GCAATGGCTG GGATCGTCCA ATCTGCTCGG TGAAGCGTTG CGGGCGCAGG TGGCACGGCT GCCGGAAATG CCGAAACCAC AACAGCTTGA TACCGAAATG GCGCAGTTGC GTGTGCAACG GTTACGTTAT GAGGATCTGC TTAATAAACA GCCGCTGCTA CGGCAAATTC ATCAGGCCGA CGGTCAGCCG CTGACCGCTG AGCAAAACCG TATTCTGGAA GCACAGCTAC GCACTCAGCG TGAGTTGCTG AACTCATTGT TGCAGGGTGG CGATACGCTA CTGCTGGAAC TGACCAAGCT GAAAGTCTCC AACGGGCAAC TGGAAGATGC GCTGAAAGAA GTGAACGAAG CGACCCACCG CTATCTGTTC TGGACCTCTG ACGTGCGCCC GATGACCATC GCCTGGCCGC TGGAAATCGC CCAGGATCTG CGTCGTCTCA TTTCGCTGGA CACCTTCAGT CAGTTGGGCA AAGCCAGTGT GATGATGCTG ACCAGCAAAG AGACGATTTT GCCGCTGTTT GGCGCGTTGA TTCTGGTCGG TTGCAGTATT TACTCGCGCC GCTATTTCAC CCGTTTTCTT GAACGTTCGG CGGCGAAAGT CGGCAAAGTG ACTCAGGATC ACTTCTGGCT GACGTTGCGC ACTCTTTTCT GGTCGATTCT CGTCGCGTCA CCGCTGCCGG TGCTGTGGAT GACGCTGGGT TACGGCTTGC GCGAGGCGTG GCCTTATCCG CTGGCGGTCG CGATTGGTGA TGGCGTCACA GCCACCGTGC CGCTGCTGTG GGTAGTGATG ATTTGCGCTA CCTTTGCCCG CCCGAACGGC TTGTTTATCG CTCATTTTGG CTGGCCGCGC GAACGTGTTT CTCGTGGGAT GCGCTACTAC CTGATGAGCA TCGGGCTTAT TGTGCCACTG ATTATGGCGC TGATGATGTT CGATAACCTC GACGACCGCG AATTTTCCGG TTCGCTGGGA CGGCTTTGCT TTATCCTCAT TTGCGGTGCG CTGGCGGTGG TCACCCTCAG CCTGAAAAAG GCCGGGATCC CGCTGTATCT CAACAAAGAA GGCAGCGGCG ACAATATTAC CAACCATATG TTGTGGAACA TGATGATTGG CGCGCCACTG GTTGCCATTC TGGCGTCGGC GGTGGGTTAT CTGGCAACGG CACAGGCGCT GTTAGCGAGG CTTGAAACCT CGGTTGCCAT CTGGTTCCTG CTACTGGTGG TTTATCACGT TATCCGCCGC TGGATGCTGA TCCAACGACG CAGGCTGGCG TTTGACCGGG CGAAGCATCG CCGGGCAGAG ATGTTAGCGC AACGTGCGCG TGGCGAAGAG GAAGCACATC ATCACAGTAG CCCGGAAGGG GCAATTGAAG TCGATGAAAG CGAAGTCGAT CTCGATGCCA TCAGTGCGCA GTCCTTGCGG CTAGTGCGCT CAATTTTGAT GTTGATCGCC CTGCTTTCGG TCATTGTGCT GTGGTCAGAA ATCCATTCCG CTTTCGGCTT CCTCGAAAAT ATTTCGCTGT GGGATGTCAC CTCCACGGTA CAGGGCGTAG AAAGTCTGGA GCCGATTACC CTCGGTGCGG TGCTGATTGC CATTCTGGTG TTTATCATCA CCACGCAGCT GGTGCGCAAT CTGCCCGCGC TGCTGGAACT GGCGATTTTG CAGCACCTGG ATTTAACGCC GGGTACCGGC TACGCCATCA CCACCATCAC CAAATATCTG CTGATGCTGA TTGGCGGGCT GGTCGGCTTC TCGATGATTG GTATTGAGTG GTCGAAATTG CAGTGGCTGG TCGCCGCGCT CGGTGTTGGT CTCGGTTTTG GTTTGCAGGA AATTTTCGCC AACTTTATCT CTGGCCTGAT CATCCTGTTC GAAAAACCGA TTCGCATTGG CGATACGGTG ACAATTCGCG ATCTTACTGG TAGCGTGACG AAAATTAATA CCCGCGCAAC CACCATCAGC GACTGGGACC GTAAAGAGAT AATCGTGCCG AACAAGGCGT TTATTACCGA GCAGTTTATC AACTGGTCGC TCTCTGACTC GGTCACGCGC GTGGTGTTGA CGATTCCGGC CCCTGCCGAT GCCAACAGTG AAGAAGTGAC GGAAATCCTG CTCACCGCAG CGCGTCGCTG CTCGCTGGTG ATCGACAACC CGGCACCGGA AGTCTTCCTG GTGGATCTGC AACAGGGGAT TCAGATTTTC GAGCTGCGTA TTTACGCCGC TGAGATGGGT CACCGTATGC CGCTACGCCA TGAGATCCAC CAGCTGATTC TGGCTGGCTT CCATGCCCAC GGTATCGATA TGCCATTCCC GCCCTTCCAG ATGCGTCTGG AAAGCCTCAA CGGTAAACAA ACGGGGAGAA CGCTGACGTC TGCGGGCAAA GGTCGTCAGG CGGGAAGTTT GTAA
|
Protein sequence | MRLIITFLMA WCLSWGAYAA TAPDSKQITQ ELEQAKAAKP AQPEVVEALQ SALNALEERK GSLERIKQYQ EVIDNYPKLS ATLRAQLNNM RDEPRSVSPG MSTDALNQEI LQVSSQLLDK SRQAQQEQER AREIADSLNQ LPQQQTDARR QLNEIERRLG TLTGNTPLNQ AQNFALQSDS ARLKALVDEL ELAQLSANNR QELARLRSEL AEKESQQLDA YLQALRNQLN SQRQLEAERA LESTELLAEN SADLPKDIVA QFKINRELSA ALNQQAQRMD LVASQQRQAA SQTLQVRQAL NTLREQSQWL GSSNLLGEAL RAQVARLPEM PKPQQLDTEM AQLRVQRLRY EDLLNKQPLL RQIHQADGQP LTAEQNRILE AQLRTQRELL NSLLQGGDTL LLELTKLKVS NGQLEDALKE VNEATHRYLF WTSDVRPMTI AWPLEIAQDL RRLISLDTFS QLGKASVMML TSKETILPLF GALILVGCSI YSRRYFTRFL ERSAAKVGKV TQDHFWLTLR TLFWSILVAS PLPVLWMTLG YGLREAWPYP LAVAIGDGVT ATVPLLWVVM ICATFARPNG LFIAHFGWPR ERVSRGMRYY LMSIGLIVPL IMALMMFDNL DDREFSGSLG RLCFILICGA LAVVTLSLKK AGIPLYLNKE GSGDNITNHM LWNMMIGAPL VAILASAVGY LATAQALLAR LETSVAIWFL LLVVYHVIRR WMLIQRRRLA FDRAKHRRAE MLAQRARGEE EAHHHSSPEG AIEVDESEVD LDAISAQSLR LVRSILMLIA LLSVIVLWSE IHSAFGFLEN ISLWDVTSTV QGVESLEPIT LGAVLIAILV FIITTQLVRN LPALLELAIL QHLDLTPGTG YAITTITKYL LMLIGGLVGF SMIGIEWSKL QWLVAALGVG LGFGLQEIFA NFISGLIILF EKPIRIGDTV TIRDLTGSVT KINTRATTIS DWDRKEIIVP NKAFITEQFI NWSLSDSVTR VVLTIPAPAD ANSEEVTEIL LTAARRCSLV IDNPAPEVFL VDLQQGIQIF ELRIYAAEMG HRMPLRHEIH QLILAGFHAH GIDMPFPPFQ MRLESLNGKQ TGRTLTSAGK GRQAGSL
|
| |