Gene ECH74115_5675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5675 
Symbol 
ID6967429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5314972 
End bp5318295 
Gene Length3324 bp 
Protein Length1107 aa 
Translation table11 
GC content56% 
IMG OID643389308 
Producthypothetical protein 
Protein accessionYP_002273704 
Protein GI209400410 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3264] Small-conductance mechanosensitive channel 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00234961 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.548719 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCCTGA TTATCACTTT TCTGATGGCC TGGTGCCTCA GTTGGGGGGC GTACGCCGCG 
ACGGCCCCCG ATAGCAAACA AATCACCCAG GAACTGGAGC AGGCAAAAGC GGCGAAACCC
GCGCAGCCAG AAGTCGTAGA GGCACTCCAG TCTGCCTTAA ATGCGCTTGA GGAACGAAAA
GGTTCCCTTG AGCGCATCAA GCAATATCAA GAAGTCATTG ATAATTATCC GAAACTCTCC
GCTACTCTGC GCGCACAATT AAACAACATG CGTGACGAGC CGCGCAGCGT GTCGCCGGGG
ATGTCTACCG ACGCGCTGAA TCAGGAAATT CTCCAGGTCA GCAGTCAGTT GCTGGATAAA
AGCCGTCAGG CCCAGCAAGA GCAGGAGCGC GCCCGCGAGA TTGCCGATTC GCTGAATCAA
CTGCCGCAAC AGCAAACCGA CGCCCGCCGT CAGTTAAATG AGATCGAGCG CCGCCTGGGG
ACGCTTACCG GCAATACTCC GCTCAATCAG GCACAAAATT TCGCGTTGCA GTCTGACTCT
GCACGTCTTA AGGCGCTCGT TGATGAACTG GAGCTGGCGC AGCTGTCTGC CAATAACCGC
CAGGAATTAG CGCGCTTGCG CTCTGAGCTG GCTGAAAAAG AGAGCCAGCA ACTGGATGCG
TATTTGCAGG CCTTGCGTAA TCAATTGAAC AGCCAACGCC AGCTTGAGGC AGAGCGGGCG
CTGGAAAGTA CCGAATTACT AGCAGAAAAC AGTGCCGATT TGCCGAAAGA TATCGTCGCG
CAATTCAAAA TTAACCGCGA ACTATCGGCG GCGCTGAATC AACAGGCGCA GCGGATGGAT
CTCGTTGCCT CGCAACAGCG TCAGGCTGCC AGCCAGACGT TACAGGTCCG GCAGGCGTTG
AATACGCTGC GTGAACAGTC GCAATGGCTG GGATCGTCCA ATCTGCTCGG TGAAGCGTTG
CGGGCGCAGG TGGCACGGCT GCCGGAAATG CCGAAACCAC AACAGCTTGA TACCGAAATG
GCGCAGTTGC GTGTGCAACG GTTACGTTAT GAGGATCTGC TTAATAAACA GCCGCTGCTA
CGGCAAATTC ATCAGGCCGA CGGTCAGCCG CTGACCGCTG AGCAAAACCG TATTCTGGAA
GCACAGCTAC GCACTCAGCG TGAGTTGCTG AACTCATTGT TGCAGGGTGG CGATACGCTA
CTGCTGGAAC TGACCAAGCT GAAAGTCTCC AACGGGCAAC TGGAAGATGC GCTGAAAGAA
GTGAACGAAG CGACCCACCG CTATCTGTTC TGGACCTCTG ACGTGCGCCC GATGACCATC
GCCTGGCCGC TGGAAATCGC CCAGGATCTG CGTCGTCTCA TTTCGCTGGA CACCTTCAGT
CAGTTGGGCA AAGCCAGTGT GATGATGCTG ACCAGCAAAG AGACGATTTT GCCGCTGTTT
GGCGCGTTGA TTCTGGTCGG TTGCAGTATT TACTCGCGCC GCTATTTCAC CCGTTTTCTT
GAACGTTCGG CGGCGAAAGT CGGCAAAGTG ACTCAGGATC ACTTCTGGCT GACGTTGCGC
ACTCTTTTCT GGTCGATTCT CGTCGCGTCA CCGCTGCCGG TGCTGTGGAT GACGCTGGGT
TACGGCTTGC GCGAGGCGTG GCCTTATCCG CTGGCGGTCG CGATTGGTGA TGGCGTCACA
GCCACCGTGC CGCTGCTGTG GGTAGTGATG ATTTGCGCTA CCTTTGCCCG CCCGAACGGC
TTGTTTATCG CTCATTTTGG CTGGCCGCGC GAACGTGTTT CTCGTGGGAT GCGCTACTAC
CTGATGAGCA TCGGGCTTAT TGTGCCACTG ATTATGGCGC TGATGATGTT CGATAACCTC
GACGACCGCG AATTTTCCGG TTCGCTGGGA CGGCTTTGCT TTATCCTCAT TTGCGGTGCG
CTGGCGGTGG TCACCCTCAG CCTGAAAAAG GCCGGGATCC CGCTGTATCT CAACAAAGAA
GGCAGCGGCG ACAATATTAC CAACCATATG TTGTGGAACA TGATGATTGG CGCGCCACTG
GTTGCCATTC TGGCGTCGGC GGTGGGTTAT CTGGCAACGG CACAGGCGCT GTTAGCGAGG
CTTGAAACCT CGGTTGCCAT CTGGTTCCTG CTACTGGTGG TTTATCACGT TATCCGCCGC
TGGATGCTGA TCCAACGACG CAGGCTGGCG TTTGACCGGG CGAAGCATCG CCGGGCAGAG
ATGTTAGCGC AACGTGCGCG TGGCGAAGAG GAAGCACATC ATCACAGTAG CCCGGAAGGG
GCAATTGAAG TCGATGAAAG CGAAGTCGAT CTCGATGCCA TCAGTGCGCA GTCCTTGCGG
CTAGTGCGCT CAATTTTGAT GTTGATCGCC CTGCTTTCGG TCATTGTGCT GTGGTCAGAA
ATCCATTCCG CTTTCGGCTT CCTCGAAAAT ATTTCGCTGT GGGATGTCAC CTCCACGGTA
CAGGGCGTAG AAAGTCTGGA GCCGATTACC CTCGGTGCGG TGCTGATTGC CATTCTGGTG
TTTATCATCA CCACGCAGCT GGTGCGCAAT CTGCCCGCGC TGCTGGAACT GGCGATTTTG
CAGCACCTGG ATTTAACGCC GGGTACCGGC TACGCCATCA CCACCATCAC CAAATATCTG
CTGATGCTGA TTGGCGGGCT GGTCGGCTTC TCGATGATTG GTATTGAGTG GTCGAAATTG
CAGTGGCTGG TCGCCGCGCT CGGTGTTGGT CTCGGTTTTG GTTTGCAGGA AATTTTCGCC
AACTTTATCT CTGGCCTGAT CATCCTGTTC GAAAAACCGA TTCGCATTGG CGATACGGTG
ACAATTCGCG ATCTTACTGG TAGCGTGACG AAAATTAATA CCCGCGCAAC CACCATCAGC
GACTGGGACC GTAAAGAGAT AATCGTGCCG AACAAGGCGT TTATTACCGA GCAGTTTATC
AACTGGTCGC TCTCTGACTC GGTCACGCGC GTGGTGTTGA CGATTCCGGC CCCTGCCGAT
GCCAACAGTG AAGAAGTGAC GGAAATCCTG CTCACCGCAG CGCGTCGCTG CTCGCTGGTG
ATCGACAACC CGGCACCGGA AGTCTTCCTG GTGGATCTGC AACAGGGGAT TCAGATTTTC
GAGCTGCGTA TTTACGCCGC TGAGATGGGT CACCGTATGC CGCTACGCCA TGAGATCCAC
CAGCTGATTC TGGCTGGCTT CCATGCCCAC GGTATCGATA TGCCATTCCC GCCCTTCCAG
ATGCGTCTGG AAAGCCTCAA CGGTAAACAA ACGGGGAGAA CGCTGACGTC TGCGGGCAAA
GGTCGTCAGG CGGGAAGTTT GTAA
 
Protein sequence
MRLIITFLMA WCLSWGAYAA TAPDSKQITQ ELEQAKAAKP AQPEVVEALQ SALNALEERK 
GSLERIKQYQ EVIDNYPKLS ATLRAQLNNM RDEPRSVSPG MSTDALNQEI LQVSSQLLDK
SRQAQQEQER AREIADSLNQ LPQQQTDARR QLNEIERRLG TLTGNTPLNQ AQNFALQSDS
ARLKALVDEL ELAQLSANNR QELARLRSEL AEKESQQLDA YLQALRNQLN SQRQLEAERA
LESTELLAEN SADLPKDIVA QFKINRELSA ALNQQAQRMD LVASQQRQAA SQTLQVRQAL
NTLREQSQWL GSSNLLGEAL RAQVARLPEM PKPQQLDTEM AQLRVQRLRY EDLLNKQPLL
RQIHQADGQP LTAEQNRILE AQLRTQRELL NSLLQGGDTL LLELTKLKVS NGQLEDALKE
VNEATHRYLF WTSDVRPMTI AWPLEIAQDL RRLISLDTFS QLGKASVMML TSKETILPLF
GALILVGCSI YSRRYFTRFL ERSAAKVGKV TQDHFWLTLR TLFWSILVAS PLPVLWMTLG
YGLREAWPYP LAVAIGDGVT ATVPLLWVVM ICATFARPNG LFIAHFGWPR ERVSRGMRYY
LMSIGLIVPL IMALMMFDNL DDREFSGSLG RLCFILICGA LAVVTLSLKK AGIPLYLNKE
GSGDNITNHM LWNMMIGAPL VAILASAVGY LATAQALLAR LETSVAIWFL LLVVYHVIRR
WMLIQRRRLA FDRAKHRRAE MLAQRARGEE EAHHHSSPEG AIEVDESEVD LDAISAQSLR
LVRSILMLIA LLSVIVLWSE IHSAFGFLEN ISLWDVTSTV QGVESLEPIT LGAVLIAILV
FIITTQLVRN LPALLELAIL QHLDLTPGTG YAITTITKYL LMLIGGLVGF SMIGIEWSKL
QWLVAALGVG LGFGLQEIFA NFISGLIILF EKPIRIGDTV TIRDLTGSVT KINTRATTIS
DWDRKEIIVP NKAFITEQFI NWSLSDSVTR VVLTIPAPAD ANSEEVTEIL LTAARRCSLV
IDNPAPEVFL VDLQQGIQIF ELRIYAAEMG HRMPLRHEIH QLILAGFHAH GIDMPFPPFQ
MRLESLNGKQ TGRTLTSAGK GRQAGSL