Gene ECH74115_4562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4562 
Symbol 
ID6969913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4229185 
End bp4232985 
Gene Length3801 bp 
Protein Length1266 aa 
Translation table11 
GC content54% 
IMG OID643388273 
Producthypothetical protein 
Protein accessionYP_002272708 
Protein GI209398580 
COG category[S] Function unknown 
COG ID[COG3164] Predicted membrane protein 
TIGRFAM ID[TIGR02099] conserved hypothetical protein TIGR02099 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0190737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGCGAT TGCCGGGGAT TTTACTGCTT ACTGGAGCCG CGCTCGTTGT GATCGCTGCC 
CTGCTGGTTA GCGGCCTGCG TATTGCTTTA CCGCATCTTG ACGCCTGGCG TCCGGAAATC
CTCAACAAAA TAGAATCCGC GACTGGCATG CCGGTAGAAG CCAGTCAGCT CTCAGCCAGC
TGGCAGAATT TTGGCCCGAC GCTTGAAGCG CACGACATCC GTGCAGAACT AAAGGATGGC
GGCGAATTTT CGGTTAAACG CGTTACTCTG GCGCTGGATG TCTGGCAGAG CCTGTTACAT
ATGCGCTGGC AGTTTCGCGA CCTCACTTTC TGGCAGCTGC GCTTTCGCAC CAACACTCCT
ATCACCAGCG GTGGTAGCGA TGACAGTCTG GAAGCCAGTC ACATCAGCGA TCTGTTTCTT
CGTCAATTTG ACCATTTCGA TCTCCGCGAC AGTGAAGTCA GTTTCCTGAC GCCATCCGGT
CAGCGCGCCG AGCTGGCGAT CCCACAACTC ACCTGGCTGA ACGATCCACG TCGACACCGT
GCGGAAGGCC TGGTAAGCCT CTCCAGCCTT ACCGGACAGC ACGGCGTGAT GCAGGTGCGC
ATGGATTTGC GCGATGATGA GGGGTTGTTA AGCAATGGTC GCGTCTGGCT CCAGGCGGAT
GACATCGACC TGAAGCCGTG GCTCGGTAAA TGGATGCAGG ACAATATTGC GCTGGAAACG
GCGCAGTTCT CCCTTGAAGG CTGGATGACG ATCGACAAAG GCGATGTAAC CGGCGGTGAC
GTCTGGCTGA AACAGGGTGG TGCCAGCTGG TTGGGCGAGA AGGAAACGCA TACGCTGTCG
GTGGATAATC TGACCGCGCA TATTACGCGT GAAAATCCGG GATGGCAGTT CTCTATTCCC
GATACACGGA TCACGATGGA CGGCAAACCC TGGCCGAGCG GAGCATTGAC GCTGGCCTGG
ATACCGGAAC AGGACGTTGG CGGCAAAGAC AATAAACGCA GTGACGAACT CCGGATTCGC
GCCAGTAATC TGGAGCTGGC AGGCCTGGAG GGCATACGCC CGCTGGCCGC GAAACTTTCA
CCTGCACTGG GTGATGTTTG GCGCTCTACA CAACCGAGCG GCAAGATTAA CACTCTGGCG
CTGGATATCC CGCTTCAGGC GGCAGACAAG ACCCGTTTTC AGGCATCGTG GAGCGATCTG
GCCTGGAAGC AATGGAAATT ATTACCGGGT GCGGAACACT TCTCCGGGAC GCTTTCCGGC
AGCGTTGAAA ATGGTTTGCT TACCGCGTCG ATGAAGCAGG CAAAGATGCC TTACGAAACG
GTATTCCGTG CGCCACTGGA AATCGCCGAC GGCCAGGCAA CTATAAGCTG GCTGAACAAT
GACAAAGGTT TCCAGCTGGA TGGGCGTAAT ATTGACGTTA AAGCCAAAGC CGTCCATGCG
CGCGGCGGTT TTCGTTACCT GCAACCTGCT AACGATGAAC CCTGGCTGGG TATTCTGGCT
GGCATCAGTA CCGATGATGG TTCACAAGCC TGGCGCTATT TCCCGGAAAA CTTGATGGGT
AAAGACCTGG TTGATTACTT AAGTGGCGCG ATTCAGGGCG GTGAAGCGGA TAACGCGACG
CTGGTTTATG GTGGCAATCC GCAACTCTTC CCCTATAAAC ACAACGAAGG TCAGTTTGAA
GTGCTGGTGC CGCTGCGCAA CGCGAAGTTT GCCTTCCAGC CGGACTGGCC TGCATTAACT
AACCTTGGTA TTGAACTGGA CTTTATTAAC GACGGTTTAT GGATGAAAAC CGATGGCGTT
AATCTGGGCG GCGTGCGCGC GAGTAATCTC ACCGCAGTGA TCCCTGACTA CTCAAAAGAA
AAACTGCTGA TTGACGCTGA CATTAAAGGT CCGGGTAAAG CCGTTGGCCC TTACTTTGAT
GAGACACCGC TGAAAGATTC TCTGGGTGCG ACCCTGCAAG AACTCCAGCT CGACGGCGAT
GTGAATGCTC GCTTACATCT TGATATCCCG CTGAACGGCG AGCTGGTAAC CGCGAAAGGT
GAAGTGACGC TGCGTAATAA CAGTCTGTTT ATCAAACCAC TCGACAGCAC CCTGAAAAAT
TTGAGCGGTA AATTCAGCTT TATCAATGGC GATCTGCAAA GTGAACCACT GACAGCAAGC
TGGTTTAATC AGCCGTTGAA CGTGGATTTT TCCACCAAAG AAGGGGCAAA AGCCTACCAG
GTCGCGGTGA ATCTCAACGG TAACTGGCAA CCGGCGAAAA CCGGCGTTCT GCCTGCAGCG
GTGAACGAAG CATTGAGTGG CAGCGTGGCG TGGGATGGTA AAGTGGGCAT TGTTCTGCCT
TATCATGCTG GCGCGACGTA TAACGTAGAG CTAAACGGCG ATTTGAAGAA TGTGAGCAGT
CACTTACCTT CACCGTTAGC CAAACCTGCG GGTGAACCAC TGCCGGTAAA CGTTAAGGTT
GATGGCAATC TCAACAGCTT TGAATTAACC GGACAGGCTG GTGCGGATAA TCATTTCAAT
AGCCGCTGGT TGCTCGGTCA AAAGCTGACG CTCGACCGTG CTATTTGGGC GGCAGACAGT
AAAACGCTCC CGCCGTTGCC GGAACAAAGT GGTGTTGAAC TCAATATGCC GCCGATGAAT
GGTGCCGAGT GGCTGGCCCT GTTTCAGAAA GGTGCGGCGG AGAGTGTCGG TGGTGCAGCG
AGTTTCCCAC AACACATAAC GTTACGTACG CCTATGTTGT CGCTGGGAAA TCAGCAATGG
AATAACCTGA GTATTGTTTC GCAACCGACG GCAAATGGCA CCCAGGTTGA GGCGCAGGGG
CGTGAAATCA ACGCCACGCT GGCGATGCGT AATAACGCGC CGTGGCTGGC GAATATCAAA
TATCTTTATT ACAACCCGAG CGTGGCGAAA ACTCGTGGTG ATTCAACGCC GTCATCACCT
TTCCCGACAA CGGAACGCAT TAACTTCCGT GGCTGGTCGG ACGCCCAAAT ACGATGCGCA
GAGTGCTGGT TCTGGGGGCA AAAATTCGGG CGCATTGACA GTGATATCAC CATTTCTGGC
AATACATTAA CGCTGACCAA TGGACTGATT GATACTGGTT TTTCGCGGCT CACTGCCGAT
GGTGAATGGA TTAACAATCC GGGGAATGAA CGTACCTCGC TGAAAGGAAA ACTGCGCGGG
CAGAAAATTG ATGCCGCCGC AGAATTTTTT GGTGTCACGA CGCCCATACG GCAGTCGTCA
TTTAATGTGG ATTACGATTT ACACTGGCGT AAAGCACCGT GGCAGCCAGA TGAGGCGACG
TTGAATGGCA TCATTCATAC TCAACTGGGT AAAGGCGAAA TTACCGAAAT CAATACCGGA
CATGCCGGGC AATTGCTGCG CTTATTGAGC GTAGATGCCC TGATGCGTAA GCTGCGTTTT
GATTTCAGAG ACACTTTTGG CGAAGGGTTC TATTTTGACT CCATTCGCAG CACCGCGTGG
ATTAAAGACG GCGTTATGCA CACCGACGAC ACGCTGGTGG ATGGCCTGGA GGCGGATATC
GCCATGAAAG GGTCGGTAAA TCTGGTACGT CGCGACCTGA ATATGGAAGC GGTTGTCGCA
CCAGAGATTT CTGCGACAGT GGGCGTGGCT GCGGCTTTTG CGGTTAACCC CATTGTTGGC
GCGGCAGTGT TTGCCGCCAG TAAAGTGCTG GGGCCGCTGT GGAGCAAAGT CTCCATTTTG
CGCTATCACA TTTCGGGTCC GCTGGACGAT CCGCAAATCA ACGAAGTGTT GCGCCAACCG
CGTAAAGAAA AAGCGCAATG A
 
Protein sequence
MRRLPGILLL TGAALVVIAA LLVSGLRIAL PHLDAWRPEI LNKIESATGM PVEASQLSAS 
WQNFGPTLEA HDIRAELKDG GEFSVKRVTL ALDVWQSLLH MRWQFRDLTF WQLRFRTNTP
ITSGGSDDSL EASHISDLFL RQFDHFDLRD SEVSFLTPSG QRAELAIPQL TWLNDPRRHR
AEGLVSLSSL TGQHGVMQVR MDLRDDEGLL SNGRVWLQAD DIDLKPWLGK WMQDNIALET
AQFSLEGWMT IDKGDVTGGD VWLKQGGASW LGEKETHTLS VDNLTAHITR ENPGWQFSIP
DTRITMDGKP WPSGALTLAW IPEQDVGGKD NKRSDELRIR ASNLELAGLE GIRPLAAKLS
PALGDVWRST QPSGKINTLA LDIPLQAADK TRFQASWSDL AWKQWKLLPG AEHFSGTLSG
SVENGLLTAS MKQAKMPYET VFRAPLEIAD GQATISWLNN DKGFQLDGRN IDVKAKAVHA
RGGFRYLQPA NDEPWLGILA GISTDDGSQA WRYFPENLMG KDLVDYLSGA IQGGEADNAT
LVYGGNPQLF PYKHNEGQFE VLVPLRNAKF AFQPDWPALT NLGIELDFIN DGLWMKTDGV
NLGGVRASNL TAVIPDYSKE KLLIDADIKG PGKAVGPYFD ETPLKDSLGA TLQELQLDGD
VNARLHLDIP LNGELVTAKG EVTLRNNSLF IKPLDSTLKN LSGKFSFING DLQSEPLTAS
WFNQPLNVDF STKEGAKAYQ VAVNLNGNWQ PAKTGVLPAA VNEALSGSVA WDGKVGIVLP
YHAGATYNVE LNGDLKNVSS HLPSPLAKPA GEPLPVNVKV DGNLNSFELT GQAGADNHFN
SRWLLGQKLT LDRAIWAADS KTLPPLPEQS GVELNMPPMN GAEWLALFQK GAAESVGGAA
SFPQHITLRT PMLSLGNQQW NNLSIVSQPT ANGTQVEAQG REINATLAMR NNAPWLANIK
YLYYNPSVAK TRGDSTPSSP FPTTERINFR GWSDAQIRCA ECWFWGQKFG RIDSDITISG
NTLTLTNGLI DTGFSRLTAD GEWINNPGNE RTSLKGKLRG QKIDAAAEFF GVTTPIRQSS
FNVDYDLHWR KAPWQPDEAT LNGIIHTQLG KGEITEINTG HAGQLLRLLS VDALMRKLRF
DFRDTFGEGF YFDSIRSTAW IKDGVMHTDD TLVDGLEADI AMKGSVNLVR RDLNMEAVVA
PEISATVGVA AAFAVNPIVG AAVFAASKVL GPLWSKVSIL RYHISGPLDD PQINEVLRQP
RKEKAQ