Gene ECH74115_B0113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_B0113 
Symbol 
ID6966433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011350 
Strand
Start bp72408 
End bp74129 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content48% 
IMG OID643384009 
Producthypothetical protein 
Protein accessionYP_002268488 
Protein GI209395618 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTAA ATACCGGACA GAACAGGCCT ACTTTCAGCT GGAGCGCGCT GGGATGGGCA 
ATTTTTTATT TTGGCTTTTT TTCCACTCTC CTGCAGGTCA TCATCTTCAG CAGTGGGTAC
AGCGGAACGA ATGGAATACG GGACTCACTG TTATTCAGTT GTCTGTGGTT GATCCCGGTG
TTTCTCTATC CTGATCGGAT AAAAATAATT GCAGCTGTTG TCGGTTTCAT TCTCTGGGGC
ACGTCGCTGG CAGCACTTTG TTATTATTTT CTCTATGGTC ATGAATTCTC TCAAAGTGTT
CTTTTCGTTA TGTTTGAAAC GAATGCCAGA GAGGCTGGTG AATATTTCAG CCAGTATTTC
AGCCTTAAAC TGTTGCTTAT CTCGCTGGTA TATACTGCGG TGTCCGTTTT TCTGTGGACA
CGTCTGCGTC CTGTATATAT TCCTTTGCCA TGGCGGAGAA TTGTCTCTTT CCTGCTGCTT
TATGCTCTGC TTCTGCATCC GGTTGTTCTG AAATCGTTAA TCAGACAGGA GCCGCTGAAT
GATACTCTAG GCAAACTGGC ATCCCGAATG GAGCCTGCTG CCCCCTGGCA GTTTGTATCC
AGTTATTATC AGTACCATCA GCAACTGAAT GCACTGACAA CCTTCCTGAA TGAAAATAGC
GCACTGCCAC CACTGGGTAA TCTCAGGGAT GAATCAGGGG AGAGACCACG CACACTGGTC
CTGGTGATTG GTGAGTCGAC ACAGCGCGAA CGCATGAGCT TATACGGGTA TCTACGTGAA
ACGACGCCGG AGCTGGATGC ACTGCGTAAA ACCGATCCGG GTCTTACTGT GTTTAATAAT
GTGGTGGCAT CGCGTCCGTA CACCATTGAA GCATTGCAAC AGGCCCTTAC TTTCGCCAAC
GAAAAGAATC CTGATCTGTA TCTGACGCAG CCGTCGCTGA TGAACATGAT GAAGCAGGCA
GGCTATAAAA CATTCTGGAT TACCAACCAG CAGACAATCA CAGCCCGTAA CACCATGCTC
ACTGTATTTT CGCGCCAGAC GGACAGGCAG TACTACATGA ATCAGCAGCG AACACAAAGT
GCGCGTGAAT ATGACACTAA CGTGCTGAAG CCGTTCCGGG AAGTGCTGAA TGACCCTGCA
CCAAAGAAAC TGATCATCGT GCATCTGCTG GGTACGCACA TTAAGTATAA ATACCGTTAT
CCGGAAGGTC AGGGGCGGTT TGATGGCATT ACAGGGCATA TTCCCACTGG ATTAAATGCG
AAAGAGCTGG AAGTGTATAA CGATTACGAT AATGCCAATC TGTTTAACGA TCATGTGGTG
GCCAGTCTGA TAAAGGACTT CAGAGCAACT GCGCCGGACG GCTTTCTGCT TTATTTTTCA
GACCATGGTG AAGAAGTATA TGATACTCCG CCATATAAAA CGCAGGGACG GAATGAAGAT
AACCCCACAC GTCCCATGTA CACTGTTCCG TTCCTGCTGT GGACCTCGGA AAAGTGGCAT
GCTGCACATC CGCGAGATTT TTCGCAGTAT GTTGACCGCA AATACAGTCT GGCTGAACTG
ATCCACACCT GGTCAGATTT GGCGGGACTG ACATACGATG GTTACGATCC GACCCGTTCT
CTGGTGAATC CGCAATTCAG GGAAACCACC CGCTGGATTG GAAATCCGTA TAAAAAGAAT
GGGCTCACTG ATTTCGACAC TCTTCCGTAT GGTGAGCCAT AG
 
Protein sequence
MHLNTGQNRP TFSWSALGWA IFYFGFFSTL LQVIIFSSGY SGTNGIRDSL LFSCLWLIPV 
FLYPDRIKII AAVVGFILWG TSLAALCYYF LYGHEFSQSV LFVMFETNAR EAGEYFSQYF
SLKLLLISLV YTAVSVFLWT RLRPVYIPLP WRRIVSFLLL YALLLHPVVL KSLIRQEPLN
DTLGKLASRM EPAAPWQFVS SYYQYHQQLN ALTTFLNENS ALPPLGNLRD ESGERPRTLV
LVIGESTQRE RMSLYGYLRE TTPELDALRK TDPGLTVFNN VVASRPYTIE ALQQALTFAN
EKNPDLYLTQ PSLMNMMKQA GYKTFWITNQ QTITARNTML TVFSRQTDRQ YYMNQQRTQS
AREYDTNVLK PFREVLNDPA PKKLIIVHLL GTHIKYKYRY PEGQGRFDGI TGHIPTGLNA
KELEVYNDYD NANLFNDHVV ASLIKDFRAT APDGFLLYFS DHGEEVYDTP PYKTQGRNED
NPTRPMYTVP FLLWTSEKWH AAHPRDFSQY VDRKYSLAEL IHTWSDLAGL TYDGYDPTRS
LVNPQFRETT RWIGNPYKKN GLTDFDTLPY GEP