Gene ECH74115_0124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0124 
Symbol 
ID6968493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp133760 
End bp135613 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content54% 
IMG OID643384201 
Producthypothetical protein 
Protein accessionYP_002268724 
Protein GI209398273 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00449065 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA CTTTGCCGTT TAAACCCCAT GTGCTGGCGC TAATTTGCAG TGCCGGGCTT 
TGTGCCGCCT CTGCCGGGCT ATATATAAAA AGCCGCACAG TGGAAGCGCC TGTGGAAACG
CAATCGACAC AACTGGCTGT GTCTGACGCT GCCGCAGTTA CGCTTCCTGC AACGGTTTCC
GCACCTCCCG TAACACCCGC CGTCGTCAAA TCCGCATTCA GCACTGCACA AATAGATCAA
TGGGTCGCGC CCGTCGCGCT GTATCCCGAC GCCCTACTTT CGCAGGTGCT GATGGCATCA
ACCTATCCGA CAAACGTTGC TCAAGCAGTG CAATGGTCGC ACGATAATCC ACTTAAACAA
GGCGATGCTG CTATTCAGGC GGTATCTGAC CAGCCGTGGG ACGCCAGCGT TAAATCACTG
GTGGCCTTTC CACAATTGAT GGCATTGATG GGCGAAAACC CGCAATGGGT GCAAAACCTG
GGCGATGCTT TTCTGGCCCA GCCGCAGGAC GTGATGGACT CGGTACAACG ATTGCGGCAA
CTGGCACAAC AAACCGGCTC GCTGAAGTCA TCAACCGAAC AGAAAGTTAT TACCACAACG
AAGAAAACTG TACCGGTAAC ACAGACAGTC ACGGCTCCCG TCATACCATC CAATACCGTT
TCAACTGCCA ACCCTGTCAT TACAGAGCCT GCAACAACCG TCATTTCCAT TGAGCCCGGC
AATCCTGATG TGGTCTATAT TCCCAACTAC AACCCAACCG TGGTTTACGG GAACTGGGCC
AATACTGCGT ATCCGCCGGT TTATCTGCCA CCACCAGCCG GAGAACCGTT TGTTGACAGC
TTTGTACGCG GATTCGGCTA TAGCATGGGC GTTGCTACCA CGTACGCACT ATTCAGCAGC
ATCGACTGGG ATGACGACGA TCATGACCAT CATCATCATG ACGATGATAA TTATCATCAC
CACGATGGCG GTCATCGTGA CGGTAATGGC TGGCAACATA ACGGCGACAA CATCAATATC
GACGTCAACA ATTTCAACCG TATCACCGGT GAGCATCTTA CTGATAAGAA TATGGCATGG
CGGCACAATC CAAACTACCG TAATGGTGTG CCCTATCATG ATCAGGATAT GGCAAAGCGG
TTTCATCAAA CTGATGTCAA CGGCGGAATG AGTGCCACGC AGCTACCTGC TCCAACACGC
GACAGCCAGC GTCAGGCGGC AGCAAGTCAG TTTCAGCAAC GAACACACGC CGCCCCCGTC
ATTACACGAG ATACCCAACG TCAGGCAGCG GCACAGCGGT TTAATGAAGC TGAACACTAT
GGGAGCTATG ACGACTTCCG CGACTTCAGC CGTCGCCAAC CACTGACCCA GCAACAAAAG
GACGCCGCTC GTCAGCGTTA TCAGTCAGCT TCTCCTGAGC AGCGCCAGGC AGTTCACGAG
AAAATGCAGA CTAACCCGCA GAACCAGCAG CGAAGAGAGG CAGCGCGTGA GCGCATTCAG
CCCGCCTCGC CTGAGCAGCG CCAGGCAGTC CGCGAGAAAA TGCAGACTAA CCCACAGATC
CAGCAGCGAA GAGACGCAGC GCGTGAGCGT ATTCAGTCAG CCTCGCCTGA GCAGCGCCAG
GTGTTTAAGG AAAAAGTACA GCAGCGCCCA CTGAACCAAC AGCAACGTGA TAACGCCCGC
CAGCGTGTTC AATCAGCATC ACCTGAACAA CGTCAGGTTT TTCGGGAGAA AGCTCAGGAG
AGCCGCCCAC AACGTCTAAA CGACAGTAAC CATACTGCCA GGCTGAATAA CGAGCAACGG
TCAGCAGTAC GCGAACGTCT CTCTGAGCGC GGAGCAAGGC GACTGGAAAG GTAA
 
Protein sequence
MKMTLPFKPH VLALICSAGL CAASAGLYIK SRTVEAPVET QSTQLAVSDA AAVTLPATVS 
APPVTPAVVK SAFSTAQIDQ WVAPVALYPD ALLSQVLMAS TYPTNVAQAV QWSHDNPLKQ
GDAAIQAVSD QPWDASVKSL VAFPQLMALM GENPQWVQNL GDAFLAQPQD VMDSVQRLRQ
LAQQTGSLKS STEQKVITTT KKTVPVTQTV TAPVIPSNTV STANPVITEP ATTVISIEPG
NPDVVYIPNY NPTVVYGNWA NTAYPPVYLP PPAGEPFVDS FVRGFGYSMG VATTYALFSS
IDWDDDDHDH HHHDDDNYHH HDGGHRDGNG WQHNGDNINI DVNNFNRITG EHLTDKNMAW
RHNPNYRNGV PYHDQDMAKR FHQTDVNGGM SATQLPAPTR DSQRQAAASQ FQQRTHAAPV
ITRDTQRQAA AQRFNEAEHY GSYDDFRDFS RRQPLTQQQK DAARQRYQSA SPEQRQAVHE
KMQTNPQNQQ RREAARERIQ PASPEQRQAV REKMQTNPQI QQRRDAARER IQSASPEQRQ
VFKEKVQQRP LNQQQRDNAR QRVQSASPEQ RQVFREKAQE SRPQRLNDSN HTARLNNEQR
SAVRERLSER GARRLER