Gene ECH74115_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1074 
Symbol 
ID6967476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1097318 
End bp1099582 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content50% 
IMG OID643385086 
Producthypothetical protein 
Protein accessionYP_002269585 
Protein GI209400423 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.361238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAA CGACAGTCGG CGTATGCATA ATTTGCGGAA TTTTTCCGTT GCTGATTTTG 
CCCCAATTGC CTGGGACATT AACCCTTGCG TTTCTGACTC TCTTCGCCTG CGTACTGGCA
TTTATCCCTG TTAAAACCGT CCGTTATATC GCGCTGACGT TGCTGTTTTT CGTTTGGGGC
ATATTATCAG CAAAGCAAAT TTTGTTGGCA GGAGAAACCT TAACTGGCGC GACGCAGGAT
GCAATTGTTG AGATCACTGC TACTGACGGC ATGACCACTC ATTACGGTCA AATTACTTAT
CTACAAGGTC AACGTATATT CCCTGCGCCA GGCCTTGTGC TGTATGGCGA ATATCTTCCG
CAAGCGGTTT GTGCCGGACA ACTGTGGTCA ATGAAACTCA AAGTTCGTGC CGTTCATGGT
CAACTTAATG ATGGCGGCTT TGATAGCCAG CGTTATGCCA TTGCCCAGCA TCAGCCGCTC
ACCGGCCGCT TTCTGCAGGC AAGTGTTATT GAACCGAATT GTAGCCTGCG TGCACAGTAT
CTGGCGTCAC TACAAACAAC GCTGCAACCC TATCCGTGGA ATGCGGTTAT TCTTGGTTTA
GGTATGGGGG AACGGTTATC CGTCCCTAAA GAAATCAAAA ATATCATGCG TGATACTGGA
ACGGCGCATT TAATGGCGAT ATCGGGATTG CACATCGCTT TTGCGGCGTT GCTGGCTGCC
GGACTCATTC GCAGTGGACA AATTTTTCTG CCTGGGCGCT GGATCCACTG GCAAATGCCA
TTAATTGGCG GAATCTGCTG TGCTGCTTTT TATGCCTGGC TGACGGGAAT GCAACCTCCT
GCATTGCGTA CCGTTGTGGC GCTTGCTACG TGGGGTATGC TTAAGTTAAG TGGGCGACAG
TGGAGTGGCT GGGATGTATG GATATGTTGT CTGGCGGCAA TTTTGCTGAT GGATCCTGTT
GTCATTCTCT CGCAAAGTTT ATGGCTCTCT GCCGCTGCGG TCGCGGCATT GATATTTTGG
TATCAGTGGT TTCCCTGTCC TGAGTGGCAA CTGCCGCCGG TATTGCGTGC AGTTGTTTCC
CTCATCCATC TGCAACTGGG AATCACACTT CTGCTTATGC CCGTGCAAAT CGTCATTTTT
CATGGCATTA GTCTGACCTC GTTTATTGCA AATCTATTAG CAATTCCCTT GGTGACATTT
ATCACGGTTC CGTTGATCCT CGCCGCGATG GTTGTGCATT TAAGCGGGCC GTTAATCCTG
GATCAAGGGT TATGGTTTCT TGCCGACCGG TCTTTGGCTT TACTTTTCTT GGGGTTAAAG
AGTTTGCCAG AAGGGTGGAT CAACATTGCT GAACGTTGGC AATGGCTATC ATTTTCCCCA
TGGTTCTTAC TGGTGGTATG GCGATTAAAC GCCTGGCGAA CGTTGCCAGC AATGTGTGTG
GCTGTAGGCT TGCTGATGTG CTGGCCGCTG TGGCAAAAAC CTCGACCTGA CGAGTGGCAA
GTGTACATGC TTGATGTCGG GCAAGGGCTG GCAATGGTGA TAGCCAGAAA CGGCAAAGCG
ATTCTCTATG ACACGGGACT GGCCTGGCCT GAAGGGGATA GTGGGCAACA ACTGATTATC
CCCTGGCTCC ACTGGCATAA TCTTGAACCG GAAGGCGTTA TCCTGAGTCA TGAACATCTG
GATCACCGGG GAGGGCTGGA CTCAATATTG CATACATGGC CGATGTTATG GATCAGAAGT
CCGTTAAACT GGGAACATCA TCAGCCCTGT GTGCGTGGCG AAGCGTGGCA ATGGCAAGGA
TTGCGTTTCA GCGCGCACTG GCCTTTACAA GGTCGCAACG ATAAAGGAAA TAACCATTCC
TGTGTGGTTA AGGTTGATGA CGGGACGAAT AGCATTCTTC TAACCGGTGA TATTGAAGCC
CCAGCTGAAC AAAAGATGCT AAGCCGTTAC TGGCAGCAAG TGCAGGCAAC ATTGCTTCAG
GTACCTCACC ATGGCAGTAA TACCTCATCA TCATTGCCAT TAATTCAGCG AGTGAATGGA
AAAGTGGCAC TCGCATCGGC ATCGCGCTAT AACGCATGGC GACTGCCCTC TAACAAAGTT
AAGCATCGCT ATCAACAACA AGGTTATACG TGGCTTGATA CTCCTCATCA GGGGCAAGTA
ACGGTCGATT TTTCAGCGCA AGGCTGGCGG ATTAGCAGCC TCAGGGAGCA AATTTTACCT
CGTTGGTATC ATCAGTGGTT TGGCGTGCCA GTGGATAACG GGTAG
 
Protein sequence
MKITTVGVCI ICGIFPLLIL PQLPGTLTLA FLTLFACVLA FIPVKTVRYI ALTLLFFVWG 
ILSAKQILLA GETLTGATQD AIVEITATDG MTTHYGQITY LQGQRIFPAP GLVLYGEYLP
QAVCAGQLWS MKLKVRAVHG QLNDGGFDSQ RYAIAQHQPL TGRFLQASVI EPNCSLRAQY
LASLQTTLQP YPWNAVILGL GMGERLSVPK EIKNIMRDTG TAHLMAISGL HIAFAALLAA
GLIRSGQIFL PGRWIHWQMP LIGGICCAAF YAWLTGMQPP ALRTVVALAT WGMLKLSGRQ
WSGWDVWICC LAAILLMDPV VILSQSLWLS AAAVAALIFW YQWFPCPEWQ LPPVLRAVVS
LIHLQLGITL LLMPVQIVIF HGISLTSFIA NLLAIPLVTF ITVPLILAAM VVHLSGPLIL
DQGLWFLADR SLALLFLGLK SLPEGWINIA ERWQWLSFSP WFLLVVWRLN AWRTLPAMCV
AVGLLMCWPL WQKPRPDEWQ VYMLDVGQGL AMVIARNGKA ILYDTGLAWP EGDSGQQLII
PWLHWHNLEP EGVILSHEHL DHRGGLDSIL HTWPMLWIRS PLNWEHHQPC VRGEAWQWQG
LRFSAHWPLQ GRNDKGNNHS CVVKVDDGTN SILLTGDIEA PAEQKMLSRY WQQVQATLLQ
VPHHGSNTSS SLPLIQRVNG KVALASASRY NAWRLPSNKV KHRYQQQGYT WLDTPHQGQV
TVDFSAQGWR ISSLREQILP RWYHQWFGVP VDNG