Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1074 |
Symbol | |
ID | 6967476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1097318 |
End bp | 1099582 |
Gene Length | 2265 bp |
Protein Length | 754 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643385086 |
Product | hypothetical protein |
Protein accession | YP_002269585 |
Protein GI | 209400423 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.361238 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAA CGACAGTCGG CGTATGCATA ATTTGCGGAA TTTTTCCGTT GCTGATTTTG CCCCAATTGC CTGGGACATT AACCCTTGCG TTTCTGACTC TCTTCGCCTG CGTACTGGCA TTTATCCCTG TTAAAACCGT CCGTTATATC GCGCTGACGT TGCTGTTTTT CGTTTGGGGC ATATTATCAG CAAAGCAAAT TTTGTTGGCA GGAGAAACCT TAACTGGCGC GACGCAGGAT GCAATTGTTG AGATCACTGC TACTGACGGC ATGACCACTC ATTACGGTCA AATTACTTAT CTACAAGGTC AACGTATATT CCCTGCGCCA GGCCTTGTGC TGTATGGCGA ATATCTTCCG CAAGCGGTTT GTGCCGGACA ACTGTGGTCA ATGAAACTCA AAGTTCGTGC CGTTCATGGT CAACTTAATG ATGGCGGCTT TGATAGCCAG CGTTATGCCA TTGCCCAGCA TCAGCCGCTC ACCGGCCGCT TTCTGCAGGC AAGTGTTATT GAACCGAATT GTAGCCTGCG TGCACAGTAT CTGGCGTCAC TACAAACAAC GCTGCAACCC TATCCGTGGA ATGCGGTTAT TCTTGGTTTA GGTATGGGGG AACGGTTATC CGTCCCTAAA GAAATCAAAA ATATCATGCG TGATACTGGA ACGGCGCATT TAATGGCGAT ATCGGGATTG CACATCGCTT TTGCGGCGTT GCTGGCTGCC GGACTCATTC GCAGTGGACA AATTTTTCTG CCTGGGCGCT GGATCCACTG GCAAATGCCA TTAATTGGCG GAATCTGCTG TGCTGCTTTT TATGCCTGGC TGACGGGAAT GCAACCTCCT GCATTGCGTA CCGTTGTGGC GCTTGCTACG TGGGGTATGC TTAAGTTAAG TGGGCGACAG TGGAGTGGCT GGGATGTATG GATATGTTGT CTGGCGGCAA TTTTGCTGAT GGATCCTGTT GTCATTCTCT CGCAAAGTTT ATGGCTCTCT GCCGCTGCGG TCGCGGCATT GATATTTTGG TATCAGTGGT TTCCCTGTCC TGAGTGGCAA CTGCCGCCGG TATTGCGTGC AGTTGTTTCC CTCATCCATC TGCAACTGGG AATCACACTT CTGCTTATGC CCGTGCAAAT CGTCATTTTT CATGGCATTA GTCTGACCTC GTTTATTGCA AATCTATTAG CAATTCCCTT GGTGACATTT ATCACGGTTC CGTTGATCCT CGCCGCGATG GTTGTGCATT TAAGCGGGCC GTTAATCCTG GATCAAGGGT TATGGTTTCT TGCCGACCGG TCTTTGGCTT TACTTTTCTT GGGGTTAAAG AGTTTGCCAG AAGGGTGGAT CAACATTGCT GAACGTTGGC AATGGCTATC ATTTTCCCCA TGGTTCTTAC TGGTGGTATG GCGATTAAAC GCCTGGCGAA CGTTGCCAGC AATGTGTGTG GCTGTAGGCT TGCTGATGTG CTGGCCGCTG TGGCAAAAAC CTCGACCTGA CGAGTGGCAA GTGTACATGC TTGATGTCGG GCAAGGGCTG GCAATGGTGA TAGCCAGAAA CGGCAAAGCG ATTCTCTATG ACACGGGACT GGCCTGGCCT GAAGGGGATA GTGGGCAACA ACTGATTATC CCCTGGCTCC ACTGGCATAA TCTTGAACCG GAAGGCGTTA TCCTGAGTCA TGAACATCTG GATCACCGGG GAGGGCTGGA CTCAATATTG CATACATGGC CGATGTTATG GATCAGAAGT CCGTTAAACT GGGAACATCA TCAGCCCTGT GTGCGTGGCG AAGCGTGGCA ATGGCAAGGA TTGCGTTTCA GCGCGCACTG GCCTTTACAA GGTCGCAACG ATAAAGGAAA TAACCATTCC TGTGTGGTTA AGGTTGATGA CGGGACGAAT AGCATTCTTC TAACCGGTGA TATTGAAGCC CCAGCTGAAC AAAAGATGCT AAGCCGTTAC TGGCAGCAAG TGCAGGCAAC ATTGCTTCAG GTACCTCACC ATGGCAGTAA TACCTCATCA TCATTGCCAT TAATTCAGCG AGTGAATGGA AAAGTGGCAC TCGCATCGGC ATCGCGCTAT AACGCATGGC GACTGCCCTC TAACAAAGTT AAGCATCGCT ATCAACAACA AGGTTATACG TGGCTTGATA CTCCTCATCA GGGGCAAGTA ACGGTCGATT TTTCAGCGCA AGGCTGGCGG ATTAGCAGCC TCAGGGAGCA AATTTTACCT CGTTGGTATC ATCAGTGGTT TGGCGTGCCA GTGGATAACG GGTAG
|
Protein sequence | MKITTVGVCI ICGIFPLLIL PQLPGTLTLA FLTLFACVLA FIPVKTVRYI ALTLLFFVWG ILSAKQILLA GETLTGATQD AIVEITATDG MTTHYGQITY LQGQRIFPAP GLVLYGEYLP QAVCAGQLWS MKLKVRAVHG QLNDGGFDSQ RYAIAQHQPL TGRFLQASVI EPNCSLRAQY LASLQTTLQP YPWNAVILGL GMGERLSVPK EIKNIMRDTG TAHLMAISGL HIAFAALLAA GLIRSGQIFL PGRWIHWQMP LIGGICCAAF YAWLTGMQPP ALRTVVALAT WGMLKLSGRQ WSGWDVWICC LAAILLMDPV VILSQSLWLS AAAVAALIFW YQWFPCPEWQ LPPVLRAVVS LIHLQLGITL LLMPVQIVIF HGISLTSFIA NLLAIPLVTF ITVPLILAAM VVHLSGPLIL DQGLWFLADR SLALLFLGLK SLPEGWINIA ERWQWLSFSP WFLLVVWRLN AWRTLPAMCV AVGLLMCWPL WQKPRPDEWQ VYMLDVGQGL AMVIARNGKA ILYDTGLAWP EGDSGQQLII PWLHWHNLEP EGVILSHEHL DHRGGLDSIL HTWPMLWIRS PLNWEHHQPC VRGEAWQWQG LRFSAHWPLQ GRNDKGNNHS CVVKVDDGTN SILLTGDIEA PAEQKMLSRY WQQVQATLLQ VPHHGSNTSS SLPLIQRVNG KVALASASRY NAWRLPSNKV KHRYQQQGYT WLDTPHQGQV TVDFSAQGWR ISSLREQILP RWYHQWFGVP VDNG
|
| |