Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_B0072 |
Symbol | |
ID | 6966456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011350 |
Strand | + |
Start bp | 35267 |
End bp | 37240 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643383973 |
Product | hypothetical protein |
Protein accession | YP_002268452 |
Protein GI | 209395548 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.385572 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATTG CCGGGCTGCG TGAAGAGCCC GCACAGGACT GGTTATCTTT GCATAAACGG CTGGCCAGTG ATGGTCTGTA TATCACGATG CAGGAAGGGG AACTGGTGGT GAAGGATGGC TGGGACAGAG CGCGGGAAGG TGTTGCACTC AGTTCGTTCG GACCGTTATG GACAGCTGAA AAACTGGGGC GGAAACTGGG GGAATATCAG CCTGTACCGA CAGATATTTT CAGTCAGGTG GGGACACCCG GTCGTTATGA TCCGGAAGCC ATAAACGTTG ATATCCGGCC GGAAAAAGTG GCAGAAACGG AGAGTCTGAA ACAGTACGCC TGCCGTCATT TTGCCGAACG CTTACCGGAA ATGGCGAGAA ATGGTGAGCT GGAAAGTTGC CTGGATGTGC ACAGAACTCT GGCAACGGCT GGATTATGGA TGGGGATCCA GCACGGGCAT CTGGTACTGC ATGATGGATT TGATAAACAA CAGACACCAG TACGTGCAGA CAGTGTGTGG CCGTTAATGA CGCTGGATTA TATGCAGGAT CTGGATGGTG GATGGCAGCC TGTACCGAAG GATATTTTTA CTCAGGTTAT ACCCGGAGAA CGGTTCCGGG GACGTAATCT TGGTACTCAG GCTGTCAGTG ATTACGAATG GTATCGTATG CGGATGGGGA CAGGTCCACA GGGAGCAATA AAGCGCGAAC TGTTTTCTGA TAAGGAAAGT CTGTGGGGGT ACACAACTGT TCAGTGTGAG TCGTTGATTG AAGATATGAT CGCGGGCGGG AATTTCAGCT GGCAGGCGTG CCATGAGATG TTTGCGCGTA AAGGGCTTAT GTTGCAGAAA CAGCATCATG GTCTTGTGAT TGTGGATGCG TTTAATCATG AGCTCACACC AGTGAAGGCC AGCAGTATCC ACCCGGATTT GACGTTATCC AGGGCTGAGC CGCAGGCAGG GCCATTTGAG ATTGCAGGTG CAGATATATT TGAACGGGTG AAGCCTGAAT GTCGCTACAA TCCGGAACTT GCTGCCAGCG ATGAAGTGGA GCCCGGCTTC CGGCGCGATC CTGTACTGCG TCGTGAACGC CGTGAAGCCC GTGCAGCTGC GCGGGAAGAT TTGCGCGCGC GTTATCTTGC ATGGAAGGAG CACTGGCGTA AACCCGACTT ACGTTACGGA GAACGGTTAC GGGAAATTCA TGCTGCCTGC CGTCGGCGCA AGGCGTATAT CCGTGTGCAG TTCCGGGATC CACAATTGCG TAAGCTGCAT TACCATATTG CGGAAGTGCA GCGTATGCAG GCGCTGATCA GGCTGAAGGA GAGTGTGAAG GAAGAGCGGT TGTCACTGAT TGAGGAGGGT AAGTGGTATC CTTTGTCCTA CCGGCAGTGG GTGGAGCAAC AGGCGGTACA GGGAGACAGA GCTGCATTAT CACAGTTACG TGGCTGGGAT TATCGTGATC GTCGCAAAGA CAAGCGACGA ACAACGAATG CCGACCGCTG CGTGATTCTC TGTGAACCGG GAGGAACTCC ACTTTATGAA GACACTGGCG TTCTCGAAGC CCGTCTGCAG AAAGATGGCA GTGTGCGTTT TCGTGATCGG AGGAATGGCG AGTTAGTCTG TGTGGATTAT GGGGACCGTG TGGTGTTCTA CCATCATCAG GACAGAAATG AACTGGTGGA TAAGCTGAAT CTGATTGCTC CTGTATTGTT TGATCGTGAG CCAGGAATGG GTTTTGAACC GGAAGGCTCA TATCAACAGT TTAATGATGT ATTTGCGGAA ATGGTGGCCT GGCATAATGC AGCTGGAATA ACCGGAAATG GACACTTTGT AATCAGCCGC CCGGATGTGG ATTTACATCG TCAGCGTAGT GAACAGTACT ACCACGAATA TATCAGGCAA CAGAAAAGCA TATCCGGGGG GCATGGCGCA TCTTATGCTC CAGTTCAGGA TAATGAGTGG ACGCCACCAT CACCTGGAAT GTAG
|
Protein sequence | MSIAGLREEP AQDWLSLHKR LASDGLYITM QEGELVVKDG WDRAREGVAL SSFGPLWTAE KLGRKLGEYQ PVPTDIFSQV GTPGRYDPEA INVDIRPEKV AETESLKQYA CRHFAERLPE MARNGELESC LDVHRTLATA GLWMGIQHGH LVLHDGFDKQ QTPVRADSVW PLMTLDYMQD LDGGWQPVPK DIFTQVIPGE RFRGRNLGTQ AVSDYEWYRM RMGTGPQGAI KRELFSDKES LWGYTTVQCE SLIEDMIAGG NFSWQACHEM FARKGLMLQK QHHGLVIVDA FNHELTPVKA SSIHPDLTLS RAEPQAGPFE IAGADIFERV KPECRYNPEL AASDEVEPGF RRDPVLRRER REARAAARED LRARYLAWKE HWRKPDLRYG ERLREIHAAC RRRKAYIRVQ FRDPQLRKLH YHIAEVQRMQ ALIRLKESVK EERLSLIEEG KWYPLSYRQW VEQQAVQGDR AALSQLRGWD YRDRRKDKRR TTNADRCVIL CEPGGTPLYE DTGVLEARLQ KDGSVRFRDR RNGELVCVDY GDRVVFYHHQ DRNELVDKLN LIAPVLFDRE PGMGFEPEGS YQQFNDVFAE MVAWHNAAGI TGNGHFVISR PDVDLHRQRS EQYYHEYIRQ QKSISGGHGA SYAPVQDNEW TPPSPGM
|
| |