Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5805 |
Symbol | |
ID | 6969043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5437955 |
End bp | 5440891 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643389433 |
Product | hypothetical protein |
Protein accession | YP_002273825 |
Protein GI | 209397351 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGAAA ATGAGAGCGG TTATTTGATT ACCTATTTCA GAAGGATATA TCAGGCGCTT CGTGGGCAAT CATCTCAATA TCCTGCCGCT AAAATTGCAC AAGAATTAGG CGATACGATT CCTGTCACAA TGCAAAACGA ACTGGTCTAT GAAATAGCCG GTGAATTCTG CGAGACTCTC TGCCGATTAC TTAGTGAGCA TCCTCCTCAT AGCAGTGATC CCGTTTCTGC ATTACGTAAG CTCTCTCCAG ACTGGCATCT TCAACTTCCA CTCGTTCTCC CCGAAGCGAA TGCTGCAGAG ATTGTAAGGC GATTACTTTC TCAATCTTCT GAAATACGCA GTGCAAGTAG TCTTCAGGTT GAAAGAATCT GGGTCGATGT TGATGACAGT TGGTATTGCG ATGCCCGGTT CCGATTCCCG GCCACAATGC GTACAGAACA GTTAACCTCT TTGTTTGAAT GCCATATTCA GCCGGAGCAG ACCCGACTCA TCATTTCAGG AAAATGGAAA AATGGCGGGG CAAGGTTGGC AATGCTAAGC CGTTATGAGC AGCAAGATTG GCGGGTCGAG TTATTACCTA TTGCGATGCA AAAACTCTCT GGTGCAGATG CAATGGCCGA AATCTCATTG TCGCTTCATG AAGGTCCAAT TCTGTTAGGT CACACAATTC CAAAAGGCGG TTATGAACTT ACTGAAGAAC TTCCCTGGGT TTTTGAAGCG ATGAATGAGA GTGAATCGCA GCTAAAACTT GTTGGTATGG GGTCTGTGAG TTCCAGGCTA AATGCTCTGT TTATTTCGCT ACCGAAAAAT AGCCATTTAG ATATTAGTGG AGAAGGTGAG TTTGATATCC CAAGGTTGCT GAAAAACAGT GAACGCAGCT TAACTAAAAT AAGTGGAGTA TTTAGCGTTG TTTTACATGA TGGTGCTGTT TGTACAATTC GCACCCAGCA ACTTTATGAT TCTGCGATTG AATATTATAT TAAATCGACA GAAGTTGAAC TGGTTAAATC GGATTATCCT GTCCATCGAG CCTGGCCGAA GATCGGTTGG AAAAAGGATC TGCAATATGG CATTGTACCG GAGAAAGAAC TTTTTTGGCG TTCCATTCGT TCAGGTAATA ATGCCTGGTA TTCTGTCGCA TCAGAAATGC CAAAAGGACA GATAGAAGTC CGGCGAATAG TTAATGATGA GGTTTTATTT AGCGGTAAAG TTGTAGTTTT ACCAGCAGAC TTCGATATTA ATATTATACC TGAGAGTGCT CAGCAAGGCA TTATTATGCT TTCTGGTATA ACGGATACTA GAATTGATAA ATATTCAAAC AATGAAAAAG TAACACTTAA GTCCGATTAT TCACAAAACG AGTGTGCTAT TTATTATAAT TCATCGCTGA TGCTGGAAAA TACCGTTGAT TTACGGGTCT CCTGGAAAGA TGGTTCTAAT CTTAAACTAT TATTACCTAA GCCGGTTAGC GGTGGCCGAT TTGTAACTAA TGATGGTTCT GTTCATTTTG ATGGTGTGGC ATCTATAGCA CATTTGCATG GAATAGATGC TGAGTTATTA ACCATATCAT GTGCTGGAAG AGGATATCTT AATATCGAGT TATTGGATGA AAATCCAGTA GCTGAAAAAT TCCGCTATTT ACATGCCGAC CTTCCTCTTT TATCAGGACG CAATGACAAA TTACAACAAA TTTCACTTTA TGAGAACTAT AATTTGCTAA ATGCTATGCT GGCATGTGCA TGGAACAGCA ATAGTACATT ATGTGTTGAT TTTTACTCTG ACAGATTTGG AAAGGATAAA GCAACACTCA ATATTAAACG CTACGATGGT AGTTTTATTG AACACGATCA AGGGTTACTG GTTGATATAA AAAATTCTGT TGTTTTTCCT GCAAATAGAA TAGACGAACT GGTTGTTGAT GCTATTTCTC TTAAGAATCC TGGCTTACAC ATATCATTGT TAAAAAAAGA TGAGTTTGCT TATGATCTTT CAGCTCTGAA TGTTCAGGAT AGTCCATGGT TAATTGTGGG AAAACTTGAT GGTACAGCTC GCATTGCACC AGTAATTAAA TGGATGCTAC CTGTATTGCA GACAAATGAT TTATTACTAA ATGCTCTATG CGAAGCAGAC CCAGAACAGC GCAAAAAAAA TTTTAATGAG CTAATTTTTG AAATAGATAA CAACCCATTG CAAAATTATT GCTGTTTATT AACAGAGTAT ATTAAGAAAT ACAAAATGAA TAATGGCTTA TCCTTGCTGG ATCTGGACTT GTTCAGAGGT ATTTCGAGTA ATTACCGCGT GGTTGTTCAA TTGTTAATAT CATCATGTCT TTCTGGTGAT AGCGATACGA TTTATGATAT ACAGGAAGAA TTACCCTTTT CATGGGGATG GATTCCCGTT TCAATCTGGA AAGATGTTTT CCAAAAATGT TGGACTTATC TGGAGAAACA GATTAACGAT AAAACATTAG CATTACATAT ATTGCAACCC TTTATTGCTT TTATGAACCA TCGTGCACAT ATCGATCGTC GGCTGGCTCC GATTGCGAAT ATGTTACTTA CATATAGTGA GAGCCTACCA ACCGGTTGTG ATGTATTGCC AACTGTTAGT CGTGAGCAGT TTAATGAAGC TAAACAGATG CTATTAAGGA ACCCCGACAG CTTTGGGCGT ATCAGTATCT TCCCTAAAGA ACTTTGGTCT AGTGCTATTA CTCCAGAGTT AAAATCTGTT TTTAATAAGC TTTGGATTGA AGATAAATAT CACTCACGGC TTGAAAAACG TTTTAATTTG ATGTTAGTCG CAGCGCTGTT AACCCAAAAA GATAATAACC TGATACATCA ACTGTCTGCG CTTTTTGAAT TTCACTATCA GCAAGCCCCA CAGCAATTAG GGGTAATCTA TCAATATTAT TTTGAACAAG CAGGAGTATG TCATTGA
|
Protein sequence | MIENESGYLI TYFRRIYQAL RGQSSQYPAA KIAQELGDTI PVTMQNELVY EIAGEFCETL CRLLSEHPPH SSDPVSALRK LSPDWHLQLP LVLPEANAAE IVRRLLSQSS EIRSASSLQV ERIWVDVDDS WYCDARFRFP ATMRTEQLTS LFECHIQPEQ TRLIISGKWK NGGARLAMLS RYEQQDWRVE LLPIAMQKLS GADAMAEISL SLHEGPILLG HTIPKGGYEL TEELPWVFEA MNESESQLKL VGMGSVSSRL NALFISLPKN SHLDISGEGE FDIPRLLKNS ERSLTKISGV FSVVLHDGAV CTIRTQQLYD SAIEYYIKST EVELVKSDYP VHRAWPKIGW KKDLQYGIVP EKELFWRSIR SGNNAWYSVA SEMPKGQIEV RRIVNDEVLF SGKVVVLPAD FDINIIPESA QQGIIMLSGI TDTRIDKYSN NEKVTLKSDY SQNECAIYYN SSLMLENTVD LRVSWKDGSN LKLLLPKPVS GGRFVTNDGS VHFDGVASIA HLHGIDAELL TISCAGRGYL NIELLDENPV AEKFRYLHAD LPLLSGRNDK LQQISLYENY NLLNAMLACA WNSNSTLCVD FYSDRFGKDK ATLNIKRYDG SFIEHDQGLL VDIKNSVVFP ANRIDELVVD AISLKNPGLH ISLLKKDEFA YDLSALNVQD SPWLIVGKLD GTARIAPVIK WMLPVLQTND LLLNALCEAD PEQRKKNFNE LIFEIDNNPL QNYCCLLTEY IKKYKMNNGL SLLDLDLFRG ISSNYRVVVQ LLISSCLSGD SDTIYDIQEE LPFSWGWIPV SIWKDVFQKC WTYLEKQIND KTLALHILQP FIAFMNHRAH IDRRLAPIAN MLLTYSESLP TGCDVLPTVS REQFNEAKQM LLRNPDSFGR ISIFPKELWS SAITPELKSV FNKLWIEDKY HSRLEKRFNL MLVAALLTQK DNNLIHQLSA LFEFHYQQAP QQLGVIYQYY FEQAGVCH
|
| |