Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5486 |
Symbol | |
ID | 6968476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5131815 |
End bp | 5134001 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643389132 |
Product | ShET2 enterotoxin, N- region family |
Protein accession | YP_002273529 |
Protein GI | 209399314 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTACTC ATATTCCTCG TAGTTCTTTC TCTGCAAATA TTAATAATAC AGCCCAGACA AATGAACACC AAACCCTGAG TGAATTGTTT TATAAAGAAC TCGAGGATAA ATTTTCTGGC AAGGAGCTGG CGACACCTCT ATTAAAAAGC TTCTCAGAGA ACTGTAGACA CAATGGTCGC CATATCTTTA GTAACAAGGA TTTTGTCATT AAGTTTTCCA TATCTGTCTT ACAAGCTGAT AAGAAAGAAA TTACGATAAT TAATAAAAAT GAAAACACGA CACTCACTCA AACCATTGCT CCAATATTTG AAGAATACCT AATGGAAATT TTACCTCAAC GCTCAGACGC TCTTGATAAA AAAGAATTAA ATCTAAACTC AGATAGAAAA GAAAAAGAAT TCCCAAGAGT TAAGCTTAAT GGTCAATGTT ATTTTCCGGG GCGCCCCCAA AACCGTATAG TATGCCGACA CATTGCTGCA CAATATATTA ATGATATTTA TCAGAATGTT GACTACAAAC CCCATCAAGA TGATTACTCT TCAGCTGAAA AATTTCTCAC TCACTTCAAC AAAAAATGCA AAAACCAGAC TTTGGCGTTG ATTTCCAGCC GTCCTGAGGG GCGTTGCGTT GCTGCCTGCG GTGATTTCGG GCTAGTTATG AAGGCATATT TTGACAAGAT GGAATCAAAT GACCTCAGTG TTATGGCTGC CATATTATTG GTAGATAACC ATGCTTTGAC GGTCCGGCTA AGAATAAAGA ACACAACTGA AGGATGTATC CATTACGTGG TTTCGGTTTA TGATCCTAAT GTAACTAACG ATAAAATAAG AATCATGAGC GAAAGTAAAG AGGATATTAA ACACTATTCA CTGATGGATT TTATGAATGT AGATTATAGC CTCCTGAAAT GGTCAAATGA TCATGTTATT AACCAATCTG TTGCAATAAT TCCAGCACTT CCAAAAGAAC AGCTATTGAT GCTAAAAGGA ACTGTGGATG AAATAACCCC TCCATTATCA CCTGCAACGA TGAATTTGCT AATGGCAATT GGTCAGAATC ACCAACTTAA GCAACTGATG ATTCAACTCC AGAAAATGCC AGAACTACAT AGAACAGAAA TGTTGACTGC CTATAATAGT ATTAACCTCC CCGGTTTATA TTTGGCTATA AATTATGGTA ATGCGGATAT CGTTGAGACT ATTTTTAATT CATTGTCAGA GCCAGGATAT GAAGGATTAC TCTCGAAAAA AAATCTCATG CATATTCTGG AGGCAAAAGA TAAAAATGGT TTTTCTGGAT TATTTTTAGC GATATCACGT AAGGATAAAA ATGTTGTAAC CTCGATTCTG AACGCCTTAC CTAAACTGGC CGCAACACAT CATTTAGATA ACGAACAAGT GTATAAATTC CTGAGTGCCA AAAATAGTAC GTCCAGCCAT GTTTTATACC ATGTTATGGC GAATGGTGAT GCCGACATGC TGAAAATTGT TTTGGACGCG TTATCTTTGT TAATTCGCAC ATGTCATTTG ACTAAAGAAC AGGTACTCGA TCTCCTGAAG GCAAAGGATT TTTATGGTTG CCCAGGACTA TACCTGGCGA TGCAAAATGG ACATAGCGAT ATCGTGAAAG TTATTCTTGA AGCATTGCCC AGCCTGGCCC AGGAAATTAA CATTTCAGCT TCCGATATTG TAGATCTTCT GACCGCTAAA AGTCTTGCGC GCGACACGGG TTTGTTTATG GCCATGCAGC GCGGACATAT GAACGTTATT AATACTATTT TTAACGCATT ACCCACTCTG TTTAATACGT TTAAATTCGA TAAAAAAAAT ATGAAGCCCC TCCTCCTGGC AAATAATTCT AATGAATACC CAGGTTTGTT TTCAGCGATA CAGCATAAAC AGCAAAACGT TGTAGAGATG GTTTATCTTG CTTTATCTGA CCATGCACGC CTGTTTGGAT TTACCGCTGA AGATATTATG GATTTTTGGC AACACAAAGC GCCACAAAAA TACTCTGCCT TTGAGTTGGC TTGTGAATTG GGTCACCGGG TTATTGCTGA ATTAATCTTT AATACATTAA ATAAGATGGC TGAAAGCTTT GGCTTTACGG ATAACCCTCG ATACATTGCG GAGAAAAATT ATATGGAAGC TTTACTCAAA AAAGCATCTC CCCATACCGT ACGCTAA
|
Protein sequence | MITHIPRSSF SANINNTAQT NEHQTLSELF YKELEDKFSG KELATPLLKS FSENCRHNGR HIFSNKDFVI KFSISVLQAD KKEITIINKN ENTTLTQTIA PIFEEYLMEI LPQRSDALDK KELNLNSDRK EKEFPRVKLN GQCYFPGRPQ NRIVCRHIAA QYINDIYQNV DYKPHQDDYS SAEKFLTHFN KKCKNQTLAL ISSRPEGRCV AACGDFGLVM KAYFDKMESN DLSVMAAILL VDNHALTVRL RIKNTTEGCI HYVVSVYDPN VTNDKIRIMS ESKEDIKHYS LMDFMNVDYS LLKWSNDHVI NQSVAIIPAL PKEQLLMLKG TVDEITPPLS PATMNLLMAI GQNHQLKQLM IQLQKMPELH RTEMLTAYNS INLPGLYLAI NYGNADIVET IFNSLSEPGY EGLLSKKNLM HILEAKDKNG FSGLFLAISR KDKNVVTSIL NALPKLAATH HLDNEQVYKF LSAKNSTSSH VLYHVMANGD ADMLKIVLDA LSLLIRTCHL TKEQVLDLLK AKDFYGCPGL YLAMQNGHSD IVKVILEALP SLAQEINISA SDIVDLLTAK SLARDTGLFM AMQRGHMNVI NTIFNALPTL FNTFKFDKKN MKPLLLANNS NEYPGLFSAI QHKQQNVVEM VYLALSDHAR LFGFTAEDIM DFWQHKAPQK YSAFELACEL GHRVIAELIF NTLNKMAESF GFTDNPRYIA EKNYMEALLK KASPHTVR
|
| |