Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5138 |
Symbol | |
ID | 6969763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4777860 |
End bp | 4779848 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643388809 |
Product | ShET2 enterotoxin, N- region family |
Protein accession | YP_002273235 |
Protein GI | 209400950 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.103144 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.23823 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGATA ACGTAACTGT TAGCAGAGTA TGTATACAAT CTCCTTCTTT CGTGCCTGAT TTGGATGGAG AAAAAAATAA ATCTCAATTA TTCGTTGACG ATATAGTTGC GTATCTTAAA AGTCCTTCAG TTTATTCACT TGAAAAAGAG GGGCCGTTAA ATCATTTTGT TAATCATTGT TCTGAAGTTG AGTTAGGTTT CTATAGCGAT GGTGCATATT CAATTCTTGT CTCCAGATCT AAGCAACAAC CTGAAGGTAT GATTTTAACC GTAAGCGATG CAGACGCAAT CAATATAGTA CATATTTCCG TATCTCCAGT GCTTATAAAA TTCCTGGATG ATATTTTTAC TTGCCTTCAT ACGTACCCTG ATGATGAGAG TTTTACAAAA GAGCAGATAA AAGCTAATAG CAAATATGAT ATTGTAGATT ATAATTGCCT GTTGCATTTT ACTGGAAAAC CAAAAAGTTT AATAGAATGT AGACATTTTG CTCTGCAATA CTGTATAGAT TCAATGAATG AGCATACAGG GAAAGTTCCA TTAAAGGCTT ACTATTCATC TCCGGAAGAT ATACAAAAAC ATATTCCTTT CGAGCTTGAG CAGCAATTTA ACAATCTACA AAAAAATCCA CCACCCGGTA CATGCGTCGT TGCCAGTGAT AAGTTTGGGG AGGCATTATC TGTCTTTTTT CACAGAATGG AAAAAGAGAA GTTAACGCAT ATGACGGCAA TCGTTCAATC TCAAACACAC GCTATGGCCG TCCGCTTGAG GATCAAAAAA ACGCCTGCTG GTGAAACAGA ATATGTTGTA TCCTTTTATG ATCCTAATGC AACCAATACT GCAGTACGCT ATAAAGCAAA CAACTGTGAT TCTTTTGGGT CATTGCAATC GTTTATAAAT ATTCAACAGG CAAAACAAAA ATGGGTAATA ACAGATATTT GCTCCGAGTG TGTAGGAATA ACCCCTTATC TCCCTCGGGA ACAAGCCCAT TTATTAAGTG GTATTGAAAA TGAGTTGCAA CCTCCATTAT CACCACCAGC ATTATTTCTA TTAATGAGAA TGGGGATATA TAAAAACATT GTTCTTTTTT TCGATAAATT AAAAAACTCT CAAGAAATGA CAGCATCAAA GGCTCTTGAT ATTCTTGCTG CGAAATCACC TGAAGGAATA TATGGGTTAT GTGTATTATT GTATCACAAT ACTATTGATA AGTTTAATGA TTACATAACA AATTTAAAAG AGTTGACCAG AAAATATAAT TTTAGCCAAG AGGACCTGGA AACTCTACTC CTTGCGAAAG ATAATCTCGG AGTGAGCTGG ATTCCCAGGG CTTTGAAAAA TAATCAAAAT AAAATTGTCA AAGCATGGTT GTTGGCGATA GATGACTTTG AGAAAGAATT TGGGGTAAAT AAAAATGAAA TACTTCTTCG TATAGGAAAG GAAATAGACT CAATTGATGA TTTAAATAGC GCTATTAGAA CCAATGATTA TAATGTTGTT AATATATTGC TAGCCAATAT AAAAGCCAAA ATGTTTAAAA ATGAATTAAA TAAAGAAGAT ATATTGAAAC TGATGGCAGC AAGAGAAAAA GTGGCGGGAG CATCAGACAA ATGGACGAAG GCATCAGGCT TATATTCTGC GATAGTGAAA GGGCATACGA AGATTGTTGC TGCCTGGATG GAGACAGCTG AAGTGATAGC CAGCCATTAT GAAAATGATA AAGATGTAGT GAGAGAACTC CTGTCGCTGA GCAGAAATAA TGCAGTTTGC TCTTTGTATG TTGCCAGCTA TAAGACAATG AGTAAGCAGG TCATTGATGT ATATCTGAAT GCGGCGATTC GCCTGGCGTT GCAACACGGG TTCACTTTCG ATGAGATTTT GGAGCAGTTT ACCCGTGACT TTGATGGGAA GTCATTCTCT CTTGCGGTAG AGAAAGCGGA TGATATATAT GGGTCTCTGG CTGAAAATAT TCAAAATTGT GGTTGGTGA
|
Protein sequence | MVDNVTVSRV CIQSPSFVPD LDGEKNKSQL FVDDIVAYLK SPSVYSLEKE GPLNHFVNHC SEVELGFYSD GAYSILVSRS KQQPEGMILT VSDADAINIV HISVSPVLIK FLDDIFTCLH TYPDDESFTK EQIKANSKYD IVDYNCLLHF TGKPKSLIEC RHFALQYCID SMNEHTGKVP LKAYYSSPED IQKHIPFELE QQFNNLQKNP PPGTCVVASD KFGEALSVFF HRMEKEKLTH MTAIVQSQTH AMAVRLRIKK TPAGETEYVV SFYDPNATNT AVRYKANNCD SFGSLQSFIN IQQAKQKWVI TDICSECVGI TPYLPREQAH LLSGIENELQ PPLSPPALFL LMRMGIYKNI VLFFDKLKNS QEMTASKALD ILAAKSPEGI YGLCVLLYHN TIDKFNDYIT NLKELTRKYN FSQEDLETLL LAKDNLGVSW IPRALKNNQN KIVKAWLLAI DDFEKEFGVN KNEILLRIGK EIDSIDDLNS AIRTNDYNVV NILLANIKAK MFKNELNKED ILKLMAAREK VAGASDKWTK ASGLYSAIVK GHTKIVAAWM ETAEVIASHY ENDKDVVREL LSLSRNNAVC SLYVASYKTM SKQVIDVYLN AAIRLALQHG FTFDEILEQF TRDFDGKSFS LAVEKADDIY GSLAENIQNC GW
|
| |