Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0124 |
Symbol | |
ID | 6968493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 133760 |
End bp | 135613 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643384201 |
Product | hypothetical protein |
Protein accession | YP_002268724 |
Protein GI | 209398273 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00449065 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGA CTTTGCCGTT TAAACCCCAT GTGCTGGCGC TAATTTGCAG TGCCGGGCTT TGTGCCGCCT CTGCCGGGCT ATATATAAAA AGCCGCACAG TGGAAGCGCC TGTGGAAACG CAATCGACAC AACTGGCTGT GTCTGACGCT GCCGCAGTTA CGCTTCCTGC AACGGTTTCC GCACCTCCCG TAACACCCGC CGTCGTCAAA TCCGCATTCA GCACTGCACA AATAGATCAA TGGGTCGCGC CCGTCGCGCT GTATCCCGAC GCCCTACTTT CGCAGGTGCT GATGGCATCA ACCTATCCGA CAAACGTTGC TCAAGCAGTG CAATGGTCGC ACGATAATCC ACTTAAACAA GGCGATGCTG CTATTCAGGC GGTATCTGAC CAGCCGTGGG ACGCCAGCGT TAAATCACTG GTGGCCTTTC CACAATTGAT GGCATTGATG GGCGAAAACC CGCAATGGGT GCAAAACCTG GGCGATGCTT TTCTGGCCCA GCCGCAGGAC GTGATGGACT CGGTACAACG ATTGCGGCAA CTGGCACAAC AAACCGGCTC GCTGAAGTCA TCAACCGAAC AGAAAGTTAT TACCACAACG AAGAAAACTG TACCGGTAAC ACAGACAGTC ACGGCTCCCG TCATACCATC CAATACCGTT TCAACTGCCA ACCCTGTCAT TACAGAGCCT GCAACAACCG TCATTTCCAT TGAGCCCGGC AATCCTGATG TGGTCTATAT TCCCAACTAC AACCCAACCG TGGTTTACGG GAACTGGGCC AATACTGCGT ATCCGCCGGT TTATCTGCCA CCACCAGCCG GAGAACCGTT TGTTGACAGC TTTGTACGCG GATTCGGCTA TAGCATGGGC GTTGCTACCA CGTACGCACT ATTCAGCAGC ATCGACTGGG ATGACGACGA TCATGACCAT CATCATCATG ACGATGATAA TTATCATCAC CACGATGGCG GTCATCGTGA CGGTAATGGC TGGCAACATA ACGGCGACAA CATCAATATC GACGTCAACA ATTTCAACCG TATCACCGGT GAGCATCTTA CTGATAAGAA TATGGCATGG CGGCACAATC CAAACTACCG TAATGGTGTG CCCTATCATG ATCAGGATAT GGCAAAGCGG TTTCATCAAA CTGATGTCAA CGGCGGAATG AGTGCCACGC AGCTACCTGC TCCAACACGC GACAGCCAGC GTCAGGCGGC AGCAAGTCAG TTTCAGCAAC GAACACACGC CGCCCCCGTC ATTACACGAG ATACCCAACG TCAGGCAGCG GCACAGCGGT TTAATGAAGC TGAACACTAT GGGAGCTATG ACGACTTCCG CGACTTCAGC CGTCGCCAAC CACTGACCCA GCAACAAAAG GACGCCGCTC GTCAGCGTTA TCAGTCAGCT TCTCCTGAGC AGCGCCAGGC AGTTCACGAG AAAATGCAGA CTAACCCGCA GAACCAGCAG CGAAGAGAGG CAGCGCGTGA GCGCATTCAG CCCGCCTCGC CTGAGCAGCG CCAGGCAGTC CGCGAGAAAA TGCAGACTAA CCCACAGATC CAGCAGCGAA GAGACGCAGC GCGTGAGCGT ATTCAGTCAG CCTCGCCTGA GCAGCGCCAG GTGTTTAAGG AAAAAGTACA GCAGCGCCCA CTGAACCAAC AGCAACGTGA TAACGCCCGC CAGCGTGTTC AATCAGCATC ACCTGAACAA CGTCAGGTTT TTCGGGAGAA AGCTCAGGAG AGCCGCCCAC AACGTCTAAA CGACAGTAAC CATACTGCCA GGCTGAATAA CGAGCAACGG TCAGCAGTAC GCGAACGTCT CTCTGAGCGC GGAGCAAGGC GACTGGAAAG GTAA
|
Protein sequence | MKMTLPFKPH VLALICSAGL CAASAGLYIK SRTVEAPVET QSTQLAVSDA AAVTLPATVS APPVTPAVVK SAFSTAQIDQ WVAPVALYPD ALLSQVLMAS TYPTNVAQAV QWSHDNPLKQ GDAAIQAVSD QPWDASVKSL VAFPQLMALM GENPQWVQNL GDAFLAQPQD VMDSVQRLRQ LAQQTGSLKS STEQKVITTT KKTVPVTQTV TAPVIPSNTV STANPVITEP ATTVISIEPG NPDVVYIPNY NPTVVYGNWA NTAYPPVYLP PPAGEPFVDS FVRGFGYSMG VATTYALFSS IDWDDDDHDH HHHDDDNYHH HDGGHRDGNG WQHNGDNINI DVNNFNRITG EHLTDKNMAW RHNPNYRNGV PYHDQDMAKR FHQTDVNGGM SATQLPAPTR DSQRQAAASQ FQQRTHAAPV ITRDTQRQAA AQRFNEAEHY GSYDDFRDFS RRQPLTQQQK DAARQRYQSA SPEQRQAVHE KMQTNPQNQQ RREAARERIQ PASPEQRQAV REKMQTNPQI QQRRDAARER IQSASPEQRQ VFKEKVQQRP LNQQQRDNAR QRVQSASPEQ RQVFREKAQE SRPQRLNDSN HTARLNNEQR SAVRERLSER GARRLER
|
| |