Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1559 |
Symbol | |
ID | 6971742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1523030 |
End bp | 1526431 |
Gene Length | 3402 bp |
Protein Length | 1133 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 643385526 |
Product | hypothetical protein |
Protein accession | YP_002270020 |
Protein GI | 209396457 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.0000950021 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAATAA CAAACTATAT ACTGCCAACA AGTCGTACTC ATGGTTCATT CTCAACTATA AAATCATGGG ACACAATGAA TTATATTAAA CATTTAATCA GACATACAAA TGACCCTATA TTTGAAGAAC AATTTTATAA AATAACACAA TCTCATATTG ACTTTGACAA AAGAGCTAAA GATGAAAAAA ATGACACCAT TAACATTTAT GATAACTTTT TCTATTCATC TAATGATGAT CTTGATTCTA AAATTAGAAG TATGTTAAAT AATTTATATG AGAAAAGCTT AACTTTCCGA AGAATAATTA ATTATTATGT GAAGGAAATA AACTTAAGTG ATTATGGCTT TCTAAAATGT AAGATTTTAC CAGCATATGC TTATAACTAT GAGATGGAAA ATGATGCCCC CCCAAAAATA CTAATTCCAA TTGACCATAA TTTAAATTTT ATTGATGCAA AATATAATGG AGAAACTTAT CGGGGAAATG AAGAGTTTGC TATTAATCTT TTTCTGCAGC ATATATTACA TAATGACATA CAAGAACAAA CATCGATAGA CTTATACACG AGCATAATAA ATAAAGAGTT GGATAGCAAT AGAAAATCAT ACAATAATGA AATTTTTAAC AATTTCTCTT TTGATAAGTC TGTAAAGTTG AATTCATATA ACTATATTGC AGATGATATA GAGCAAGTAA TCGATAAAGG AAGCAAAGTT CAATTGGAGG TATATAATTT ATTATCCGAA GAAAAGATAT TTGAACATAA AATTATGAAT AATTGGACAA GGAGCATAAA AAATATATTG ACGACATATT TGTTTATGTC ATCAGGAGCG GTGACAGCCA GAAATGTTCA AACCTTTTCT CCAACAATAA ATAATGAGTC AAGGATTCGA TTGCCGAGAG CATTGCCAGT AGGCCATCCA TATCCTGAGG AACATAAGGC TTCTGGTTTC TCCCCTTTTA TGATGGGGGG GCTGAGTGGT GATATTCTTC CGGAAATTTT AACGGGGAAT GGACCATCTA TATTTTTTAA CGGAAAACAT AATAACCAAC ATGATGGAGC TTTTGGAAAA ATAATAGATT TTACCCAAAA TGGAAATAAA ATAAGTGCAA AAGATAAAGA AATAATAAAA AGATATATTT TTGATAAGAT CAATGTTTTG ATTAAAGAGT ATTTCATTAG AACTGGTAAA AATTCTCATA CCCCATTTGA AGTTTTTATA AAGGAGCGAT TATTTAATCA ATATGATATT TTTAAAACAT TGGCTAGAGA TATATTGGCA CACCCATTAG TAATATATGA TGCGGGTTAC AAAAATTATC ATGAGTCATT AAATGCTGCT ATTGCAATAA ACTCTAGACC ATTACAAGAA ATACATTATG GTGATGTTTT ATATCATTAT CATAAAAATG ACATCTCTTT GGGAGTAGAT ACTCTTTACG GGAGGGAAAG TTTTGATATT GTACTGGATG CAATGAACGT ATATAGAAAA AGCAAAAAAA TGAGAGTTAT TTCCAATAAT GAGATGAAAA AAAGCATTAA AATATCTGAA TTAGTTATCC ATAATATTAT AAAGAAAGGA TTGACTAATT GTTTGCTTAA AAAGGATGTT CTTAATGCCA GATATGATCT TATTAGAGAT ATTCTTCGAT ATTCTTTAAA TATACGACAG GGAATTAAAC ATGATGATGT TAATAGAATA GCGGAAAATA TAATAAAAAA GTATGGTATA ACTGAGGGTA TGAATCCTAA ACCTAGGAAT GCCAGAATAT CTAAAGAATT GCTTTTATTA GCTGTTGATA GACAGATTGA GTGGGCGAAA AAACATTTTA TAACAAAAGA TGTATTGGAA AATGTTGTGT CAAAATGTGA TTTATCATCT ATCTTTAATG TTAATAAAGT GCTTCAGAAT ACTATTCTTG AGTTTGTCCA TGAAATTAAT AATATATCAT CTGCTCGCTG GATGTCAAAA TCAGAAAAGA ATAATAAACA AAAAGAGGCA ATAGAAAAGT TCAAAAAAGA AGTATCCCAT ATGAATGGCG GGCAGCAGTT TATTTGGGGG TTTGATAAGG TTATTCAAGA AGGCTTAAGT GGATTGATTG AGTTAAGTAT CGATATTAAT GATAGTACAA ATCATCGTGA TAAGTCTTCT CTTTCTCCTG ATGGGAGAGC TGTGTTACAT TTTTTAGGTA CAATTTGGAA TATGGCGATG GGAGCTGTAC CTGGTTATAA TGCATTGTCT GGTGTAAGTA GCATTTTACA TAGTGCTATA GTAAAAGAAT CTAGCAATAT CTGTGATTAT ATTCAGGGGG CTGTACGTAT TGGAATGGAC TTTGTTCCAG GCACTCGCTC TGACTTACAT AGCCGTTCGC TGCAGATAAA ATATGAGGCT TTGAAGCATA TAGAAAAAAA CATTAATGAT AATATTATTT ATCATCCGAG TAATAATGCT AATTTCTATT CTGTAATTGA GTCAATTGAT GGTAATGACT TTATATATAA CGAAAAACAA TCTAAAATAT TAGAAATGAA ACAGGATCGT GGGGGGAATA GATATAGTGC AGTAGATCTT AACTCTTCTA AGTATGGGTA TTATGAGAAA GTTGGCGGTG GTTTTTATAG ATATATAGAA TCCTTTAACC CCATATCTTC AGAGACACCA AATAAAATAG TCTACAAGGG GGAATCAGTA GATTTAACTA AGGAGCCAAA TTCGGAATTA TATTCAGGTA GGTATTCTAT AAATAACAAA CAGGTTAATG TTTATTTCTT TCGTGACGCT GATGGTACAT TTTATAAATC AGAAGGTCTT CATGGTGGGG GAGTTATTAG ATACATAGAT AAACCGTATT CTCAGTTAAG AGAAGGAGAT ATTGGGTATG ATGAGGATTT GTTGGATATA TACGATGATT CTCCGGTGCT TGAAGACACG TTGCCTGCTT TATCTTCTGA AATAGTACCA ACTCCAGAAC ATAGTATTAA ACAAATTTAT TCGAAAATTA AGGAGGGGCA CATAGAACTG TCCGATTCAG ACATCATATT GTGTCGCGGC ACAACCGGTA TTCAAGCTGA AAATATCGTT GAATATAAAA CTGCTGGAGG GCTTCCTGAT TCAAATCCAA ATGTAAAAGC ACCAGATGAA TATATGGCAC AACAGCAGGT ACGTATTGGA AGAATATTGC CTGAATACAC ATCGGATCTT AGCGTTGCTG ATCGGTTTAG TCGTGAGCAT TATCTAATAG TTGTTAAAGT AAAGGCAAAA TATATCACAC GAGGAAGTGT TACAGAGAGT GGTTGGGTTA TAGATAAGAC CGCACCTGTT GAACCACTTG CGATAATTGA TAGAACTTTT GGTATGAAGG AAAATATCTC AATGGTAAAT GCATCGAAAT AG
|
Protein sequence | MKITNYILPT SRTHGSFSTI KSWDTMNYIK HLIRHTNDPI FEEQFYKITQ SHIDFDKRAK DEKNDTINIY DNFFYSSNDD LDSKIRSMLN NLYEKSLTFR RIINYYVKEI NLSDYGFLKC KILPAYAYNY EMENDAPPKI LIPIDHNLNF IDAKYNGETY RGNEEFAINL FLQHILHNDI QEQTSIDLYT SIINKELDSN RKSYNNEIFN NFSFDKSVKL NSYNYIADDI EQVIDKGSKV QLEVYNLLSE EKIFEHKIMN NWTRSIKNIL TTYLFMSSGA VTARNVQTFS PTINNESRIR LPRALPVGHP YPEEHKASGF SPFMMGGLSG DILPEILTGN GPSIFFNGKH NNQHDGAFGK IIDFTQNGNK ISAKDKEIIK RYIFDKINVL IKEYFIRTGK NSHTPFEVFI KERLFNQYDI FKTLARDILA HPLVIYDAGY KNYHESLNAA IAINSRPLQE IHYGDVLYHY HKNDISLGVD TLYGRESFDI VLDAMNVYRK SKKMRVISNN EMKKSIKISE LVIHNIIKKG LTNCLLKKDV LNARYDLIRD ILRYSLNIRQ GIKHDDVNRI AENIIKKYGI TEGMNPKPRN ARISKELLLL AVDRQIEWAK KHFITKDVLE NVVSKCDLSS IFNVNKVLQN TILEFVHEIN NISSARWMSK SEKNNKQKEA IEKFKKEVSH MNGGQQFIWG FDKVIQEGLS GLIELSIDIN DSTNHRDKSS LSPDGRAVLH FLGTIWNMAM GAVPGYNALS GVSSILHSAI VKESSNICDY IQGAVRIGMD FVPGTRSDLH SRSLQIKYEA LKHIEKNIND NIIYHPSNNA NFYSVIESID GNDFIYNEKQ SKILEMKQDR GGNRYSAVDL NSSKYGYYEK VGGGFYRYIE SFNPISSETP NKIVYKGESV DLTKEPNSEL YSGRYSINNK QVNVYFFRDA DGTFYKSEGL HGGGVIRYID KPYSQLREGD IGYDEDLLDI YDDSPVLEDT LPALSSEIVP TPEHSIKQIY SKIKEGHIEL SDSDIILCRG TTGIQAENIV EYKTAGGLPD SNPNVKAPDE YMAQQQVRIG RILPEYTSDL SVADRFSREH YLIVVKVKAK YITRGSVTES GWVIDKTAPV EPLAIIDRTF GMKENISMVN ASK
|
| |