Gene ECH74115_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1559 
Symbol 
ID6971742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1523030 
End bp1526431 
Gene Length3402 bp 
Protein Length1133 aa 
Translation table11 
GC content32% 
IMG OID643385526 
Producthypothetical protein 
Protein accessionYP_002270020 
Protein GI209396457 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0000950021 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATAA CAAACTATAT ACTGCCAACA AGTCGTACTC ATGGTTCATT CTCAACTATA 
AAATCATGGG ACACAATGAA TTATATTAAA CATTTAATCA GACATACAAA TGACCCTATA
TTTGAAGAAC AATTTTATAA AATAACACAA TCTCATATTG ACTTTGACAA AAGAGCTAAA
GATGAAAAAA ATGACACCAT TAACATTTAT GATAACTTTT TCTATTCATC TAATGATGAT
CTTGATTCTA AAATTAGAAG TATGTTAAAT AATTTATATG AGAAAAGCTT AACTTTCCGA
AGAATAATTA ATTATTATGT GAAGGAAATA AACTTAAGTG ATTATGGCTT TCTAAAATGT
AAGATTTTAC CAGCATATGC TTATAACTAT GAGATGGAAA ATGATGCCCC CCCAAAAATA
CTAATTCCAA TTGACCATAA TTTAAATTTT ATTGATGCAA AATATAATGG AGAAACTTAT
CGGGGAAATG AAGAGTTTGC TATTAATCTT TTTCTGCAGC ATATATTACA TAATGACATA
CAAGAACAAA CATCGATAGA CTTATACACG AGCATAATAA ATAAAGAGTT GGATAGCAAT
AGAAAATCAT ACAATAATGA AATTTTTAAC AATTTCTCTT TTGATAAGTC TGTAAAGTTG
AATTCATATA ACTATATTGC AGATGATATA GAGCAAGTAA TCGATAAAGG AAGCAAAGTT
CAATTGGAGG TATATAATTT ATTATCCGAA GAAAAGATAT TTGAACATAA AATTATGAAT
AATTGGACAA GGAGCATAAA AAATATATTG ACGACATATT TGTTTATGTC ATCAGGAGCG
GTGACAGCCA GAAATGTTCA AACCTTTTCT CCAACAATAA ATAATGAGTC AAGGATTCGA
TTGCCGAGAG CATTGCCAGT AGGCCATCCA TATCCTGAGG AACATAAGGC TTCTGGTTTC
TCCCCTTTTA TGATGGGGGG GCTGAGTGGT GATATTCTTC CGGAAATTTT AACGGGGAAT
GGACCATCTA TATTTTTTAA CGGAAAACAT AATAACCAAC ATGATGGAGC TTTTGGAAAA
ATAATAGATT TTACCCAAAA TGGAAATAAA ATAAGTGCAA AAGATAAAGA AATAATAAAA
AGATATATTT TTGATAAGAT CAATGTTTTG ATTAAAGAGT ATTTCATTAG AACTGGTAAA
AATTCTCATA CCCCATTTGA AGTTTTTATA AAGGAGCGAT TATTTAATCA ATATGATATT
TTTAAAACAT TGGCTAGAGA TATATTGGCA CACCCATTAG TAATATATGA TGCGGGTTAC
AAAAATTATC ATGAGTCATT AAATGCTGCT ATTGCAATAA ACTCTAGACC ATTACAAGAA
ATACATTATG GTGATGTTTT ATATCATTAT CATAAAAATG ACATCTCTTT GGGAGTAGAT
ACTCTTTACG GGAGGGAAAG TTTTGATATT GTACTGGATG CAATGAACGT ATATAGAAAA
AGCAAAAAAA TGAGAGTTAT TTCCAATAAT GAGATGAAAA AAAGCATTAA AATATCTGAA
TTAGTTATCC ATAATATTAT AAAGAAAGGA TTGACTAATT GTTTGCTTAA AAAGGATGTT
CTTAATGCCA GATATGATCT TATTAGAGAT ATTCTTCGAT ATTCTTTAAA TATACGACAG
GGAATTAAAC ATGATGATGT TAATAGAATA GCGGAAAATA TAATAAAAAA GTATGGTATA
ACTGAGGGTA TGAATCCTAA ACCTAGGAAT GCCAGAATAT CTAAAGAATT GCTTTTATTA
GCTGTTGATA GACAGATTGA GTGGGCGAAA AAACATTTTA TAACAAAAGA TGTATTGGAA
AATGTTGTGT CAAAATGTGA TTTATCATCT ATCTTTAATG TTAATAAAGT GCTTCAGAAT
ACTATTCTTG AGTTTGTCCA TGAAATTAAT AATATATCAT CTGCTCGCTG GATGTCAAAA
TCAGAAAAGA ATAATAAACA AAAAGAGGCA ATAGAAAAGT TCAAAAAAGA AGTATCCCAT
ATGAATGGCG GGCAGCAGTT TATTTGGGGG TTTGATAAGG TTATTCAAGA AGGCTTAAGT
GGATTGATTG AGTTAAGTAT CGATATTAAT GATAGTACAA ATCATCGTGA TAAGTCTTCT
CTTTCTCCTG ATGGGAGAGC TGTGTTACAT TTTTTAGGTA CAATTTGGAA TATGGCGATG
GGAGCTGTAC CTGGTTATAA TGCATTGTCT GGTGTAAGTA GCATTTTACA TAGTGCTATA
GTAAAAGAAT CTAGCAATAT CTGTGATTAT ATTCAGGGGG CTGTACGTAT TGGAATGGAC
TTTGTTCCAG GCACTCGCTC TGACTTACAT AGCCGTTCGC TGCAGATAAA ATATGAGGCT
TTGAAGCATA TAGAAAAAAA CATTAATGAT AATATTATTT ATCATCCGAG TAATAATGCT
AATTTCTATT CTGTAATTGA GTCAATTGAT GGTAATGACT TTATATATAA CGAAAAACAA
TCTAAAATAT TAGAAATGAA ACAGGATCGT GGGGGGAATA GATATAGTGC AGTAGATCTT
AACTCTTCTA AGTATGGGTA TTATGAGAAA GTTGGCGGTG GTTTTTATAG ATATATAGAA
TCCTTTAACC CCATATCTTC AGAGACACCA AATAAAATAG TCTACAAGGG GGAATCAGTA
GATTTAACTA AGGAGCCAAA TTCGGAATTA TATTCAGGTA GGTATTCTAT AAATAACAAA
CAGGTTAATG TTTATTTCTT TCGTGACGCT GATGGTACAT TTTATAAATC AGAAGGTCTT
CATGGTGGGG GAGTTATTAG ATACATAGAT AAACCGTATT CTCAGTTAAG AGAAGGAGAT
ATTGGGTATG ATGAGGATTT GTTGGATATA TACGATGATT CTCCGGTGCT TGAAGACACG
TTGCCTGCTT TATCTTCTGA AATAGTACCA ACTCCAGAAC ATAGTATTAA ACAAATTTAT
TCGAAAATTA AGGAGGGGCA CATAGAACTG TCCGATTCAG ACATCATATT GTGTCGCGGC
ACAACCGGTA TTCAAGCTGA AAATATCGTT GAATATAAAA CTGCTGGAGG GCTTCCTGAT
TCAAATCCAA ATGTAAAAGC ACCAGATGAA TATATGGCAC AACAGCAGGT ACGTATTGGA
AGAATATTGC CTGAATACAC ATCGGATCTT AGCGTTGCTG ATCGGTTTAG TCGTGAGCAT
TATCTAATAG TTGTTAAAGT AAAGGCAAAA TATATCACAC GAGGAAGTGT TACAGAGAGT
GGTTGGGTTA TAGATAAGAC CGCACCTGTT GAACCACTTG CGATAATTGA TAGAACTTTT
GGTATGAAGG AAAATATCTC AATGGTAAAT GCATCGAAAT AG
 
Protein sequence
MKITNYILPT SRTHGSFSTI KSWDTMNYIK HLIRHTNDPI FEEQFYKITQ SHIDFDKRAK 
DEKNDTINIY DNFFYSSNDD LDSKIRSMLN NLYEKSLTFR RIINYYVKEI NLSDYGFLKC
KILPAYAYNY EMENDAPPKI LIPIDHNLNF IDAKYNGETY RGNEEFAINL FLQHILHNDI
QEQTSIDLYT SIINKELDSN RKSYNNEIFN NFSFDKSVKL NSYNYIADDI EQVIDKGSKV
QLEVYNLLSE EKIFEHKIMN NWTRSIKNIL TTYLFMSSGA VTARNVQTFS PTINNESRIR
LPRALPVGHP YPEEHKASGF SPFMMGGLSG DILPEILTGN GPSIFFNGKH NNQHDGAFGK
IIDFTQNGNK ISAKDKEIIK RYIFDKINVL IKEYFIRTGK NSHTPFEVFI KERLFNQYDI
FKTLARDILA HPLVIYDAGY KNYHESLNAA IAINSRPLQE IHYGDVLYHY HKNDISLGVD
TLYGRESFDI VLDAMNVYRK SKKMRVISNN EMKKSIKISE LVIHNIIKKG LTNCLLKKDV
LNARYDLIRD ILRYSLNIRQ GIKHDDVNRI AENIIKKYGI TEGMNPKPRN ARISKELLLL
AVDRQIEWAK KHFITKDVLE NVVSKCDLSS IFNVNKVLQN TILEFVHEIN NISSARWMSK
SEKNNKQKEA IEKFKKEVSH MNGGQQFIWG FDKVIQEGLS GLIELSIDIN DSTNHRDKSS
LSPDGRAVLH FLGTIWNMAM GAVPGYNALS GVSSILHSAI VKESSNICDY IQGAVRIGMD
FVPGTRSDLH SRSLQIKYEA LKHIEKNIND NIIYHPSNNA NFYSVIESID GNDFIYNEKQ
SKILEMKQDR GGNRYSAVDL NSSKYGYYEK VGGGFYRYIE SFNPISSETP NKIVYKGESV
DLTKEPNSEL YSGRYSINNK QVNVYFFRDA DGTFYKSEGL HGGGVIRYID KPYSQLREGD
IGYDEDLLDI YDDSPVLEDT LPALSSEIVP TPEHSIKQIY SKIKEGHIEL SDSDIILCRG
TTGIQAENIV EYKTAGGLPD SNPNVKAPDE YMAQQQVRIG RILPEYTSDL SVADRFSREH
YLIVVKVKAK YITRGSVTES GWVIDKTAPV EPLAIIDRTF GMKENISMVN ASK