Gene ECH74115_1565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1565 
Symbol 
ID6966561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1529076 
End bp1530437 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content40% 
IMG OID643385530 
Producthypothetical protein 
Protein accessionYP_002270024 
Protein GI209400220 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00000172498 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTCCTA CATCGCAATT ACGACCGACC GGGACATTCT GCTCCTATTC CGCTGAAACA 
TCAGCAGACA TCAAAAGCGA AATCACACCA ATTCAGATAG AAGAAGCGCG GGCCAGTGGT
CGTTTATATA TCAAAGATTG TGATATTGAG TATCTGCCAC AGTTACCAAA CGAAATAACA
TCAGTTACAA TCGAAAACTG CAACAACCTG ACAACCCTTA CAGGATTGCC GGTTAATACA
CAAAACCTCT CCGTCATTAA CTGTGAAAAA TTACAAATCA CAGACATGCC ATCAACCGTA
AAAAATCTAC ATATTGAATT AACTGATTCA CCATTTATAC ATTTCATATC TGAAGGCATC
GAGTGCCTGA CGGTTTGCCA CTGCTATATA TCTGGAGTGC CAGAGAGTGT CCGCTACCTT
GAAATAAAAG GTAGCGCCAC AGACAGCATA AAAAATGTTC CAAACGGGTT ATCATCTCTC
AGCATCAATA GCTATAACCC GGAGAATCAG GCCAGAATTG ACAACCTGAT ATCACCGTCA
CTGAAGACGC TATCGCTGAC TGGATGTAGC AATATTATAC TGCCGGAGAA ACTTCCGGAG
AGTGTGACAT CGGTAACCAT TCATGCGGAG CAGAAAACCA CGTGGAACAT CGGTGTTGAA
GGGATGCCTG ATGGGCTGGA TCTTGATTTA CAAAATGTAC TACTCTCTCC AGATGTAGTT
AAAGCAAAAA ACATCACCTT TCAGGGCAAC GCTCTGGATG TGGCCTTACA CTTTCGCGAG
GGAGACATTG TCTATGGACT ATCTTCACCC AGAGAAAAAC TTGTCAACAG CATTAAACTA
GTTAACGACT TTTCCAAAAA AGATATTATA ACTCAGAATA CGTTAACAAA CGCAGTATGG
GACCCCAGAA CACCTCGCAA ATATAAGCAA GATCCACTTA TCAAAAGAGC ATTAAATGAA
CACGAAAGAG GAATAAAATT TAAACAACAC TTAAAGAATC ACAATAATTA TAATGTTACC
ATGGCCGACC TTTCCGTATA CAATCGCGAC AAATTATGGG CAAAAACAAG CAAGGCCGGC
CTAGAGTTTC AGACATTAAC ACGCAATAAA ACGGTTATTT TTTGTGCGGA TGAGCTTGTC
AACTCACTCA AACTCATAGC TAACAAGTCA GAGGGCTATG GCCAGAGTAT TACCGCCAGC
GAATTACGAT GGATTTACCG TAATAAAGAC AACAACCAAA TAATGAAAAA CATAAAATTT
TATCTACATG GCAAAGAGAT ACCAGCAGAA AGAATATTAG ATACACCAGA ATGGAAAGAC
TATCGTCCAA AATACTCTGG TTCCACATAT AAATATTCTT AA
 
Protein sequence
MLPTSQLRPT GTFCSYSAET SADIKSEITP IQIEEARASG RLYIKDCDIE YLPQLPNEIT 
SVTIENCNNL TTLTGLPVNT QNLSVINCEK LQITDMPSTV KNLHIELTDS PFIHFISEGI
ECLTVCHCYI SGVPESVRYL EIKGSATDSI KNVPNGLSSL SINSYNPENQ ARIDNLISPS
LKTLSLTGCS NIILPEKLPE SVTSVTIHAE QKTTWNIGVE GMPDGLDLDL QNVLLSPDVV
KAKNITFQGN ALDVALHFRE GDIVYGLSSP REKLVNSIKL VNDFSKKDII TQNTLTNAVW
DPRTPRKYKQ DPLIKRALNE HERGIKFKQH LKNHNNYNVT MADLSVYNRD KLWAKTSKAG
LEFQTLTRNK TVIFCADELV NSLKLIANKS EGYGQSITAS ELRWIYRNKD NNQIMKNIKF
YLHGKEIPAE RILDTPEWKD YRPKYSGSTY KYS