Gene ECH74115_4916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4916 
Symbol 
ID6972391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4553712 
End bp4555403 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content50% 
IMG OID643388601 
Productphosphoethanolamine transferase 
Protein accessionYP_002273028 
Protein GI209399321 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATACA TCAAATCGAT TACACAGCAG AAGCTGAGCT TTTTGCTTGC AATCTATATT 
GGCCTTTTTA TGAATGGCGC GGTTTTTTAC CGCCGCTTCG GCAGCTATGC GCACGATTTT
ACCGTCTGGA AAGGCATTTC TGCTGTTGTT GAACTGGCCG CCACCGTACT GGTGACCTTC
TTTTTACTAC GTCTTCTTTC GCTGTTTGGC CGCCGCAGCT GGCGTATTCT GGCATCGCTG
GTTGTGCTCT TTTCCGCAGG TGCCAGCTAT TACATGACCT TCCTTAATGT GGTCATTGGT
TATGGCATCA TCGCTTCCGT CATGACCACC GATATCGACC TGTCAAAAGA AGTTGTTGGT
CTGAACTTTA TTCTCTGGTT AATCGCCGTT AGCGCATTGC CTCTTATCCT TATCTGGAAT
AACCGCTGTC GCTACACCTT GCTCCGACAA CTGCGAACCC CAGGGCAGCG TATTCGCAGC
CTGGCGGTCG TCGTACTGGC GGGAATTATG GTTTGGGCAC CGATTCGTTT GCTGGATATC
CAGCAGAAGA AAGTGGAGAG GGCGACCGGC GTTGATTTGC CGAGTTATGG CGGTGTCGTA
GCGAACTCTT ATCTGCCATC AAACTGGCTT TCTGCGTTGG GGCTGTACGC CTGGGCGCGG
GTCGATGAAT CTTCCGATAA TAATTCATTG CTTAACCCGG CGAAGAAATT CACTTATCAG
GCACCGCAAA ATGTTAATGA TACTTATGTC GTGTTTATCA TCGGTGAAAC CACGCGTTGG
GACCATATGG GCATTTTCGG CTATGAGCGT AATACCACGC CGAAGCTGGC CCAGGAGAAA
AATCTGGCGG CGTTCCGTGG TTACTCCTGT GATACCGCAA CCAAACTCTC TCTGCGTTGC
ATGTTTGTAC GTCAGGGGGG CGCGGAAGAT AATCCGCAGC GCACATTAAA AGAACAGAAC
ATTTTCGCGG TACTGAAGCA GTTAGGATTC AGTTCTGACC TCTACGCTAT GCAGAGCGAA
ATGTGGTTCT ACAGCAATAC GATGGCGGAC AATATTGCTT ACCGTGAACA GATTGGTGCG
GAGCCACGTA ACCGTGGCAA GCCGGTAGAT GATATGTTGC TGGTAGACGA AATGCAGCAA
TCGCTGGGGC GCAACCCGGA TGGTAAGCAT CTAATCATTC TGCATACCAA AGGTTCGCAC
TTTAACTACA CCCAGCGTTA CCCGCGCAGC TTTGCGCAGT GGAAGCCGGA ATGTATTGGT
GTTGATAGCG GATGTACCAA AGCGCAGATG ATCAACTCCT ATGACAACTC GGTGACCTAT
GTGGATCACT TTATCTCCAG TGTGATTGAT CAGGTTCGCG ATAAGAAAGC GATTGTGTTC
TACGCAGCTG ACCACGGCGA GTCAATTAAT GAACGCGAGC ACCTGCACGG CACGCCGCGT
GAACTGGCAC CGCCGGAGCA GTTCCGCGTA CCGATGATGG TCTGGATGTC AGATAAATAT
CTGGAAAATC CGGTCAATGC GCAGGCGTTT GCGCAGCTGA AAAAAGCAGC CGACATGAAA
GTGCCACGCC GTCACGTAGA GCTGTACGAC ACCATCATGG GTTGTCTTGG CTATACTTCA
CCGGATGGTG GAATTAACGA AAACAACAAC TGGTGTCACA TCCCGCAGAC AAAAGAGGCA
GCGGCTAACT AA
 
Protein sequence
MRYIKSITQQ KLSFLLAIYI GLFMNGAVFY RRFGSYAHDF TVWKGISAVV ELAATVLVTF 
FLLRLLSLFG RRSWRILASL VVLFSAGASY YMTFLNVVIG YGIIASVMTT DIDLSKEVVG
LNFILWLIAV SALPLILIWN NRCRYTLLRQ LRTPGQRIRS LAVVVLAGIM VWAPIRLLDI
QQKKVERATG VDLPSYGGVV ANSYLPSNWL SALGLYAWAR VDESSDNNSL LNPAKKFTYQ
APQNVNDTYV VFIIGETTRW DHMGIFGYER NTTPKLAQEK NLAAFRGYSC DTATKLSLRC
MFVRQGGAED NPQRTLKEQN IFAVLKQLGF SSDLYAMQSE MWFYSNTMAD NIAYREQIGA
EPRNRGKPVD DMLLVDEMQQ SLGRNPDGKH LIILHTKGSH FNYTQRYPRS FAQWKPECIG
VDSGCTKAQM INSYDNSVTY VDHFISSVID QVRDKKAIVF YAADHGESIN EREHLHGTPR
ELAPPEQFRV PMMVWMSDKY LENPVNAQAF AQLKKAADMK VPRRHVELYD TIMGCLGYTS
PDGGINENNN WCHIPQTKEA AAN