Gene ECH74115_4937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4937 
Symbol 
ID6966648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4575162 
End bp4576157 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content47% 
IMG OID643388620 
Productacyltransferase family protein 
Protein accessionYP_002273047 
Protein GI209400863 
COG category[S] Function unknown 
COG ID[COG3274] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.125366 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCCA AAATTTACTG GATTGATAAC CTGCGAGGGA TAGCGTGTTT AATGGTGGTG 
ATGATTCACA CCACTACCTG GTATGTGACC AATGCTCATA GTGTTAGCCC CGTCACCTGG
GATATCGCCA ATGTTCTGAA CTCTGCCTCT CGTGTCAGCG TGCCGCTATT TTTCATGATT
TCCGGCTATC TCTTTTTTGG CGAACGCAGC GCCCAGCCGC GCCATTTCTT GCGTATCGGC
TTATGTCTGT TTTTTTATAG CGCAATCGCG CTGCTCTACA TTGCACTGTT TACCTCCATT
AATGTGGAGT TAGCGCTGAA AAACCTGCTG CAAAAGCCAG TGTTTTACCA CTTATGGTTT
TTCTTCGCGA TTGCGGTGAT TTATCTGGTT TCACCGCTGA TTCAGGTGAA GAACGTCGGC
GGAAAAATGT TGCTGGTGCT AATGGTGGTG ATTGGTATCA TCGCTAACCC AAACACCGTG
CCGCAGAAAA TCGACGGTTT TGAATGGCTG CCAATTAACT TATATATCAA TGGCGATACT
TTTTACTACA TTCTGTATGG CATGTTGGGC CGCGCTTTAG GGATGATGGA CACACAGCAT
AAAGCACTGT CGTGGGTGAG CGCCGCACTG TTTGCGACGG GAGTTTTTAT TATCTCTCGC
GGGACATTAT ATGAATTGCA GTGGCGCGGA AATTTTGCCG ATACCTGGTA TCCTTACTGT
GGGCCGATGG TTTTTATCTG CGCAATCGCG CTATTGACTC TGGTTAAAAA CACGCTGGAT
ACGCGTACCA TTCGCGGACT TGGCTTAATC TCCCGCCATT CGTTGGGTAT ATACGGATTC
CACGCCTTGA TTATCCATGC GCTGCGCACC CGGGGAATTG AGCTTAAAAA TTGGCCAATA
CTGGATATTA TTTGGATTTT TTGCGCGACG TTGGCAGCGA GTTTGTTACT TTCTATGCTG
GTACAACGAA TCGACAGAAA CAGATTAGTG AGTTAA
 
Protein sequence
MQPKIYWIDN LRGIACLMVV MIHTTTWYVT NAHSVSPVTW DIANVLNSAS RVSVPLFFMI 
SGYLFFGERS AQPRHFLRIG LCLFFYSAIA LLYIALFTSI NVELALKNLL QKPVFYHLWF
FFAIAVIYLV SPLIQVKNVG GKMLLVLMVV IGIIANPNTV PQKIDGFEWL PINLYINGDT
FYYILYGMLG RALGMMDTQH KALSWVSAAL FATGVFIISR GTLYELQWRG NFADTWYPYC
GPMVFICAIA LLTLVKNTLD TRTIRGLGLI SRHSLGIYGF HALIIHALRT RGIELKNWPI
LDIIWIFCAT LAASLLLSML VQRIDRNRLV S