Gene ECH74115_0094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0094 
SymbolmurF 
ID6969213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp99236 
End bp100594 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content55% 
IMG OID643384171 
ProductUDP-N-acetylmuramoyl-tripeptide--D-alanyl-D- alanine ligase 
Protein accessionYP_002268694 
Protein GI209398779 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0770] UDP-N-acetylmuramyl pentapeptide synthase 
TIGRFAM ID[TIGR01143] UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGCG TAACCCTTAG CCAACTTACC GACATTCTCA ACGGTGAACT GCAAGGTGCA 
GATATCACCC TTGATGCTGT AACCACTGAT ACCCGAAAAC TGACGCCGGG CTGCCTGTTT
GTTGCCCTGA AAGGCGAACG TTTCGATGCT CATGATTTTG CCGACCAGGC GAAAGCTGGC
GGCGCAGGCG CACTACTGGT TAGCCGTCCG CTGGATATCG ACCTGCCGCA GTTAATCGTC
AAGGATACGC GTCTGGCGTT TGGTGAACTG GCTGCATGGG TTCGCCAGCA AGTTCCGGCG
CGCGTGGTTG CTCTGACAGG TTCCTCCGGC AAAACCTCCG TTAAAGAGAT GACGGCGGCG
ATTTTAAGCC AGTGCGGCAA CACGCTTTAT ACGGCAGGCA ATCTCAACAA CGACATCGGT
GTACCGATGA CGCTGTTGCG CTTAACGCCG GAATACGATT ACGCAGTTAT TGAACTTGGC
GCGAACCATC AGGGCGAAAT TGTCTGGACT GTGAGTCTGA CTCGCCCGGA AGCTGCGCTG
GTCAACAACC TGGCAGCGGC GCATCTGGAA GGTTTTGGCT CGCTTGCGGG TGTCGCGAAA
GCGAAAGGTG AAATCTTTAG CGGCCTGCCG GAAAACGGTA TCGCCATTAT GAACGCCGAC
AACAACGACT GGCTGAACTG GCAGAGCGTA ATTGGCTCAC GCAAAGTGTG GCGTTTCTCA
CCTAATGCCG CCAACAGCGA TTTCACCGCC ACCAATATCC ATGTGACTTC GCACGGTACG
GAATTTACCC TGCAAACCCC AACCGGTAGC GTGGATGTTC TGCTGCCGTT GCCGGGGCGT
CACAATATTG CGAATGCGCT GGCAGCCGCT GCGCTCTCCA TGTCCGTGGG CGCAACGCTT
GATGCTATCA AAGCGGGGCT GGCAAATCTG AAAGCTGTTC CAGGCCGTTT GTTCCCCATT
CAACTGGCAG AAAACCAGTT GCTGCTCGAC GACTCCTACA ACGCCAATGT CGGTTCAATG
ACCGCCGCTG TTCAGGTACT GGCTGAAATG CCGGGCTACC GCGTGCTGGT GGTGGGCGAT
ATGGCGGAAC TGGGCGCTGA AAGCGAAGCC TGCCATGTAC AGGTGGGCGA GGCGGCAAAA
GCTGCTGGTA TTGACCGCGT GTTAAGCGTG GGTAAACAAA GCCATGCTAT CAGCACCGCC
AGCGGCGTTG GCGAACATTT TGCTGAGAAA ACTGCGTTAA TTACGCGTCT TAAATCACTG
ATTGCTGAGC AACAGGTAAT TACGATTTTA GTTAAGGGTT CACGTAGTGC CGCCATGGAA
GAGGTAGTAC GCGCTTTACA GGAGAATGGG ACATGTTAG
 
Protein sequence
MISVTLSQLT DILNGELQGA DITLDAVTTD TRKLTPGCLF VALKGERFDA HDFADQAKAG 
GAGALLVSRP LDIDLPQLIV KDTRLAFGEL AAWVRQQVPA RVVALTGSSG KTSVKEMTAA
ILSQCGNTLY TAGNLNNDIG VPMTLLRLTP EYDYAVIELG ANHQGEIVWT VSLTRPEAAL
VNNLAAAHLE GFGSLAGVAK AKGEIFSGLP ENGIAIMNAD NNDWLNWQSV IGSRKVWRFS
PNAANSDFTA TNIHVTSHGT EFTLQTPTGS VDVLLPLPGR HNIANALAAA ALSMSVGATL
DAIKAGLANL KAVPGRLFPI QLAENQLLLD DSYNANVGSM TAAVQVLAEM PGYRVLVVGD
MAELGAESEA CHVQVGEAAK AAGIDRVLSV GKQSHAISTA SGVGEHFAEK TALITRLKSL
IAEQQVITIL VKGSRSAAME EVVRALQENG TC