Gene ECH74115_4082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4082 
SymbolamiC 
ID6972414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3774554 
End bp3775807 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content54% 
IMG OID643387840 
ProductN-acetylmuramoyl-L-alanine amidase AmiC 
Protein accessionYP_002272280 
Protein GI209398948 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.354645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGAT CCAACACTGC AATCAGCCGT CGTCGTTTAC TGCAAGGCGC GGGTGCCATG 
TGGCTATTGA GCGTAAGTCA GGTCAGTCTG GCTGCGGTCA GCCAGGTCGT GGCGGTGCGC
GTCTGGCCTG CGTCCAGCTA CACCCGCGTG ACGGTAGAAT CAAATCGTCA GCTGAAATAT
AAGCAGTTCG CGTTGAGTAA CCCGGAACGC GTGGTGGTGG ATATCGAAGA TGTAAACCTG
AACTCGGTAC TCAAGGGGAT GGCGGCACAA ATTCGTGCAG ACGACCCGTT CATCAAGTCG
GCGCGCGTCG GGCAATTTGA CCCGCAAACC GTACGGATGG TTTTTGAATT AAAGCAAAAC
GTAAAACCGC AGCTGTTTGC CCTTGCGCCG GTCGCCGGGT TTAAAGAGCG TCTGGTGATG
GACCTCTATC CGGCCAATGC ACAGGATATG CAGGACCCGC TGCTGGCGCT GCTGGAGGAT
TACAACAAAG GCGACCTCGA AAAGCAGGTG CCGCCAGCGC AAAGTGGTCC ACAACCAGGT
AAAGCAGGGC GCGATCGTCC GATTGTCATT ATGCTTGACC CTGGCCACGG TGGCGAAGAC
TCCGGTGCGG TGGGGAAATA CAAAACGCGC GAAAAAGATG TGGTATTGCA AATAGCTCGC
CGTTTGCGCT CTCTGATCGA GAAAGAGGGC AATATGAAGG TGTACATGAC GCGCAATGAA
GACATCTTCA TTCCGTTGCA AGTGCGCGTA GCAAAAGCTC AGAAACAGCG TGCTGACCTG
TTTGTCTCTA TCCATGCCGA CGCCTTTACC AGTCGCCAGC CGAGCGGTTC CTCGGTGTTT
GCGCTCTCAA CCAAAGGTGC AACCAGTACT GCGGCAAAAT ATCTGGCACA AACCCAGAAC
GCCTCCGACT TGATTGGTGG CGTGAGCAAA AGCGGTGACC GCTATGTCGA CCACACCATG
TTCGATATGG TGCAGTCGCT GACCATTGCT GACAGCCTTA AGTTTGGTAA AGCGGTGCTG
AATAAGCTCG GTAAAATCAA CAAGCTGCAT AAAAATCAAG TTGAACAGGC CGGGTTTGCC
GTACTAAAGG CACCAGATAT TCCCTCCATT CTGGTCGAAA CGGCGTTTAT CAGTAACGTT
GAGGAAGAGC GTAAACTGAA AACGGCGACC TTCCAGCAGG AAGTTGCGGA GTCTATTCTT
GCGGGAATTA AAGCGTATTT TGCCGATGGG GCGACCCTGG CGAGAAGGGG ATGA
 
Protein sequence
MSGSNTAISR RRLLQGAGAM WLLSVSQVSL AAVSQVVAVR VWPASSYTRV TVESNRQLKY 
KQFALSNPER VVVDIEDVNL NSVLKGMAAQ IRADDPFIKS ARVGQFDPQT VRMVFELKQN
VKPQLFALAP VAGFKERLVM DLYPANAQDM QDPLLALLED YNKGDLEKQV PPAQSGPQPG
KAGRDRPIVI MLDPGHGGED SGAVGKYKTR EKDVVLQIAR RLRSLIEKEG NMKVYMTRNE
DIFIPLQVRV AKAQKQRADL FVSIHADAFT SRQPSGSSVF ALSTKGATST AAKYLAQTQN
ASDLIGGVSK SGDRYVDHTM FDMVQSLTIA DSLKFGKAVL NKLGKINKLH KNQVEQAGFA
VLKAPDIPSI LVETAFISNV EEERKLKTAT FQQEVAESIL AGIKAYFADG ATLARRG