Gene EcolC_0898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0898 
Symbol 
ID6064549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp971695 
End bp972948 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content54% 
IMG OID641600301 
ProductN-acetylmuramoyl-L-alanine amidase 
Protein accessionYP_001723894 
Protein GI170018940 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGAT CCAACACTGC AATCAGCCGT CGTCGTTTAC TGCAAGGCGC GGGTGCCATG 
TGGCTATTGA GCGTAAGTCA GGTCAGTCTG GCTGCGGTCA GCCAGGTCGT GGCGGTGCGC
GTCTGGCCTG CGTCCAGCTA CACCCGCGTG ACGGTGGAAT CAAATCGTCA GCTGAAATAT
AAGCAGTTCG CGTTGAGTAA CCCTGAACGC GTGGTGGTGG ATATCGAAGA TGTAAACCTG
AACTCGGTAC TCAAGGGGAT GGCGGCACAA ATTCGTGCAG ACGACCCGTT CATCAAGTCG
GCGCGCGTCG GGCAATTTGA CCCGCAAACC GTACGGATGG TTTTTGAATT AAAGCAAAAC
GTAAAACCGC AGCTGTTTGC CCTTGCGCCG GTCGCCGGGT TTAAAGAGCG TCTGGTGATG
GACCTCTATC CGGCCAATGC ACAGGATATG CAGGACCCGC TGCTGGCGCT GCTGGAGGAT
TACAACAAAG GCGACCTCGA AAAGCAGGTG CCGCCAGCGC AAAGTGGTCC ACAACCGGGT
AAAGCAGGGC GCGATCGTCC GATTGTCATT ATGCTTGACC CTGGCCACGG TGGCGAAGAC
TCCGGTGCGG TGGGGAAATA CAAAACGCGC GAAAAAGATG TGGTATTGCA AATAGCTCGC
CGTTTGCGCT CTCTGATCGA GAAAGAGGGC AATATGAAGG TGTACATGAC GCGCAATGAA
GACATCTTCA TTCCATTGCA AGTGCGCGTA GCAAAAGCCC AGAAACAGCG TGCTGACCTG
TTTGTCTCTA TCCATGCCGA CGCCTTTACC AGTCGTCAGC CGAGCGGTTC CTCTGTGTTT
GCGCTCTCAA CCAAAGGTGC AACCAGTACT GCGGCAAAAT ATCTGGCACA AACCCAGAAC
GCCTCGGACT TGATTGGTGG CGTGAGCAAA AGCGGTGACC GCTATGTCGA CCACACCATG
TTCGATATGG TACAGTCGCT GACCATTGCC GACAGCCTGA AGTTTGGTAA AGCGGTACTG
AATAAGCTCG GTAAAATCAA CAAGCTGCAT AAAAATCAAG TTGAACAGGC CGGGTTTGCC
GTACTAAAGG CACCAGATAT TCCCTCCATT CTGGTCGAAA CGGCGTTTAT CAGTAACGTT
GAGGAAGAGC GTAAACTGAA AACGGCGACT TTCCAGCAGG AAGTTGCGGA GTCTATTCTT
GCGGGGATTA AAGCGTATTT TGCCGATGGG GCGACGCTGG CGAGAAGGGG ATGA
 
Protein sequence
MSGSNTAISR RRLLQGAGAM WLLSVSQVSL AAVSQVVAVR VWPASSYTRV TVESNRQLKY 
KQFALSNPER VVVDIEDVNL NSVLKGMAAQ IRADDPFIKS ARVGQFDPQT VRMVFELKQN
VKPQLFALAP VAGFKERLVM DLYPANAQDM QDPLLALLED YNKGDLEKQV PPAQSGPQPG
KAGRDRPIVI MLDPGHGGED SGAVGKYKTR EKDVVLQIAR RLRSLIEKEG NMKVYMTRNE
DIFIPLQVRV AKAQKQRADL FVSIHADAFT SRQPSGSSVF ALSTKGATST AAKYLAQTQN
ASDLIGGVSK SGDRYVDHTM FDMVQSLTIA DSLKFGKAVL NKLGKINKLH KNQVEQAGFA
VLKAPDIPSI LVETAFISNV EEERKLKTAT FQQEVAESIL AGIKAYFADG ATLARRG