Gene EcSMS35_2964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2964 
SymbolamiC1 
ID6145539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3036388 
End bp3037641 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content53% 
IMG OID641617833 
ProductN-acetylmuramoyl-L-alanine amidase AmiC 
Protein accessionYP_001744985 
Protein GI170681690 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGAT CCAACACTGC AATCAGCCGT CGTCGTTTAC TGCAAGGCGC GGGTGCCATG 
TGGCTATTGA GCGTAAGTCA GGTCAGCCTG GCTGCGGTCA GCCAGGTCGT GGCGGTGCGC
GTCTGGCCTG CGTCCAGCTA CACCCGCGTG ACGGTAGAAT CTAATCGTCA GCTGAAATAT
AAGCAGTTCG CGTTGAGTAA TCCTGAACGC GTGGTGGTGG ATATCGAAGA TGTAAACCTG
AACTCGGTGC TCAAGGGGAT GGCTGCGCAA ATCCGCGCTG ACGACCCGTT CATCAAGTCG
GCGCGCGTCG GGCAATTTGA CCCGCAAACC GTACGTATGG TTTTTGAATT AAAGCAAAAC
GTAAAACCGC AGCTGTTTGC CCTTGCGCCG GTCGCCGGGT TTAAAGAGCG TCTGGTGATG
GATCTCTATC CTGCCAATGC ACAGGATATG CAGGACCCGC TGCTGGCGCT GCTGGAGGAT
TACAACAAAG GCGACCTCGA AAAGCAGGTG CCGCCAGCAC AAAGTGGTCC ACAACCGGGT
AAAGCTGGGC GGGATCGTCC GATTGTCATT ATGCTTGACC CTGGTCACGG TGGCGAAGAC
TCCGGTGCGG TGGGGAAATA CAAAACACGC GAAAAAGACG TAGTATTGCA AATAGCTCGC
CGTCTGCGCT CTCTGATCGA GAAAGAGGGC AATATGAAGG TGTACATGAC GCGCAATGAA
GACATCTTCA TTCCGTTGCA AGTGCGCGTA GCAAAAGCCC AGAAACAGCG TGCTGACTTG
TTTGTCTCTA TCCATGCCGA CGCCTTTACC AGTCGCCAGC CGAGCGGTTC CTCGGTGTTT
GCGCTCTCAA CCAAAGGCGC AACCAGTACT GCGGCAAAAT ATCTGGCACA AACCCAGAAC
GCCTCGGACT TGATTGGTGG CGTAAGCAAA AGCGGTGACC GCTATGTCGA CCACACTATG
TTCGATATGG TGCAGTCGCT GACCATTGCT GACAGCCTGA AGTTTGGTAA AGCGGTGCTG
AATAAGCTCG GTAAAATCAA CAAGCTGCAT AAAAATCAAG TTGAACAGGC CGGGTTTGCC
GTACTAAAGG CACCAGATAT TCCCTCCATT CTGGTCGAAA CGGCGTTTAT CAGTAACGTT
GAGGAAGAGC GTAAACTGAA AACGGCGACT TTCCAGCAGG AAGTTGCGGA GTCTATTCTT
GCGGGAATTA AAGCGTATTT TGCCGATGGG GCGACGCTGG CGAGAAGGGG ATAA
 
Protein sequence
MSGSNTAISR RRLLQGAGAM WLLSVSQVSL AAVSQVVAVR VWPASSYTRV TVESNRQLKY 
KQFALSNPER VVVDIEDVNL NSVLKGMAAQ IRADDPFIKS ARVGQFDPQT VRMVFELKQN
VKPQLFALAP VAGFKERLVM DLYPANAQDM QDPLLALLED YNKGDLEKQV PPAQSGPQPG
KAGRDRPIVI MLDPGHGGED SGAVGKYKTR EKDVVLQIAR RLRSLIEKEG NMKVYMTRNE
DIFIPLQVRV AKAQKQRADL FVSIHADAFT SRQPSGSSVF ALSTKGATST AAKYLAQTQN
ASDLIGGVSK SGDRYVDHTM FDMVQSLTIA DSLKFGKAVL NKLGKINKLH KNQVEQAGFA
VLKAPDIPSI LVETAFISNV EEERKLKTAT FQQEVAESIL AGIKAYFADG ATLARRG