Gene EcSMS35_4640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4640 
SymbolamiB 
ID6143598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4740847 
End bp4742184 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content54% 
IMG OID641619456 
ProductN-acetylmuramoyl-l-alanine amidase II 
Protein accessionYP_001746564 
Protein GI170680514 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.4247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0772035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTATC GCATCAGAAA TTGGTTGGTA GCGACGTTGC TGCTGCTGTG CGCGCAGGTG 
GGTGCCGCGA CGCTCTCTGA TATTCAGGTT TCTAACGGCA ACCAACAGGC GCGGATAACG
TTGAGTTTTA TTGGCGATCC TGATTATGCG TTTAGCCATC AAAGCAAACG CATCGTGGCG
CTCGATATCA AACAAACGGG CGTGATTCAG GGACTGCCGT TGTTGTTCAG CGGCAATAAT
CTGGTGAAGG CGATTCGCTC TGGAACGCCT AAAGATGCAC AAACGCTACG GCTGGTGGTC
GATCTTACCG AAAATGGTAA AACCGAAGCG GTGAAGCGGC AGAATGGCAG CAATTACACT
GTCGTCTTTA CGATTAACGC CGATGCGCCG CCACCGCCTC CTCCGCCGCC TGTGGTTGCG
AAACGCGTTG AAACGCCTGC GGTTGGCGCA CCGCGCGTCA GCGAACCGGC GCGCAATCCG
TTTAAAACGG AAAGTAACCG CACTACGGGT GTTATCAGCA GTAATACGGT AACGCGTCCG
GCAGCGCGCG CGACGGCTAA CACTGGCGAT AAAATTATCA TCGCTATTGA TGCCGGACAC
GGCGGTCAGG ATCCTGGCGC TATCGGCCCC GGTGGTACGC GGGAGAAAAA TGTCACCATC
GCCATCGCAC GTAAATTACG TACTTTGCTC AATGACGATC CAATGTTTAA AGGCGTTTTA
ACCCGTGACG GGGATTACTT TATTTCGGTG ATGGGGCGCA GCGATGTGGC ACGTAAGCAA
AACGCCAATT TCCTCGTGTC GATTCACGCT GATGCCGCAC CAAACCGCAG TGCGACTGGC
GCTTCCGTAT GGGTGCTCTC TAACCGTCGT GCAAACAGCG AGATGGCAAG CTGGCTGGAA
CAGCATGAGA AACAGTCGGA GCTACTGGGC GGAGCGGGCG ATGTGCTGGC GAACAGTCAG
TCTGACCCCT ATTTGAGCCA GGCGGTGCTG GATTTACAGT TCGGTCATTC CCAGCGGGTA
GGGTATGATG TAGCGACCAG TATGATCAGT CAGTTGCAAC GCATTGGCGA AATTCATAAA
CGTCGACCAG AACACGCCAG CCTTGGCGTT CTGCGTTCGC CGGATATCCC ATCAGTACTG
GTCGAAACCG GTTTTATCAG CAACAACAGC GAAGAACGTT TGCTGGCGAG CGACGATTAC
CAACAACAGC TGGCAGAAGC CATTTATAAA GGTCTGCGCA ATTATTTCCT TGCGCATCCG
ATGCAATCTG CGCCGCAGGG TGCAACGGCA CAAACTGCCA GTACGGTGAC GACGCCAGAT
CGTACGCTGC CAAACTAA
 
Protein sequence
MMYRIRNWLV ATLLLLCAQV GAATLSDIQV SNGNQQARIT LSFIGDPDYA FSHQSKRIVA 
LDIKQTGVIQ GLPLLFSGNN LVKAIRSGTP KDAQTLRLVV DLTENGKTEA VKRQNGSNYT
VVFTINADAP PPPPPPPVVA KRVETPAVGA PRVSEPARNP FKTESNRTTG VISSNTVTRP
AARATANTGD KIIIAIDAGH GGQDPGAIGP GGTREKNVTI AIARKLRTLL NDDPMFKGVL
TRDGDYFISV MGRSDVARKQ NANFLVSIHA DAAPNRSATG ASVWVLSNRR ANSEMASWLE
QHEKQSELLG GAGDVLANSQ SDPYLSQAVL DLQFGHSQRV GYDVATSMIS QLQRIGEIHK
RRPEHASLGV LRSPDIPSVL VETGFISNNS EERLLASDDY QQQLAEAIYK GLRNYFLAHP
MQSAPQGATA QTASTVTTPD RTLPN