Gene EcSMS35_0096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0096 
SymbolmurC 
ID6142605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp107398 
End bp108873 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content55% 
IMG OID641614997 
ProductUDP-N-acetylmuramate--L-alanine ligase 
Protein accessionYP_001742213 
Protein GI170683694 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0773] UDP-N-acetylmuramate-alanine ligase 
TIGRFAM ID[TIGR01082] UDP-N-acetylmuramate--alanine ligase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0263957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAC AACAATTGGC AAAACTGCGT TCCATCGTGC CCGAAATGCG TCGCGTTCGG 
CACATACATT TTGTCGGCAT CGGTGGTGCC GGTATGGGCG GTATTGCCGA AGTTCTGGCC
AATGAAGGTT ATCAGATCAG TGGTTCCGAT TTAGCGCCAA ATCCGGTCAC GCAGCAGTTA
ATGAATCTGG GAGCGACGAT TTATTTCAAC CATCGCCCGG AAAACGTACG TGATGCCAGC
GTGGTCGTTG TTTCCAGCGC GATTTCTGCC GATAACCCGG AAATTGTTGC AGCTCATGAA
GCGCGTATTC CGGTGATCCG TCGTGCTGAA ATGCTGGCTG AGTTAATGCG TTTTCGTCAT
GGCATCGCCA TTGCCGGAAC ACACGGCAAA ACGACAACCA CCGCGATGGT TTCCAGCATC
TACGCAGAAG CGGGGCTCGA CCCAACCTTC GTTAACGGCG GGCTGGTAAA AGCGGCGGGG
GTTCATGCGC GTTTGGGGCA TGGTCGGTAC CTGATTGCCG AAGCAGATGA GAGTGATGCA
TCGTTCCTGC ATCTGCAACC GATGGTGGCG ATTGTCACCA ATATCGAAGC CGACCACATG
GATACCTACC AGGGCGACTT TGAGAATTTA AAACAGACTT TTATTAATTT TCTGCACAAC
CTGCCGTTTT ACGGTCGTGC GGTGATGTGT GTTGATGATC CGGTGATCCG CGAATTGTTA
CCGCGTGTGG GACGTCAGAC CACGACTTAC GGCTTCAGCG AAGATGCCGA CGTGCGTGTA
GAAGATTATC AGCAGATTGG CCCGCAGGGG CACTTTACGC TGCTGCGCCA GGACAAAGAG
CCGATGCGCG TCACCCTGAA TGCGCCAGGT CGTCATAACG CGCTGAACGC CGCAGCTGCG
GTTGCGGTTG CTACGGAAGA GGGAATTGAC GACGAGGCTA TTTTGCGTGC GCTGGAGAGC
TTCCAGGGGA CGGGGCGCCG TTTCGATTTC CTCGGTGAAT TCCCGCTGGA GCCAGTGAAT
GGAAAAAGCG GTACGGCAAT GCTGGTCGAT GACTACGGCC ACCACCCGAC GGAAGTGGAC
GCCACTATTA AAGCGGCGCG CGCAGGCTGG CCGGATAAAA ACCTGGTAAT GCTGTTTCAG
CCGCACCGTT TTACCCGTAC GCGCGACCTG TATGATGATT TCGCCAATGT GCTGACGCAG
GTTGATACCC TGTTGATGCT GGAAGTGTAT CCGGCTGGTG AAGCGCCAAT TCCGGGAGCG
GACAGCCGTT CGCTGTGTCG CACAATTCGT GGACGTGGGA AAATTGATCC CATTCTGGTG
CCGGACCCGG CGCAGGTAGC CGAGATGCTG GCACCGGTAT TAACCGGTAA CGACCTGATT
CTCGTTCAGG GGGCTGGTAA TATCGGAAAA ATTGCTCGTT CTTTAGCTGA AATCAAACTG
AAGCCGCAAA CTCCGGAGGA AGAACAACAT GACTGA
 
Protein sequence
MNTQQLAKLR SIVPEMRRVR HIHFVGIGGA GMGGIAEVLA NEGYQISGSD LAPNPVTQQL 
MNLGATIYFN HRPENVRDAS VVVVSSAISA DNPEIVAAHE ARIPVIRRAE MLAELMRFRH
GIAIAGTHGK TTTTAMVSSI YAEAGLDPTF VNGGLVKAAG VHARLGHGRY LIAEADESDA
SFLHLQPMVA IVTNIEADHM DTYQGDFENL KQTFINFLHN LPFYGRAVMC VDDPVIRELL
PRVGRQTTTY GFSEDADVRV EDYQQIGPQG HFTLLRQDKE PMRVTLNAPG RHNALNAAAA
VAVATEEGID DEAILRALES FQGTGRRFDF LGEFPLEPVN GKSGTAMLVD DYGHHPTEVD
ATIKAARAGW PDKNLVMLFQ PHRFTRTRDL YDDFANVLTQ VDTLLMLEVY PAGEAPIPGA
DSRSLCRTIR GRGKIDPILV PDPAQVAEML APVLTGNDLI LVQGAGNIGK IARSLAEIKL
KPQTPEEEQH D