Gene EcHS_A0097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0097 
SymbolmurC 
ID5590911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp102520 
End bp103995 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content54% 
IMG OID640919285 
ProductUDP-N-acetylmuramate--L-alanine ligase 
Protein accessionYP_001456880 
Protein GI157159562 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0773] UDP-N-acetylmuramate-alanine ligase 
TIGRFAM ID[TIGR01082] UDP-N-acetylmuramate--alanine ligase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.0154515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAC AACAATTGGC AAAACTGCGT TCCATCGTGC CCGAAATGCG TCGCGTTCGG 
CACATACATT TTGTCGGCAT TGGTGGTGCC GGTATGGGCG GTATTGCCGA AGTTCTGGCC
AATGAAGGTT ATCAGATCAG TGGTTCCGAT TTAGCGCCAA ATCCGGTCAC GCAGCAGTTA
ATGAATCTGG GTGCGACGAT TTATTTCAAC CATCGCCCGG AAAACGTACG TGATGCCAGC
GTGGTCGTTG TTTCCAGCGC CATTTCTGCC GATAACCCGG AAATTGTCGC AGCTCATGAA
GCGCGTATTC CGGTGATCCG TCGTGCCGAA ATGCTGGCTG AGTTAATGCG TTTTCGTCAT
GGCATCGCCA TTGCAGGAAC GCACGGCAAA ACGACAACCA CCGCGATGGT TTCCAGCATC
TACGCAGAAG CGGGGCTCGA CCCAACCTTC GTTAACGGCG GGCTGGTAAA AGCGGCGGGG
GTTCATGCGC GTTTGGGGCA TGGTCGGTAC CTGATTGCCG AAGCAGATGA GAGTGATGCA
TCGTTCCTGC ATCTGCAACC GATGGTGGCG ATTGTCACCA ATATCGAAGC CGACCACATG
GATACCTACC AGGGCGACTT TGAGAATTTA AAACAGACTT TTATTAATTT TCTGCACAAC
CTGCCGTTTT ACGGTCGTGC GGTGATGTGT GTTGATGATC CGGTGATCCG CGAATTGTTA
CCGCGAGTGG GACGTCAGAC CACGACTTAC GGCTTCAGCG AAGATGCCGA CGTGCGTGTA
GAAGATTATC AGCAGATTGG CCCGCAGGGG CACTTTACGC TGCTGCGCCA GGACAAAGAG
CCGATGCGCG TCACCCTGAA TGCGCCAGGT CGTCATAACG CGCTGAACGC CGCAGCTGCG
GTTGCGGTTG CTACGGAAGA GGGAATTGAT GACGAGGCTA TTTTACGTGC GCTGGAGAGC
TTCCAGGGGA CTGGGCGCCG TTTTGATTTC CTCGGTGAAT TCCCGCTGGA GCCAGTGAAT
GGTAAAAGCG GTACGGCAAT GCTGGTCGAT GACTACGGCC ACCACCCGAC GGAAGTTGAC
GCCACCATTA AAGCGGCGCG CGCAGGCTGG CCGGATAAAA ACCTGGTAAT GCTGTTTCAG
CCGCACCGTT TTACCCGTAC GCGCGACCTG TATGATGATT TCGCCAATGT GCTGACGCAG
GTTGATACCC TGTTGATGCT GGAAGTGTAT CCGGCTGGCG AAGCGCCAAT TCCGGGAGCG
GACAGCCGTT CGCTGTGTCG CACAATTCGT GGACGTGGGA AAATTGATCC CATTCTGGTG
CCGGACCCTG CGCAGGTAGC CGAGATGCTG GCACCGGTAT TAACCGGTAA CGACCTGATT
CTCGTTCAGG GGGCTGGTAA TATCGGAAAA ATTGCCCGTT CTTTAGCTGA AATCAAACTG
AAGCCGCAAA CTCCGGAGGA AGAACAACAT GACTGA
 
Protein sequence
MNTQQLAKLR SIVPEMRRVR HIHFVGIGGA GMGGIAEVLA NEGYQISGSD LAPNPVTQQL 
MNLGATIYFN HRPENVRDAS VVVVSSAISA DNPEIVAAHE ARIPVIRRAE MLAELMRFRH
GIAIAGTHGK TTTTAMVSSI YAEAGLDPTF VNGGLVKAAG VHARLGHGRY LIAEADESDA
SFLHLQPMVA IVTNIEADHM DTYQGDFENL KQTFINFLHN LPFYGRAVMC VDDPVIRELL
PRVGRQTTTY GFSEDADVRV EDYQQIGPQG HFTLLRQDKE PMRVTLNAPG RHNALNAAAA
VAVATEEGID DEAILRALES FQGTGRRFDF LGEFPLEPVN GKSGTAMLVD DYGHHPTEVD
ATIKAARAGW PDKNLVMLFQ PHRFTRTRDL YDDFANVLTQ VDTLLMLEVY PAGEAPIPGA
DSRSLCRTIR GRGKIDPILV PDPAQVAEML APVLTGNDLI LVQGAGNIGK IARSLAEIKL
KPQTPEEEQH D