Gene EcSMS35_0091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0091 
SymbolmurF 
ID6146512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp101283 
End bp102641 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content55% 
IMG OID641614992 
ProductUDP-N-acetylmuramoyl-tripeptide--D-alanyl-D- alanine ligase 
Protein accessionYP_001742208 
Protein GI170682834 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0770] UDP-N-acetylmuramyl pentapeptide synthase 
TIGRFAM ID[TIGR01143] UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.667428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGCG TAACCCTTAG CCAACTTACC GATATTCTCA ACGGTGAACT GCAAGGTGCA 
GATATTACCC TTGATGCTGT AACCACTGAC ACGCGAAAAC TGACGCCGGG CTGCCTGTTT
GTTGCCCTGA AAGGCGAACG TTTCGATGCT CATGATTTTG CCGACCAGGC GAAAGCTGGC
GGCGCAGGCG CACTACTGGT TAGCCGTCCG CTGGATATCG ATCTGCCGCA GTTAATCGTC
AAGGATACGC GTCTGGCGTT TGGTGAACTG GCTGCATGGG TTCGCCAGCA AGTTCCGGCG
CGCGTGGTTG CTCTGACAGG TTCCTCCGGC AAAACATCCG TTAAAGAGAT GACGGCGGCG
ATTTTAAGCC AGTGCGGCAA CACGCTTTAT ACGGCAGGCA ATCTCAACAA CGACATCGGC
GTACCGATGA CGCTGTTGCG CTTAACGCCG GAATACGATT ACGCAGTTAT TGAACTTGGC
GCGAACCATC AGGGCGAAAT TGCCTGGACT GTGAGTCTGA CTCGCCCGGA AGCGGCGCTG
GTCAACAACC TGGCAGCGGC ACATCTGGAA GGTTTTGGCT CGCTTGCGGG TGTCGCGAAA
GCGAAAGGTG AAATCTTTAG CGGCCTGCCG GAAAACGGTA TCGCCATCAT GAACGCTGAC
AACAACGACT GGCTGAACTG GCAGAGCGTA ATTGGCTCAC GCAAAGTGTG GCGTTTCTCA
CCCAATGCCG CCAACAGCGA TTTCACCGCC ACCAATATCC ATGTGACTTC GCACGGTACG
GAATTTACCC TGCAAACCCC AACCGGTAGC GTGGATGTTC TGCTGCCGTT GCCGGGGCGT
CACAATATTG CGAATGCGCT GGCAGCCGCT GCGCTCTCCA TGGCCGTGGG CGCAACGCTT
GATGCTATCA AAGCGGGGCT GGCAAATCTG AAAGCTGTTC CAGGCCGTCT GTTCCCCATT
CAACTGGTAG AAAACCAGTT GCTGCTCGAC GACTCCTACA ACGCCAATGT TGGTTCAATG
ACTGCAGCAG TCCAGGTACT GGCTGAAATG CCGGGCTACC GCGTGCTGGT GGTGGGCGAT
ATGGCGGAAC TGGGCGCTGA AAGCGAAGCC TGCCATATAC AGGTGGGCGA AGCGGCAAAA
GCAGCTGGTA TTGACCGCGT GTTAAGCGTG GGCAAACAAA GCCATGCTAT CAGCACCGCC
AGCGGCGTTG GCGAACATTT TTCCGATAAA ACTGCGCTTA TCGCGCGTCT TAAATCACTG
ATTGCTGAGC AACAGGTAAT TACGATTTTA GTTAAGGGTT CACGTAGTGC TGCCATGGAA
GAGGTAGTAC GCGCTTTACA GGAGAATGGG ACATGTTAG
 
Protein sequence
MISVTLSQLT DILNGELQGA DITLDAVTTD TRKLTPGCLF VALKGERFDA HDFADQAKAG 
GAGALLVSRP LDIDLPQLIV KDTRLAFGEL AAWVRQQVPA RVVALTGSSG KTSVKEMTAA
ILSQCGNTLY TAGNLNNDIG VPMTLLRLTP EYDYAVIELG ANHQGEIAWT VSLTRPEAAL
VNNLAAAHLE GFGSLAGVAK AKGEIFSGLP ENGIAIMNAD NNDWLNWQSV IGSRKVWRFS
PNAANSDFTA TNIHVTSHGT EFTLQTPTGS VDVLLPLPGR HNIANALAAA ALSMAVGATL
DAIKAGLANL KAVPGRLFPI QLVENQLLLD DSYNANVGSM TAAVQVLAEM PGYRVLVVGD
MAELGAESEA CHIQVGEAAK AAGIDRVLSV GKQSHAISTA SGVGEHFSDK TALIARLKSL
IAEQQVITIL VKGSRSAAME EVVRALQENG TC