Gene EcolC_3571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3571 
SymbolmurF 
ID6065724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3903573 
End bp3904931 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content55% 
IMG OID641602988 
ProductUDP-N-acetylmuramoyl-tripeptide--D-alanyl-D- alanine ligase 
Protein accessionYP_001726512 
Protein GI170021558 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0770] UDP-N-acetylmuramyl pentapeptide synthase 
TIGRFAM ID[TIGR01143] UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00253313 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTAGCG TAACCCTTAG CCAACTTACC GACATTCTCA ACGGTGAACT GCAAGGTGCA 
GATATCACCC TTGATGCTGT AACCACTGAT ACCCGAAAAC TGACGCCGGG CTGCCTGTTT
GTTGCCCTGA AAGGCGAACG TTTCGATGCT CATGATTTTG CCGACCAGGC GAAAGCTGGC
GGCGCAGGCG CACTACTGGT TAGCCGTCCG CTGGACATCG ACCTGCCGCA GTTAATCGTC
AAGGATACGC GTCTGGCGTT TGGTGAACTG GCTGCATGGG TTCGCCAGCA AGTTCCGGCG
CGCGTGGTTG CTCTGACAGG TTCCTCCGGC AAAACCTCCG TTAAAGAGAT GACGGCGGCT
ATTTTAAGCC AGTGCGGCAA CACGCTTTAT ACGGCAGGCA ATCTCAACAA CGACATCGGT
GTACCGATGA CGCTGTTGCG CTTAACGCCG GAATACGATT ACGCAGTTAT TGAACTTGGC
GCGAACCATC AGGGCGAAAT AGCCTGGACT GTGAGTCTGA CTCGCCCGGA AGCTGCGCTG
GTCAACAACC TGGCAGCGGC GCATCTGGAA GGTTTTGGCT CGCTTGCGGG TGTCGCGAAA
GCGAAAGGTG AAATCTTTAG CGGCCTGCCG GAAAACGGTA TCGCCATTAT GAACGCCGAC
AACAACGACT GGCTGAACTG GCAGAGCGTA ATTGGCTCAC GCAAAGTGTG GCGTTTCTCA
CCCAATGCCG CCAACAGCGA TTTCACCGCC ACCAATATCC ATGTGACCTC GCACGGTACG
GAATTTACCC TACAAACCCC AACCGGTAGC GTCGATGTTC TGCTGCCGTT GCCGGGGCGT
CACAATATTG CGAATGCGCT GGCAGCCGCT GCGCTCTCCA TGTCCGTGGG CGCAACGCTT
GATGCTATCA AAGCGGGGCT GGCAAATCTG AAAGCTGTTC CAGGCCGTCT GTTCCCCATC
CAACTGGCAG AAAACCAGTT GCTGCTCGAC GACTCCTACA ACGCCAATGT CGGTTCAATG
ACTGCAGCAG TCCAGGTACT GGCTGAAATG CCGGGCTACC GCGTGCTGGT GGTGGGCGAT
ATGGCGGAAC TGGGCGCTGA AAGCGAAGCC TGCCATGTAC AGGTGGGCGA GGCGGCAAAA
GCTGCTGGTA TTGACCGCGT GTTAAGCGTG GGTAAACAAA GCCATGCTAT CAGCACCGCC
AGCGGCGTTG GCGAACATTT TGCTGATAAA ACTGCGTTAA TTACGCGTCT TAAATCACTG
ATTGCTGAGC AACAGGTAAT TACGATTTTA GTTAAGGGTT CACGTAGTGC CGCCATGGAA
GAGGTAGTAC GCGCTTTACA GGAGAATGGG ACATGTTAG
 
Protein sequence
MISVTLSQLT DILNGELQGA DITLDAVTTD TRKLTPGCLF VALKGERFDA HDFADQAKAG 
GAGALLVSRP LDIDLPQLIV KDTRLAFGEL AAWVRQQVPA RVVALTGSSG KTSVKEMTAA
ILSQCGNTLY TAGNLNNDIG VPMTLLRLTP EYDYAVIELG ANHQGEIAWT VSLTRPEAAL
VNNLAAAHLE GFGSLAGVAK AKGEIFSGLP ENGIAIMNAD NNDWLNWQSV IGSRKVWRFS
PNAANSDFTA TNIHVTSHGT EFTLQTPTGS VDVLLPLPGR HNIANALAAA ALSMSVGATL
DAIKAGLANL KAVPGRLFPI QLAENQLLLD DSYNANVGSM TAAVQVLAEM PGYRVLVVGD
MAELGAESEA CHVQVGEAAK AAGIDRVLSV GKQSHAISTA SGVGEHFADK TALITRLKSL
IAEQQVITIL VKGSRSAAME EVVRALQENG TC