Gene EcSMS35_3485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3485 
SymbolmurA 
ID6144571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3561848 
End bp3563107 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content53% 
IMG OID641618314 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_001745461 
Protein GI170679709 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.027658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAT TTCGTGTTCA GGGGCCAACG AAGCTCCAGG GCGAAGTCAC AATTTCCGGC 
GCTAAAAATG CTGCTCTGCC TATCCTTTTT GCCGCACTAC TGGCGGAAGA ACCGGTAGAG
ATCCAGAACG TCCCGAAACT AAAAGACGTC GATACATCAA TGAAGCTGCT AAGCCAGCTG
GGTGCGAAAG TAGAACGTAA TGGTTCTGTG CATATTGATG CCCGCGACGT TAATGTATTC
TGCGCACCTT ACGATCTGGT TAAAACCATG CGTGCTTCTA TCTGGGCGCT GGGGCCGCTG
GTAGCGCGCT TTGGTCAGGG GCAAGTTTCA CTGCCTGGCG GTTGTACGAT TGGTGCACGT
CCGGTTGATC TACACATTTC TGGTCTCGAA CAATTAGGCG CGACCATCAA ACTGGAAGAA
GGTTACGTTA AAGCTTCCGT CGATGGTCGT TTGAAAGGCG CACATATCGT GATGGATAAA
GTCAGCGTTG GCGCAACGGT GACCATCATG TGTGCTGCAA CCCTGGCCGA AGGCACCACG
ATTATTGAAA ACGCAGCGCG TGAACCGGAA ATCGTCGATA CCGCGAACTT CCTGATTACG
CTGGGTGCGA AAATTAGCGG TCAGGGCACC GATCGTATCG TCATTGAAGG TGTGGAACGT
TTAGGCGGCG GTGTCTATCG CGTGCTGCCG GATCGTATCG AAACCGGTAC TTTCCTGGTG
GCGGCGGCGA TCTCTCGCGG CAAAATTATC TGCCGTAACG CGCAGCCAGA TACTCTGGAC
GCCGTGCTGG CGAAACTGCG TGACGCTGGA GCGGACATCG AAGTCGGCGA AGACTGGATT
AGCCTGGATA TGCATGGCAA ACGTCCGAAG GCTGTTAACG TACGTACCGC GCCGCATCCG
GCATTCCCGA CCGATATGCA GGCCCAGTTC ACGCTGTTGA ACCTGGTGGC AGAAGGGACC
GGATTCATCA CCGAAACGGT CTTTGAAAAC CGCTTTATGC ATGTGCCAGA GCTGAGCCGT
ATGGGCGCGC ACGCCGAAAT CGAAAGCAAT ACCGTTATTT GTCACGGTGT TGAAAAACTT
TCTGGCGCAC AGGTTATGGC AACCGATCTA CGTGCATCAG CAAGCCTGGT GCTGGCTGGC
TGTATTGCGG AAGGGACGAC GGTAGTTGAT CGTATTTATC ACATCGATCG TGGCTACGAA
CGCATTGAAG ACAAACTGCG CGCTTTAGGT GCAAATATTG AGCGTGTGAA AGGCGAGTAA
 
Protein sequence
MDKFRVQGPT KLQGEVTISG AKNAALPILF AALLAEEPVE IQNVPKLKDV DTSMKLLSQL 
GAKVERNGSV HIDARDVNVF CAPYDLVKTM RASIWALGPL VARFGQGQVS LPGGCTIGAR
PVDLHISGLE QLGATIKLEE GYVKASVDGR LKGAHIVMDK VSVGATVTIM CAATLAEGTT
IIENAAREPE IVDTANFLIT LGAKISGQGT DRIVIEGVER LGGGVYRVLP DRIETGTFLV
AAAISRGKII CRNAQPDTLD AVLAKLRDAG ADIEVGEDWI SLDMHGKRPK AVNVRTAPHP
AFPTDMQAQF TLLNLVAEGT GFITETVFEN RFMHVPELSR MGAHAEIESN TVICHGVEKL
SGAQVMATDL RASASLVLAG CIAEGTTVVD RIYHIDRGYE RIEDKLRALG ANIERVKGE