Gene EcSMS35_4148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4148 
SymbolwecA 
ID6143502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4247563 
End bp4248666 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content49% 
IMG OID641618971 
Productundecaprenyl-phosphate alpha-N-acetylglucosaminyl 1-phosphatetransferase 
Protein accessionYP_001746103 
Protein GI170682041 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 
TIGRFAM ID[TIGR02380] undecaprenyl-phosphate alpha-N-acetylglucosaminyl 1-phosphatetransferase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATTTAC TGACAGTGAG TACTGATCTC ATCAGTATTT TTTTATTCAC GACACTGTTT 
CTGTTTTTTG CCCGTAAGGT GGCAAAAAAA GTCGGTTTAG TGGATAAACC AAACTTCCGC
AAACGTCACC AGGGATTGAT ACCTCTCGTT GGGGGGATTT CGGTTTACGC AGGGATTTGC
TTCACGTTCG GAATTGTCGA TTACTATATT CCGCATGCAT CTCTCTATCT CGCTTGTGCT
GGTGTGCTTG TTTTCATTGG CGCGCTGGAT GACCGTTTTG ATATCAGCGT AAAAATCCGT
GCCACCATAC AGGCCGCTGT TGGCATTGTT ATGATGGTGT TCGGCAAACT TTATCTCAGT
AGCCTGGGTT ATATCTTTGG CTCCTGGGAG ATGGTGCTCG GACCGTTTGG TTACTTCCTG
ACGCTATTTG CCGTCTGGGC GGCCATTAAT GCGTTCAACA TGGTTGATGG CATTGATGGC
TTGCTGGGCG GGTTGTCCTG CGTCTCGTTT GCAGCAATCG GTATGATTTT GTGGTTCGAC
GGGCAAACCA GTCTCGCAAT CTGGTGCTTT GCGATGATCG CCGCTATCCT GCCATACATC
ATGCTTAACC TTGGTATCCT GGGTCGCCGC TACAAAGTCT TTATGGGGGA TGCGGGCAGT
ACGCTGATTG GTTTTACCGT TATCTGGATC CTGCTCGAAA CGACTCAGGG AAAAACGCAC
CCAATAAGCC CGGTTACTGC ATTATGGATA ATCGCCATTC CACTAATGGA TATGGTGGCG
ATTATGTACC GTCGCCTGCG TAAAGGCATG AGCCCATTCT CTCCTGACCG TCAGCATATT
CACCATTTGA TCATGCGTGC CGGGTTTACT TCCCGCCAGG CGTTTGTGCT GATTACCCTT
GCCGCAGCAC TGCTCGCTTC CATTGGCGTG CTGGCAGAAT ATTCTCATTT TGTCCCGGAG
TGGGTCATGC TGGTGCTCTT TTTGCTAGCA TTCTTCCTCT ATGGATATTG CATCAAGCGT
GCCTGGAAAG TTGCTCGCTT TATTAAGCGC GTAAAACGCA GACTGCGTAG AAATCGTGGT
GGCAGCCCCA ATTTAACCAA ATAA
 
Protein sequence
MNLLTVSTDL ISIFLFTTLF LFFARKVAKK VGLVDKPNFR KRHQGLIPLV GGISVYAGIC 
FTFGIVDYYI PHASLYLACA GVLVFIGALD DRFDISVKIR ATIQAAVGIV MMVFGKLYLS
SLGYIFGSWE MVLGPFGYFL TLFAVWAAIN AFNMVDGIDG LLGGLSCVSF AAIGMILWFD
GQTSLAIWCF AMIAAILPYI MLNLGILGRR YKVFMGDAGS TLIGFTVIWI LLETTQGKTH
PISPVTALWI IAIPLMDMVA IMYRRLRKGM SPFSPDRQHI HHLIMRAGFT SRQAFVLITL
AAALLASIGV LAEYSHFVPE WVMLVLFLLA FFLYGYCIKR AWKVARFIKR VKRRLRRNRG
GSPNLTK