Gene EcSMS35_0190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0190 
SymbollpxD 
ID6147512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp210153 
End bp211178 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content53% 
IMG OID641615091 
ProductUDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase 
Protein accessionYP_001742307 
Protein GI170683962 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1044] UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase 
TIGRFAM ID[TIGR01853] UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000507466 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCAA TTCGACTGGC TGATTTAGCG CAGCAGTTGG ATGCAGAACT ACACGGTGAT 
GGCGATATCG TCATCACCGG CGTTGCGTCC ATGCAATCTG CACAAACAGG TCACATTACG
TTCATGGTTA ACCCAAAATA CCGTGAGCAT TTAGGCTTGT GCCAGGCGTC CGCGGTTGTC
ATGACCCAGG ACGATCTTCC TTTCGCGAAA AGTGCCGCGC TGGTAGTGAA GAATCCCTAC
CTGACTTACG CGCGCATGGC GCAAATTTTA GATACCACGC CGCAGCCCGC GCAGAACATT
GCACCCAGTG CAGTGATTGA CGCGACGGCG AAGCTGGGTA ACAACGTCTC AATTGGCGCT
AACGCGGTGA TTGAGTCCGG CGTTGAACTG GGCGATAACG TGATTATCGG TGCCGGTTGC
TTCGTAGGTA AAAACAGCAA AATCGGTGCA GGTTCGCGTC TCTGGGCGAA CGTAACCATT
TACCATGAGA TCCAGATCGG TCAGAATTGC CTGATCCAGT CCGGAACAGT GGTAGGCGCA
GACGGCTTTG GTTATGCCAA CGATCGTGGT AACTGGGTGA AGATCCCGCA GATAGGTCGC
GTAATTATTG GCGATCGCGT GGAGATCGGC GCATGTACTA CCATCGATCG CGGTGCGCTG
GATGACACTG TTATTGGCAA TGGTGTGATC ATTGATAACC AGTGCCAGAT TGCACATAAC
GTCGTGATTG GCGACAATAC GGCAGTTGCC GGTGGCGTCA TTATGGCGGG CAGCCTGAAA
ATTGGTCGTT ACTGCATGAT CGGCGGAGCC AGCGTAATCA ACGGGCATAT GGAAATATGC
GACAAAGTGA CGGTTACGGG CATGGGTATG GTGATGCGTC CCATCACTGA ACCAGGCGTC
TATTCCTCAG GCATTCCGCT GCAACCCAAC AAAGTCTGGC GCAAAACCGC TGCACTGGTG
ATGAACATTG ATGACATGAG CAAGCGTCTG AAATCGCTTG AGCGCAAGGT TAATCAACAA
GACTAA
 
Protein sequence
MPSIRLADLA QQLDAELHGD GDIVITGVAS MQSAQTGHIT FMVNPKYREH LGLCQASAVV 
MTQDDLPFAK SAALVVKNPY LTYARMAQIL DTTPQPAQNI APSAVIDATA KLGNNVSIGA
NAVIESGVEL GDNVIIGAGC FVGKNSKIGA GSRLWANVTI YHEIQIGQNC LIQSGTVVGA
DGFGYANDRG NWVKIPQIGR VIIGDRVEIG ACTTIDRGAL DDTVIGNGVI IDNQCQIAHN
VVIGDNTAVA GGVIMAGSLK IGRYCMIGGA SVINGHMEIC DKVTVTGMGM VMRPITEPGV
YSSGIPLQPN KVWRKTAALV MNIDDMSKRL KSLERKVNQQ D