Gene EcSMS35_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0101 
SymbollpxC 
ID6145349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp113191 
End bp114108 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content51% 
IMG OID641615002 
ProductUDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine deacetylase 
Protein accessionYP_001742218 
Protein GI170681175 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0774] UDP-3-O-acyl-N-acetylglucosamine deacetylase 
TIGRFAM ID[TIGR00325] UDP-3-0-acyl N-acetylglucosamine deacetylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000241937 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.583142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAAC AAAGGACACT TAAACGTATC GTTCAGGCGA CGGGTGTCGG TTTACATACC 
GGCAAGAAAG TCACCCTGAC GTTACGCCCT GCGCCGGCCA ACACCGGGGT CATCTATCGT
CGCACCGACT TGAATCCACC GGTAGATTTC CCGGCCGATG CCAAATCTGT GCGTGATACC
ATGCTCTGTA CGTGTCTGGT CAACGAGCAT GATGTACGGA TTTCAACCGT AGAGCACCTC
AATGCTGCTC TCGCGGGCTT GGGCATCGAT AACATTGTTA TCGAAGTTAA CGCGCCGGAA
ATCCCGATCA TGGACGGCAG CGCCGCTCCG TTTGTATACC TGCTGCTTGA CGCCGGTATC
GACGAGCTGA ACTGCGCCAA GAAATTTGTT CGCATCAAAG AGACTGTTCG TGTCGAAGAT
GGCGATAAGT GGGCTGAATT TAAGCCGTAC AATGGTTTTT CGCTGGATTT CACCATCGAT
TTTAACCATC CGGCTATTGA TTCCAGCAAC CAGCGCTATG CGATGAACTT CTCCGCTGAT
GCGTTTATGC GCCAGATCAG CCGTGCGCGT ACGTTCGGTT TCATGCGTGA TATCGAATAT
CTGCAGTCCC GTGGTTTGTG CCTGGGCGGC AGCTTCGATT GTGCCATCGT TGTTGACGAT
TATCGCGTAC TGAACGAAGA CGGCCTGCGT TTTGAAGACG AATTTGTGCG TCACAAAATG
CTCGATGCGA TCGGTGACTT GTTCATGTGT GGTCACAATA TTATTGGTGC ATTTACCGCT
TATAAATCTG GTCATGCACT GAATAACAAA CTGCTGCAGG CTGTCCTTGC AAAACAGGAA
GCCTGGGAAT ATGTGACCTT CCAGGACGAC GCAGAACTGC CGTTGGCCTT CAAAGCGCCT
TCAGCCGTAC TGGCATAA
 
Protein sequence
MIKQRTLKRI VQATGVGLHT GKKVTLTLRP APANTGVIYR RTDLNPPVDF PADAKSVRDT 
MLCTCLVNEH DVRISTVEHL NAALAGLGID NIVIEVNAPE IPIMDGSAAP FVYLLLDAGI
DELNCAKKFV RIKETVRVED GDKWAEFKPY NGFSLDFTID FNHPAIDSSN QRYAMNFSAD
AFMRQISRAR TFGFMRDIEY LQSRGLCLGG SFDCAIVVDD YRVLNEDGLR FEDEFVRHKM
LDAIGDLFMC GHNIIGAFTA YKSGHALNNK LLQAVLAKQE AWEYVTFQDD AELPLAFKAP
SAVLA