Gene ECH74115_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0104 
SymbollpxC 
ID6967167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp111143 
End bp112060 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content51% 
IMG OID643384181 
ProductUDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine deacetylase 
Protein accessionYP_002268704 
Protein GI209397750 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0774] UDP-3-O-acyl-N-acetylglucosamine deacetylase 
TIGRFAM ID[TIGR00325] UDP-3-0-acyl N-acetylglucosamine deacetylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000975271 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAAC AAAGGACACT TAAACGTATC GTTCAGGCGA CGGGTGTCGG TTTACATACC 
GGCAAGAAAG TCACCCTGAC GTTACGCCCT GCGCCGGCCA ACACCGGGGT CATCTATCGT
CGCACCGACT TGAATCCACC GGTAGATTTC CCGGCCGATG CCAAATCTGT GCGTGATACC
ATGCTCTGTA CGTGTCTGGT CAACGAGCAT GATGTACGGA TTTCAACCGT AGAGCACCTC
AATGCTGCTC TCGCGGGCTT GGGCATCGAT AACATTGTTA TCGAAGTTAA CGCGCCGGAA
ATCCCGATCA TGGACGGCAG CGCCGCTCCG TTTGTATACC TGCTGCTTGA CGCCGGTATC
GACGAGTTGA ACTGCGCCAA AAAATTTGTT CGCATCAAAG AGACTGTTCG TGTCGAAGAT
GGCGATAAGT GGGCTGAATT TAAGCCGTAC AATGGTTTTT CGCTGGATTT CACCATCGAT
TTTAACCATC CGGCTATTGA TTCCAGCAAC CAGCGCTATG CGATGAACTT CTCCGCTGAT
GCGTTTATGC GCCAGATCAG CCGTGCGCGT ACGTTCGGTT TCATGCGTGA TATCGAATAT
CTGCAGTCCC GTGGTTTGTG CCTGGGCGGC AGCTTCGATT GTGCCATCGT TGTTGACGAT
TATCGCGTAC TGAACGAAGA CGGCCTGCGT TTTGAAGACG AATTTGTGCG TCACAAAATG
CTCGATGCGA TCGGTGACTT GTTCATGTGT GGTCACAATA TTATTGGTGC ATTTACCGCT
TATAAATCCG GTCATGCACT GAATAACAAA CTGCTGCAGG CTGTCCTGGC GAAACAGGAA
GCCTGGGAAT ATGTGACCTT CCAGGACGAC GCAGAACTGC CGTTGGCCTT CAAAGCGCCT
TCAGCCGTAC TGGCATAA
 
Protein sequence
MIKQRTLKRI VQATGVGLHT GKKVTLTLRP APANTGVIYR RTDLNPPVDF PADAKSVRDT 
MLCTCLVNEH DVRISTVEHL NAALAGLGID NIVIEVNAPE IPIMDGSAAP FVYLLLDAGI
DELNCAKKFV RIKETVRVED GDKWAEFKPY NGFSLDFTID FNHPAIDSSN QRYAMNFSAD
AFMRQISRAR TFGFMRDIEY LQSRGLCLGG SFDCAIVVDD YRVLNEDGLR FEDEFVRHKM
LDAIGDLFMC GHNIIGAFTA YKSGHALNNK LLQAVLAKQE AWEYVTFQDD AELPLAFKAP
SAVLA