Gene EcHS_A0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0102 
SymbollpxC 
ID5591066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp108312 
End bp109229 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content51% 
IMG OID640919290 
ProductUDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine deacetylase 
Protein accessionYP_001456885 
Protein GI157159567 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0774] UDP-3-O-acyl-N-acetylglucosamine deacetylase 
TIGRFAM ID[TIGR00325] UDP-3-0-acyl N-acetylglucosamine deacetylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value1.52337e-17 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAAC AAAGGACACT TAAACGTATC GTTCAGGCGA CGGGTGTCGG TTTACATACC 
GGCAAGAAAG TCACCCTGAC GTTACGCCCT GCGCCGGCCA ACACCGGGGT CATCTATCGT
CGCACCGACT TGAATCCACC GGTAGATTTC CCGGCCGATG CCAAATCTGT GCGTGATACC
ATGCTCTGTA CGTGTCTGGT CAACGAGCAT GATGTACGGA TTTCAACCGT AGAGCACCTC
AATGCTGCTC TCGCGGGCTT GGGCATCGAT AACATTGTTA TCGAAGTTAA CGCGCCGGAA
ATCCCGATCA TGGACGGCAG CGCCGCTCCG TTTGTATACC TGCTGCTTGA CGCCGGTATC
GACGAGTTGA ACTGCGCCAA AAAATTTGTT CGCATCAAAG AGACTGTTCG TGTCGAAGAT
GGCGATAAGT GGGCTGAATT TAAGCCGTAC AATGGTTTTT CGCTGGATTT CACCATCGAT
TTTAACCATC CGGCTATTGA TTCCAGCAAC CAGCGCTATG CGATGAACTT CTCCGCTGAT
GCGTTTATGC GCCAGATCAG CCGTGCGCGT ACGTTCGGTT TCATGCGTGA TATCGAATAT
CTGCAGTCCC GTGGTTTGTG CCTGGGCGGC AGCTTCGATT GTGCCATCGT TGTTGACGAT
TATCGCGTAC TGAACGAAGA CGGCCTGCGT TTTGAAGACG AATTTGTGCG TCACAAAATG
CTCGATGCGA TCGGTGACTT GTTCATGTGT GGTCACAATA TTATTGGTGC ATTTACCGCT
TATAAATCCG GTCATGCACT GAATAACAAA CTGCTGCAGG CTGTCCTGGC GAAACAGGAA
GCCTGGGAAT ATGTGACCTT CCAGGACGAC GCAGAACTGC CGTTGGCCTT CAAAGCGCCT
TCAGCTGTAC TGGCATAA
 
Protein sequence
MIKQRTLKRI VQATGVGLHT GKKVTLTLRP APANTGVIYR RTDLNPPVDF PADAKSVRDT 
MLCTCLVNEH DVRISTVEHL NAALAGLGID NIVIEVNAPE IPIMDGSAAP FVYLLLDAGI
DELNCAKKFV RIKETVRVED GDKWAEFKPY NGFSLDFTID FNHPAIDSSN QRYAMNFSAD
AFMRQISRAR TFGFMRDIEY LQSRGLCLGG SFDCAIVVDD YRVLNEDGLR FEDEFVRHKM
LDAIGDLFMC GHNIIGAFTA YKSGHALNNK LLQAVLAKQE AWEYVTFQDD AELPLAFKAP
SAVLA