Gene EcHS_A1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1022 
SymbollpxK 
ID5592455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1028704 
End bp1029690 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content55% 
IMG OID640920189 
Producttetraacyldisaccharide 4'-kinase 
Protein accessionYP_001457754 
Protein GI157160436 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1663] Tetraacyldisaccharide-1-P 4'-kinase 
TIGRFAM ID[TIGR00682] tetraacyldisaccharide 4'-kinase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.0136979 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGAAA AAATCTGGTC TGGTGAATCC CCTTTGTGGC GGCTATTGCT GCCACTCTCC 
TGGTTGTATG GCCTGGTGAG TGGCGCGATC CGTCTTTGCT ATAAACTAAA ACTGAAGCGC
GCCTGGCGTG CCCCCGTACC GGTTGTCGTG GTTGGTAATC TCACCGCAGG CGGCAACGGA
AAAACCCCGG TCGTTGTCTG GCTGGTGGAA CAGTTGCAAC AGCGCGGTAT TCGCGTGGGG
GTCGTATCGC GGGGATATGG TGGTAAGGCT GAATCTTATC CGCTGTTATT GTCGGCAGAT
ACCACAACAG CACAGGCGGG TGATGAACCT GTGTTGATTT ATCAACGCAC TGATGCGCCT
GTTGCGGTTT CTCCCGTTCG TTCTGATGCG GTAAAAGCCA TTCTGGCGCA ACACCCTGAT
GTGCAGATCA TCGTAACCGA CGACGGTTTA CAGCATTACC GTCTGGCGCG TGATGTGGAA
ATTGTCGTTA TTGATGGTGT GCGTCGCTTT GGCAATGGCT GGTGGTTGCC GGCGGGGCCA
ATGCGTGAGC GAGCGGGGCG CTTAAAGTCG GTTGATGCGG TAATCGTCAA CGGCGGTGTC
CCTCGCAGCG GTGAAATCCC CATGCATCTG CTGCCGGGTC AGGCGGTGAA TTTACGTACC
GGTACGCGTT GTGACGTTGC TCAGCTTGAA CATGTAGTGG CGATGGCGGG GATTGGGCAT
CCGCCGCGCT TTTTTGCCAC GCTGAAGATG TGTGGCGTAC AACCGGAAAA ATGTGTACCG
CTGGCCGATC ATCAGTCTTT GAACCATGCG GATGTCAGTG CGTTGGTAAG CGCCGGGCAA
ACGCTGGTAA TGACTGAAAA AGATGCGGTG AAATGCCGGG CCTTTGCAGA AGAAAATTGG
TGGTATTTGC CTGTAGACGC ACAGCTTTCA GGTGATGAAC CAGCGAAACT GCTTACGCAA
CTAACCTCGC TGGCTTCTGG CAACTAG
 
Protein sequence
MIEKIWSGES PLWRLLLPLS WLYGLVSGAI RLCYKLKLKR AWRAPVPVVV VGNLTAGGNG 
KTPVVVWLVE QLQQRGIRVG VVSRGYGGKA ESYPLLLSAD TTTAQAGDEP VLIYQRTDAP
VAVSPVRSDA VKAILAQHPD VQIIVTDDGL QHYRLARDVE IVVIDGVRRF GNGWWLPAGP
MRERAGRLKS VDAVIVNGGV PRSGEIPMHL LPGQAVNLRT GTRCDVAQLE HVVAMAGIGH
PPRFFATLKM CGVQPEKCVP LADHQSLNHA DVSALVSAGQ TLVMTEKDAV KCRAFAEENW
WYLPVDAQLS GDEPAKLLTQ LTSLASGN