Gene EcHS_A0184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0184 
SymbollpxB 
ID5593292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp201505 
End bp202653 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content54% 
IMG OID640919371 
Productlipid-A-disaccharide synthase 
Protein accessionYP_001456965 
Protein GI157159647 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0763] Lipid A disaccharide synthetase 
TIGRFAM ID[TIGR00215] lipid-A-disaccharide synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00000688697 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAC AGCGTCCATT AACGATTGCC CTGGTCGCCG GAGAAACCTC CGGCGATATC 
CTGGGGGCCG GTTTAATCCG CGCTCTGAAA GAACATGTGC CCAACGCCCG CTTTGTTGGT
GTTGCCGGGC CACGAATGCA GGCTGAAGGC TGCGAAGCCT GGTACGAAAT GGAAGAACTG
GCGGTGATGG GCATTGTTGA AGTGCTCGGT CGTCTGCGTC GCTTACTGCA TATTCGTGCC
GATCTGACAA AGCGTTTTGG CGAACTGAAG CCAGATGTTT TTGTTGGTAT TGATGCGCCT
GACTTCAATA TTACTCTTGA AGGTAACCTC AAAAAGCAGG GTATCAAAAC CATTCATTAC
GTCAGTCCGT CAGTCTGGGC GTGGCGACAG AAACGTGTTT TCAAAATAGG CAGAGCCACC
GATCTGGTGC TCGCATTTCT GCCTTTCGAA AAAGCGTTTT ATGACAAATA CAACGTACCG
TGCCGCTTTA TCGGTCATAC CATGGCTGAT GCCATGCCAT TAGATCCAGA TAAAAATGCC
GCCCGTGATG TGCTGGGGAT CCCTCACGAT GCCCACTGCC TGGCGTTGCT ACCGGGGAGC
CGTGGTGCAG AAGTTGAAAT GCTTAGTGCC GATTTCCTGA AAACGGCCCA GCTTTTGCGC
CAGACATATC CGGATCTCGA AATCGTGGTG CCACTGGTGA ATGCCAAACG CCGCGAGCAG
TTTGAACGCA TCAAAGCTGA AGTCGCGCCA GACCTTTCAG TTCATTTGCT GGATGGGATG
GGCCGTGAGG CGATGGTCGC CAGCGATGCG GCGCTACTGG CGTCGGGTAC GGCAGCCCTG
GAGTGTATGC TGGCGAAATG CCCGATGGTG GTGGGATATC GCATGAAGCC TTTTACCTTC
TGGTTGGCGA AGCGGCTGGT GAAAACTGAT TATGTCTCGC TGCCAAATCT GCTGGCGGGC
AGAGAGTTAG TCAAAGAATT ATTGCAGGAA GAGTGTGAGC CGCAAAAACT GGCTGCGGCG
CTGTTACCGC TGTTGGCGAA CGGGAAAACC AGCCACGCGA TGCACGATAC CTTCCGTGAA
CTGCATCAGC AGATCCGCTG CAATGCCGAT GAGCAGGCGG CACAAGCCGT TCTGGAGTTA
GCACAATGA
 
Protein sequence
MTEQRPLTIA LVAGETSGDI LGAGLIRALK EHVPNARFVG VAGPRMQAEG CEAWYEMEEL 
AVMGIVEVLG RLRRLLHIRA DLTKRFGELK PDVFVGIDAP DFNITLEGNL KKQGIKTIHY
VSPSVWAWRQ KRVFKIGRAT DLVLAFLPFE KAFYDKYNVP CRFIGHTMAD AMPLDPDKNA
ARDVLGIPHD AHCLALLPGS RGAEVEMLSA DFLKTAQLLR QTYPDLEIVV PLVNAKRREQ
FERIKAEVAP DLSVHLLDGM GREAMVASDA ALLASGTAAL ECMLAKCPMV VGYRMKPFTF
WLAKRLVKTD YVSLPNLLAG RELVKELLQE ECEPQKLAAA LLPLLANGKT SHAMHDTFRE
LHQQIRCNAD EQAAQAVLEL AQ