Gene EcolC_3478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3478 
SymbollpxB 
ID6068300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3795297 
End bp3796445 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content54% 
IMG OID641602894 
Productlipid-A-disaccharide synthase 
Protein accessionYP_001726419 
Protein GI170021465 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0763] Lipid A disaccharide synthetase 
TIGRFAM ID[TIGR00215] lipid-A-disaccharide synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00171417 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0362862 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAC AGCGTCCATT AACGATTGCC CTGGTCGCCG GAGAAACCTC CGGCGATATC 
CTGGGGGCCG GTTTAATCCG CGCTCTGAAA GAACGTGTGC CCAACGCCCG CTTTGTTGGC
GTTGCCGGGC CACGAATGCA GGCTGAAGGC TGCGAAGCCT GGTACGAAAT GGAAGAACTG
GCGGTGATGG GCATTGTTGA AGTGCTCGGT CGTCTGCGTC GCTTACTGCA TATTCGTGCC
GATCTGACAA AGCGTTTTGG CGAACTGAAG CCAGATGTTT TTGTTGGTAT TGATGCGCCC
GACTTCAATA TTACTCTTGA AGGTAACCTC AAAAAGCAGG GTATCAAAAC CATTCATTAC
GTCAGTCCAT CCGTCTGGGC GTGGCGACAG AAACGCGTTT TCAAAATAGG CAGAGCCACC
GATCTGGTGC TCGCATTTCT GCCTTTCGAA AAAGCGTTTT ATGACAAATA CAATGTACCG
TGCCGCTTTA TCGGTCATAC CATGGCTGAT GCCATGCCGT TAGATCCAGA TAAAAATGGT
GCCCGTGATG TGCTGGGGAT CCCTTACGAT GCCCACTGTC TGGCATTGTT GCCGGGCAGC
CGTGGTGCAG AAGTCGAAAT GCTTAGTGCC GATTTCCTGA AAACTGCCCA GCTTTTGCGC
CAGACATATC CGGATCTCGA AATCGTGGTG CCGCTGGTGA ATGCCAAACG CCGCGAGCAG
TTTGAACGCA TCAAAGCTGA AGTCGCGCCA GATCTGTCGG TTCATTTGCT GGATGGAATG
GGCCGTGAGG CGATGGTCGC CAGCGATGCG GCACTACTGG CGTCGGGGAC GGCAGCCCTG
GAGTGTATGC TGGCGAAATG CCCGATGGTG GTGGGATATC GCATGAAGCC TTTTACCTTC
TGGTTGGCGA AGCGGCTGGT GAAAACTGAT TATGTCTCGC TGCCAAATCT GCTGGCGGGC
AGAGAGTTAG TCAAAGAGTT ATTGCAGGAA GAGTGTGAGC CGCAAAAACT GGCTGCGGCG
CTGTTACCGC TGTTGGCGAA CGGGAAAACC AGCCACGCGA TGCACGATAC CTTCCGTGAA
CTGCATCAGC AGATCCGCTG CAATGCCGAT GAGCAGGCGG CACAAGCCGT TCTGGAGTTA
GCACAATGA
 
Protein sequence
MTEQRPLTIA LVAGETSGDI LGAGLIRALK ERVPNARFVG VAGPRMQAEG CEAWYEMEEL 
AVMGIVEVLG RLRRLLHIRA DLTKRFGELK PDVFVGIDAP DFNITLEGNL KKQGIKTIHY
VSPSVWAWRQ KRVFKIGRAT DLVLAFLPFE KAFYDKYNVP CRFIGHTMAD AMPLDPDKNG
ARDVLGIPYD AHCLALLPGS RGAEVEMLSA DFLKTAQLLR QTYPDLEIVV PLVNAKRREQ
FERIKAEVAP DLSVHLLDGM GREAMVASDA ALLASGTAAL ECMLAKCPMV VGYRMKPFTF
WLAKRLVKTD YVSLPNLLAG RELVKELLQE ECEPQKLAAA LLPLLANGKT SHAMHDTFRE
LHQQIRCNAD EQAAQAVLEL AQ