Gene Smed_1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1143 
SymbollpxB 
ID5321989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1211820 
End bp1213019 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content64% 
IMG OID640790084 
Productlipid-A-disaccharide synthase 
Protein accessionYP_001326829 
Protein GI150396362 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0763] Lipid A disaccharide synthetase 
TIGRFAM ID[TIGR00215] lipid-A-disaccharide synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0582133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.762709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATA GAGCCTACAG GCTGGCGGTC ATCGCCGGCG AAGTTTCCGG CGACCTGCTC 
GGCGCCGACC TCGTGCGGGC CCTGCGCGAT CGGGCGGACG GTACCGTCGA ACTCGTTGGT
ATCGGCGGCG AGGCACTCGA GGCCGAAGGG CTTCGGCCGT TGTTCGACTA TTCCGAACTC
TCGATCATGG GCTTCTCGCA GGTCCTGGCA AATCTTCCCA AGCTCCTGGC GCGGATTCGA
CAGACCGCGA GCGCGATCAC CGCTGCGCGG CCGGATGCTC TGGTTATAAT CGACAGCCCC
GATTTCACTC ACCGAGTGGC GCAGCGCGTC CGCGCCGCAC TGCCGGACTT GCCGGTGATC
GATTATGTCT GCCCGAGCGT CTGGGCATGG AAGCCGGAAC GGGCGCCGCG CATGCGGGCT
TATGTGGATC ACGTGCTGGC GGTTCTGCCG TTCGAACCGG ACGTGATGGT GAAACTCGGC
GGCCCGCCGA CCACCTATGT CGGTCACAGG CTGGCATTGG ACAGCAACGT GCTTGCCGTA
CGGCAGCGCC AGCGGCTGAA GCAACAGGCG CAGGAACCGG GCGGGGCGAA CGCCTGTCTC
CTGTTACCCG GCTCACGCGG AAGCGAGATC AGCCGCCTGC TGCCTGTCTT TCGCGACACA
GTCGAAGAAC TCGCTGATCG GAACGAAGGC ATCCGCTTCC TTCTGCCAAC TGTGCCTCGG
CAGGAAGAAC GGGTGAGGGC GATGACGGCC TCCTGGAGGG TGCAACCCGC GATCAGTGTA
ACTTCGGAAA GGAAATGGGA GGCTTTCGCC GAAGCCGACG CGGCCATAGC CGCATCGGGT
ACGGTTATCC TCGAGCTGGC GCTCGCCGGC GTTCCCGTCG TCTCCACCTA TTCTGCCGAC
TGGCTCGTAA GCCTCCTGCA TTCCCGCATC CGGATCTGGA CCGCAGCGCT TCCGAACCTG
ATCGCGGATT TTCCGGTCGT TCCGGAATAT TTCAACAAGA TGATACGGCC GGCCTCCCTG
ACGCGCTGGT TCGAGCGGCT TTCCTGTGAC ACGCCTCAAC GACGCGCGAT GCTCGACGGG
TTCGCGCTCG TGCAGCAGCG AATGGAGACG GATCGTCCGC CAGGCGAAAA GGCGGCGGAC
ATCGTTTTGA CCTATATTCA AGCGGGGCGG AAGGGATCCT CGGGTCAAGC CAAAGGATGA
 
Protein sequence
MTDRAYRLAV IAGEVSGDLL GADLVRALRD RADGTVELVG IGGEALEAEG LRPLFDYSEL 
SIMGFSQVLA NLPKLLARIR QTASAITAAR PDALVIIDSP DFTHRVAQRV RAALPDLPVI
DYVCPSVWAW KPERAPRMRA YVDHVLAVLP FEPDVMVKLG GPPTTYVGHR LALDSNVLAV
RQRQRLKQQA QEPGGANACL LLPGSRGSEI SRLLPVFRDT VEELADRNEG IRFLLPTVPR
QEERVRAMTA SWRVQPAISV TSERKWEAFA EADAAIAASG TVILELALAG VPVVSTYSAD
WLVSLLHSRI RIWTAALPNL IADFPVVPEY FNKMIRPASL TRWFERLSCD TPQRRAMLDG
FALVQQRMET DRPPGEKAAD IVLTYIQAGR KGSSGQAKG