Gene Rleg_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1788 
SymbollpxB 
ID8012849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1779641 
End bp1780819 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content63% 
IMG OID644824379 
Productlipid-A-disaccharide synthase 
Protein accessionYP_002975612 
Protein GI241204516 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0763] Lipid A disaccharide synthetase 
TIGRFAM ID[TIGR00215] lipid-A-disaccharide synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0879744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0112942 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGGG CGTCGCTGAA GATCGCCGTC ATTGCCGGCG AGGTGTCGGG GGATCTGCTC 
GGTGCCGATC TCATCGCCGC CCTGAAGCGG GTTCATAGCG GACCGGTGGA GCTCGTCGGT
GTCGGCGGCG AGGGGCTGCA GGCGGAAGGC TTGCGATCCC TGTTCGATTT CTCCGAGCTG
TCGATCATGG GTATCACCCA GGTGCTGAGC CGGCTGCCAA AGCTCTATAC GCTGATCCGC
CAGACGACGG CTGCCATCAT CGCCGCAAGG CCGGATATTC TTCTGATCAT CGACAGCCCG
GATTTCACCC ATCGCGTCGC AAAGCGTGTC CGCATCGCTC TGCCGGATCT GCCGGTTGTC
AATTATGTTT GTCCAAGCGT CTGGGCCTGG AAGGAATATC GCGCCACGCG CATGCTCGCC
TATGTCGATC ATGTGCTCGC CGTCCTGCCC TTCGAGCCGG CAACAATGCG TGCGCTCGGC
GGGCCTGAGA CCACCTATGT CGGCCATCGC CTGACCGCCG ATCCGGCGCT GCTCGAAGTG
CGGCAGCAGC GCGCCATGCG CGCACCCGTC GAGGGAGCCG GCAAGGCGAT CCTGATGCTT
CCTGGATCAA GATCCTCCGA AATCGCCAAA CTGCTTCCGT TCTTCGAGGA TGCGGCCAAA
GAACTTGTCG CCCGCAACGG CCCGATGCGA TTCCTGCTGC CGACCGTGCC GCACAACGAA
GCGCTGGTGA AAGGGCTCGT CGCCGGCTGG GCCACGCCGC CGGAGGTGGC GGTCGGGCCG
GCGCAGAAGT GGAAGGCCCT TGCCGAGGCG GATGCGGCGA TGGCAGCTTC CGGCACGGTG
ATCCTCGAAC TCGCCCTTGC CGGCGTGCCG ACGGTGTCCG TTTACAAGAC GGATTGGATT
ATCCGCCTGC TCGCCCGGCG CATCAAGGTA TGGACGGGCG CATTGCCCAA TATCATCGCC
GATTATGCCG TCGTGCCGGA ATATCTGAAC GAGATCGTCC GCGGCGCCAG CCTGGCGCGC
TGGATGGAGC GGCTGTCGGC CGACACGTTC CAATTGAAGG CGATGAACGA GGGTTATGAT
CTCGTCTGGC AACGCATGCA GACCGAAAAG CCGCCCGGCG AACACGCTGC CGAGATCCTT
CTCGACGTGC TGAAAAAGAA AAAACCCGGT CGTTTCTGA
 
Protein sequence
MNGASLKIAV IAGEVSGDLL GADLIAALKR VHSGPVELVG VGGEGLQAEG LRSLFDFSEL 
SIMGITQVLS RLPKLYTLIR QTTAAIIAAR PDILLIIDSP DFTHRVAKRV RIALPDLPVV
NYVCPSVWAW KEYRATRMLA YVDHVLAVLP FEPATMRALG GPETTYVGHR LTADPALLEV
RQQRAMRAPV EGAGKAILML PGSRSSEIAK LLPFFEDAAK ELVARNGPMR FLLPTVPHNE
ALVKGLVAGW ATPPEVAVGP AQKWKALAEA DAAMAASGTV ILELALAGVP TVSVYKTDWI
IRLLARRIKV WTGALPNIIA DYAVVPEYLN EIVRGASLAR WMERLSADTF QLKAMNEGYD
LVWQRMQTEK PPGEHAAEIL LDVLKKKKPG RF