Gene RPD_2844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2844 
SymbollpxB 
ID4023342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3169561 
End bp3170742 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID637963042 
Productlipid-A-disaccharide synthase 
Protein accessionYP_569973 
Protein GI91977314 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0763] Lipid A disaccharide synthetase 
TIGRFAM ID[TIGR00215] lipid-A-disaccharide synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.809049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCTG CGCGTTCGAG CCCAGGCGCG CAGCACAGGC TGTTTCTGAT CGCCACCGAA 
GAATCCGGCG ATCGCCTTGG TGCGGCGCTC ATGAAGGAGC TGCAGCAGCG GCTCGGCGTC
TCCGTTCGGT TCGAAGGCGT CGGCGGCCGG GCGATGGCGG AGCAGGGGCT GGTCTCGCTG
TTTCCGATCG AGGAGCTGTC GATCATGGGG ATCTCCGCCG TGGTGCGGCG GCTGCCCTCG
ATCCTGCGCA GGATCCGCAG CACCGCCGAG GCGGTGCATC GCGCCCGGCC CGACATGCTG
ATCATCATCG ACAGTCCCGA CTTTACCCAT CGAGTCGCCA AGCGGGTGCG GCTGCGTGAC
CCGTCGATCG CGATCGTCAA CTACGTATCT CCGACGGTCT GGGCGTGGCG GCCGGGCCGG
GCGCGCGCGA TGCGGCCCTA TGTCGATCAC GTGCTGGCGC TGTTGCCGTT CGAGCCGCAG
GAATATCGCA GGCTGCGCGG TCCGCCCTGC ACCTATGTCG GCCATCCGCT GACCGAGCAG
ATCGACAGTC TGCGTCCGAG CCCGGCCGAG CAGGCCCGCC GCGATTCCGA TCCCCCGGTG
CTGGTGGTGC TGCCTGGCAG CCGGCGCAGC GAGATCTTCC ATCAGATGGC GGTGTTCGGC
GAGACGCTGG GGCGGCTTCA GGCGGAGCAG GGCAATCTCG AACTGATCCT GCCGACGGTT
CCGCATCTGC GCGACGCGGT CGAGGCCGGG GTGCGCGACT GGCCGGTGCA GCCGACCATC
GTGGTCGGCG ATGCCGAGAA AAAGGCCGCG TTCCGGATCG CGCGGGCGGC GTTCGCAAAA
TCCGGCACGG TGACGCTCGA ACTGGCGTTG GCGCATGTGC CGATGGTGGC GGTCTACAAG
GCCGGGGCGA TGGAGGCGTG GATCGGCAAG CGGGTGATCC GCTCGGCCTC GGTGATCCTC
GCCAATCTCG TCGTCGGCGA AAACGTCATC CCGGAGTTCA TCCAGGAAGA CTGCGTGCCC
GACAAGTTGG TCCCGGCATT GCGGGAGGTG TTGGCCGACA CGCCGATGCG GACGCGCCAG
CTCGAAGGCT TCACCCGGAT CGACGACATC ATGTCGACCG GCGCGCAAAC GCCGAGCGCC
TGCGCGGCGG ATGTCGTGCT GGCGGTGCTG CGCAAGGCGT GA
 
Protein sequence
MMAARSSPGA QHRLFLIATE ESGDRLGAAL MKELQQRLGV SVRFEGVGGR AMAEQGLVSL 
FPIEELSIMG ISAVVRRLPS ILRRIRSTAE AVHRARPDML IIIDSPDFTH RVAKRVRLRD
PSIAIVNYVS PTVWAWRPGR ARAMRPYVDH VLALLPFEPQ EYRRLRGPPC TYVGHPLTEQ
IDSLRPSPAE QARRDSDPPV LVVLPGSRRS EIFHQMAVFG ETLGRLQAEQ GNLELILPTV
PHLRDAVEAG VRDWPVQPTI VVGDAEKKAA FRIARAAFAK SGTVTLELAL AHVPMVAVYK
AGAMEAWIGK RVIRSASVIL ANLVVGENVI PEFIQEDCVP DKLVPALREV LADTPMRTRQ
LEGFTRIDDI MSTGAQTPSA CAADVVLAVL RKA