Gene RPD_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1114 
Symbol 
ID4021590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1267829 
End bp1268749 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content64% 
IMG OID637961306 
ProductWecB/TagA/CpsF family glycosyl transferase 
Protein accessionYP_568253 
Protein GI91975594 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1922] Teichoic acid biosynthesis proteins 
TIGRFAM ID[TIGR00696] bacterial polymer biosynthesis proteins, WecB/TagA/CpsF family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.51541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAAG CGCAAGATGC CAGCCGAAAT CCAGTTTCTG ATCCGCTGAA TGCCGAGCGG 
CGAGCCGAGG AGCGTCGAGT CGCGCCGTTC CACGTTTCGA CCGACAGCTC CGTTTCGTTC
GAGGAACGGC GGGTGACTGG CGAGCGTCGG CGCGAGCGGT TTCAGCAATG GCAGCGCAAC
ATGATCGGCG GCCTGCCGAT CGTCGTCGCC GACCGTGCCG AAACCGCAAA GGTGATGGTC
GACGAGGCGC TGAAGCGCCG CGGCCAGTGG CGCTACCCGG CCTATATGAC GTCGACCAAC
GGCGAGGTCA CCTATCGCTG CGCAGTCGAT CCGAGCGAAC GTGCGATGTT TCTGGAAGCC
GATGCGATTC ACGCCGACGG CATGCCGCAC GTGTTCGTGT CACGGTTCAA ATGCCAGACT
CCGCTGCCGG AGCGCGTCGC GACCACCGAC CTGTTTCACG ATGTCGCGCG CGAAGCCAGT
GTGCGCGGCG CGACGATGTT CATGCTCGGC GCCGACGAGA CCTCGAACCG TCTCGCGACC
GAATTGGTGA AGCGACGCTA TCCCAAGCTA AAACTGGTCG GGCGGCGCAA CGGCTTCTTC
GCCGACGAGG CGGAAGAGAT CGCGGCCTGC CGGCAGATCG CCGAACTGGC TCCCGATATT
CTCTGGATCT CGATGGGCGT CCCGCGCGAG CAGGTCTTCA TCCGGCGGCA TCGCCATCGG
CTGACCACCG TCGGAATCAT CAAGACGTCG GGCGGCCTGT TCGATTTCCT GTCGGGCTCC
AAGGCGCGGG CGCCGCAGTG GATGCAGCGA ATTGGCCTCG AATGGCTATG GCGGATGGCG
CTCGAGCCGC GACGGCTCGG GATGCGCTAC CTCAAGACCA ACCCTTACGC GATGTATCTG
CTGCTGACCC GGACGCGCTG A
 
Protein sequence
MPKAQDASRN PVSDPLNAER RAEERRVAPF HVSTDSSVSF EERRVTGERR RERFQQWQRN 
MIGGLPIVVA DRAETAKVMV DEALKRRGQW RYPAYMTSTN GEVTYRCAVD PSERAMFLEA
DAIHADGMPH VFVSRFKCQT PLPERVATTD LFHDVAREAS VRGATMFMLG ADETSNRLAT
ELVKRRYPKL KLVGRRNGFF ADEAEEIAAC RQIAELAPDI LWISMGVPRE QVFIRRHRHR
LTTVGIIKTS GGLFDFLSGS KARAPQWMQR IGLEWLWRMA LEPRRLGMRY LKTNPYAMYL
LLTRTR