Gene RPD_1648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1648 
Symbol 
ID4022128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1856004 
End bp1857089 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content71% 
IMG OID637961843 
Productglycosyl transferase family protein 
Protein accessionYP_568786 
Protein GI91976127 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.206697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGCGA TGCTCCGGGT GCACGATTTG AGCGAGGACG ACGTGCCGCC CGACAGGCCG 
CCACAAGTAT CTGTGATCCT GCCGGTTCGC GACGGCCAGC GCTGGCTGTG CGAAGCGATC
GACAGCGTAC TGACGCAGAC ATTGTCCGAT CTCGAACTCG TGGTGATCGA CGACGGCTCG
ACCGACGCGA CGCCGGCGCT CCTTGATGAA GTCCGCGCCC GCGACCCGCG TGTGATCGCG
CTGCGGCAGG AGCGGGAGGG CCTGGTCGCC GCGCTCAATC GCGGGCTTGC GCAAGCGCGC
GCGCCGCTGA TCGCCCGGCT GGATGCCGAC GACATCGCGC TGCCCGATCG GCTGGCGCGG
CAATGCGATT ATCTGCACGC CCACCCGGAC GTCGTGCTAC TCGGCGGCTG GGCCGAGATC
ATCGACGAAA ACGGCGCATC GCGCGGCAAG CAAATGCGGC CGAACCCGAG CGGCTTGCGC
GAGACGCTGG CGAGGAAAAG CCCCTTCATT CACCCGACGG TGATGGTTCG CGCCGACGCC
GCGCGGCGCG TCGGCGGCTA TCGCTCCGCC TTCGAGGCCG GCGAGGACTA TGACTTCTGG
CTGCGCCTCG CCGATGCGGG CGAGATCGCG ATCCTGCCCG AGGTGCTGAT CCGCTATCGC
GAGCACGGCG GCAGCGTCAC GCGCACGCGC GAGCTGCGTC AGATCTATTC GGCCCGCCTC
GCCAAGCTCG CCAGCGCCGC CCGCCGTGGC GGCGGCCCCG ATCCGTCCGC CGCACTCGCT
GCGCCGCCGG ACTGGCACGA CCCGGCCCCC GGCCCGTTCG AACGCGACAG CTCGCGGCTA
TTCCGGGTGC TCGAACTCGC CGATCCCGAG CTGGCGCGCG CGACGCCGGC GTCGGCGATC
GACCTCGCGG CCATCACCTC GCAGCTCGCG ACGCTGACCA CCGGCGAACG GAAATTCGCG
CAGGTCGCCG TCCTGAACTT GCTGCGCGCT GATCGCAAGC GGCCCGGCGT CTCGCGCGCC
TCGCTGCTGG CGCTGCTGGT GCGGCTCGGA CCGGCCAAGG CGATCAGGCT GCTCTTGAAG
GGTTAG
 
Protein sequence
MHAMLRVHDL SEDDVPPDRP PQVSVILPVR DGQRWLCEAI DSVLTQTLSD LELVVIDDGS 
TDATPALLDE VRARDPRVIA LRQEREGLVA ALNRGLAQAR APLIARLDAD DIALPDRLAR
QCDYLHAHPD VVLLGGWAEI IDENGASRGK QMRPNPSGLR ETLARKSPFI HPTVMVRADA
ARRVGGYRSA FEAGEDYDFW LRLADAGEIA ILPEVLIRYR EHGGSVTRTR ELRQIYSARL
AKLASAARRG GGPDPSAALA APPDWHDPAP GPFERDSSRL FRVLELADPE LARATPASAI
DLAAITSQLA TLTTGERKFA QVAVLNLLRA DRKRPGVSRA SLLALLVRLG PAKAIRLLLK
G