Gene RPC_0672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0672 
Symbol 
ID3970611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp732724 
End bp733950 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content66% 
IMG OID637923788 
Producthypothetical protein 
Protein accessionYP_530563 
Protein GI90422193 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.532138 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAA CTTTCTTCGC CCCGGCCGCC GATCGGTCAA GCGTAGCGCC GGCGAAGCTG 
CCTGCCCTGA GCGCCAAGCG CCGATCATGT TTCAGGGTGG AACTGCATTC GGATTTCAGC
AACGCCGAGT TTCGCCACAC CTGGAAATCG TTCGAGCAAT CGGCGACGGC CACGGCGTTC
CAGCGCCTGT CCTACATCCA GGCGCTGCTC GCCAACATCG TGCCGCAGCG AGCGGTGGAA
CCGATCCTGG TCGGGGTGCG CGACACCAGC TGCGGCGCGA TCGTGCTGGT CGCGTCGCTG
ATGCGGACCC GCCGCCACGG CGTGTCGGTC ATCGAGGCGC TCGATCTCGG GCTGTGCGAC
TATTTCGCCC CGTTGATCCG TCCCGGCCTT GAATTCAGCG CGGTGGAGTT TGACGAGCTT
TGGCGCGAGA TTTGCGGCGC GCTGAAGCCG GTCGGCGCGC TCTCGATCGA GAAGATCCCC
GCTGAGATCT TCGGCTATCC CAATCCGCTC GCCAAGCTGC CGTCGGCACG GCCGACCAAC
GACTTTGCGA CGACCTTGCG GATGCGCGCG GCGGATGGCG CCCACTTGGT GGATCTGCAG
AGTTATTCGG TGGTGCGAAA GGCCAACCGG CTGTGCCGAA AGCCGGAGAA CTGGGGCAAC
ATCCAACTGG AATTGGCAGA CACCGCAGCC GCGCTGCAGC AAGCGCTCGA CCTGATGGTC
GCGCATCGGC TGGTTCGGTC CCACGCGCTG GGACGCCATG ACCTGCTCGA CGACCGCGGA
TTCATCGCGT TCTACCGGCA ACTCGCGCAG GATGGGCTGG CGGACGGCTC GGTCCGGGTG
TTCGTGCTGT CCTCCGATGC TGAGCCGATC GCCGTGGTCT ATTCGCTCGT GCATCGCAAT
GCGCTGACGG TGGTGGTGCC TTCGATGACC ACGGAGGAGC GGTGGCGCAA ACTGTCGCCC
GGGCTGGTCG CGATGGTGAA ATGCTGCGAA TGGGCCGACC GCGAGGGCTT TCACAACTTC
GATCTCAGCG TCGGCGCGCT GCAATACAAA ACCCGGTTCG GCGGCGATCA GCGCCGGCTG
TACGAAATCC GTCAGGCGCT CTCGCCGGCC GGCCTGCTGA TCACCGCTGA AGTGACGGCG
AAGCGCCGGC TGCGCGCCTT CGCCGCCCGC CACCCGAAGG CCAAGGCGCT GGTGCGCCGC
ATGCTGCGCC GGCCGCCGGC TACCTGA
 
Protein sequence
MNRTFFAPAA DRSSVAPAKL PALSAKRRSC FRVELHSDFS NAEFRHTWKS FEQSATATAF 
QRLSYIQALL ANIVPQRAVE PILVGVRDTS CGAIVLVASL MRTRRHGVSV IEALDLGLCD
YFAPLIRPGL EFSAVEFDEL WREICGALKP VGALSIEKIP AEIFGYPNPL AKLPSARPTN
DFATTLRMRA ADGAHLVDLQ SYSVVRKANR LCRKPENWGN IQLELADTAA ALQQALDLMV
AHRLVRSHAL GRHDLLDDRG FIAFYRQLAQ DGLADGSVRV FVLSSDAEPI AVVYSLVHRN
ALTVVVPSMT TEERWRKLSP GLVAMVKCCE WADREGFHNF DLSVGALQYK TRFGGDQRRL
YEIRQALSPA GLLITAEVTA KRRLRAFAAR HPKAKALVRR MLRRPPAT