Gene RPC_0664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0664 
Symbol 
ID3970603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp723072 
End bp724052 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content64% 
IMG OID637923780 
Productcellulose biosynthesis protein CelD 
Protein accessionYP_530555 
Protein GI90422185 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.250925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTCT ACGCATCAGA GCATTATCTC GACGCCGTCG CGGCGGTCTA TTTCAAGGGC 
CAGCGCGCGC GCATCGAAGA CGTACAAATC GGCGACGAGG TGCTTCGGCT TCTCGTGGTC
AATGACAAGC GTGTCATTAC GCGACATCAG TTCTTGGATT TCCACCAGCC GCTGCTTGAA
ACCGAGACTC GCGGAGCGAC CCGCAACGGC CGGTACGCGC CCGCCGTGGC GCGCCGGGTG
ATCGAGCGGA CGCAGTGGGA TCCGGCGCAG TTTCCCGGCC TGGAGCTCGC GCCCTATGTC
GACTGGTCGC AGTTTCCCGA ATACGACGAC TACAAGGCCT ATCTGCTGAA GCACCACAAG
AGCCTGGTGC GCGATCGCGA GCGCCGCGGG CGCAGCCTCG CCGCGGCGCA TGGCGAACTG
GTGTTCACCA TGAACGACAC GCAGGCCGAC GTGTTCGACG CCGCGCAGCG TTGGAAGAGC
CGGCAGCTGC GCGACAGCGG CCTGGCGGAT TATTTCGCCG CCCCGCACAC CATGGAGTTT
CTCGAGGCAT TGCGCAGCCG CGGCCAATTG GTCGCCTCGA CGCTGCGCGC CTCCGGCCAA
TTGTTGTCGC TGTGGATCGG CTTCGTGCAC GACCGCACCT GGTCCGGCTG GATTTTCACT
TATGATCCGG CGTTCCGGAA ATACTCGGTG GGACACCAGC TGCTCAGCTT CATGCTGAGC
GAGAGCCACC GCCTCGGCCA CCGCGAGTTC GATTTTTCGA TCGGCAGCGA GGACTACAAG
ATGATCTACG CCACGCACGG GCGCGTGCTG GGATCGATCG GCCAGCCGCC GCTCGGCCAG
CGGTTGATCG GCTACGCCAA GGACGAATTG CGGGATCGGA CCCCGAAGCT GTTCGACGCC
GCGCGGAATC TGAAGAAGCG GATCGACGGC ACGCTGCCGA CTCAGCTGGT GGCGGGCCAG
ACCGGCCCGG CCAAGGCGTG A
 
Protein sequence
MNFYASEHYL DAVAAVYFKG QRARIEDVQI GDEVLRLLVV NDKRVITRHQ FLDFHQPLLE 
TETRGATRNG RYAPAVARRV IERTQWDPAQ FPGLELAPYV DWSQFPEYDD YKAYLLKHHK
SLVRDRERRG RSLAAAHGEL VFTMNDTQAD VFDAAQRWKS RQLRDSGLAD YFAAPHTMEF
LEALRSRGQL VASTLRASGQ LLSLWIGFVH DRTWSGWIFT YDPAFRKYSV GHQLLSFMLS
ESHRLGHREF DFSIGSEDYK MIYATHGRVL GSIGQPPLGQ RLIGYAKDEL RDRTPKLFDA
ARNLKKRIDG TLPTQLVAGQ TGPAKA