Gene RPD_3294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3294 
Symbol 
ID4023803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3646023 
End bp3647297 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content68% 
IMG OID637963497 
Productcellulose biosynthesis protein CelD 
Protein accessionYP_570419 
Protein GI91977760 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.421998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0110847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGCC CGAATATCCG GCGCAGGATC GACAGCGAAC CGGCGCGTTG TGGGACGGAT 
TCGGCCCAGA CACAACCCGC ACCACAACCG ACCGATCGCC CCGCTGCCGA TCCATCCGCG
ACCGGAACCG CCCTGCCCGC CGCGCTCTCG ATCGCTGTGG CGGCGAAGCT GTCGGACCTC
CCGTTCTGGC CCGGGATCAG TGAAGCGACG CCCGGCCATC GCTTCGTGTT TCAATGCGCC
GACGTGCTCG AGGTCTGGCG CGACACCATC GGCGCGGCGC AGCGCGTGAA GCCGCTGTTC
GTCGCCGTGA CCACACAGGA CGGCGCACCG CTGCTGCTGC TGCCGCTCGG GATCGAACGC
CGCGACGGAC TGCGCGTGCT CGGCTTCCTC GACGGCACCG TCAGCGACTT CAACGCGCCG
ATCGTATTCG CGCCCGCACA GCATTGGGAC GGCGCTGCGA TGCGTCGGCT GTGGGACGAT
CTGCAGCCGC ATCTGCCGGC CTACGACATC GCGATCTTCC GCAAGATGCC CGAGACCATC
GACGGGATCG CCAATCCGTT CCGGCATCTG GCGACCGCGC GCAGCGCCCA CGGCACGCAT
ACGCTCAAAC TGCCGGCGGA CTGGTCGGCG GCTGCGCGCG ACATCCTGCC CGATCTCTCG
GATTCGCGCC GCAGGCTGCG CAAGCTCGAC CGACTCGGCG CGACGGAGTT CCGCATCGCC
GACAATGTCG ACGAGGCGAT CGCCTTCACG ACGGCGGCGA TGGCGATGAA AGGCCGCCGG
CTGGTCGACA CGATCGGCGT CGACCGCTTC GCCAGTGAAC CCGGCTACGC CGATTACTAC
GTCGAGATCA CCCGCCGGCT GTTCGCAACC GGCGCCGTCC ACGTCTCTGC GCTGATGATC
GACGGCACGC CGCGCGCAGC GCATTGGGGC TTCGTGTTCG CCGGCCGGTT CTATCATCTG
CTGACGGCGT TCGACGTCGA CGCGGCGTGG CGGCCCTACG CGCTCGGCCG GATGCACAAT
GAATTCCTGA TGGAATGGAG CGCCAATGCG GGGCTCGCGA CGTTCGATTT CGGCGTCGGC
GACGAGCCGT ACAAGACCGC CTACAGCAAC GACTATCAGC AACTCGCCGA TGCGATCCTG
CCGCGCACGC TGATCGGCCA CGCCTATGGA TGGCTGGTCG ACCTGCGGCG CTATTCCGCG
CGCACGATCC GCGCCTCCGC TTTCGCCACG ACGGCCGAGC GCATCCGCCG CGATCTCAAC
AAATGGCGGA ATTGA
 
Protein sequence
MHGPNIRRRI DSEPARCGTD SAQTQPAPQP TDRPAADPSA TGTALPAALS IAVAAKLSDL 
PFWPGISEAT PGHRFVFQCA DVLEVWRDTI GAAQRVKPLF VAVTTQDGAP LLLLPLGIER
RDGLRVLGFL DGTVSDFNAP IVFAPAQHWD GAAMRRLWDD LQPHLPAYDI AIFRKMPETI
DGIANPFRHL ATARSAHGTH TLKLPADWSA AARDILPDLS DSRRRLRKLD RLGATEFRIA
DNVDEAIAFT TAAMAMKGRR LVDTIGVDRF ASEPGYADYY VEITRRLFAT GAVHVSALMI
DGTPRAAHWG FVFAGRFYHL LTAFDVDAAW RPYALGRMHN EFLMEWSANA GLATFDFGVG
DEPYKTAYSN DYQQLADAIL PRTLIGHAYG WLVDLRRYSA RTIRASAFAT TAERIRRDLN
KWRN