Gene RPD_3339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3339 
Symbol 
ID4023850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3702215 
End bp3703342 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content59% 
IMG OID637963544 
Productputative OpgC protein 
Protein accessionYP_570464 
Protein GI91977805 
COG category[S] Function unknown 
COG ID[COG4645] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.523015 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGCA TGAGCGAAGC AACGAAATCC ACCGCTACGA AACGCGATCT GCGTCTCGAT 
CTCTTCCGCG GCATGGCGAA CTGGGCGATC TTTCTGGATC ACATCCCTAA CAACGCGGTC
GCATGGCTGA CGATGCGAAA CTACGGATTC AGCGACGCTG CTGAATTGTT CGTCTACGTC
TCCGGATTCA CGGTCGCATT CGTGTATGCG CGGATGATGC GCGCCAAAGG ATTGCTCGCG
GCGGCAATCG GAATTCTCGG CCGGGTCTGG CAGATCTATG TCGCCTATGT GCTGCTTTTC
GTCTTCTACG TGGTTGCCGT CGGCTATGTG GGGCAGGTCG ATGGGCATGC CCATCTTCTC
GACCAATACA ACATCCGCAG GCTGATCGCC GATCCGGTCG AATTTCTGAA GCACGGTCTT
CTGCTCGAAT ATCGGCCGCT CAATCTGGAT GTCCTGCCGC TCTACATCGC GCTGATGGCG
CCGTTTCCGC CGGTGCTCTG GCTGCTCACG AAGTCGCCTA ACGTCGCATT GGCCGGTTCG
TTCATCCTCT ACATTGTCGC GCGGTGGTTA GGCTGGAATC TCACCGACTA TCCGTCCGGT
TCCTGGTACT TCAACCCGTT CGCCTGGCAG TTTCTGTTCG TGATCGGGGC GTGGACGGCG
ATCGTCGATC GCGACGCGTT GCAGAAGATC TTGCGGTCGA GGATCATCAT GGCGCTGGCC
GTCGCGATCG TGGCTGTGTC TGCGATCGTG ACTGTCGCAC TACGCACCGG GAACGACTGG
CTGTTGCCCG AGACGCTTCG GCTTGCGTTC TCCCTGAACG ACAAGACAAA CCTCGCGCCC
TATCGCATTG TTCACTTTCT GGCATTGGCG ATCATCGTGG CGCGTCTTAT TCCGAGGGAT
GCTCCATCGC TCAACTGGCC GGTGTGGCGG CCGTTGATCG TCAGCGGCCA GCACTCGCTG
GAAGTGTTCT GCGTGGGAAC CTTCTTGGCA GCCATCGCCT ATTTCGCGCT CCACCTGATC
AACGACTCTT TCGCGGCTCA GATCTTCGTG AGCGTCGTCG GCATTGCCGG GATGGTCGCG
GTTGCCTATT TCCGGACTTG GGTGAAGAGC AGGTCGCTCG CCGTGTAG
 
Protein sequence
MDSMSEATKS TATKRDLRLD LFRGMANWAI FLDHIPNNAV AWLTMRNYGF SDAAELFVYV 
SGFTVAFVYA RMMRAKGLLA AAIGILGRVW QIYVAYVLLF VFYVVAVGYV GQVDGHAHLL
DQYNIRRLIA DPVEFLKHGL LLEYRPLNLD VLPLYIALMA PFPPVLWLLT KSPNVALAGS
FILYIVARWL GWNLTDYPSG SWYFNPFAWQ FLFVIGAWTA IVDRDALQKI LRSRIIMALA
VAIVAVSAIV TVALRTGNDW LLPETLRLAF SLNDKTNLAP YRIVHFLALA IIVARLIPRD
APSLNWPVWR PLIVSGQHSL EVFCVGTFLA AIAYFALHLI NDSFAAQIFV SVVGIAGMVA
VAYFRTWVKS RSLAV