Gene RPC_4103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4103 
Symbol 
ID3973153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4561922 
End bp4563277 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content63% 
IMG OID637927207 
Producthypothetical protein 
Protein accessionYP_533948 
Protein GI90425578 
COG category[S] Function unknown 
COG ID[COG4222] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.901263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCA CCCTGCTCTG TTCGGTCGCC GCGCTGGCGC TCGGCGTCAC CACGGCCGTA 
GCACAAACCG AAGGCGAGTT CCCGGCGACG CTGGCGGGCC ATGCCGTGCT GCCGGCGCAG
AGCTTCATCG ATGCGCCGAA GGACGCGCCC GACGATCTGA AGACCTCCGG CAAATACACC
ACCGGCAAGC GCGTCGAGGC GCTGGGCAGC GTGATGGGCA AGTCCTATGA GCGGCCGACC
GGGGTGTCGC TGCCGTTCAA GGGCCAGCCG CTGCAGGGCC ATTCCGGCAT CAAGGTGATG
CCGGACGGCT CGTTCTGGGT GCTGACCGAC AACGGCATGG GCTCGCGCTA CAATTCCGCC
GACTCGATGC TGTATCTCAA CCGGCACAAG ATCGACTGGG CGGGCGGCAA GATCGAGCGC
CAGGAAACCG TGTTCCTGCA CGACCCCGAC AAGAAGGTGC CGTTCCGCAT CGTCCACGAG
GACACCGCCA AGCGCTATCT GACGGGTGCG GATTTCGACA CCGAGGGCTT TCAGATCATC
GGCGACACTT TCTGGATCGG CGACGAATTC GGCCCCTACA TCCTGAAGAC CGACAAGGCC
GGCAAGGTGC TGGCGGTGTT CGAAACCTTC GCCGACGGCA AGCCGGTGAA ATCGCCCGAC
CATTGGTCGG TGCAGTCGCC GGGCGCGCCG GGCGCCAGCT ACACCGGCGT CAATCTGCGC
CGCTCCAAGG GTTTTGAAGG CTTTGCGTCC TCGAAGGACG GCAAATTCCT CTATGGCCTG
CTGGAAGGCC CGCTGTGGAA CGCCGAGAGC AAGGACTGGG AGAAGGTCGA CGGCCATGAG
GCCTCGCGGA TTGTCGAATT CGACGTCGCC GCGGAAAAAT TCACCGGCCG CTATTGGCAA
TACGTGTTCG AGCAGAACGG CAACGCAATC GGCGATTTCA ACATGATCGA CGCCAGCCAC
GGCCTGATCA TCGAGCGCGA CAACGGCGAA GGCACCAAGG ACAAGGCCTG CGCCGAAGCA
AACCGCGGCG CCGATTGCTT CCCCGACCTC GCCCGGTTCA AGCGCGTGGT GAAGATTGAA
CTCAGCGACG CCAATGTCGG CAAGCCGGTG CGCAAGATCG GTTACATCGA CCTGATGAAG
ATTCGCGACC CCAACCACAA GGCGAAGAAG CCGCTGAACG ACGGCGTGCT GACCTTCCCG
TTCTTCACCA TCGAGAACGT CGATCAGGTC GACGACACGC ACATCATCGT CGGCAACGAC
AACAACCTGC CGTTCTCCTC CAGCCGCGAT CCCAACAAGG CCGACGACAA CGAGTTCGTG
CTGCTCGAAG TCGGCGATTT CCTGAAGGCG AAGTAA
 
Protein sequence
MRITLLCSVA ALALGVTTAV AQTEGEFPAT LAGHAVLPAQ SFIDAPKDAP DDLKTSGKYT 
TGKRVEALGS VMGKSYERPT GVSLPFKGQP LQGHSGIKVM PDGSFWVLTD NGMGSRYNSA
DSMLYLNRHK IDWAGGKIER QETVFLHDPD KKVPFRIVHE DTAKRYLTGA DFDTEGFQII
GDTFWIGDEF GPYILKTDKA GKVLAVFETF ADGKPVKSPD HWSVQSPGAP GASYTGVNLR
RSKGFEGFAS SKDGKFLYGL LEGPLWNAES KDWEKVDGHE ASRIVEFDVA AEKFTGRYWQ
YVFEQNGNAI GDFNMIDASH GLIIERDNGE GTKDKACAEA NRGADCFPDL ARFKRVVKIE
LSDANVGKPV RKIGYIDLMK IRDPNHKAKK PLNDGVLTFP FFTIENVDQV DDTHIIVGND
NNLPFSSSRD PNKADDNEFV LLEVGDFLKA K