Gene RPD_3903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3903 
Symbol 
ID4024419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4340710 
End bp4342065 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content62% 
IMG OID637964107 
Producthypothetical protein 
Protein accessionYP_571025 
Protein GI91978366 
COG category[S] Function unknown 
COG ID[COG4222] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAAC TGTTACTCGG GTCCGTCGCG GTTCTGGCGC TGGCATCATC CGCCGCGCTG 
GCGCAATCCG AAGGTGAGTT TCCGGCCACG CTGGCAGGTC ACGCGGTGCT GCCGGCGACG
TCGTTCGTCG ACGCGCCGGC CGACGCGCCC GCCGATCTGA AAAACGCCGG CAAGTACACC
ACCGGCAAAC GTGTCGAGGC GCTGGGCAGC GTCGAGGGCA AGTCCTATGG CCGGCCGACC
GGCGTGTCGG TGCCGTTCAA GGGGCAGCCG ATGCAGGGCC ATTCCGGCAT CAAGACGATG
TCCGACGGCT CGTTCTGGGT GCTGACCGAC AATGGCTTCG GCTCGCGCTA CAACTCCGCC
GATTCGATGC TGTATCTGAA CCGTCACAAG ATCGATTGGG CGAGCGGCAA GGTGGAGCGG
CAGGAGACCA TCTTCCTGCA CGACCCCGAC AAAAAGGTGC CGTTCCGCAT CGTCCACGAG
GACAGCGCCA AACGCTATCT GACCGGCGCT GATTTCGACA CCGAAGGCTT TCAGATCATC
GGCGATCATT TCTGGATCGG CGAAGAGTTC GGCCCCTACA TCCTGAAGAC CGACAAATCC
GGCAAGGTTC TGGCGGTGTT CGAGACTTTG GTCGACGGCA AGCCGGTGAA GTCGCCGGAG
CATTGGTCGA TCCAGTCGCC GGGCGGGCCG GGCGCGACCT ATAGCGGCGT CAACCTGCGC
CGCTCGAAGG GCTTCGAGGG CTTCGCGGCG TCGACCGACG GCAAGTTCCT GTATCCGCTG
TTCGAGGGCG CGCTGACCGA CCTCGACAAG AAGGACATCG AGAAGCAGGA CGGCGTCGAG
GCCGCGCGGA TTCTCGAATT CGACGTCGCC GCGGAGAAGT TCACCGGCCG CTCCTGGCAG
TACGTGTTCG AGCAGAACGG CAACGCGATC GGCGATTTCA ACATGATCGA CGCCACCCAC
GGGCTGATCA TCGAACGCGA CAACGGCGAA GGCACCAAGG ACAAGGCCTG TCCGGAAGGC
CAGCGCGGCA CCGACTGCTT CCACGACCTC GCCAAGTTCA AGCGGGTGGT GAAGATCGAA
CTCACTGACG CCAATGCCGG AAAGCCGGTG CGCAAGATCG GCTACATCGA CCTGATGAAG
ATCAAGGACC CGAACAAGAA GGCGCGCAAG CCGCTCAACG ACGGCGCGCT GACCTTCCCG
TTCTTCACCA TCGAGAACGT CGACAAGGTC GACGATCGTC ACATCATCGT CGGCAACGAC
AACAACCTGC CGTTCTCGTC GAGCCGCGAT CCCAACAAGG CCGACGACAA CGAGTTCGTG
CTGCTCGAAG TCGCGGATTT CCTGAAGGCG AAGTAA
 
Protein sequence
MRKLLLGSVA VLALASSAAL AQSEGEFPAT LAGHAVLPAT SFVDAPADAP ADLKNAGKYT 
TGKRVEALGS VEGKSYGRPT GVSVPFKGQP MQGHSGIKTM SDGSFWVLTD NGFGSRYNSA
DSMLYLNRHK IDWASGKVER QETIFLHDPD KKVPFRIVHE DSAKRYLTGA DFDTEGFQII
GDHFWIGEEF GPYILKTDKS GKVLAVFETL VDGKPVKSPE HWSIQSPGGP GATYSGVNLR
RSKGFEGFAA STDGKFLYPL FEGALTDLDK KDIEKQDGVE AARILEFDVA AEKFTGRSWQ
YVFEQNGNAI GDFNMIDATH GLIIERDNGE GTKDKACPEG QRGTDCFHDL AKFKRVVKIE
LTDANAGKPV RKIGYIDLMK IKDPNKKARK PLNDGALTFP FFTIENVDKV DDRHIIVGND
NNLPFSSSRD PNKADDNEFV LLEVADFLKA K