Gene RPD_3628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3628 
Symbol 
ID4024142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4044015 
End bp4047470 
Gene Length3456 bp 
Protein Length1151 aa 
Translation table11 
GC content69% 
IMG OID637963832 
Producthypothetical protein 
Protein accessionYP_570752 
Protein GI91978093 
COG category[S] Function unknown 
COG ID[COG4717] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0684828 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.861297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAATCG AAGAACTCGC GCTGGAGCGC TACGGCATCT TCACCGACCG CAAAGTCTCT 
TTCGATCCGC AGGCGTCGCT GCATGTGGTG TTCGGCGCCA ATGAAGCGGG CAAGACGTCG
GCGCTGTCGG CGATCGCCGA TCTGCTGTTC GGCTTCGGCG CGCGGACCGA CTACGATTTC
AGGCACGATA GCAAACTGCT GCGACTGGGC GGCACCTTCC GTCACTCCGA CGGCCGCGTG
ATTAAGGCGC GGCGGCGCAA GGGCAACAAG AACACGCTGC TGGACGCCGA CGATCAGCCG
TTGCCCGACG ATCATTTCGC CGCACTTCTC GGAGGCGTTT CGCGCACCGC CTTCAACAGC
GAGTTCGGAA TGACGGCGCA GACGCTGCGG GAAGGCGGCG AAGAGCTGCT CAGCGCAGGC
GGCCGTCTCG CCGAAACACT GGCGGCGAGT TCAGCCGGCA TGGCCGAATT ATCGCGCACC
AAGGACCGCT TGCAGGAGCA GGCCGACCAG TTGTTCAGCG CGCGCAAATC CGCCAGCAAG
CCGTTCTATC TGGCGGCCGA ACGCCGGGAG GCTTCCGACA AAGACCTTCG CGAGGCCATC
GTCACACGCG AGGCACTGCG ACAAACGCAG GCCTCGGTGC AGGACGCGGC CTCCGAGCTG
GAGAAGCTCA AGGTCGCTCA TGCGGCCTGC GGCAGCACGT TGGCGCGCTG GCAACGAATG
TTGAGGGTGC GATCGCAGCT GACGCGGCTG GACCGCATCG AGGCTGAACT GACGGGGCTT
GCCAACTTGC CGGTCGTCGC CGAGCAGACG CTCGCGGATT GGCGCGCCGC GATCGATCGG
CGGGCCGCGC TTGAAGCCGA GATCGCCGCG CTTGACGAGG CGGCGGCGAC GGATGCCGCC
GAGATCGCCA CTCTGGACGT CGATGACGCG CTGCTGGCGG AGGACGCCGC GATCGAAGTG
CTGCGCGAAC GGCTGGGCGC GGTGCGCAAG GCGGCAGCCG ACCTGCCGCG ACGCCGTCAA
GACCGCGCCA GCGCCGAGGC GGCGCTTGAT GATTGCGCGC GGCGACTCGG GCTGGTCTCG
CATGTCGAGC TTCTGGCGAA GCTGCCGACG GACCCGGCGC TCGCAGATGC GCGCGATCGC
ATCGAAATGA TGCGGCGCGC GACACAGGAA TTGGCCGAAA TCGAAGCCCG GCACGGTCGC
GCCCGACGAG ACCTCGAGGC ATTCGCCGCG CGGGAGGGCG AAGAGCAGCA CGTCAGCGAC
ATCGAGCCGC TGCGCCACCG CTTCGAGGCG CTTGGCGACA TTCCTGCGCA GGACGAGCGC
CTGTACCGCG ATCGCGCCGT TTATGTACGC GAGATCGAAT CCATCCGCGA TGCCGTCGCG
GCGCTCGACC CAGCGCCGGG ACCGCTCGAC CGGGTGCGCG CCTTGCCCGT GCCGGACCGC
GCCACGATCA CCAAACACGC CGCCGCCGTC GAGCTCAGCG AGGCGGAGCT GAAGCGGCTC
GATGCGGAGA TTTCGAAACT CGACGATGCG ATCGCCGCGA CCGAGGCCGA GCTGGCGCGG
CTGTCGAGCT CTGGTGCGGT GCCGGCCCGG GCCGATCTGA TCGCGGCCCG ACACGCGCGC
GACGCGAGGC TCGACGCGCT CCGCGCCGGC CTCGACGGCG ATCGTGACAA TCGCCTCGCC
CGCTTCGACG AGGTCACCGA TACGTCGCGC AAGATCGACG GCATCACCGA CTCTCTGCTG
ACCGACACCG AGCGCGCGAC CCGCCACGAA GACGCACAGC GGCGGATCGT CGACGCGCGC
AACGAACGCG CGCGCGCGGA GACCAAGCGC GCCAGCCTGA CGACGGGGCT GGCAGAGATC
GCCGATCGCT GGGCCAAGGC CTGGGCCGCT TCGGGCCTGC AGCCTCGGGG CCCGGCCGAG
ATGCTGCGCT GGCGCGAACG GCTCGACGAT GTCGTCGCGC GGCTCGACAA ATGCGATCTG
CAGCGCGCCG GCATCGACGC GTTGGCCACT GCCCTCGAAG ACGGCAAGTC GGCGATCATA
GCGTTTCTCG ACAGCGTCGG ACGCCGGTCC GTTCCCACCG CGCCGGCCGC GGTGCTGTAT
CGCGAAGCCA AGGGTCGGTT CGACCAATTG CAGGCGGCGT GGAGCGAGAC CAAGGCCCGT
GCGGTGGAGA AAGCCCGCCT CGAACGCGAC CTGACCGAGG CCGATGCAGC CCGTGCGCAG
ATCGAGACCC GCCTTGCAGA TCTTCGGCAG CATTGGCCGC GGACGATGAG CGCGATCGGG
CTCGCCACCG ATGCGACGCC GGTGCAGGCC GAAGCGGCGC TGTCGATCTG GAACGCGGTG
GCCGTGCCGC GCGCCAGCTT CGAGCGCGAG GGGCGCAGCG TCGATACCAT CATGTCCGAT
CTGCAGGATT TCGAGACGGA GGTGACAGCC CTACTCGATC GGGTCGCGCC GGACCTGCGC
GGCGTTTCCG CCCAGGAGGC AATGCCGCGC CTCGCGGAAC GTTTGGCCGA CGCCCGCCGC
GGCAGCGAAG CGCGCAGGCG GTTGCAAGGA AATGCGGCGA AGCGTGCGGC CAATCGAAAT
ACATTGGTCG CGCAACTGGC TGCAAGCACC AGCTTGCTGG AAGGCGCCAG CCGGTCGCTG
CGCGCCGATA TCCCCGCACT GCCGGAGCTG CTGTCGCGTC TCGCCACGCG GCTGGCATTG
CAAGCCGAGC AGTTGGCGTT GCGGAGCCAT CTGCTCGAGA TCGCCGACGG GCACGACGAA
AGCGCGCTGC GCCAGGAGCG CGACGGCGTC GACCTCGACC GCTTGCCGGC CGACATCGCC
AGCGCGACGG TGCAACAAGA CCAACTGCTG AAGGACATCT CGGACGCGGC GGCCAACCAC
AACCAGCGGC AGCGCGAATT GGACGAGCTG ACAAAGGGCC GCGATGCCGC GGGCGCCGCC
GGTCGGCGCC GCGAGGCGGC TGCCGAGATG CTGTCGATCG CGGAAGACTG GCTGCTGCGC
TCGGCGGCTT CCCTGCTGGC CCGCCGTGCC ATCGAGCTTC ATCGCGCCAA GGTGCAGGAT
CCGATGGTCG CGCGCGCCGG CGACTTGTTA GCGCTCGCGA CCGCAGGCGC GTTCGCGGGG
CTCGGCATCG ACTATGGCGA CGATGACCAG CCGACGCTGG TGGCCCGGCG CGCGTCGGGC
GAACGGGTGC CGCTCTCGGG ACTGAGCGAG GGAACGCGCG ATCAGTTGTT CCTGGCGCTG
CGCCTCGCCT TGCTCGAGCG CCGAACCTCA GAGCCGATGC CGTTCATCGG CGACGACCTG
CTGGCGAGCT TCGACGACAG ACGCACGCTG GCAACGCTGC GATTGCTGGC CGCAGCCGGA
GCGCAGCGCC AGATGCTGCT CTTCACGCAC CACCAGCACG TGGCCGATCT CGCATTGTCA
CTCGCGGATC ATCGCATCGA TCTCATCAAT CTGTAG
 
Protein sequence
MRIEELALER YGIFTDRKVS FDPQASLHVV FGANEAGKTS ALSAIADLLF GFGARTDYDF 
RHDSKLLRLG GTFRHSDGRV IKARRRKGNK NTLLDADDQP LPDDHFAALL GGVSRTAFNS
EFGMTAQTLR EGGEELLSAG GRLAETLAAS SAGMAELSRT KDRLQEQADQ LFSARKSASK
PFYLAAERRE ASDKDLREAI VTREALRQTQ ASVQDAASEL EKLKVAHAAC GSTLARWQRM
LRVRSQLTRL DRIEAELTGL ANLPVVAEQT LADWRAAIDR RAALEAEIAA LDEAAATDAA
EIATLDVDDA LLAEDAAIEV LRERLGAVRK AAADLPRRRQ DRASAEAALD DCARRLGLVS
HVELLAKLPT DPALADARDR IEMMRRATQE LAEIEARHGR ARRDLEAFAA REGEEQHVSD
IEPLRHRFEA LGDIPAQDER LYRDRAVYVR EIESIRDAVA ALDPAPGPLD RVRALPVPDR
ATITKHAAAV ELSEAELKRL DAEISKLDDA IAATEAELAR LSSSGAVPAR ADLIAARHAR
DARLDALRAG LDGDRDNRLA RFDEVTDTSR KIDGITDSLL TDTERATRHE DAQRRIVDAR
NERARAETKR ASLTTGLAEI ADRWAKAWAA SGLQPRGPAE MLRWRERLDD VVARLDKCDL
QRAGIDALAT ALEDGKSAII AFLDSVGRRS VPTAPAAVLY REAKGRFDQL QAAWSETKAR
AVEKARLERD LTEADAARAQ IETRLADLRQ HWPRTMSAIG LATDATPVQA EAALSIWNAV
AVPRASFERE GRSVDTIMSD LQDFETEVTA LLDRVAPDLR GVSAQEAMPR LAERLADARR
GSEARRRLQG NAAKRAANRN TLVAQLAAST SLLEGASRSL RADIPALPEL LSRLATRLAL
QAEQLALRSH LLEIADGHDE SALRQERDGV DLDRLPADIA SATVQQDQLL KDISDAAANH
NQRQRELDEL TKGRDAAGAA GRRREAAAEM LSIAEDWLLR SAASLLARRA IELHRAKVQD
PMVARAGDLL ALATAGAFAG LGIDYGDDDQ PTLVARRASG ERVPLSGLSE GTRDQLFLAL
RLALLERRTS EPMPFIGDDL LASFDDRRTL ATLRLLAAAG AQRQMLLFTH HQHVADLALS
LADHRIDLIN L