Gene RPD_3659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3659 
Symbol 
ID4024173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4083145 
End bp4085502 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content60% 
IMG OID637963863 
Productintegrase catalytic subunit 
Protein accessionYP_570783 
Protein GI91978124 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGGACA CGACGTTCGC GCCAGCACCG AGTGCATTAT CGGGGTCCAC GACGGCATTT 
GACCTTACGC CAACCGACGT CAAGACAGGC GAACGGTACG CCGTCTTCAT CCCCGATTTT
GACACCCCCG TGGTGCTTGA ATTTCGGAAG GAGCTTTCCA ACGGCTCGTT AAGGTTCGCG
CGGTACGGCA CGGATCTGGA GATCCACTAT CTGAAAGCCG AATGGGACGT CATGCGCTCC
AACGGACGCG CGTGCCGAAT TTCGGACCTC GGACGGCGGG GCAAAGATCC GGGTGAAATA
GAAGATATCG ATCCTTCGGC GCTGTTGGAT CCGGACGAAC AGACCATAAC GGTCAAGGAG
CGCGCCAAGC GTCTCCTCGC CGCGAAGCGT TTGGAAGAAG CGCGGACGCT TCGTTTCTAC
GTTCTTCGGT ATGACGATGC ACCGGCTGGA AAGGGACACG TCGGCGTCGC CAATTTCATC
GGCGAGAACT GGACGGAAGC GCGGGACAAA GGATTCAAAT GGATTCCCTC GTCCGGCGCC
CTGCTGCGTG CCGTGAACGA ATGCGGGGCG CCCGGCGAGC GGCCGCTCTC CGCATTCCTG
AACCGACGGG GCCTCCACAA CCGCAGTCTG CGTTGGCCCA AGTTCACCAC GGACCTCGCC
ATCGTAATGA ATTCGGAATT TTGGTCCAAG CGTGCCAAAC GCAAGACCGA CGTTATCTCC
GATTTCTACG CGGAGTTCGA CCGTGAGAAC GCGGGACGCG AGGAAAAGGA AAAACTGACG
CGTCCTACCA AGGAGACTCT GCGACTTTGG ATCTGCGCTG GCGAGAACTA CTGGAGCTGG
CGGAGCAAGT ACGGTGAGAA GGCGGCACGC CGGCGCTTCA AAGGGCACCA TCGGCCGATT
GAGGCAACTT GCCCCCTCGA ATACGTCATG ATCGACCACA CCCGCATAGA TGCCTGGGCC
GCCATCTACG ACGAGAAGGG CACCAAGGTG CTGGTGGAGC GCCCGTGGCT GACGCTGGCC
ATCGACGTCT ACTCGCGCAT GATCCTGGGC GCCGTTCTTA CCTACGAATC CCCATCCGTC
TATAGCGCTC TTCTCTGCTT GAAGCAGGTG GTGCGCCGCA AGTCCTTCCT GATCGACGAG
TTCGGTTACC ACAAGGGAGC CACCGACGGG TGGGGTCGTC CGACCACGGT AATCGCTGAC
AATGGATGGG AATTCGTGGG CATCTCTTTC CAGGTGTGCT GCGAGGCTGC TGGCATTCAC
GTCATCTGGG CCCCGGTGAA AGCGCCGGAG TTCAAGCCCT ACGTCGAGCG CGTGTTCGGG
ATTCTGAACG AGATGCTCTG GCACCGTCTC GATCAGGGCA TCCCGCTCAA GCCCCAGGAG
ATGACGGCGA TGGGGCTGAC GCCGCACACG AAGGCCGTTC ACACCTTGGA ATGGCTGTAC
CACCGAATGT GGACCGGCAT CGTCACGTTG TATCACGTCG AGGAGCACGG CGCGGACAAG
ATCGTTCCTG CGAAGCGTTG GCGCGAGGGC TTGCTCTCCG ACGGTAGGCC TACGATCGAC
GACGTGCGGG ATCTCGACAA GGCGCTGGGA CGCTCGCAGG TCTGTCTGCT CACCACAAGC
GGGATCACTT TCGACAACCA TCGGTTCCAC GACCCGGACA CGACGAGCAT GTTGCTCGAT
AGGCTGCTCA GGAAAGCGAA AAGAGGAACG CAGCGCAACG GAAGGCTGTC GAGTGGAGTC
GTTGCGGTGC TCTGCACGTA CAGTTCGCAG GACTGTTCAT CGATTAACGT ATGGGACTTT
GTACTCCGCA AAAACGTCTC TCTTCCGAAC TGGAACCGGC GATTCTCCGA GGGGCTTTCT
TTCAAGACGG CTGCCGACAT CCGAAATTTC GCTCGCGTCG AGAACAGGGC GTTTCATTCG
GACGCCGACA AGCACGCGGC GCGTGCCGCG TTCGGACGGT TGATCGGGGA AAAAATCAAG
ACGTTGCCGT TCGGACAGGC CCGCAAGCTG GCCGGCGAGA TCGTCGCCCC TCAGTTGGTA
GCCGGCGACT TCGTCGAGCA AAAGCTGATC GGATCTTCGG CGACCGACGA CCATTCGTTG
GACGTGCCGC AGACCGTAGC GGCAGGTGAG CGTACGGACG ATCGCATTCC GGACAAGGGT
TTTCGTCGCG GCGGCAAGGC CGCTACCCGG AAGGCGACGG CTACCCGCCG CGCAAATATC
GCCGCTGGAA AAAAGGAGGC TGCAAAATTG GAGGCGGCAC CGCCGTCTCC GCGCGTGGCG
AAGACGGCGG TCGGTAAAGG CATGACCGAC ACAGAGGCCG CCGAGCTTCT GGCCTCCTTG
TCCGAAGATC TGGATTGA
 
Protein sequence
MTDTTFAPAP SALSGSTTAF DLTPTDVKTG ERYAVFIPDF DTPVVLEFRK ELSNGSLRFA 
RYGTDLEIHY LKAEWDVMRS NGRACRISDL GRRGKDPGEI EDIDPSALLD PDEQTITVKE
RAKRLLAAKR LEEARTLRFY VLRYDDAPAG KGHVGVANFI GENWTEARDK GFKWIPSSGA
LLRAVNECGA PGERPLSAFL NRRGLHNRSL RWPKFTTDLA IVMNSEFWSK RAKRKTDVIS
DFYAEFDREN AGREEKEKLT RPTKETLRLW ICAGENYWSW RSKYGEKAAR RRFKGHHRPI
EATCPLEYVM IDHTRIDAWA AIYDEKGTKV LVERPWLTLA IDVYSRMILG AVLTYESPSV
YSALLCLKQV VRRKSFLIDE FGYHKGATDG WGRPTTVIAD NGWEFVGISF QVCCEAAGIH
VIWAPVKAPE FKPYVERVFG ILNEMLWHRL DQGIPLKPQE MTAMGLTPHT KAVHTLEWLY
HRMWTGIVTL YHVEEHGADK IVPAKRWREG LLSDGRPTID DVRDLDKALG RSQVCLLTTS
GITFDNHRFH DPDTTSMLLD RLLRKAKRGT QRNGRLSSGV VAVLCTYSSQ DCSSINVWDF
VLRKNVSLPN WNRRFSEGLS FKTAADIRNF ARVENRAFHS DADKHAARAA FGRLIGEKIK
TLPFGQARKL AGEIVAPQLV AGDFVEQKLI GSSATDDHSL DVPQTVAAGE RTDDRIPDKG
FRRGGKAATR KATATRRANI AAGKKEAAKL EAAPPSPRVA KTAVGKGMTD TEAAELLASL
SEDLD