Gene RPB_4609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4609 
Symbol 
ID3912426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5204722 
End bp5208039 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content71% 
IMG OID637886513 
ProductSel1-like protein 
Protein accessionYP_488203 
Protein GI86751707 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGCG TATCGTGGAG CGTCGAGGGC ATCGAACCGT CAGTGCGCGA AAGGGCGGAG 
GCCGCCGCCC GGCGCGCTGG CATGTCGCTC GCCGACTGGA TCGACGGTCA GCTCGGCGAC
AGCGCACCTC AGCCGCACCC GACGGCCGAC ACGACCCGGG AGTCGATCCG CGCCCCGCTG
GCGGAAAAGA ACGCGACCGA GGTCGCCGAA ATTCACCAAC GGCTGGATTC CATCGCCCGC
CAGATCGACC AGATTTCACG CCCCCCGGTG CGCAGCGAGC CCGGCGTGGC GCGGCAGCTC
AACGACGCGA TTTCGCGACT CGACGCAAGG CTCGCCCGGA TCACCGAACC CAAGGTCGCT
ACGGCGGCCC CCTCAGCCGT CGCGCCGACG CAGGCCGCGC CGCCCCGAGC CGCTGCCTCC
GCCCCGCAAT CGCCGACCGA ACGCGTCGAG CACGCCGCGG CGCAGGTCTA TCACGCCTCG
CCGCCGCTCG ATCCGAGTGC GCTCGATCGC GCGATCGCCG AGATCGCCGC GCGCCAGTCC
GAGCTCGACG CCCCGGTGGA GCGGATGCCG CTGCGCCAGT CGCCGCCGAT CGCGCCGGCG
ATGGCGCCGC CGCAGGCACG GCCGGGGCCC GACTTTTCCA GCCTCGAGCA ACAGCTCCTC
AAGATCACCA GCCAGATCGA CGCGCTGCAG CGCCCCGACG TGATCGAGCA ATCGATCGCG
GCGTTCCGCG CCGATCTCGC CGATATCCGC CAGACCATCA CCGAAGCATT GCCGCGCAAG
GCGATCGAAT CGCTCGAAAG CGAGATCAGA TCGCTGTCGC AGCGGCTCGA CGAGACCCGT
GCCAACGGCA GCGACGCCGG CGTGATCGCC GGGATCGAAC GCGCGCTCGG CGAAATCCGC
GATGCCTTGC GCTCGCTGAC GCCGGCCGAG CAGCTCGCCG GCTTCGACGA GGCGATCCGC
AATCTCGGCG GCAAGATCGA CATGATCGTG CGCAACAGCG ACGATCCCGG CACGCTGCAA
CAGCTCGAGA ATGCAATCGG CGCATTGCGC GGCATCGTCT CCAACGTCGC CTCCAACGAG
GCGCTGGCGC AGCTCAGCGA CAACGTTCAC ACGCTGGCCG ACAAGGTCGA TCAGCTCGCC
CGCGCCGACA GCCACAGCGA TTCCTTCGCG GCGCTGGAAA GCCGGATTTC GGCGCTGACC
GCGGCGCTGG AGAACCGCGA ACGGCCGATC GCCACCGAAT CCACCGAGCA GCTCGAAGGC
GCGGTGCGGG CGCTGTCGGA GCGGCTCGAT CAGATGCCGG TCGGCAACGA CGGCTCGTCG
GCGTTCGCTC ATCTCGAACA GCGTGTTTCC TACCTGCTCG AGCGGATGGA AGCCGCCGCC
GTCCAACGCG GCAGCGGCGA TCTCGGCCGG GTCGAGGAAG GGCTGCAGGA CATCCTCCGG
ATGCTCGAGC GGCAGCAGGA GAGCTTCCAC CGCATCGCCG ATCTCGGTCG CGCGCCGGCC
GCGCCGCCGT TCGACCCGGG CGTCGTCGAG TCGATCAAGC GCGAAATTTC CGACATCCGC
CTGAGCCAGT CGCAAACCGG ACGCCACACC CAGGATTCGC TGGAAGCGGT TCACAACACG
CTCGGCCACG TCGTCGACCG GCTGGCGATG ATCGAAGGCG ATTTGCGCAA GGCGCGGTCC
GCGCCGCAAC CCGCGGCGGC GCGCGAACCC GCCCAGCCGC AGCAGCCGGT CGTCACGCCT
CCGCCGGCGG CGCCGCAAAT CTCGCTGCCG CCGCGCCCGG AGATGCCGAA TCCCGCAGCC
GCGACCGCGT TCGCGGCCGC TCCGCGGGAG TTCGCGCCGA CGCGGCCCAC GGAACAGCCC
GAGCCGACGC CGGGACCGCG GGCGATCATG GATATTCTGG CGCCGCCCGT CAGCCGCCCC
TCGGCCCCTG AGCCGCAGAT CGCGCCGCAA AGGCTCGCCG CCGATGCCTC GCTGCCACCG
GATCATCCGC TCGAACCCGG CACCCGCCCT CCCGGCCGGG TGGCGTCGCC GTCGGAACGC
ATCGCAGCCT CGGAGAGCGC CATCAGCGAA TTCGCCGGCG CCAAGCCGGA GCCCGCCAGC
AGCTCGAATT TCATCGCGGC CGCCCGCCGC GCCGCGCAGG CCGCGGCAGC GGCCACCACC
CAATCCGGCG ACAAGCCCAA GGGCGACGGC GGCAAGTCCG GCCCGGCTCC CGGCAAGCCG
GGATCGACCA TCGGGTCCAA GATCCGCTCG CTGCTGGTCG GCGCCAGCGT CGTGGTGATC
GTGCTCGGCA CCTTCAAGAT GGCGATGAAT CTGCTCGATG ACGGCCACCC GACCCCGGCC
GCCAGCCTCA GCGAACCGGC GCCGCAGGAC TTGATGCCGC AGAGTGGGGA CGAAGAGATC
GAGCCGCCCG CGGCGAACCC CGCCACGCCG ACGCCCGCCC CGTCGATGAC ATCGCCGACC
CCGATCAATC GGCAGTCGCT GTTCGCGCCG CCGCAAACGC CTGCTGCGCC GCCCGCGCCG
TCCACCGCTC CTGCCTCCGC CGACGTCACC GGGACGATCC CGACGCCGCA GGTCAACGCC
GCCGCAAACA CCACGGTCAC GGCCACGGTC GCGATTCCGG CCGGCGAAAC CCTCCCCGAC
GCGATCGGCG GACAGGCGCT GCGCAAGGCC GCGCTGAAGG GCGACGCCGC GGCGGCCTAC
GAGGTCGGCA ACCGCTACGC CGACGGCAAG GGTATCACCG CGAATTTCGA GGAAGCGGCG
AAGTGGTACG GCCGCGCGGC GCAAGCCGGC ATCGTGCCGG CGATGTTCCG GATGGGGACC
CTCAACGAGA AGGGCCTCGG CGTGAAGAAG GACCTCGACA CGGCCCGGCG CTTCTACATT
CAGGCGGCGG ACCGCGGCAA CGCCAAGGCC ATGCACAATC TCGCGGTGCT CGACGCCGAT
GGCGGCGCCA AGGGCGCCAA CTACAAGAGC GCTGCGGAAT GGTTCCGCAA GGCGGCCGAG
CGCGGCGTCG CCGACAGCCA GTTCAACCTC GGCATCCTCT ATGCCCGCGG CATCGGCGTC
GAACAGAACC TCGCCGAATC GTTCAAATGG TTCAGCCTCG CCGCCGCCCA GGGTGACGCC
GATTCCGCGC GCAAACGCGA CGACGTCGCC AAGCGGCTCG ACCCGCAGTC GTTGTCGGCG
GCCCGGCTCG CGATCCAGAC CTTCACCGTG GAGCCGCAGC CCGACAGCGC CGTCAAGGTC
GCGGCGCCGG CCGGCGGCTG GGACGCGCAG GCGACCGCCA CAGCCAAACC AGCGACCAGC
AAGCGCGCCG CGCGTTAA
 
Protein sequence
MNRVSWSVEG IEPSVRERAE AAARRAGMSL ADWIDGQLGD SAPQPHPTAD TTRESIRAPL 
AEKNATEVAE IHQRLDSIAR QIDQISRPPV RSEPGVARQL NDAISRLDAR LARITEPKVA
TAAPSAVAPT QAAPPRAAAS APQSPTERVE HAAAQVYHAS PPLDPSALDR AIAEIAARQS
ELDAPVERMP LRQSPPIAPA MAPPQARPGP DFSSLEQQLL KITSQIDALQ RPDVIEQSIA
AFRADLADIR QTITEALPRK AIESLESEIR SLSQRLDETR ANGSDAGVIA GIERALGEIR
DALRSLTPAE QLAGFDEAIR NLGGKIDMIV RNSDDPGTLQ QLENAIGALR GIVSNVASNE
ALAQLSDNVH TLADKVDQLA RADSHSDSFA ALESRISALT AALENRERPI ATESTEQLEG
AVRALSERLD QMPVGNDGSS AFAHLEQRVS YLLERMEAAA VQRGSGDLGR VEEGLQDILR
MLERQQESFH RIADLGRAPA APPFDPGVVE SIKREISDIR LSQSQTGRHT QDSLEAVHNT
LGHVVDRLAM IEGDLRKARS APQPAAAREP AQPQQPVVTP PPAAPQISLP PRPEMPNPAA
ATAFAAAPRE FAPTRPTEQP EPTPGPRAIM DILAPPVSRP SAPEPQIAPQ RLAADASLPP
DHPLEPGTRP PGRVASPSER IAASESAISE FAGAKPEPAS SSNFIAAARR AAQAAAAATT
QSGDKPKGDG GKSGPAPGKP GSTIGSKIRS LLVGASVVVI VLGTFKMAMN LLDDGHPTPA
ASLSEPAPQD LMPQSGDEEI EPPAANPATP TPAPSMTSPT PINRQSLFAP PQTPAAPPAP
STAPASADVT GTIPTPQVNA AANTTVTATV AIPAGETLPD AIGGQALRKA ALKGDAAAAY
EVGNRYADGK GITANFEEAA KWYGRAAQAG IVPAMFRMGT LNEKGLGVKK DLDTARRFYI
QAADRGNAKA MHNLAVLDAD GGAKGANYKS AAEWFRKAAE RGVADSQFNL GILYARGIGV
EQNLAESFKW FSLAAAQGDA DSARKRDDVA KRLDPQSLSA ARLAIQTFTV EPQPDSAVKV
AAPAGGWDAQ ATATAKPATS KRAAR