Gene RPC_0738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0738 
Symbol 
ID3970560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp810538 
End bp813855 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content69% 
IMG OID637923853 
ProductSel1 
Protein accessionYP_530628 
Protein GI90422258 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGAAG CGGACAAGAG CATGAATTCG CGCGTATCGT GGAGTGTCGA GGGGATCGAA 
CCGTCAGTGC GCGAGCGGGC CGAAGCCGCC GCGCGCCGGG CCGGCCTTTC GCTCGGCGAC
TGGATCAGCG CCCAGGTCGG CGACGTTCCG CCGCAGCTTC GGCCGCAGGA CCCATCGCAG
GCGACTGTGC GGCAGTCATC GCCCGGTTTG GCGGAGAAAG ATGCCCAGGA GGTCGCCGAA
ATCCACCAAC GGCTGGATTC CATCACCCGG CAGATCGAAC AGATTTCGCG GCCGGCCGGC
CGCGGCGAGC CCGGCGTAGC CCGGCAGCTC AACGACGCCA TCTCGCGGCT CGATGCCCGG
TTGTCGCAGA TTACCGCACG GCCCGCTCAA CGCCCCGATC CGCAGCAATC TCAGCCACAG
ATCGACCGGG TCGAGCGCGC CGCTGCCGAT GTCTATCGCT CGTCGCCGCC GCTCAGCCCG
GTCTCGCTGG ATTTTGCGAT CGCCGAGATC GCCGCGCGGC AGAACGAGCT CGACGCCGCC
GCCAACCAGA TCATGCCGCG CCGCGCCGCG CCGCCGATCG CGCCCGCGAT GCCGGCCGCG
CCGGATGTGT CCGGGCTGGA ACGCCAGTTG CACAAGATCA CTACCCAGAT CGACGCATTG
CAGCGTCCTG ATGCGATCGA GCAATCGATC GCCGCTTTCC GCACCGAGCT CGCCGAGATC
CGCCAGACCA TCACCGAAGC GATGCCGCGT CGCGCCATCG AATCGCTGGA GACCGAGATC
CGCTCGCTGG CGCAGCGCAT CGACGAGAGC CGGCACAGCG GCATCGATGG CGGTGCGCTG
GCCGGGGTCG AGCGCGCGCT GGAAGAAATC CGTGACGTGC TGCGCTCGTT GACGCCGGCC
GAGCAGCTCG CCGGGTTCGA CGAGGCGATC CGCAATCTCG GCGGCAAGAT CGACATGATC
GTCCGCTCGA GCGGCGATCC TTCGACGCTG CAACAGCTCG AAGGCGCCAT AGCGGCGCTG
CGCGCCATCG TCTCCAACGT CGCCTCCAAC GAGGCGCTGG CGCGGCTCAG CGAGGACGTC
CACACGCTGT CGTCCAAAGT CGACCAGCTC GCCCGGGCCG ACAGCCACAG CGATTCCTTC
GCCGCCCTCG AACAACGCAT CGCCGCTCTG ACCACGACGC TGGAGAACCG CGAACGGCCG
GTGCAAAGCG ATCCTTCGGC GCAACTGGAA ACCGCGCTGC AGGCGTTATC CGATCGGCTC
GATCATCTGC CGGTCGGCAA TGACGGCGCC GCCACTTTCG CGCATCTCGA GCAGCGGGTC
AGCTACCTGC TCGAGCGCCT GGAAGCCTCG GCCGAGGCTC GCTCGCCGAA TCTCGGGCGG
GTCGAAGAAG GCTTGCAGGA CGTGCTGCGT CAGCTCGAAC GCCAGCAGGA CACCTTCGCC
GCGCTGACCG CGGTCAGCCG CACCACGTCG CCGGACAACG GTCTGGTCGA TACGCTGAAG
CGCGAAATCA CCGATCTGCG GCTGACGCAG TCGCAAGCCG ATCGCCACAC CCAGGACTCC
CTGGAAGCGG TGCACAATAC GCTCGGCCAC GTGGTCGACC GGCTGGCGAT GATCGAAGGC
GATCTACGCA GCGCGCGCTC CGCGCCGCAG CCGGCTCCGA CGCCGCAGCC GCAGCCCGCA
GCAATCCGGC CGATCACGCC GCAGCCCGCG CCGCCGCCCT CGGCGCCCGC GGTCGTGGTA
CCGCCCCGAC CGGAGATGCC GAATCCCGCC GCCGCGCATT TCGACGCAGC CCCCCGTCAA
TTCGCCCCGG TTTCGACGCC GGCCGAGCCG TCGGCGCCGA GTGCCGCGAA GACCATCAAG
GATATTTTGA GCCCGAAGCC GGCGCGCGAC AGCGTCGCCG CGCCGCCCGA GAAGGCGCCT
GCGCGTGCTC CGTCCCGGCT GTCGATCAAT CCCGAACTGC CGCCGGATCA TCCGCTAGAG
CCCGGCACCC GCCCGCAGGG ACGGACCTCA CCGTCGGAGC GGATCGCGGC GTCGGAAAGC
GCGATCAGCG AGATCCCGGC GCCGACCCGC GAACAAGTTA GTTCGTCGAA CTTCATCGCC
GCGGCACGCC GCGCCGCGCA GGCCGCCGCA GCCGCGAGCC AGGTGACCGA GAAGTCGAAA
TCCGCCGCCA AATCCGCCGC CAAGTCGGCG GCGAAGCCGG GCGGCAAAAC CGGCGCTCCG
GGCACGGCGA CGCCGAGCGG TGGCAACTCG ACGCTGGGTT CGAAAATTCG CTCGTTGCTG
GTCGGGGCCA GCGTGGTGGT GATCGTGCTC GGCACCTTCA AAATGGCGAT GACCATGCTC
GACGGCGGCC CGTCGCAGCC CGCCGCCATC GAAAGCGCGG AGCCGGCGCC GTTGCCGCCG
CCCAGCAGCG GCGACGCCGC ACCGGCGCCT GCCACACCGG CCAACCCGTC GATGACCTCG
CCGACCCCGA TCGACCGGCA ATCGTTGTTC GGCCCGGCGC CGGCCGCGCC GGGCGGCAGC
CGCGTCGCCG ATGCCGACGT CACCGGCACG ATTCCGACAC CGCCACCGGC GATTCCGACG
GCGCCGCCGA GCGGCCGTGC GGCGACCATG GTGTCGATCC CGGCAAACGA GAAACTGCCG
GATGCGATCG GCGGAGCGCA ACTGCGCTTC GCTGCGCTGA AAGGCGATCC CGCCGCGGCC
TATGAAATCG GTGTGCGCTA CGCCGAAGGC AAAACCGTGC CGGCGAATTT CGACGAGGCC
GCGAAATGGT ATGAGCGCGC CGCGCAGGCC GGCATCGTGC CGGCGATCTT CCGGATCGGC
ACCCTTTATG AAAAGGGTCT CAGCGTGAAC AAGGATCTCG GCGCCGCGCG ACGCTACTAC
ATCCTCGCGG CGGAGCGCGG CAACGCCAAG GCGATGCACA ATCTCGCGGT GCTGGAAGCC
GATGGCGGCG CGCAGGGCGC CAACTACAAG AGCGCCTCGC AATGGTTCCG CAAGGCTGCC
GAGCGCGGCG TCGCCGACAG CCAATTCAAT CTCGGCATTC TCTATGCGCG CGGCATCGGC
GTCGAGCAGA ACCTCGCCGA ATCCTACAAA TGGTTCAGCC TCGCTGCCGC GCAGGGTGAC
GCCGATGCCG GCCGTAAGCG CGACGATGTC GCCAAACGGC TCGACCCGCA ATCGCTGTCG
GCCGCCAAAT TGGCGATCCA GACCTTCATG CCGGAGCCGC AGCCGGACGA CGCGATCAAC
GTGGCCAGCC CGCCCGGTGG TTGGGACGGT GCCGGCACTG CTGCCAGGCC CGCGGCGGCC
AAGCGCGCGG CGCGGTAA
 
Protein sequence
MPEADKSMNS RVSWSVEGIE PSVRERAEAA ARRAGLSLGD WISAQVGDVP PQLRPQDPSQ 
ATVRQSSPGL AEKDAQEVAE IHQRLDSITR QIEQISRPAG RGEPGVARQL NDAISRLDAR
LSQITARPAQ RPDPQQSQPQ IDRVERAAAD VYRSSPPLSP VSLDFAIAEI AARQNELDAA
ANQIMPRRAA PPIAPAMPAA PDVSGLERQL HKITTQIDAL QRPDAIEQSI AAFRTELAEI
RQTITEAMPR RAIESLETEI RSLAQRIDES RHSGIDGGAL AGVERALEEI RDVLRSLTPA
EQLAGFDEAI RNLGGKIDMI VRSSGDPSTL QQLEGAIAAL RAIVSNVASN EALARLSEDV
HTLSSKVDQL ARADSHSDSF AALEQRIAAL TTTLENRERP VQSDPSAQLE TALQALSDRL
DHLPVGNDGA ATFAHLEQRV SYLLERLEAS AEARSPNLGR VEEGLQDVLR QLERQQDTFA
ALTAVSRTTS PDNGLVDTLK REITDLRLTQ SQADRHTQDS LEAVHNTLGH VVDRLAMIEG
DLRSARSAPQ PAPTPQPQPA AIRPITPQPA PPPSAPAVVV PPRPEMPNPA AAHFDAAPRQ
FAPVSTPAEP SAPSAAKTIK DILSPKPARD SVAAPPEKAP ARAPSRLSIN PELPPDHPLE
PGTRPQGRTS PSERIAASES AISEIPAPTR EQVSSSNFIA AARRAAQAAA AASQVTEKSK
SAAKSAAKSA AKPGGKTGAP GTATPSGGNS TLGSKIRSLL VGASVVVIVL GTFKMAMTML
DGGPSQPAAI ESAEPAPLPP PSSGDAAPAP ATPANPSMTS PTPIDRQSLF GPAPAAPGGS
RVADADVTGT IPTPPPAIPT APPSGRAATM VSIPANEKLP DAIGGAQLRF AALKGDPAAA
YEIGVRYAEG KTVPANFDEA AKWYERAAQA GIVPAIFRIG TLYEKGLSVN KDLGAARRYY
ILAAERGNAK AMHNLAVLEA DGGAQGANYK SASQWFRKAA ERGVADSQFN LGILYARGIG
VEQNLAESYK WFSLAAAQGD ADAGRKRDDV AKRLDPQSLS AAKLAIQTFM PEPQPDDAIN
VASPPGGWDG AGTAARPAAA KRAAR