Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0738 |
Symbol | |
ID | 3970560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 810538 |
End bp | 813855 |
Gene Length | 3318 bp |
Protein Length | 1105 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637923853 |
Product | Sel1 |
Protein accession | YP_530628 |
Protein GI | 90422258 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGGAAG CGGACAAGAG CATGAATTCG CGCGTATCGT GGAGTGTCGA GGGGATCGAA CCGTCAGTGC GCGAGCGGGC CGAAGCCGCC GCGCGCCGGG CCGGCCTTTC GCTCGGCGAC TGGATCAGCG CCCAGGTCGG CGACGTTCCG CCGCAGCTTC GGCCGCAGGA CCCATCGCAG GCGACTGTGC GGCAGTCATC GCCCGGTTTG GCGGAGAAAG ATGCCCAGGA GGTCGCCGAA ATCCACCAAC GGCTGGATTC CATCACCCGG CAGATCGAAC AGATTTCGCG GCCGGCCGGC CGCGGCGAGC CCGGCGTAGC CCGGCAGCTC AACGACGCCA TCTCGCGGCT CGATGCCCGG TTGTCGCAGA TTACCGCACG GCCCGCTCAA CGCCCCGATC CGCAGCAATC TCAGCCACAG ATCGACCGGG TCGAGCGCGC CGCTGCCGAT GTCTATCGCT CGTCGCCGCC GCTCAGCCCG GTCTCGCTGG ATTTTGCGAT CGCCGAGATC GCCGCGCGGC AGAACGAGCT CGACGCCGCC GCCAACCAGA TCATGCCGCG CCGCGCCGCG CCGCCGATCG CGCCCGCGAT GCCGGCCGCG CCGGATGTGT CCGGGCTGGA ACGCCAGTTG CACAAGATCA CTACCCAGAT CGACGCATTG CAGCGTCCTG ATGCGATCGA GCAATCGATC GCCGCTTTCC GCACCGAGCT CGCCGAGATC CGCCAGACCA TCACCGAAGC GATGCCGCGT CGCGCCATCG AATCGCTGGA GACCGAGATC CGCTCGCTGG CGCAGCGCAT CGACGAGAGC CGGCACAGCG GCATCGATGG CGGTGCGCTG GCCGGGGTCG AGCGCGCGCT GGAAGAAATC CGTGACGTGC TGCGCTCGTT GACGCCGGCC GAGCAGCTCG CCGGGTTCGA CGAGGCGATC CGCAATCTCG GCGGCAAGAT CGACATGATC GTCCGCTCGA GCGGCGATCC TTCGACGCTG CAACAGCTCG AAGGCGCCAT AGCGGCGCTG CGCGCCATCG TCTCCAACGT CGCCTCCAAC GAGGCGCTGG CGCGGCTCAG CGAGGACGTC CACACGCTGT CGTCCAAAGT CGACCAGCTC GCCCGGGCCG ACAGCCACAG CGATTCCTTC GCCGCCCTCG AACAACGCAT CGCCGCTCTG ACCACGACGC TGGAGAACCG CGAACGGCCG GTGCAAAGCG ATCCTTCGGC GCAACTGGAA ACCGCGCTGC AGGCGTTATC CGATCGGCTC GATCATCTGC CGGTCGGCAA TGACGGCGCC GCCACTTTCG CGCATCTCGA GCAGCGGGTC AGCTACCTGC TCGAGCGCCT GGAAGCCTCG GCCGAGGCTC GCTCGCCGAA TCTCGGGCGG GTCGAAGAAG GCTTGCAGGA CGTGCTGCGT CAGCTCGAAC GCCAGCAGGA CACCTTCGCC GCGCTGACCG CGGTCAGCCG CACCACGTCG CCGGACAACG GTCTGGTCGA TACGCTGAAG CGCGAAATCA CCGATCTGCG GCTGACGCAG TCGCAAGCCG ATCGCCACAC CCAGGACTCC CTGGAAGCGG TGCACAATAC GCTCGGCCAC GTGGTCGACC GGCTGGCGAT GATCGAAGGC GATCTACGCA GCGCGCGCTC CGCGCCGCAG CCGGCTCCGA CGCCGCAGCC GCAGCCCGCA GCAATCCGGC CGATCACGCC GCAGCCCGCG CCGCCGCCCT CGGCGCCCGC GGTCGTGGTA CCGCCCCGAC CGGAGATGCC GAATCCCGCC GCCGCGCATT TCGACGCAGC CCCCCGTCAA TTCGCCCCGG TTTCGACGCC GGCCGAGCCG TCGGCGCCGA GTGCCGCGAA GACCATCAAG GATATTTTGA GCCCGAAGCC GGCGCGCGAC AGCGTCGCCG CGCCGCCCGA GAAGGCGCCT GCGCGTGCTC CGTCCCGGCT GTCGATCAAT CCCGAACTGC CGCCGGATCA TCCGCTAGAG CCCGGCACCC GCCCGCAGGG ACGGACCTCA CCGTCGGAGC GGATCGCGGC GTCGGAAAGC GCGATCAGCG AGATCCCGGC GCCGACCCGC GAACAAGTTA GTTCGTCGAA CTTCATCGCC GCGGCACGCC GCGCCGCGCA GGCCGCCGCA GCCGCGAGCC AGGTGACCGA GAAGTCGAAA TCCGCCGCCA AATCCGCCGC CAAGTCGGCG GCGAAGCCGG GCGGCAAAAC CGGCGCTCCG GGCACGGCGA CGCCGAGCGG TGGCAACTCG ACGCTGGGTT CGAAAATTCG CTCGTTGCTG GTCGGGGCCA GCGTGGTGGT GATCGTGCTC GGCACCTTCA AAATGGCGAT GACCATGCTC GACGGCGGCC CGTCGCAGCC CGCCGCCATC GAAAGCGCGG AGCCGGCGCC GTTGCCGCCG CCCAGCAGCG GCGACGCCGC ACCGGCGCCT GCCACACCGG CCAACCCGTC GATGACCTCG CCGACCCCGA TCGACCGGCA ATCGTTGTTC GGCCCGGCGC CGGCCGCGCC GGGCGGCAGC CGCGTCGCCG ATGCCGACGT CACCGGCACG ATTCCGACAC CGCCACCGGC GATTCCGACG GCGCCGCCGA GCGGCCGTGC GGCGACCATG GTGTCGATCC CGGCAAACGA GAAACTGCCG GATGCGATCG GCGGAGCGCA ACTGCGCTTC GCTGCGCTGA AAGGCGATCC CGCCGCGGCC TATGAAATCG GTGTGCGCTA CGCCGAAGGC AAAACCGTGC CGGCGAATTT CGACGAGGCC GCGAAATGGT ATGAGCGCGC CGCGCAGGCC GGCATCGTGC CGGCGATCTT CCGGATCGGC ACCCTTTATG AAAAGGGTCT CAGCGTGAAC AAGGATCTCG GCGCCGCGCG ACGCTACTAC ATCCTCGCGG CGGAGCGCGG CAACGCCAAG GCGATGCACA ATCTCGCGGT GCTGGAAGCC GATGGCGGCG CGCAGGGCGC CAACTACAAG AGCGCCTCGC AATGGTTCCG CAAGGCTGCC GAGCGCGGCG TCGCCGACAG CCAATTCAAT CTCGGCATTC TCTATGCGCG CGGCATCGGC GTCGAGCAGA ACCTCGCCGA ATCCTACAAA TGGTTCAGCC TCGCTGCCGC GCAGGGTGAC GCCGATGCCG GCCGTAAGCG CGACGATGTC GCCAAACGGC TCGACCCGCA ATCGCTGTCG GCCGCCAAAT TGGCGATCCA GACCTTCATG CCGGAGCCGC AGCCGGACGA CGCGATCAAC GTGGCCAGCC CGCCCGGTGG TTGGGACGGT GCCGGCACTG CTGCCAGGCC CGCGGCGGCC AAGCGCGCGG CGCGGTAA
|
Protein sequence | MPEADKSMNS RVSWSVEGIE PSVRERAEAA ARRAGLSLGD WISAQVGDVP PQLRPQDPSQ ATVRQSSPGL AEKDAQEVAE IHQRLDSITR QIEQISRPAG RGEPGVARQL NDAISRLDAR LSQITARPAQ RPDPQQSQPQ IDRVERAAAD VYRSSPPLSP VSLDFAIAEI AARQNELDAA ANQIMPRRAA PPIAPAMPAA PDVSGLERQL HKITTQIDAL QRPDAIEQSI AAFRTELAEI RQTITEAMPR RAIESLETEI RSLAQRIDES RHSGIDGGAL AGVERALEEI RDVLRSLTPA EQLAGFDEAI RNLGGKIDMI VRSSGDPSTL QQLEGAIAAL RAIVSNVASN EALARLSEDV HTLSSKVDQL ARADSHSDSF AALEQRIAAL TTTLENRERP VQSDPSAQLE TALQALSDRL DHLPVGNDGA ATFAHLEQRV SYLLERLEAS AEARSPNLGR VEEGLQDVLR QLERQQDTFA ALTAVSRTTS PDNGLVDTLK REITDLRLTQ SQADRHTQDS LEAVHNTLGH VVDRLAMIEG DLRSARSAPQ PAPTPQPQPA AIRPITPQPA PPPSAPAVVV PPRPEMPNPA AAHFDAAPRQ FAPVSTPAEP SAPSAAKTIK DILSPKPARD SVAAPPEKAP ARAPSRLSIN PELPPDHPLE PGTRPQGRTS PSERIAASES AISEIPAPTR EQVSSSNFIA AARRAAQAAA AASQVTEKSK SAAKSAAKSA AKPGGKTGAP GTATPSGGNS TLGSKIRSLL VGASVVVIVL GTFKMAMTML DGGPSQPAAI ESAEPAPLPP PSSGDAAPAP ATPANPSMTS PTPIDRQSLF GPAPAAPGGS RVADADVTGT IPTPPPAIPT APPSGRAATM VSIPANEKLP DAIGGAQLRF AALKGDPAAA YEIGVRYAEG KTVPANFDEA AKWYERAAQA GIVPAIFRIG TLYEKGLSVN KDLGAARRYY ILAAERGNAK AMHNLAVLEA DGGAQGANYK SASQWFRKAA ERGVADSQFN LGILYARGIG VEQNLAESYK WFSLAAAQGD ADAGRKRDDV AKRLDPQSLS AAKLAIQTFM PEPQPDDAIN VASPPGGWDG AGTAARPAAA KRAAR
|
| |