Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0801 |
Symbol | |
ID | 4021275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 900997 |
End bp | 904401 |
Gene Length | 3405 bp |
Protein Length | 1134 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637960991 |
Product | Sel1 |
Protein accession | YP_567940 |
Protein GI | 91975281 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.835897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.855509 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGCG TATCGTGGAG CGTAGAAGGT ATCGAGCCGT CGGTGCGCGA GCGGGCGGAA GCCGCCGCCA AGCGCGCCGG CATGTCGCTG GCCGATTGGA TCAACGGTCA GCTCGGCGAT ACCGCGCCCC AGACTTTGGT TCAGTCCCAG CCCCGCTCGG CCGCCGAGGC CGGACATCAG CCGTTCGGCG CCGCATTGGC GGAAAACAGC GCCACCGAAG TTGCCGAGAT CCATCAGCGG CTGGATTCTA TTGCACGCCA GATCGACCAG ATGTCGCGGC CGCCGGTGCG TAACGAGCCC GGCGTGGCGC GGCAGCTCAA CGACGCAATT TCGCGGCTGG ACGCCCGCCT GGCGCGGATC ACCGAGCCCA AGGCTTCTAC GACGGCTTCT ACGACGGCTT CTACGACAGC TTCTACGGCT TCCGCGACGG TTCCTGCGGC GGCTCCCCCG GCGGCGGCAA CCGCCCGGAC CGCACCGCTT CAACACGCCC CTCTCCAGAC CCCGACCGAG CGGGTCGAAC GCGCTGCGGC TCAGGTCTAT CACGCCTCGC CCCCGCTCGA TCCCAATGCG CTCGATCTGG CGATCGCTGA AATCGCCGCG CGACAGAACG AACTCGACAG CACGGTGAAC CGGGTCGCGC CGCGCCAGGC CCCCCCGATC GTCCCGGCGA TGGCGCCGCC GCCGGTCCGA ACCGGGCCGG ACTTCTCCAG CCTGGAGCAG CAACTCCTCA AGATCACCAG CCAGATCGAC GCGCTGCAGC GGCCCGACGT GATCGAGCAG TCGATCGCGG CGTTCCGCGC CGATCTCGCC GAAATCCGTC AGACCATCAC CGAGGCGATG CCCCGCAAGG CGATCGAGTC GCTCGAGAGC GAGATCAAAT CGCTGTCGCG GCGGCTCGAC GAAACCCGCA GCAACGGCAG CGACGCCAGC GTCATCGCCG GCATCGAACG CGCGCTCGGG GAGATCCACG ACGCGCTGCG CTCGCTCACC CCGGCCGAGC AACTCGCCGG CTTCGACGAG GCGATCCGCA ATCTCGGCGG CAAGATCGAC ATGATCGTGC GCAACAGCGA CGATCCCGGC ACGATGCAGC AGCTCGAGAA AGCCATCGGC GCTTTGCGTA GCATCGTCTC CAACGTCGCC TCCAACGAGG CGCTGGCGCA GCTCAGCGAC AATGTTCACA CGCTGGCCGA CAAGGTCGAC CAACTCACCC GCGTCGATCA TCACAGCGAC TCGTTTGCGG CGCTTGAAAA CCGCATCTCG GCGCTGACCG CGGCGCTCGA AAGCCGCGAG CGCCCGGTCG CCGCCGACTC CTCCGAGCAG CTCGAAGGCG CGGTGCGCGC GCTGTCGGAG CGACTCGACC AACTGCCGGT CGGCAACGAC AGCTCGTCGG CCTTCGCGCA TCTCGAGCAG CGCGTCTCCT ATCTCCTGGA ACGGATGGAA ACCGCCGCGA CGCCGCGCGG CAGCGGCGAT TTCGGCCGGG TCGAGGAAGG GCTGCAGGAC ATCCTGCGGA TGCTCGAGCG GCAGCAGGAG AATTTTCATC GCTTGGCCGA CATCGAGCGC GCGCCGCCGC CGCCCGCTTT CGATCCCGGA GTTGTGGAGA CCCTCAAGCG CGAAGTCTCC GACATGCGCT TCAGCCAGTC GGAAACCGGC CGGCACACCC AGGACTCGCT GGAAGCGGTT CACAACACCC TCGGCCATGT CGTCGACCGG CTGGCGATGA TCGAAGGCGA CCTGCGCGCG GCGCGCGCCG CGCCGCAGCC CGCGCCGGAG CCGGCCAGGC CGCTGCCGGT GACCGCCCAG CCGGCGGCGT CGCCGCCGGT TTCGCTGCCG CCGCGTCCTG AAATGCCGAA TCCCGCCGCG GCGACTGCAT TCAGCGCTGC GCCACGAGAG TTCGCGCCGG CGCAACCGGC GCCGGCACCC GCACCTGCAC CGCGGGCGAT CCAGGACATT CTCGATCCGG CTGCGAGCCG GGCCGCGGCC GGCCCCTCGA CCGAGCCGCA GATTTCGTCG CCACACGCAT CGATCAATCC CGCATTGCCG CCGGACCACC CGCTGGAGCC AGGCTCCCGC CCGCCGGCCC GGGTCACCTC GCCGTCCGAG CGGATCGCGG CATCGGAAAG CGCGATCAAC GAACTCGGCG GCGCCAAGCC GGAGCCGGCC AGCAGCTCGA ACTTCATCGC CGCCGCCCGC CGCGCGGCGC AAGCCGCAGC GTCGGCGACC GGCCATTCCA CCGACAAGTC CAAAGGCGAC GGCAAAGCCG GCCCGACCCC CGGCAAGGCC GGACCGGGCT CCACCATCGG CTCCAAGATT CGCTCACTTC TGGTTGGCGC CAGCGTCGTC GTGATCGTGC TCGGCACCTT CAAGATGGCG ATGAACTTGC TCGACGGTGG CCAGCCGGTC CCGGCGGCAA GCCTCAGCGA GCCAGCGCCG CAAGGGATGG CGCCGTCCGA CGAGGACGAG GATGACACGC CACCCGCCGC CAGCGCTCCT TCCGCACCGG CGCCATCGAT GACGTCGCCG ACCCCGATCA ATCGCCAGTC GCTGTTCGCG CCGCAACAAC CGCCCGCGGC CGCACCGGCG CCAGCCGCTC CCGCTCCGGC TCCGGCGGTC TCCCCGGCAA CCGCACCCGC AGACATCACC GGCACCATCC CGGCGCCGCA GGCTGGCGCC GGGATGGGCG CCGCCGGCAA GATTGCGATT CCGGCCGGCG AAACCCTGCC CGACGCGATC GGCGGACCGG TGCTGCGCAA GGCCGCGCTG AAAGGCGACG CCGCGGCGGC TTTCGAAATC GGCAACCGCT ACGCCGACGG CAAAGGCATC GCGGCGAATT TCGAAGAGGC CGCCAAATGG TACGGCCGCG CTGCGCAGGC CGGCATCGTG CCGGCGATGT TCCGGATGGG CACCCTCAAC GAGAAGGGGC TCGGGCTGAA AAAGGATCTC GATACGGCGC GGCGCTACTA CGTGCAGGCG GCCGATCGCG GCAACGCCAA GGCGATGCAC AATCTCGCCG TGCTCGACGC CGACGGCGGC TCGAAGGGCG CGAACTACAA GACCGCGGCG GAGTGGTTCA GGAAAGCCGC CGAGCGCGGC GTCGCCGACA GCCAGTTCAA CCTCGGAATC CTGTATGCAC GAGGCATCGG CGTCGAACAG AACCTCGCCG AATCGTTCAA ATGGTTCAGC CTCGCGGCGG CGCAGGGCGA CTCCGACTCC GCCCGCAAGC GCGACGATGT CGCCAAGCGG CTCGATCCGC AATCGCTGTC GGCGGCCAAA CTCGCGATCC AGACGTTTGT CGTCGAGCCG CAGCCCGACG ACGCCGTCAA GGTTGCAGCG CCCGCCGGTG GCTGGGACGC CCAATCCCCG GCGGCGGCGA TCAGTCCTGC GACCAGCAAG CGCGCGGCAC GCTAA
|
Protein sequence | MNRVSWSVEG IEPSVRERAE AAAKRAGMSL ADWINGQLGD TAPQTLVQSQ PRSAAEAGHQ PFGAALAENS ATEVAEIHQR LDSIARQIDQ MSRPPVRNEP GVARQLNDAI SRLDARLARI TEPKASTTAS TTASTTASTA SATVPAAAPP AAATARTAPL QHAPLQTPTE RVERAAAQVY HASPPLDPNA LDLAIAEIAA RQNELDSTVN RVAPRQAPPI VPAMAPPPVR TGPDFSSLEQ QLLKITSQID ALQRPDVIEQ SIAAFRADLA EIRQTITEAM PRKAIESLES EIKSLSRRLD ETRSNGSDAS VIAGIERALG EIHDALRSLT PAEQLAGFDE AIRNLGGKID MIVRNSDDPG TMQQLEKAIG ALRSIVSNVA SNEALAQLSD NVHTLADKVD QLTRVDHHSD SFAALENRIS ALTAALESRE RPVAADSSEQ LEGAVRALSE RLDQLPVGND SSSAFAHLEQ RVSYLLERME TAATPRGSGD FGRVEEGLQD ILRMLERQQE NFHRLADIER APPPPAFDPG VVETLKREVS DMRFSQSETG RHTQDSLEAV HNTLGHVVDR LAMIEGDLRA ARAAPQPAPE PARPLPVTAQ PAASPPVSLP PRPEMPNPAA ATAFSAAPRE FAPAQPAPAP APAPRAIQDI LDPAASRAAA GPSTEPQISS PHASINPALP PDHPLEPGSR PPARVTSPSE RIAASESAIN ELGGAKPEPA SSSNFIAAAR RAAQAAASAT GHSTDKSKGD GKAGPTPGKA GPGSTIGSKI RSLLVGASVV VIVLGTFKMA MNLLDGGQPV PAASLSEPAP QGMAPSDEDE DDTPPAASAP SAPAPSMTSP TPINRQSLFA PQQPPAAAPA PAAPAPAPAV SPATAPADIT GTIPAPQAGA GMGAAGKIAI PAGETLPDAI GGPVLRKAAL KGDAAAAFEI GNRYADGKGI AANFEEAAKW YGRAAQAGIV PAMFRMGTLN EKGLGLKKDL DTARRYYVQA ADRGNAKAMH NLAVLDADGG SKGANYKTAA EWFRKAAERG VADSQFNLGI LYARGIGVEQ NLAESFKWFS LAAAQGDSDS ARKRDDVAKR LDPQSLSAAK LAIQTFVVEP QPDDAVKVAA PAGGWDAQSP AAAISPATSK RAAR
|
| |