Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4609 |
Symbol | |
ID | 3912426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 5204722 |
End bp | 5208039 |
Gene Length | 3318 bp |
Protein Length | 1105 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637886513 |
Product | Sel1-like protein |
Protein accession | YP_488203 |
Protein GI | 86751707 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGCG TATCGTGGAG CGTCGAGGGC ATCGAACCGT CAGTGCGCGA AAGGGCGGAG GCCGCCGCCC GGCGCGCTGG CATGTCGCTC GCCGACTGGA TCGACGGTCA GCTCGGCGAC AGCGCACCTC AGCCGCACCC GACGGCCGAC ACGACCCGGG AGTCGATCCG CGCCCCGCTG GCGGAAAAGA ACGCGACCGA GGTCGCCGAA ATTCACCAAC GGCTGGATTC CATCGCCCGC CAGATCGACC AGATTTCACG CCCCCCGGTG CGCAGCGAGC CCGGCGTGGC GCGGCAGCTC AACGACGCGA TTTCGCGACT CGACGCAAGG CTCGCCCGGA TCACCGAACC CAAGGTCGCT ACGGCGGCCC CCTCAGCCGT CGCGCCGACG CAGGCCGCGC CGCCCCGAGC CGCTGCCTCC GCCCCGCAAT CGCCGACCGA ACGCGTCGAG CACGCCGCGG CGCAGGTCTA TCACGCCTCG CCGCCGCTCG ATCCGAGTGC GCTCGATCGC GCGATCGCCG AGATCGCCGC GCGCCAGTCC GAGCTCGACG CCCCGGTGGA GCGGATGCCG CTGCGCCAGT CGCCGCCGAT CGCGCCGGCG ATGGCGCCGC CGCAGGCACG GCCGGGGCCC GACTTTTCCA GCCTCGAGCA ACAGCTCCTC AAGATCACCA GCCAGATCGA CGCGCTGCAG CGCCCCGACG TGATCGAGCA ATCGATCGCG GCGTTCCGCG CCGATCTCGC CGATATCCGC CAGACCATCA CCGAAGCATT GCCGCGCAAG GCGATCGAAT CGCTCGAAAG CGAGATCAGA TCGCTGTCGC AGCGGCTCGA CGAGACCCGT GCCAACGGCA GCGACGCCGG CGTGATCGCC GGGATCGAAC GCGCGCTCGG CGAAATCCGC GATGCCTTGC GCTCGCTGAC GCCGGCCGAG CAGCTCGCCG GCTTCGACGA GGCGATCCGC AATCTCGGCG GCAAGATCGA CATGATCGTG CGCAACAGCG ACGATCCCGG CACGCTGCAA CAGCTCGAGA ATGCAATCGG CGCATTGCGC GGCATCGTCT CCAACGTCGC CTCCAACGAG GCGCTGGCGC AGCTCAGCGA CAACGTTCAC ACGCTGGCCG ACAAGGTCGA TCAGCTCGCC CGCGCCGACA GCCACAGCGA TTCCTTCGCG GCGCTGGAAA GCCGGATTTC GGCGCTGACC GCGGCGCTGG AGAACCGCGA ACGGCCGATC GCCACCGAAT CCACCGAGCA GCTCGAAGGC GCGGTGCGGG CGCTGTCGGA GCGGCTCGAT CAGATGCCGG TCGGCAACGA CGGCTCGTCG GCGTTCGCTC ATCTCGAACA GCGTGTTTCC TACCTGCTCG AGCGGATGGA AGCCGCCGCC GTCCAACGCG GCAGCGGCGA TCTCGGCCGG GTCGAGGAAG GGCTGCAGGA CATCCTCCGG ATGCTCGAGC GGCAGCAGGA GAGCTTCCAC CGCATCGCCG ATCTCGGTCG CGCGCCGGCC GCGCCGCCGT TCGACCCGGG CGTCGTCGAG TCGATCAAGC GCGAAATTTC CGACATCCGC CTGAGCCAGT CGCAAACCGG ACGCCACACC CAGGATTCGC TGGAAGCGGT TCACAACACG CTCGGCCACG TCGTCGACCG GCTGGCGATG ATCGAAGGCG ATTTGCGCAA GGCGCGGTCC GCGCCGCAAC CCGCGGCGGC GCGCGAACCC GCCCAGCCGC AGCAGCCGGT CGTCACGCCT CCGCCGGCGG CGCCGCAAAT CTCGCTGCCG CCGCGCCCGG AGATGCCGAA TCCCGCAGCC GCGACCGCGT TCGCGGCCGC TCCGCGGGAG TTCGCGCCGA CGCGGCCCAC GGAACAGCCC GAGCCGACGC CGGGACCGCG GGCGATCATG GATATTCTGG CGCCGCCCGT CAGCCGCCCC TCGGCCCCTG AGCCGCAGAT CGCGCCGCAA AGGCTCGCCG CCGATGCCTC GCTGCCACCG GATCATCCGC TCGAACCCGG CACCCGCCCT CCCGGCCGGG TGGCGTCGCC GTCGGAACGC ATCGCAGCCT CGGAGAGCGC CATCAGCGAA TTCGCCGGCG CCAAGCCGGA GCCCGCCAGC AGCTCGAATT TCATCGCGGC CGCCCGCCGC GCCGCGCAGG CCGCGGCAGC GGCCACCACC CAATCCGGCG ACAAGCCCAA GGGCGACGGC GGCAAGTCCG GCCCGGCTCC CGGCAAGCCG GGATCGACCA TCGGGTCCAA GATCCGCTCG CTGCTGGTCG GCGCCAGCGT CGTGGTGATC GTGCTCGGCA CCTTCAAGAT GGCGATGAAT CTGCTCGATG ACGGCCACCC GACCCCGGCC GCCAGCCTCA GCGAACCGGC GCCGCAGGAC TTGATGCCGC AGAGTGGGGA CGAAGAGATC GAGCCGCCCG CGGCGAACCC CGCCACGCCG ACGCCCGCCC CGTCGATGAC ATCGCCGACC CCGATCAATC GGCAGTCGCT GTTCGCGCCG CCGCAAACGC CTGCTGCGCC GCCCGCGCCG TCCACCGCTC CTGCCTCCGC CGACGTCACC GGGACGATCC CGACGCCGCA GGTCAACGCC GCCGCAAACA CCACGGTCAC GGCCACGGTC GCGATTCCGG CCGGCGAAAC CCTCCCCGAC GCGATCGGCG GACAGGCGCT GCGCAAGGCC GCGCTGAAGG GCGACGCCGC GGCGGCCTAC GAGGTCGGCA ACCGCTACGC CGACGGCAAG GGTATCACCG CGAATTTCGA GGAAGCGGCG AAGTGGTACG GCCGCGCGGC GCAAGCCGGC ATCGTGCCGG CGATGTTCCG GATGGGGACC CTCAACGAGA AGGGCCTCGG CGTGAAGAAG GACCTCGACA CGGCCCGGCG CTTCTACATT CAGGCGGCGG ACCGCGGCAA CGCCAAGGCC ATGCACAATC TCGCGGTGCT CGACGCCGAT GGCGGCGCCA AGGGCGCCAA CTACAAGAGC GCTGCGGAAT GGTTCCGCAA GGCGGCCGAG CGCGGCGTCG CCGACAGCCA GTTCAACCTC GGCATCCTCT ATGCCCGCGG CATCGGCGTC GAACAGAACC TCGCCGAATC GTTCAAATGG TTCAGCCTCG CCGCCGCCCA GGGTGACGCC GATTCCGCGC GCAAACGCGA CGACGTCGCC AAGCGGCTCG ACCCGCAGTC GTTGTCGGCG GCCCGGCTCG CGATCCAGAC CTTCACCGTG GAGCCGCAGC CCGACAGCGC CGTCAAGGTC GCGGCGCCGG CCGGCGGCTG GGACGCGCAG GCGACCGCCA CAGCCAAACC AGCGACCAGC AAGCGCGCCG CGCGTTAA
|
Protein sequence | MNRVSWSVEG IEPSVRERAE AAARRAGMSL ADWIDGQLGD SAPQPHPTAD TTRESIRAPL AEKNATEVAE IHQRLDSIAR QIDQISRPPV RSEPGVARQL NDAISRLDAR LARITEPKVA TAAPSAVAPT QAAPPRAAAS APQSPTERVE HAAAQVYHAS PPLDPSALDR AIAEIAARQS ELDAPVERMP LRQSPPIAPA MAPPQARPGP DFSSLEQQLL KITSQIDALQ RPDVIEQSIA AFRADLADIR QTITEALPRK AIESLESEIR SLSQRLDETR ANGSDAGVIA GIERALGEIR DALRSLTPAE QLAGFDEAIR NLGGKIDMIV RNSDDPGTLQ QLENAIGALR GIVSNVASNE ALAQLSDNVH TLADKVDQLA RADSHSDSFA ALESRISALT AALENRERPI ATESTEQLEG AVRALSERLD QMPVGNDGSS AFAHLEQRVS YLLERMEAAA VQRGSGDLGR VEEGLQDILR MLERQQESFH RIADLGRAPA APPFDPGVVE SIKREISDIR LSQSQTGRHT QDSLEAVHNT LGHVVDRLAM IEGDLRKARS APQPAAAREP AQPQQPVVTP PPAAPQISLP PRPEMPNPAA ATAFAAAPRE FAPTRPTEQP EPTPGPRAIM DILAPPVSRP SAPEPQIAPQ RLAADASLPP DHPLEPGTRP PGRVASPSER IAASESAISE FAGAKPEPAS SSNFIAAARR AAQAAAAATT QSGDKPKGDG GKSGPAPGKP GSTIGSKIRS LLVGASVVVI VLGTFKMAMN LLDDGHPTPA ASLSEPAPQD LMPQSGDEEI EPPAANPATP TPAPSMTSPT PINRQSLFAP PQTPAAPPAP STAPASADVT GTIPTPQVNA AANTTVTATV AIPAGETLPD AIGGQALRKA ALKGDAAAAY EVGNRYADGK GITANFEEAA KWYGRAAQAG IVPAMFRMGT LNEKGLGVKK DLDTARRFYI QAADRGNAKA MHNLAVLDAD GGAKGANYKS AAEWFRKAAE RGVADSQFNL GILYARGIGV EQNLAESFKW FSLAAAQGDA DSARKRDDVA KRLDPQSLSA ARLAIQTFTV EPQPDSAVKV AAPAGGWDAQ ATATAKPATS KRAAR
|
| |