Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4688 |
Symbol | |
ID | 3912506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 5304639 |
End bp | 5305727 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637886593 |
Product | hypothetical protein |
Protein accession | YP_488282 |
Protein GI | 86751786 |
COG category | [R] General function prediction only |
COG ID | [COG5621] Predicted secreted hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCTA GCCACCCGAT CTCACGCCGT GCGGTCGCCG GCGGCCTGCT CGCGCTCAGC CTCGGCGGAT CGCGCGCGCT CGCGCAAGGC TTCGCGGGGC TCGGCAGCGA GGCCGGCGAA TTCGCCCCCG TCGTTCCCGG ACGGGTGCTG AGCTTCCCGG CCGACCACGG CGCCCACCCG GATTTCCGCA TCGAATGGTG GTACCTGACG GCGAATCTGC AGGGCGCGGA CGGCAAGCCC TACGGCGTGC AGTGGACGCT GTTCCGGCAG GCGATGACGC CGGGACCGCA GCGCGAAGGC TGGGCCAATC AGCAGGTCTG GATGGCGCAT GCGGCGCTCT CCAGCGCCGA GACGCATCGC TTCGCCGAGA AATTTTCCCG CGGCGGCATC GGTCAGGCCG GCGTGTCCGC CGCGCCGTTT CGCGCCTTCA TCGACGACTG GCAGATGACC GGCGGCGACG CGATGGACGC GGCGACGCTG TCGCCGCTCG ACGTCACCGC GACCGGTGCG GATTTCGGCT ACCGGCTGCG CCTCACCGCC GAGCGGCCGC TGGTGCTGCA GGGCGACGCC GGCTATTCGC GCAAATCCGA GCGCGGGCAG GCCTCGTATT ATTACAGTCA GCCCTATTTC GCCGCGCGCG GGACGCTGAC GTTGGACGGC CGGCCGATCG AGGTCAGCGG CACCGCCTGG ATGGACCGCG AATTCTCCAG CCAGCCGCTG GCGTCGGACC AGACCGGCTG GGACTGGTTC TCGCTGCATC TCGCCTCCGG CGAGAAAGTG ATGCTGTTCC GGCTGCGCCA GAGCGACGGC AACGCCTATT TCGCCGGCAA CTGGATCGGG CTCGACGGGC GGTCCGAACA GCTCGCGCCG GACGCCATCG TGCTCGATCC GATCGGCTTC ACCGACACCG CCGGCCGCAA ACTGCCGACA TCCTGGCGTG TCCGTGTGCC GGTGCGCGGT CTTGCGATCG AGACCGCACC GCTCAACCCG AACGCCTGGA TGGGCACCAG CTTTCCCTAT TGGGAGGGGC CGATTTCGTT CAGCGGTAGC CAGAGCGGCA ACGGATATCT TGAGATGACC GGCTATTGA
|
Protein sequence | MSASHPISRR AVAGGLLALS LGGSRALAQG FAGLGSEAGE FAPVVPGRVL SFPADHGAHP DFRIEWWYLT ANLQGADGKP YGVQWTLFRQ AMTPGPQREG WANQQVWMAH AALSSAETHR FAEKFSRGGI GQAGVSAAPF RAFIDDWQMT GGDAMDAATL SPLDVTATGA DFGYRLRLTA ERPLVLQGDA GYSRKSERGQ ASYYYSQPYF AARGTLTLDG RPIEVSGTAW MDREFSSQPL ASDQTGWDWF SLHLASGEKV MLFRLRQSDG NAYFAGNWIG LDGRSEQLAP DAIVLDPIGF TDTAGRKLPT SWRVRVPVRG LAIETAPLNP NAWMGTSFPY WEGPISFSGS QSGNGYLEMT GY
|
| |