Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3272 |
Symbol | |
ID | 4023781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3627009 |
End bp | 3628136 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637963475 |
Product | rhomboid-like protein |
Protein accession | YP_570397 |
Protein GI | 91977738 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00126234 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCGGTTTG CGGCCGCCGC TGCGGTGCCT AGAGTGGGTC CGTTCTCGGC GCGTAGGCCT TTTCCAGGCT CCTGCGCGAT CTTTACGGGG TGGCTGCAAA ACAGACCGGC GCGCGCGCGG ATCATCGCGG CGGACAGCAC CGGCAGGCTT GAAGGGGAGG GCTTTCTCAG AGTCCTCCCC TTTTTTGTGG GTGCCAGGGC TCGGCAGGGG ACGGCGATCA TCGGGCTCCG TTGGGCGTCG CCGGGGGCGC CGGCGGACCG GCTTGCGCCG TTCAGCCAAG CCGCCTATGG GATCGGATCG CCTCCGGGGC CGTGCGCCGC GGTCGGGCCG CTCGCAGCAG GTCGTAAAAC GTTGGATTCC TCCCCCGAAA CGCAGCCGCT GCCACCGCCC CCGCCGGAGC CGCCGCGCGA ACCGATCCTG AACCTTCCGG CCGCGCTCGC CGCCTATGTC GCGCTGCTGG CGGTGATCCA TCTGCGCGTG TTGCTGCCGC CGGACATCGA ATATTGGACC ATCGAAGTGT TCGGCTTCAT TCCGAAGCGC TATGACGCGA CCCTGCTGGC GACGCCGTTC GCCGGGGGCA GCGGCGCCAA GGTCTGGAGT TTCGTGACCT ATTCGCTGCT CCACGCCAAT CTCAGCCACA TCATCTTCAA CGTGCTGTGG CTGCTGCCGT TCGGCAGCGC AGTGGCGCGG CGGTTCGGCG CGGCGCGGTT CTTCCTGTTC ATGGCGGTCA CCGCGGTCGG CGGCGCGCTC GCCCATCTCG TCACCCACGA GCACGAGATC GCGCCGATGA TCGGCGCTTC GGCCTCGGTG TCCGGCGCGA TGGCGGCGGC GATCCGGTTT GCGTTTGCGC GCGGCAGTTT CCTGTCGCTG CGCAGCGGCG ACGCCGATGC GGCGGCGCGG GTGCCGGCGC AGCCCTTGAT CCGCGCGCTG CGCGATCCGC GCGTGCTCGC CTTCCTCGCG ATCTGGTTCG GCATCAACAT CATCTTCGGC GTCGGCTCGA TCGCGGTCGG CAACGAAGGC GCGAGCGTCG CCTGGCAGGC GCATATCGGC GGCTTCTTCG CGGGCCTGCT GCTGTTCTCG TTGTTCGACC CGGTGCCGCG ATCGGCGCAG ACCTCCGCTC ACAACTAA
|
Protein sequence | MRFAAAAAVP RVGPFSARRP FPGSCAIFTG WLQNRPARAR IIAADSTGRL EGEGFLRVLP FFVGARARQG TAIIGLRWAS PGAPADRLAP FSQAAYGIGS PPGPCAAVGP LAAGRKTLDS SPETQPLPPP PPEPPREPIL NLPAALAAYV ALLAVIHLRV LLPPDIEYWT IEVFGFIPKR YDATLLATPF AGGSGAKVWS FVTYSLLHAN LSHIIFNVLW LLPFGSAVAR RFGAARFFLF MAVTAVGGAL AHLVTHEHEI APMIGASASV SGAMAAAIRF AFARGSFLSL RSGDADAAAR VPAQPLIRAL RDPRVLAFLA IWFGINIIFG VGSIAVGNEG ASVAWQAHIG GFFAGLLLFS LFDPVPRSAQ TSAHN
|
| |