Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_3458 |
Symbol | |
ID | 5085874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009429 |
Strand | + |
Start bp | 333463 |
End bp | 335268 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640485024 |
Product | hypothetical protein |
Protein accession | YP_001169640 |
Protein GI | 146279482 |
COG category | [R] General function prediction only |
COG ID | [COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.55725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0698709 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAGA GCTTTGGATG TCCTTGCTGC AGTGGCGCCT TCGACGGCTT CCTGAAAGGC GTGCGCCGCA CGGTGGCGGC ACAACTTCCA CCGACACTTC CGATCAACCG CCGGTCCTTC ATGTCGGGCA CCGCCGCCTT GACGGGCCTC GTCGCAGCGC TTCCGACGTG GAGCAGGGCG CAGACCGCCC CTGCGAGGAT CTTCAGCGGC GGGACGATCC TGACTGTCGA TGCGGCGTTC TCCGAGGCTG AGGCCATTGC CATCCGGGGC GACAGGATCA TCGCGGTCGG GAGCATTCAG GACGTGCGCG CCGCCGCCGG CGAAGATGCG GTTGAGGTCG ATCTCGCGGG GCGCACGATG CTGCCGGGCT TCATCGAGCC GCACACGCAT GTCGTTTCGG GGGCCGCCGT CGGCGGCATC ATGACGAACG TCGGCATATC GCGCTTTGGC ACGGCGGCGG AGATCCTCGA CCACCTGCGA TCGCTGGTGC CAGGGACGCC CCCGGGAGAG TGGATACTGG CCCGCAATTT CGACCCCGCA TTGCAGGAAG GACCGGAAGC GCTGACCTTC GCGGAACTCG ATGCCGTCTC GACAGAGGTG CCGGTCTTCG TCATGAACGC CTCGGGTCAC CTTGCCTATG CCAATCGCAA GGCCTTCGAG GCGGCTGGCA TCCCGGAGGA CATCCCCGAC CCGCCGGGCG CCGAGTTCGT CCGCGACGCC GAGGGCCGGC TGACCGGCGT CATGAAGAAC AACGTCTCCT TCCTCAAGGT TGTCTCGGCA GCACCGGCGA TGGGACGGCT CGATCCGGTG ACGGCTCTGA TCGACCTGCT GTCGGACTGG AGCCGCCTCG GTCTGACCAC GGTCAGCGAA CTGGCGCTGG GAACGCTGAC CGGCTCACCC GAGGACGCAG CCATCGTTCT TGGCGCGGCG GCCTCGGGTC GGCTCAAGGC GCGCATCCGG GCCTATCCCT TCTACACCGT CGGTTCCTCT GCCTGGGACG AGGCGGGGAT CGGGCCGGGC ACAGGCGACG CCCTCGCACG GATCACGGGC TACAAGCTCG TCGCGGACGG ATCGAACCAG GGCTTCACCG GCCTCCAGCG AGAGCCCTAT CTCGGCTCGG ACAGCCGCGG AACCGCCTAC ATGACGCCCG AGGAGATGAC CAGCATCGCC CTCGACCGCG CCGCGAAGGG TTGGCCTCTC GCCCTTCACG CGAACGGCGA CGCCGGGATC GACATGGTGC TCGATGCCTG CGAGGCCGTG CGCGATGCCG GCGTCGACAT GAGCCTCATC CGCACCCGGA TCGAGCATTG TTCCATGCTG CATGACGACC AGATCGCGCG CATGAAGGAC CTCGGCGTCT CGGCGAGCTT CCTGATCGGG CATGTCCATT TCTGGGGTGT CTGGATGCGC GACCGCGTCT TCGGCCCGGA GCGCGTCAAC AACCTTGACC GCTGCCGCAG CGTGGAGGAG GCCGGTGTCG GCTTCACGCT CCATTCCGAC TTCACGGTGA CCGAACCCGA TCCGCTTCAC ATGATCCAGA TGGCCGTGAC CCGCCGGACG TGGAAGGAAC CGGACTTTGT CCTCAATCCG GGAGAGCGCA TATCCGTCGA ATCCGCGATC CGCGCGATGA CCTCGGAGGC CGCCTGGCAG TTGCTCTCGG ACCATGAGGT CGGCAGCCTC GAAGTCGGGA AGATGGCGGA TCTCGTGATC CTGGAACAGG ACCCAAGGCG GGTCGACCCC GATACCATCA GGAACATCCG GGTGCTCGAG ACATGGATGA ACGGAGATCA GGTGTTCGTC GCCTGA
|
Protein sequence | MNESFGCPCC SGAFDGFLKG VRRTVAAQLP PTLPINRRSF MSGTAALTGL VAALPTWSRA QTAPARIFSG GTILTVDAAF SEAEAIAIRG DRIIAVGSIQ DVRAAAGEDA VEVDLAGRTM LPGFIEPHTH VVSGAAVGGI MTNVGISRFG TAAEILDHLR SLVPGTPPGE WILARNFDPA LQEGPEALTF AELDAVSTEV PVFVMNASGH LAYANRKAFE AAGIPEDIPD PPGAEFVRDA EGRLTGVMKN NVSFLKVVSA APAMGRLDPV TALIDLLSDW SRLGLTTVSE LALGTLTGSP EDAAIVLGAA ASGRLKARIR AYPFYTVGSS AWDEAGIGPG TGDALARITG YKLVADGSNQ GFTGLQREPY LGSDSRGTAY MTPEEMTSIA LDRAAKGWPL ALHANGDAGI DMVLDACEAV RDAGVDMSLI RTRIEHCSML HDDQIARMKD LGVSASFLIG HVHFWGVWMR DRVFGPERVN NLDRCRSVEE AGVGFTLHSD FTVTEPDPLH MIQMAVTRRT WKEPDFVLNP GERISVESAI RAMTSEAAWQ LLSDHEVGSL EVGKMADLVI LEQDPRRVDP DTIRNIRVLE TWMNGDQVFV A
|
| |