Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2059 |
Symbol | |
ID | 5704731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2357678 |
End bp | 2358832 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641271546 |
Product | purine phosphorylase family 1 |
Protein accession | YP_001536917 |
Protein GI | 159037664 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0775] Nucleoside phosphorylase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00192453 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGCACGA ACAGCGGTCT CGTCGTGATC CTCACGGCGC TGGACCTCGA ATACGCAGCG GTCCGCGACC AACTGACCGA CCTACGTGTA CGCCGGCACC CCGCCGGCAC CCGCTTCGAG GTCGGCCGGA TCGGCCAAAG CGACTGCCGG GTCGCCCTGG GGCTGGTCGG TAAGGGCAAT CATCCGGCCG CTGTGCTCGC CGAACGGGCG ATGGCCGAGT TCTCCCCGGC CGCTGTGCTG TTCGTGGGAG TCGCCGGTGG CCTTTGGCCC AATATCCGAC TCGGTGACGT CGTCGTCGCC AGCAAAATCT ACGCCTACCA CGGCGGCACC AGCGAGGACG ACGGTTTGAA GGCCCGACCG AAGGCGTGGG AGATCCCCCA CGAGGCCGAC CAGATCGCCC ACCACGTCGA CAGGTCCGCC GCGTGGCGCC GTGGTCTTTC CGTAGGCGCA GCGCCCAAAG TCCACTTCGG GCCGATTGCG GCAGGAGAAG TGGTGCAGGA CTCGGGAATC TCGGAACAGG CCCGCTGGAT CCGCCAGCAC TACAACGACG CGGTAGCGAT CGAGATGGAA GCGGCCGGTG TGGCCCAGGC GGGCCATCTC AACCGTGCCC TGCCCGTGGT GGTCGTGCGC GGCATCAGCG ACCACGCCGA CGGCAGCAAG GCAGCCACGG ACGGACAGGA CTGGCAACCG AAGGCCGCAC GCCACGCCGC CGCGTTCGCC ACCGCACTGG CGCGAGAACT GACTATCGAC GGGCCGGCCA GCCGAGGCGG CGCGGACCGG GACGGGAGCC CCACGATGCC GACGACCAAC CACAACATCG CCACCGGGAA CGCTTACGTC GGGGTGCAAG CCGGGCAAAT CTACGGCAAC GTCACCGTCG GCGTTGGCGC CGATCAGCCG ATCGACTTGG CAGCGAGCAT CGCGGACCTG CGAACCCACC TCAAGCAGGC CCACCTCGAC GGACAGTTGG ACGAAGAGAC CTACGCCGCC GCAGAGGCGG AACTGGAAGC GGCAACGGTA TGCGTCTCGG CGGGCACACC CGAGAAGAAG AGCGGCCTGA TGGTCGCGCT CAAGCGGTTG CGGGGCCTGG TAGCCGACGT GTCTGAGCTG GCCGCGCGGC TCGCGGCGAT CATCGCCGTG GTGCGGGACC TGTGA
|
Protein sequence | MSTNSGLVVI LTALDLEYAA VRDQLTDLRV RRHPAGTRFE VGRIGQSDCR VALGLVGKGN HPAAVLAERA MAEFSPAAVL FVGVAGGLWP NIRLGDVVVA SKIYAYHGGT SEDDGLKARP KAWEIPHEAD QIAHHVDRSA AWRRGLSVGA APKVHFGPIA AGEVVQDSGI SEQARWIRQH YNDAVAIEME AAGVAQAGHL NRALPVVVVR GISDHADGSK AATDGQDWQP KAARHAAAFA TALARELTID GPASRGGADR DGSPTMPTTN HNIATGNAYV GVQAGQIYGN VTVGVGADQP IDLAASIADL RTHLKQAHLD GQLDEETYAA AEAELEAATV CVSAGTPEKK SGLMVALKRL RGLVADVSEL AARLAAIIAV VRDL
|
| |