Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0908 |
Symbol | |
ID | 4895181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 933127 |
End bp | 934584 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640111493 |
Product | phage SPO1 DNA polymerase-related protein |
Protein accession | YP_001042791 |
Protein GI | 126461677 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAGG TGGTCCTGCC GCGTCTGGGA ACCGTGGCGG CCTGGCGCGC CGAGGCCCGG CGGCTGGCGC AGGCGGGTGT TCCGGCGGAA AGCGTCGTCT GGCGCGTGGG CGCGGGCGAA GCCGACCTTT TTGCCGACCT GCCCGCCCTG CCCGCGGGCC CGGCGCGCCA GATCCGGCTG TCGCGCGAGG CGGTCGGCTC ATTGGAGACT GCCCTCTGCC ACGCCGATCC CGAGCGGTTC GGGCGTGCCT ACGGTCTTCT CCTGCGGCTG GCGGACGGCA CTTTGCGCTG GGGCGACCGG AGCGATCCCG CGCTGCGCAA GCTCCTTGCG CAGGAGAAGA TGGTCCGGCG CGAGATCCAC AAGATGCACG CCTTCGTCCG CTTCCGCGAG CTTCCCTCGG AAGGCTCCCG CCGGGCCTTC GCGGCCTGGT TCGAGCCGGA CCATCCGGTC GAGGAGGCGG CGACGCCCTT CTTCGCCCGC CGGTTCGGCG ACATGGACTG GGCCATCGTC ACGCCCGAGG TCACCGCGCG GTTCGTGGCG GGGCAGCTCG ATTTCGCCCC GACCGAGGAG CGCGCCGCGC CGCCGGCCGA TGGAACGGAA GAGCTGTGGC GGACCTACTA CGCCAACATC TTCAATCCGG CGCGGCTGAT GGTGAAGGCG ATGCAGTCCG AAATGCCGAA ACGCTACTGG AAGAACCTGC CCGAGGCGGA GCTGATTCCG GGCCTGATCC GGGGTGCGGC CGAACGGGCG GCCGAGATGC AGGCCGCAGC GCCGACCGAG CCGCCCGCGC GAACGGCGGC CGTGGCGCGG CAGCGCGCGG CGGCGGCGGG CGGCCCCCCT GCGGCCAGCG ACGGTTCAGC GCCCGGCTCC TTGGCTGAAG CGAAGACCGC AGCCGAGGGA TGCCGGCGCT GCGGCCTCTG GGCCAATGCC ACGCAGGTCG TGTTCGGGGA GGGACCGGCC ACGGCTCGCA TGATGGTCGT GGGCGAGCAG CCCGGGGATC GCGAGGATCT GGCCGGACGG CCCTTCGTGG GCCCCGCGGG GCAACTCTTC GACGAGGAGG CGGCAGCGGC CGGCCTCGAT CGAGGGTCGG TCTATGTCAC CAACGCGGTC AAGCACTTCA AGTTCACCCC GCGCGGCAAG CGCCGCATCC ACCAGAAGCC CGACGCGGGC GAGGTGACTG CCTGCCGCTG GTGGCTCGAT CTCGAGCGGG ATCTGGTGCG CCCCCGCCTG ATCGTCGCGA TGGGCGCAAC GGCGCTCGCC TCGCTCACCG GCTCCGGAGC GGGGATCCTC AAGCGGCGCG GGTCGCTCGA GAGGCTCGAC GACGGGACGC CCCTGTTCGT GACGGTCCAT CCCTCCTACA TCCTGCGCCT GCCGGACGAG GCCGCGCGCG CCGAGGAACG CAGGCGGTTT CGTGCAGATC TTGAGGAGGC CCGACAGCTG CTGGAGCGTC TGGACTGA
|
Protein sequence | MPEVVLPRLG TVAAWRAEAR RLAQAGVPAE SVVWRVGAGE ADLFADLPAL PAGPARQIRL SREAVGSLET ALCHADPERF GRAYGLLLRL ADGTLRWGDR SDPALRKLLA QEKMVRREIH KMHAFVRFRE LPSEGSRRAF AAWFEPDHPV EEAATPFFAR RFGDMDWAIV TPEVTARFVA GQLDFAPTEE RAAPPADGTE ELWRTYYANI FNPARLMVKA MQSEMPKRYW KNLPEAELIP GLIRGAAERA AEMQAAAPTE PPARTAAVAR QRAAAAGGPP AASDGSAPGS LAEAKTAAEG CRRCGLWANA TQVVFGEGPA TARMMVVGEQ PGDREDLAGR PFVGPAGQLF DEEAAAAGLD RGSVYVTNAV KHFKFTPRGK RRIHQKPDAG EVTACRWWLD LERDLVRPRL IVAMGATALA SLTGSGAGIL KRRGSLERLD DGTPLFVTVH PSYILRLPDE AARAEERRRF RADLEEARQL LERLD
|
| |