Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0266 |
Symbol | |
ID | 3834586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 330933 |
End bp | 331880 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637824346 |
Product | Phage SPO1 DNA polymerase-related protein |
Protein accession | YP_425358 |
Protein GI | 83591606 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCG ACGCCCTTCC CCCGCCGCCG GCCCGACAGG CCGGCCCGCT TTCCGCTCCC GGTTCGTTCC TCGGCCAGGG GGGCGAAGAG GGGCATGGCT TTGATCTCGC CGGGCTTCTT GGCTGGTATC TTGAAGCCGG GGTGGATCTG GCGGTGGGTG ACAGCCCGCT TGACCGCTAT GCCCTGAGCG AACGCGCCCC CGTCCCCCGG GTTGCGCCGC CCGCCGCCCG CCCGCCGACG CCAGCGGCTT CGCGTCCAAC CGGGCGAACC CCGGGCGGAC CGCGGATCGA GGATGGTGGC CCCAATGACG AACCGCCGAC CCAGGCCGAA AGCCAAAGGG CCGCGTCGTT TCTCGCCGCC CAGGCGGGCT CGCTCGAGGA CCTGCGAACC GCGCTCGATT CCTTCCGCTC CTGCGCCCTG CGCCGCACCG CGACGCAAAC GGTCTTTGCC GACGGTTGTC CCCAATCGGG GATGATGATC ATCGGCGAGG CCCCCGGGGC CGACGAGGAT CGGCTGGGAC GACCCTTCGT CGGCGTCTCG GGCAAGCTGC TTGATGCCAT GCTGGGCTCG ATCGGTCTTG ATCGCGCCGA CAGTTGCTAC ATCACCAACG TCGTGCCCTG GCGCCCGCCC GGCAACCGCA AGCCCACCGC CGACGAGGTG GCGCTGTGCC TGCCCTTTCT CGAACGCCAT ATCGCCCTGG TCCGCCCCCG GCTGATCCTG GCGGTGGGCG GGCTGGCCGC CCAGGCGCTG TTTGGGCGGA GCGAGGGCAT CACCCGCCTA CGCGGCCACT GGCATGACTA TGAGGGCCCC GGGCTTGACC GGTCGGTGCC CGTCCTCGCC ACCTTCCACC CTGCCTATCT GCTGCGCACC CCGGCGCAAA AGCGCTTGGC TTGGCGCGAC CTGCTGGCGC TGCGCCAGCG GCTCGATAGC CTGCCCAGCC TGTCATAA
|
Protein sequence | MSTDALPPPP ARQAGPLSAP GSFLGQGGEE GHGFDLAGLL GWYLEAGVDL AVGDSPLDRY ALSERAPVPR VAPPAARPPT PAASRPTGRT PGGPRIEDGG PNDEPPTQAE SQRAASFLAA QAGSLEDLRT ALDSFRSCAL RRTATQTVFA DGCPQSGMMI IGEAPGADED RLGRPFVGVS GKLLDAMLGS IGLDRADSCY ITNVVPWRPP GNRKPTADEV ALCLPFLERH IALVRPRLIL AVGGLAAQAL FGRSEGITRL RGHWHDYEGP GLDRSVPVLA TFHPAYLLRT PAQKRLAWRD LLALRQRLDS LPSLS
|
| |