Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0117 |
Symbol | |
ID | 4895677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 131901 |
End bp | 133004 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640110700 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_001042009 |
Protein GI | 126460895 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.571395 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.180189 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGTGACA CCGACGAAAG CCCGGACGAC GGGGACATCT CCGCCCGGCT TCTGGCCTGG TACGACCGGC ACGCGCGCGT CATGCCCTGG CGGGTGGGCC CGGCCGAGCG GCGCGCGGGG CACAGGCCCG ATCCCTACCG GGTCTGGCTG TCCGAGATCA TGCTCCAGCA GACGACGGTG GCCGCGGTGC GCGACTATTT CCGCCGCTTC ACCGACCGCT GGCCGGATGT GGAGGCGCTG GCCGCCGCGC CGGATGCCGA TGTCATGGCC GAATGGGCGG GCCTTGGCTA TTACGCGCGG GCGCGCAACC TGTTGAAGGG CGCGCGGGCG GTGGTGGCGC TGCACGGCGG CCGCTTTCCC GAGACGCGGG ACGGGCTGCT CTCGCTGCCG GGGGTCGGGC CCTATACGGC CGCGGCGGTG GCCTCGATCG CCTTCGACGA GCCCGCGACC GTGGTCGACG GCAATGTCGA GCGCGTGGTC TCGCGCCTCT TTGCGGTCGA GACGCCGCTG CCCGCGGCCA AGCCCGAGCT CACGCGGCTC GCCGCCACCC TCACCCCGCA GGAGCGGCCG GGGGATCATG CGCAGGCGAT GATGGATCTC GGCGCCACGA TCTGCACGCC GCGAAAGCCC GTCTGCAGCC TCTGCCCGCT CAGGCCCGAT TGCGAGGGCC ACCGCGCAGG CCTCGAGGCG GAGCTGCCGC GCAAGGCGCC GAAGGCGGAA AAGCCCGTGC GCGAGGGCAG GCTCTGGATC GCGGTTCGCG CCGACGGGGC GGTGCTGCTC GAGACCCGGC CCGAGCGCGG GATGCTCGGC GGCATGCTGG GCTGGCCCGG CACCGACTGG GACCGGAGCG GCGGCCCCGC GGGCGCGCCG CTCGAGGCCG ACTGGCGCGA GACGGGGGTC GAGGTGCGCC ACACCTTCAC CCACTTCCAC CTGCGGCTCG AGGTGCTGGT GGCGCAGGTG GCCGAAGGGG CGGTCCCCGC CCGCGGGAGC TTCGTGCCCC GCGCGGAGTT CCGGCCCGCG GCCCTTCCGA CCCTGATGCG CAAGGGCTGG TCCGTTGCCG CAGCGGCGAT CCGTCACCCG ACCGGAGAGG AACGACTCGC TTAG
|
Protein sequence | MRDTDESPDD GDISARLLAW YDRHARVMPW RVGPAERRAG HRPDPYRVWL SEIMLQQTTV AAVRDYFRRF TDRWPDVEAL AAAPDADVMA EWAGLGYYAR ARNLLKGARA VVALHGGRFP ETRDGLLSLP GVGPYTAAAV ASIAFDEPAT VVDGNVERVV SRLFAVETPL PAAKPELTRL AATLTPQERP GDHAQAMMDL GATICTPRKP VCSLCPLRPD CEGHRAGLEA ELPRKAPKAE KPVREGRLWI AVRADGAVLL ETRPERGMLG GMLGWPGTDW DRSGGPAGAP LEADWRETGV EVRHTFTHFH LRLEVLVAQV AEGAVPARGS FVPRAEFRPA ALPTLMRKGW SVAAAAIRHP TGEERLA
|
| |