Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3251 |
Symbol | |
ID | 3836698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 3746569 |
End bp | 3747648 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637827367 |
Product | A/G-specific DNA-adenine glycosylase |
Protein accession | YP_428333 |
Protein GI | 83594581 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCCC CAACCCCCGA TTCGCTGCCC CCCGCCAGCG TGCTTGGCGA GCGCCTGCTT GACTGGTACC GGCGCAACGC CCGGACCCTG CCCTGGCGCG CGCCCTTTGG CGAGCGCACC GACCCCTATC GGGTCTGGCT ATCCGAGGTC ATGCTCCAGC AAACGACGGT GCCGGCCGTC ATCCCCTATT TCCAGGCCTT CCTCGCCCGC TGGCCCACCG TCACCGATCT TGCCGCCGCC CCCCTGGACG AGGTGCTGAC CGCCTGGGCC GGCCTTGGCT ATTACGCCCG GGCCCGCAAT CTGCACAAAT GCGCCCAGAC CATCGCCACT TGGCGCGACG GAACCTTCCC GGCGACCGAG GACGAGTTGC ACACCCTGCC CGGCATCGGC ACCTATACCG CCGCCGCCAT CGCCGCCATC GCCTTCGGCC AGCCCGCCGT GGTCATGGAT GGCAATATCG AACGGGTAAT GGCCCGACTG TTCGCCGAGA CCGAGCCTTT GCCCCAGGGC AAAAAGGCGC TCTACGCCCG CGCCGCCCAA TTGACGCCGA CCGCCCATCC CGGCGAGCAC GCCCAGGCGC TGATGGATCT GGGGGCGACG TTGTGCACCC CGCGCAAGCC GGCCTGCGGC CTGTGTCCCT GGCGCGATCC CTGCCTGGGC CGCCGGCTGG GTCTGGCCGA GACCCTGCCG GCCAAGGCGC CGAAAAAACT GAAGCCGACG CGCTGTGGCA TCGCCTTTTG GGTCACCCGA CCCGATGGCA CCGTGCTGTT GCGCCGCCGC CCGGAAAGCG GCCTGCTGGG CGGCATGATC GAAGTGCCCT CGACCCCTTG GCGGGAAGAC CCCTGGACCC TGGCCGAGGC GCGGGCCGAA GCGCCGCTGC CCGCCGAGTG GGTGCCGCTG GCCGGACGCG TTCGCCATAC CTTCACCCAT TTCCACCTCG ACCTCGATGT CGTCGCCGGC CGGGTCGGCG CCAGGGCCAA TGCCCGGGGG CTGTGGGTGC CCTTTGACCA GTTCGATCGC CATGCCTTGC CCGCCGTGAT GCTCAAAGTC GTGCGACTGG CTTTGGCGCG CACCCATTGA
|
Protein sequence | MTAPTPDSLP PASVLGERLL DWYRRNARTL PWRAPFGERT DPYRVWLSEV MLQQTTVPAV IPYFQAFLAR WPTVTDLAAA PLDEVLTAWA GLGYYARARN LHKCAQTIAT WRDGTFPATE DELHTLPGIG TYTAAAIAAI AFGQPAVVMD GNIERVMARL FAETEPLPQG KKALYARAAQ LTPTAHPGEH AQALMDLGAT LCTPRKPACG LCPWRDPCLG RRLGLAETLP AKAPKKLKPT RCGIAFWVTR PDGTVLLRRR PESGLLGGMI EVPSTPWRED PWTLAEARAE APLPAEWVPL AGRVRHTFTH FHLDLDVVAG RVGARANARG LWVPFDQFDR HALPAVMLKV VRLALARTH
|
| |