Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0028 |
Symbol | mutY |
ID | 5711650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 31815 |
End bp | 32909 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641265922 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_001531378 |
Protein GI | 159042584 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.598212 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.0354214 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAAAA AAATCGCGAG CGACGCGGCG GCGGCGGCGC TGAGCAGCAC CCTGCTCGAC TGGTACGATG CCCATGCCCG CGTGATGCCC TGGCGCGTGG GGCCTGCCGA GCGCGCCGCG GGCACGCGCC CGGACCCCTA TCGCGTCTGG CTCTCCGAGG TGATGCTGCA ACAGACCACC GTGGCCGCCG TGCGCGACTA CTTCCGCGCC TTCACCGACC GCTGGCCCCG GGTCACCGAC CTGGCGGCGG CCGCGGATGC CGATGTCATG GCCGCCTGGG CGGGGCTCGG GTACTACGCC CGCGCGCGCA ACCTGCTGAA ATGCGCGCGG GTGGTGACGC AGGACCATGG CGGGCGCTTT CCCGACACCG CCGAAGGCTT GCGCGCCCTG CCCGGCATCG GCCCCTATAC CTCCGCCGCC ATCGCCGCGA TTGCCTTCGA CCGGCCCGAA ACGGTGGTCG ACGGCAATGT CGAACGGGTC ATGGCCCGGC TGCGCGGGAT CGAAACGCCC CTGCCGCCCG CCAAGCCGGA ACTGACCGAG GCCGCCGCGG CCCTCACGCC GGACAAACGC CCCGGCGACT ACGCCCAGGC GGTGATGGAT CTCGGCGCCA CGATCTGCAC CCCGCGCAAC CCCGCCTGCG GCATCTGCCC GTGGCGGGAC CCTTGTGTCG CCCGCGCCAC CGGCATCGCC GCCGAACTGC CCCGCAAACT GCCGAAGAAG CCCAAACCGA CCCGTTTCGG GCTGGTCTAT GTCGCGCGCC ACCCGGACGG TGCCTGGCTG CTGGAAACCC GTCCCGACCG GGGCCTTCTG GGCGGCATGC TCGCCTATCC CTCCACCGAC TGGACCGAGG AGGCGCCCGC CGACGCCCCC CCGGTCGCCG CGGACTGGCA CGACCCGGCC CTCGAAGTGC GCCACACCTT CACCCATTTC CACCTGCGCC TGGCCCTGCG CACCGCGATC ACGGACGCCC CGCCCGCAAG GGGCCGCTTC GTGCCGCGCG CCGCCTTTCG CCCCGCCGAT CTGCCCACGG TGTTCCGCAA GGCCCATGAC CTCGCGGCGG CACATCTTGA CACAGGCTTC CCTTCCCCGC TCTGA
|
Protein sequence | MHKKIASDAA AAALSSTLLD WYDAHARVMP WRVGPAERAA GTRPDPYRVW LSEVMLQQTT VAAVRDYFRA FTDRWPRVTD LAAAADADVM AAWAGLGYYA RARNLLKCAR VVTQDHGGRF PDTAEGLRAL PGIGPYTSAA IAAIAFDRPE TVVDGNVERV MARLRGIETP LPPAKPELTE AAAALTPDKR PGDYAQAVMD LGATICTPRN PACGICPWRD PCVARATGIA AELPRKLPKK PKPTRFGLVY VARHPDGAWL LETRPDRGLL GGMLAYPSTD WTEEAPADAP PVAADWHDPA LEVRHTFTHF HLRLALRTAI TDAPPARGRF VPRAAFRPAD LPTVFRKAHD LAAAHLDTGF PSPL
|
| |