Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2267 |
Symbol | |
ID | 3705059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2615063 |
End bp | 2616127 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637738746 |
Product | A/G-specific adenine glycosylase MutY |
Protein accession | YP_344255 |
Protein GI | 77165730 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA GCGACTTCAG CCAACGCCTG CTTACCTGGT TTGACGCCTA TGGACGCAAG GATCTGCCTT GGCAACAAAA TCCTACCCTC TACCGGGTCT GGGTCTCGGA AATTATGCTG CAGCAAACCC AGGTAGCTAC CGTCATCCCC TATTACCAGC GATTTATCGA GCGTTTCCCT AGCCTGCCAG CCCTCGCCCA CGCCTCTGTG GATGAAATTC TCGGACTCTG GGCAGGACTA GGCTACTATG CCCGGGCCCG GCGCCTGCAC CAGGCGGCCC GGATAGCCTG GGAAACCCAT GGAGGGGAAT TGCCCGCTAC TCTTGAGGCG CTCATGGAAT TACCAGGAAT TGGCCGTTCC ACCGGCGGCG CTATCTTGGC CCTGGCCCTG GGTCAGCGTT ACCCCATTCT GGATGGCAAC GTCAAGCGGG TACTGACCCG GCAGGAGGCC ATCGAGCATT GGCCGGGACA ACCAAAGGTG GAAAAGCAGC TCTGGCAGCG GGCCGCAACG CTGCTTCCTC GAACCCGCCT GGCGGACTAC ACCCAAGCCA TCATGGATCT GGGGGCTACC GTCTGCACCC GTCATCGCCC TCATTGTCCC TCCTGCCCCG TCAAGAAAAC TTGTCAAGCC CATCTCCAAG AAAACCCGGA AGCTTATCCC CGTTCCCGCC CTCGCAAGCG TTTACCCCTG CGCGCTACCT GTATGTTAAT TCTGCTTAAT GATCAGGGAG AGGTTCTATT GGAGCGCCGC CCGCCCGTAG GAATTTGGGG TGGGCTATGG AGTTTCCCCG AGTGTCCTCC TCAAACGGAG GCGGCTCTGT GGTGCCAGGA ACAGTTCGGC TGGCCTATTG GAGAAGTTCA GCATTGGCCT CCTTTACGTC ACCATTTCAC CCACTTTACC CTTGATATCC AACCAGTCAT AGCTCGAATT CGCGGCGAGG CCCGGCAAGT CATGGAACCT AACAGCCAGG TCTGGTATAA AATGGAGCCC ATGTATAAGC GCGGTCTCCC GGCCCCCACC CTCCGGCTAT TAAAGCGGCT GCGCGAACCA TCAAAAGGTG AATAA
|
Protein sequence | MNKSDFSQRL LTWFDAYGRK DLPWQQNPTL YRVWVSEIML QQTQVATVIP YYQRFIERFP SLPALAHASV DEILGLWAGL GYYARARRLH QAARIAWETH GGELPATLEA LMELPGIGRS TGGAILALAL GQRYPILDGN VKRVLTRQEA IEHWPGQPKV EKQLWQRAAT LLPRTRLADY TQAIMDLGAT VCTRHRPHCP SCPVKKTCQA HLQENPEAYP RSRPRKRLPL RATCMLILLN DQGEVLLERR PPVGIWGGLW SFPECPPQTE AALWCQEQFG WPIGEVQHWP PLRHHFTHFT LDIQPVIARI RGEARQVMEP NSQVWYKMEP MYKRGLPAPT LRLLKRLREP SKGE
|
| |