Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_04530 |
Symbol | mutY |
ID | 7759411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 428702 |
End bp | 429790 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643803375 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_002797683 |
Protein GI | 226942610 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCCCG AGCAATTCGG CAGCGCCGTG CTCGCCTGGT ACGACGACCA CGGCCGCAAG GACCTGCCCT GGCAGCGGGA CATCACCCCC TACCGGGTAT GGGTCTCGGA AATCATGCTG CAGCAGACCC AGGTCGCCAC CGTGCTCGGT TACTACGAGC GTTTCATGGC CGCCCTGCCG ACGGTGCAGA CCCTGGCCGC GGCACCCGAG GACGAGGTGC TGCACCTGTG GACCGGACTC GGCTACTACA GCCGCGCGCG CAACCTGCAC AAGACCGCGA AGATCCTGGT CGCCGAGCAT GCCGGCGAAT TCCCCCGCTC GGTGGAGGCC CTCGCCGAAC TGCCCGGAAT CGGCCGCTCC ACCGCCGGCG CCATCGCCAG CATCGGCATG GGGCTGCGCG CGCCGATCCT CGACGGCAAC GTCAAGCGCG TGCTGGCCCG CTACCTGGCC GAGGACGGCC ATCCCGGCGA GCCCAGGGCG GCGAAGCGCC TGTGGGAAGC CGCCGAACGC TTCACCCCCG AGGCACGGGT CAACCACTAC ACCCAGGCGA TGATGGACCT CGGCGCCACC CTCTGCACCC GTACGCGCCC GAGCTGCCTG CTCTGCCCGC TGGCGAGCGG CTGCCGTGCC CACCTGCTCG GCCGCGAGAC CGACTACCCG ACGCCCAGGC CGCGCCGGGA ACTGCCGCGC AAGCGCACCC TGATGCCGCT GCTGGCCAGC CGCGACGGCG CCATCCTGCT CTACCGGCGC CCGTCCAGCG GGCTCTGGGG CGGGCTCTGG AGCCTGCCGG AACTGGACGA CCTGGCCGCC CTCGAAACCC TCGCCGCCCG CCATGCCCTG CGTCTCGGCG AGCGCCGCGC GCTGCCGGGC CTGACCCACA CCTTCAGCCA TTTCCAGTTG GCCATCGAAC CCTGGCTGGT CACGGTGGAA AGCGCCGGCC CCGCCGTGGC CGAGGCCGAC TGGCTCTGGT ATAACCTCGC CGCGCCGCCG CGCCTGGGCC TCGCCGCCCC GGTGAAGAAG CTGCTCAAAC GCGCCCAGGG CGAACTGCAG CCCCCGGCGG ACCGGCCGAT TTCGAGGAGA AGCCCATGA
|
Protein sequence | MTPEQFGSAV LAWYDDHGRK DLPWQRDITP YRVWVSEIML QQTQVATVLG YYERFMAALP TVQTLAAAPE DEVLHLWTGL GYYSRARNLH KTAKILVAEH AGEFPRSVEA LAELPGIGRS TAGAIASIGM GLRAPILDGN VKRVLARYLA EDGHPGEPRA AKRLWEAAER FTPEARVNHY TQAMMDLGAT LCTRTRPSCL LCPLASGCRA HLLGRETDYP TPRPRRELPR KRTLMPLLAS RDGAILLYRR PSSGLWGGLW SLPELDDLAA LETLAARHAL RLGERRALPG LTHTFSHFQL AIEPWLVTVE SAGPAVAEAD WLWYNLAAPP RLGLAAPVKK LLKRAQGELQ PPADRPISRR SP
|
| |