Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1600 |
Symbol | |
ID | 7090957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 1724866 |
End bp | 1725945 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643464926 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_002361911 |
Protein GI | 217977764 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.05241 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGTGACG CCGCAATCGC AGACGCCGCA ATCCCAGACG CAGTTCTTGC CTGGTACGAC CGTCATCGCC GCGTTCTGCC CTGGCGCGCG CCGCCCGGCG CGGCGGCCGA CCCTTACGCC GTCTGGCTCT CGGAAATCAT GCTGCAGCAG ACGACGGTCG CGGCGGTCAA ATCCTATTTC TCCGCGTTTC TGGCGCGCTG GCCCAACGTC GACGCCCTCG CGCGGGCGCC GGCCGAGGAG GTGATGCGGC AGTGGGCCGG GCTTGGCTAT TATTCGCGGG CGCGCAATTT GCACGCCTGC GCCAAGACCG TGTCCGCAAA ATTTGGCGGA CAATTTCCGG ACGAGGAGGC GGCCCTGCGC GCCCTGCCAG GTCTTGGCCC TTATACCGCC GCGGCGGTCG CCGCGATCGC CTTTTGCCGC AAGGCCGCCG TCGTCGACGG CAATGTCGAG CGCGTCCTGT CGCGCCTCTA CGCGATCGAG GCGCCACCGC CGGCGGGAAA ACGCCTGATC TACGCACGGG CCGAAGCGCT GACGCCGGCG GAGCGTCCCG GCGATTATGC GCAGGCGATG ATGGATCTTG GCGCGACGAT CTGCACGCCG AAAAGCCCCG CCTGCGCGAT CTGCCCCCTG AACGGAGCCT GCGCCGCGTT CAGGATCGGC GATCCAGCGC GTTTTCCGGT GAAGGCCGCG AAACCGGAGC GGCCGCTCCG GCGAGGCGCC GCCTTTTATG TGGCGCGGCC CGACGGCGCG GTCCTGGTGC GAACGCGCCC GCCGAAAGGG CTGCTCGGCG GCATGACGGA GATCCCGGGC TCACCCTGGA CCGAGGATTT CGACGAGGCC GGCGCGCCGC GCCATGCCCC GGTCGAGGCG CGCTATCGCC GGCTGGCGCG CCCGGTCGAG CACAGTTTCA CGCATTTTGC CTTGCAGCTT TCGGTGTATG TGGGGGAGGC TGGGGCAAAC ATGCCGGCGC CCGACGGTTG CCGCTGGGCG GCGGCCGATC TTGAGAATGA GGCGCTGCCG ACTCTCATGC GCAAACTCGT CAGCGCGGCG AGGCGGCGGG AATTTGGGGG AGATCTGTGA
|
Protein sequence | MCDAAIADAA IPDAVLAWYD RHRRVLPWRA PPGAAADPYA VWLSEIMLQQ TTVAAVKSYF SAFLARWPNV DALARAPAEE VMRQWAGLGY YSRARNLHAC AKTVSAKFGG QFPDEEAALR ALPGLGPYTA AAVAAIAFCR KAAVVDGNVE RVLSRLYAIE APPPAGKRLI YARAEALTPA ERPGDYAQAM MDLGATICTP KSPACAICPL NGACAAFRIG DPARFPVKAA KPERPLRRGA AFYVARPDGA VLVRTRPPKG LLGGMTEIPG SPWTEDFDEA GAPRHAPVEA RYRRLARPVE HSFTHFALQL SVYVGEAGAN MPAPDGCRWA AADLENEALP TLMRKLVSAA RRREFGGDL
|
| |