Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3033 |
Symbol | |
ID | 7092710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 3347320 |
End bp | 3348198 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643466343 |
Product | formamidopyrimidine-DNA glycosylase |
Protein accession | YP_002363305 |
Protein GI | 217979158 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAAT TGCCCGAAGT CGAAACCGTG CGGCGCGGAC TTGAGCCTGT GATGGTCGGA GCGCGCATTC TGAGCGTCGA TCAGCGGCGG CCCGATCTCC GCTTTCCCTT TCCAGACCGC TTTCCCGAGC GCCTCGCCGG CCGGCGCATT CTGGCGCTCG GCCGCAGGGC AAAATATCTT CTGGCCGATC TCGATGACGG CGATGTCCTG ATCATGCATC TTGGCATGTC GGGCTCCTTT CGCGTTGAGC AAGCCGGCCC CGCCAAAACC CTCTCGCCCC GCGGGGCGCC GCCAAAAAAC GCCGCGCATG ACCACGTCGT CTTCACCCTC ACCAGCGGAG GGCGAATTGT CTACAATGAT CCGCGCCGAT TTGGCTTCAT GCAGATCGCA GCGCGCGCCG ATCTTGCGGC GCATCCGCTG TTCCGATCGC TTGGCGTCGA ACCTCTTGGC AATGAATTGA GCGGCGCCGC GCTCGCCCGG CTATTCGCCG GGAAAACAAC GTCGCTAAAA GCCGCGCTGC TCGACCAAAG CCTCGTCGCC GGGCTGGGCA ATATTTATGT CTGCGAGGCC CTGCATCGCG CCGGCCTATC GCCGCTGCGG CAGGCGGGCA GCCTCACCAA GAAATCGGGG CGGCCAACGG AGCGGGCGAA CCGGCTCGCC GACACGATCC GCGAGGTGCT TGAAGAGGCG GTCGCGGCCG GCGGCTCCTC GCTGCGCGAT CATCGCCAGA CCAATGGCGC TCTGGGTTAT TTTCAGCACA ACTTTCGGGT CTACGACCGC GCGCTGCATC CTTGTCCGAC GCCCGGCTGC AAAGGCGAAA TCTCGCGAAT CACGCAAGGT GGCCGGTCGA GTTTTTTCTG CAGCATGTGT CAAAAATAA
|
Protein sequence | MPELPEVETV RRGLEPVMVG ARILSVDQRR PDLRFPFPDR FPERLAGRRI LALGRRAKYL LADLDDGDVL IMHLGMSGSF RVEQAGPAKT LSPRGAPPKN AAHDHVVFTL TSGGRIVYND PRRFGFMQIA ARADLAAHPL FRSLGVEPLG NELSGAALAR LFAGKTTSLK AALLDQSLVA GLGNIYVCEA LHRAGLSPLR QAGSLTKKSG RPTERANRLA DTIREVLEEA VAAGGSSLRD HRQTNGALGY FQHNFRVYDR ALHPCPTPGC KGEISRITQG GRSSFFCSMC QK
|
| |