Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4046 |
Symbol | |
ID | 5834516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4502787 |
End bp | 4503686 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641369837 |
Product | formamidopyrimidine-DNA glycosylase |
Protein accession | YP_001641487 |
Protein GI | 163853444 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.2977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.800255 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAAC TGCCCGAAGT CGAGACCGTG CGCCGGGGGC TCGCCCCCGC GATGGTCGGG GCGCGCGTCG CCCGCGTCAC CCTGCGTAGG CCGAACCTGC GCTTCCCCTT CCCCGAGCGC TTCGCCGAGC GGCTGGAGGG CACCACGGTG CTGGAACTGG CGCGCCGGGC CAAATACCTC ACGGCGCATC TCGATTCCGG CGAGAGCCTG ATCCTGCATC TCGGCATGAG CGGGCGCTTC GACGTGCGTC TGCCCGACGG CTCGAACCTC TCGCCGGGCG ACTTCTACCT TGAAGGGGCG CTCGGCACGC CCAAGCACGA CCACGTGGTG ATGGCCTTCG CCAACGGTGC CACCGTCACC TACAACGACG CCCGCCGCTT CGGCTTCATG GATCTCGTGG CCACGCGCGA TCTCGAGACC TGCCGCCACT TCGCCAGCAT GGGCGTCGAG CCGCTCTCCG ACGCCCTCGA CGCACCCCGG CTCGCGCGCC TGTTCGCCCG GAAGATCACG CCGTTGAAGG CGGCACTGCT CGACCAGCGC CTGATCGCGG GCCTGGGCAA CATCTATGTC TGCGAGGCGC TGCACCGCTC GGGCCTCCAC CCGGCCCTGC CGGCGGGCGC GCTCGCCAAG CCCGACGGTT CGCCGGCGCC CAAGGCGAAG ACACTCGTCA AGGAGATCAA GGCGGTGCTG ACGGAGGCAG TGGCGGCCGG CGGCTCCACC TTGCGCGACT ACGCCCGGCC GGACGGGGAG CGCGGCGCCT TCCAGCACGG CTTCCGCGTC TACGACCGGG TGGGCCATGC CTGCCCGACC AAGGGCTGTA CCGGCCGAAT CGGCCGGATC GTGCAGGGTG GACGCTCGAC CTTCTTCTGC GAAACCTGCC AGGTCCTGCC GGTCCGGTAA
|
Protein sequence | MPELPEVETV RRGLAPAMVG ARVARVTLRR PNLRFPFPER FAERLEGTTV LELARRAKYL TAHLDSGESL ILHLGMSGRF DVRLPDGSNL SPGDFYLEGA LGTPKHDHVV MAFANGATVT YNDARRFGFM DLVATRDLET CRHFASMGVE PLSDALDAPR LARLFARKIT PLKAALLDQR LIAGLGNIYV CEALHRSGLH PALPAGALAK PDGSPAPKAK TLVKEIKAVL TEAVAAGGST LRDYARPDGE RGAFQHGFRV YDRVGHACPT KGCTGRIGRI VQGGRSTFFC ETCQVLPVR
|
| |