Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_4416 |
Symbol | |
ID | 7117730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 4673226 |
End bp | 4674125 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643527115 |
Product | formamidopyrimidine-DNA glycosylase |
Protein accession | YP_002423120 |
Protein GI | 218532304 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAAC TGCCCGAAGT CGAGACCGTG CGCCGGGGGC TCGCCCCCGC GATGGTCGGG GCGCGCGTCG CCCGCGTCAC CCTGCGTAGG CCGAACCTGC GCTTCCCCTT CCCCGAGCGC TTCGCCGAGC GGCTGGAGGG CACCACGGTG CTGGAGCTGG CGCGCCGGGC CAAATACCTC ACGGCGCATC TCGATTCCGG CGAGAGCCTG ATCCTGCATC TCGGCATGAG CGGGCGCTTC GATGTACGGC TGCCCGACGG CTCGAACCTC TCGCCGGGCG ACTTCTACCT CGAGGGGGCG CTCGGCACGC CCAAGCACGA CCACGTGGTG ATGGCCTTCG CCAACGGTGC CACCGTCACC TACAACGACG CCCGCCGCTT CGGCTTCATG GATCTCGTGG CCACGCGCGA TCTCGAGACC TGCCGCCACT TCGCCAGCAT GGGCGTCGAG CCGCTCTCCG ACGCCCTCGA CGCGCCCCTG CTCGCGCGCC TGTTCGCCCG GAAGATCACG CCGCTGAAGG CGGCACTGCT CGACCAGCGC CTGATCGCGG GCCTGGGCAA CATCTATGTC TGCGAGGCGC TGCACCGCTC GGGCCTCCAC CCGGCCCTGC CGGCGGGCGC GCTCGCCAAG CCCGACGGTT CGCCGGCGCC CAAGGCGAAG ACACTCGTCA AGGAGATCAA GGCGGTGCTG ACGGAGGCAG TGGCGGCCGG CGGCTCCACC TTGCGCGACT ACGCCCGGCC GGACGGGGAG CGCGGCGCCT TCCAGCACGG CTTCCGCGTC TACGACCGGG TGGGCCATGC CTGCCCGACC AAGGGCTGTA CCGGCCGGGT CGGCCGGATC GTGCAGGGTG GACGCTCGAC CTTCTTCTGC GAAACCTGCC AGGTCCTGCC GGTCCGGTAA
|
Protein sequence | MPELPEVETV RRGLAPAMVG ARVARVTLRR PNLRFPFPER FAERLEGTTV LELARRAKYL TAHLDSGESL ILHLGMSGRF DVRLPDGSNL SPGDFYLEGA LGTPKHDHVV MAFANGATVT YNDARRFGFM DLVATRDLET CRHFASMGVE PLSDALDAPL LARLFARKIT PLKAALLDQR LIAGLGNIYV CEALHRSGLH PALPAGALAK PDGSPAPKAK TLVKEIKAVL TEAVAAGGST LRDYARPDGE RGAFQHGFRV YDRVGHACPT KGCTGRVGRI VQGGRSTFFC ETCQVLPVR
|
| |