Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4246 |
Symbol | |
ID | 5834352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4722708 |
End bp | 4723679 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641370037 |
Product | phage SPO1 DNA polymerase-related protein |
Protein accession | YP_001641686 |
Protein GI | 163853643 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.558315 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACGA TGACGGCAAA CCACGCGGAT ACCGACGCCA GAGGCGACCT GATCTCCTTC CTCGACTTCC ATGTCGCGGC AGGCGTCGAC GCGGCCCTCG ACGAGCAGCC CCACGACCGC TTCGCCGAGG CCGAGTCCGT TCGGGCTGAC GCCGCCTCGC AGGCCGAGCC GGCCACGCGC CGCGAGCCCC TTGAACCACG CCGCGAACCT CTTCCACCGC GCAGCGAGCC GCCGCGGCGT GACGCTCCCT CCGCGGAAGA GCCGCCCCTG GCGTTCCGGC GTGAGCTGCC GAGCCGCGAC GCGCCCACGC GCGAGCCCCC TCGCACCTAT GGCAACGCGG CCGGCGCCAA GCCCGGCGAG GCGGCGGACG ATGCGCGGGC ACGGGCCGCG CAGATGAAGA CTCTCGACGA ACTGGAAGAG CTGCTGCGCG GCTTCGAGGG CTGCGGCCTG CGCTTCACCG CCAAGAACCT CGTCTTCGCC GACGGCAATC CGCAGGCGCG GGTGATGTTC GTCGGCGAGG CGCCGGGGGC CGACGAGGAC CGGATCGGCA AGCCGTTCAT GGGACGCTCC GGCCAGCTTC TCGACCGGAT GATGGCGGCG ATCGGCCTCG ACCGGACCAG CGCCTACATC TCCAACGTGG TGCCCTGGCG CCCGCCGGGC AACCGCAATC CGACCCCGCA GGAGATCTCG ATCTGCCGCC CCTTCGTCGA GCGTCAGATC GAACTCGCCA ACCCCGACAT CCTCGTCTGT CTCGGTGCGC CCGCGACGCA GACGCTGACC GGCACCAAGG ACGGCATCCT CAAGGCGCGC GGACGCTTCT ACCCGTATCG CCTCGGCGAC GGCCGCGAGA TCCGGGCGCT CGCCACCCTC CATCCGGCCT ACCTGCTGCG CCAGCCGGTG CAGAAGCGCC TCGCTTGGCG CGACTTCCGG ATGCTGAAGA GTGCCCTCGA CGGGAGCGCC GCGGGCAAGT AG
|
Protein sequence | METMTANHAD TDARGDLISF LDFHVAAGVD AALDEQPHDR FAEAESVRAD AASQAEPATR REPLEPRREP LPPRSEPPRR DAPSAEEPPL AFRRELPSRD APTREPPRTY GNAAGAKPGE AADDARARAA QMKTLDELEE LLRGFEGCGL RFTAKNLVFA DGNPQARVMF VGEAPGADED RIGKPFMGRS GQLLDRMMAA IGLDRTSAYI SNVVPWRPPG NRNPTPQEIS ICRPFVERQI ELANPDILVC LGAPATQTLT GTKDGILKAR GRFYPYRLGD GREIRALATL HPAYLLRQPV QKRLAWRDFR MLKSALDGSA AGK
|
| |