Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0801 |
Symbol | |
ID | 4784485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 838538 |
End bp | 839875 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640089362 |
Product | guanine deaminase |
Protein accession | YP_001019998 |
Protein GI | 124265994 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02967] guanine deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAAA GGCTGGCCCT GTTCGGCGAC CTGCTCGACA TCGAGGTCGA CCCCGGCTTC GCTGCCCCTG GCGACGCCGC CGGCGTGCGC TACCGGCCCG ACCACTGGCT GCTGGTCGAG GACGGCCGCA TCGTCGGTGC CGAGCCGGCC CGCGCCGGCA GCGGCCCCGA CGCGAGCTGG CAGCGTGTGG ACCACGCCGG GCGGCTGATC ACGCCGGGCT TCATCGACAC CCATGTGCAC TGCCCGCAGC TCGACGTGAT CGCGAGCTAC GGCACCGCGT TGCTCGAGTG GCTGAACACC TACACCTTCC CGGCCGAGCT GCGCTATGCC GATCCGCTGG TGGCGGCCAG CGGTGCCGAG CGCTTCGTCG ATGCGCTGCT GGCGCACGGC ACCACCTCGG CGGTGGTGTT CCCGACTGTC CACAAGGGCG CGACCGAGGC GCTGTTCACC TCGGCCCGCG CACGCGGCAT GCGGCTGGTG GCCGGCAAGG TGCTGATGGA CCGCCACGCG CCCGACGGCC TGCGCGACGA CGTGCTGCAG GCCGAGCGAG ATTGCGCCGA TCTGATCGCG CGCTGGCACG GCAACGGCCG CCTGTCGTAC GCGGTGACGG TGCGCTTCGC GGCCACCAGC ACGCCGGAGC AGCTGGCGAT GGCCGGTCGG CTGTGCCGCG AACACCCGGG CGTGTACATG CAGACCCACG TGGCCGAGAA CACCGACGAG GTGCGCTGGA TCGCCGAGCT GTTCCCCGAG GCTCGCAGCT ACCTCGATGT CTACCACCGG CACGGTCTGC TGCACGAGCG CGCGGTGCTG GCGCACGGCA TCTGGCTCGA CGACACCGAC CGCGCGTTGC TGCGCGACAC CGGTGCGCAG ATCGCCTTCT GCCCGTCGAG CAACCTGTTC CTCGGCAGTG GTTTGTTCGA CTGGCAGGCC GCGGTCGACA CCGGCTACCG CGTGTCGATG GCCAGCGACG TGGGCGGCGG CACCAGCCTG TCGATGCTGC GCACGCTGGC CGATGCCTAC AAGGTGCAGG CGCTGCGCGG CGTGAAGCTC AGCGCCTGGA AGGCGCTGCA TGCCGCGACG CGCGGCGCCG CCGAGGCGCT GGGCCTGGCG CACGAGATGG GTCACCTCGG ACATGGTGCG CTGGCTGACC TGGCGGTGTG GGACTGGGCG GTCGGCCCGG TCGCCACGCA CCGCGATGCG GTGGCGCGCC GCGGTCGTGC CGGCGTGTCG CCGCTGACTG CGCTGCACGA GCGCGTGTTC GCGTGGATGA CGCTAGGCGA CGAGCGCAAT CTCGTCGCGA CCTACGTGGC CGGCGCGTGC CGCCACGAGC GCGGCTGA
|
Protein sequence | MSQRLALFGD LLDIEVDPGF AAPGDAAGVR YRPDHWLLVE DGRIVGAEPA RAGSGPDASW QRVDHAGRLI TPGFIDTHVH CPQLDVIASY GTALLEWLNT YTFPAELRYA DPLVAASGAE RFVDALLAHG TTSAVVFPTV HKGATEALFT SARARGMRLV AGKVLMDRHA PDGLRDDVLQ AERDCADLIA RWHGNGRLSY AVTVRFAATS TPEQLAMAGR LCREHPGVYM QTHVAENTDE VRWIAELFPE ARSYLDVYHR HGLLHERAVL AHGIWLDDTD RALLRDTGAQ IAFCPSSNLF LGSGLFDWQA AVDTGYRVSM ASDVGGGTSL SMLRTLADAY KVQALRGVKL SAWKALHAAT RGAAEALGLA HEMGHLGHGA LADLAVWDWA VGPVATHRDA VARRGRAGVS PLTALHERVF AWMTLGDERN LVATYVAGAC RHERG
|
| |