Gene Mpe_A0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0801 
Symbol 
ID4784485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp838538 
End bp839875 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content72% 
IMG OID640089362 
Productguanine deaminase 
Protein accessionYP_001019998 
Protein GI124265994 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAA GGCTGGCCCT GTTCGGCGAC CTGCTCGACA TCGAGGTCGA CCCCGGCTTC 
GCTGCCCCTG GCGACGCCGC CGGCGTGCGC TACCGGCCCG ACCACTGGCT GCTGGTCGAG
GACGGCCGCA TCGTCGGTGC CGAGCCGGCC CGCGCCGGCA GCGGCCCCGA CGCGAGCTGG
CAGCGTGTGG ACCACGCCGG GCGGCTGATC ACGCCGGGCT TCATCGACAC CCATGTGCAC
TGCCCGCAGC TCGACGTGAT CGCGAGCTAC GGCACCGCGT TGCTCGAGTG GCTGAACACC
TACACCTTCC CGGCCGAGCT GCGCTATGCC GATCCGCTGG TGGCGGCCAG CGGTGCCGAG
CGCTTCGTCG ATGCGCTGCT GGCGCACGGC ACCACCTCGG CGGTGGTGTT CCCGACTGTC
CACAAGGGCG CGACCGAGGC GCTGTTCACC TCGGCCCGCG CACGCGGCAT GCGGCTGGTG
GCCGGCAAGG TGCTGATGGA CCGCCACGCG CCCGACGGCC TGCGCGACGA CGTGCTGCAG
GCCGAGCGAG ATTGCGCCGA TCTGATCGCG CGCTGGCACG GCAACGGCCG CCTGTCGTAC
GCGGTGACGG TGCGCTTCGC GGCCACCAGC ACGCCGGAGC AGCTGGCGAT GGCCGGTCGG
CTGTGCCGCG AACACCCGGG CGTGTACATG CAGACCCACG TGGCCGAGAA CACCGACGAG
GTGCGCTGGA TCGCCGAGCT GTTCCCCGAG GCTCGCAGCT ACCTCGATGT CTACCACCGG
CACGGTCTGC TGCACGAGCG CGCGGTGCTG GCGCACGGCA TCTGGCTCGA CGACACCGAC
CGCGCGTTGC TGCGCGACAC CGGTGCGCAG ATCGCCTTCT GCCCGTCGAG CAACCTGTTC
CTCGGCAGTG GTTTGTTCGA CTGGCAGGCC GCGGTCGACA CCGGCTACCG CGTGTCGATG
GCCAGCGACG TGGGCGGCGG CACCAGCCTG TCGATGCTGC GCACGCTGGC CGATGCCTAC
AAGGTGCAGG CGCTGCGCGG CGTGAAGCTC AGCGCCTGGA AGGCGCTGCA TGCCGCGACG
CGCGGCGCCG CCGAGGCGCT GGGCCTGGCG CACGAGATGG GTCACCTCGG ACATGGTGCG
CTGGCTGACC TGGCGGTGTG GGACTGGGCG GTCGGCCCGG TCGCCACGCA CCGCGATGCG
GTGGCGCGCC GCGGTCGTGC CGGCGTGTCG CCGCTGACTG CGCTGCACGA GCGCGTGTTC
GCGTGGATGA CGCTAGGCGA CGAGCGCAAT CTCGTCGCGA CCTACGTGGC CGGCGCGTGC
CGCCACGAGC GCGGCTGA
 
Protein sequence
MSQRLALFGD LLDIEVDPGF AAPGDAAGVR YRPDHWLLVE DGRIVGAEPA RAGSGPDASW 
QRVDHAGRLI TPGFIDTHVH CPQLDVIASY GTALLEWLNT YTFPAELRYA DPLVAASGAE
RFVDALLAHG TTSAVVFPTV HKGATEALFT SARARGMRLV AGKVLMDRHA PDGLRDDVLQ
AERDCADLIA RWHGNGRLSY AVTVRFAATS TPEQLAMAGR LCREHPGVYM QTHVAENTDE
VRWIAELFPE ARSYLDVYHR HGLLHERAVL AHGIWLDDTD RALLRDTGAQ IAFCPSSNLF
LGSGLFDWQA AVDTGYRVSM ASDVGGGTSL SMLRTLADAY KVQALRGVKL SAWKALHAAT
RGAAEALGLA HEMGHLGHGA LADLAVWDWA VGPVATHRDA VARRGRAGVS PLTALHERVF
AWMTLGDERN LVATYVAGAC RHERG