Gene Mpe_A0782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0782 
Symbol 
ID4784170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp817660 
End bp819042 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content72% 
IMG OID640089343 
Producthydroxydechloroatrazine ethylaminohydrolase 
Protein accessionYP_001019979 
Protein GI124265975 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.026744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACGC TGCTGATCCA CAACGCCCGC CTCGTCGTCA CGATGGACGA GCAGCGCCGC 
GAGATCGCCG ACGGCAGCGT GTTCATCCGC GACCACGTGA TCGAGGCGGT CGGTCCGGCG
GCCGAACTGC CAGGCACCGC CGACGAGGTG ATCGATGCCC GTGACCACGT CGTCCTGCCC
GGGCTGATCA ACACCCACCA CCACATGACC CAGTCCTTGA CGCGCGTGAT CGCGCAGGAC
TGCGAGCTGT TCGACTGGCT GGGGACGCTC TACCCGATCT GGGCCGGGCT CACGCCCGAG
ATGGTGCGGG TGTCGACGCA GACCGCGATG GCCGAGCTGC TGCTGGCGGG CTGCACGACC
AGCAGCGACC ATCTCTATCT CTACCCCAAC GGGGTGATGC TCGACGACAG CATCGAGGCG
GCCACGGAGA TCGGCATGCG CTTCCACGCG GCGCGCGGCT CGATGAGCGT GGGCCAGAGC
CAGGGCGGCC TGCCACCCGA CCGCGTGGTG GAGGCCGAGC CGGCGATCCT GAAGGAGACG
CAGCGGCTGA TCGAGCGCTG GCACGACCCG GCGCGCTTCG CGATGCGGCG CATCGTCGTC
GCGCCGTGCT CGCCGTTCTC GGTGAGCCGC ACGCTGATGC GCGAGTCGGC GGCGCTGGCA
CGCAGCTTCG GCGCGGACCA CCGCGTCTCG CTGCACACCC ACCTGGCCGA GAACGACAAG
GACATCGACT ACTCGCGCGA GAAGTTCGGC ATGACGCCGG CCGAGTACGC CGAGGACCTG
GGCTGGGTCG GCCGCGACGT GTGGCATGCG CACTGCGTCA AACTCGACGC GCCCGGCATC
GGCCTGTTCG CGCGCACCGG CACCGGCGTG GCGCATTGCC CCTGCTCGAA CATGCGGCTG
GCCTCCGGCA TCGCACCGGT GCGCGCGATG CGCGACGCCG GCGTGCCGGT GGGCCTGGGC
GTGGATGGCT CGGCCTCGAA CGACGGCGGC CACCTGCTGG CCGAGGCGCG CATGGCGATG
CTGCTGCAGC GCGTGGCGCA CGGCCCCGAG CGCGGGCCAT CGGCGATGGG CGCGCGCGAG
GCTCTCGAGC TGGCCACGCG CGGCGGCGCC GCGGTGCTGA ACCGCGACGA CATCGGCGTG
CTCGCACCCG GCATGGCGGC CGACCTGGCG ATCTTCGGGC TCGACGACGT GGGCCTGGCC
GGCGCGCTGC ACGACCCGCT GGCCGCGTTG CTGTTCTGCC AGCCGCCGCG CGCTCGCCAC
ACCCTCGTGC ACGGCCGCGT GGTGGTGCGC GACTGCGAGC TGACCACGCT GGAACTGCCG
GCCCTGGTGC GGCGGCACAA CCGGCTGGCG CGGCAACTCG TCGATGGAGC CGGCCGCGCC
TGA
 
Protein sequence
MPTLLIHNAR LVVTMDEQRR EIADGSVFIR DHVIEAVGPA AELPGTADEV IDARDHVVLP 
GLINTHHHMT QSLTRVIAQD CELFDWLGTL YPIWAGLTPE MVRVSTQTAM AELLLAGCTT
SSDHLYLYPN GVMLDDSIEA ATEIGMRFHA ARGSMSVGQS QGGLPPDRVV EAEPAILKET
QRLIERWHDP ARFAMRRIVV APCSPFSVSR TLMRESAALA RSFGADHRVS LHTHLAENDK
DIDYSREKFG MTPAEYAEDL GWVGRDVWHA HCVKLDAPGI GLFARTGTGV AHCPCSNMRL
ASGIAPVRAM RDAGVPVGLG VDGSASNDGG HLLAEARMAM LLQRVAHGPE RGPSAMGARE
ALELATRGGA AVLNRDDIGV LAPGMAADLA IFGLDDVGLA GALHDPLAAL LFCQPPRARH
TLVHGRVVVR DCELTTLELP ALVRRHNRLA RQLVDGAGRA