Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0782 |
Symbol | |
ID | 4784170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 817660 |
End bp | 819042 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640089343 |
Product | hydroxydechloroatrazine ethylaminohydrolase |
Protein accession | YP_001019979 |
Protein GI | 124265975 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.026744 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCACGC TGCTGATCCA CAACGCCCGC CTCGTCGTCA CGATGGACGA GCAGCGCCGC GAGATCGCCG ACGGCAGCGT GTTCATCCGC GACCACGTGA TCGAGGCGGT CGGTCCGGCG GCCGAACTGC CAGGCACCGC CGACGAGGTG ATCGATGCCC GTGACCACGT CGTCCTGCCC GGGCTGATCA ACACCCACCA CCACATGACC CAGTCCTTGA CGCGCGTGAT CGCGCAGGAC TGCGAGCTGT TCGACTGGCT GGGGACGCTC TACCCGATCT GGGCCGGGCT CACGCCCGAG ATGGTGCGGG TGTCGACGCA GACCGCGATG GCCGAGCTGC TGCTGGCGGG CTGCACGACC AGCAGCGACC ATCTCTATCT CTACCCCAAC GGGGTGATGC TCGACGACAG CATCGAGGCG GCCACGGAGA TCGGCATGCG CTTCCACGCG GCGCGCGGCT CGATGAGCGT GGGCCAGAGC CAGGGCGGCC TGCCACCCGA CCGCGTGGTG GAGGCCGAGC CGGCGATCCT GAAGGAGACG CAGCGGCTGA TCGAGCGCTG GCACGACCCG GCGCGCTTCG CGATGCGGCG CATCGTCGTC GCGCCGTGCT CGCCGTTCTC GGTGAGCCGC ACGCTGATGC GCGAGTCGGC GGCGCTGGCA CGCAGCTTCG GCGCGGACCA CCGCGTCTCG CTGCACACCC ACCTGGCCGA GAACGACAAG GACATCGACT ACTCGCGCGA GAAGTTCGGC ATGACGCCGG CCGAGTACGC CGAGGACCTG GGCTGGGTCG GCCGCGACGT GTGGCATGCG CACTGCGTCA AACTCGACGC GCCCGGCATC GGCCTGTTCG CGCGCACCGG CACCGGCGTG GCGCATTGCC CCTGCTCGAA CATGCGGCTG GCCTCCGGCA TCGCACCGGT GCGCGCGATG CGCGACGCCG GCGTGCCGGT GGGCCTGGGC GTGGATGGCT CGGCCTCGAA CGACGGCGGC CACCTGCTGG CCGAGGCGCG CATGGCGATG CTGCTGCAGC GCGTGGCGCA CGGCCCCGAG CGCGGGCCAT CGGCGATGGG CGCGCGCGAG GCTCTCGAGC TGGCCACGCG CGGCGGCGCC GCGGTGCTGA ACCGCGACGA CATCGGCGTG CTCGCACCCG GCATGGCGGC CGACCTGGCG ATCTTCGGGC TCGACGACGT GGGCCTGGCC GGCGCGCTGC ACGACCCGCT GGCCGCGTTG CTGTTCTGCC AGCCGCCGCG CGCTCGCCAC ACCCTCGTGC ACGGCCGCGT GGTGGTGCGC GACTGCGAGC TGACCACGCT GGAACTGCCG GCCCTGGTGC GGCGGCACAA CCGGCTGGCG CGGCAACTCG TCGATGGAGC CGGCCGCGCC TGA
|
Protein sequence | MPTLLIHNAR LVVTMDEQRR EIADGSVFIR DHVIEAVGPA AELPGTADEV IDARDHVVLP GLINTHHHMT QSLTRVIAQD CELFDWLGTL YPIWAGLTPE MVRVSTQTAM AELLLAGCTT SSDHLYLYPN GVMLDDSIEA ATEIGMRFHA ARGSMSVGQS QGGLPPDRVV EAEPAILKET QRLIERWHDP ARFAMRRIVV APCSPFSVSR TLMRESAALA RSFGADHRVS LHTHLAENDK DIDYSREKFG MTPAEYAEDL GWVGRDVWHA HCVKLDAPGI GLFARTGTGV AHCPCSNMRL ASGIAPVRAM RDAGVPVGLG VDGSASNDGG HLLAEARMAM LLQRVAHGPE RGPSAMGARE ALELATRGGA AVLNRDDIGV LAPGMAADLA IFGLDDVGLA GALHDPLAAL LFCQPPRARH TLVHGRVVVR DCELTTLELP ALVRRHNRLA RQLVDGAGRA
|
| |