Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlut_21740 |
Symbol | |
ID | 7985689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Micrococcus luteus NCTC 2665 |
Kingdom | Bacteria |
Replicon accession | NC_012803 |
Strand | + |
Start bp | 2329467 |
End bp | 2332844 |
Gene Length | 3378 bp |
Protein Length | 1125 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644807108 |
Product | haloacid dehalogenase superfamily protein, subfamily IA, variant 3 with third motif having DD or ED/beta-phosphoglucomutase family hydrolase |
Protein accession | YP_002958196 |
Protein GI | 239918638 |
COG category | [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0637] Predicted phosphatase/phosphohexomutase [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR02009] beta-phosphoglucomutase family hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.976778 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCAGA CGCTCACCCC CGCCGCTCCG CAGGACCTGA ACGCCCGCCG CCACCGGCTC CGCCGCTACC GCGCCGTCAT CTTCGACATG GACGGGGTCA TCACGGACAC CGCCGGCGTG CACGCCGCCG CGTGGAAGGA GCTCTTCGAC GCGGCCCTGC CCGACGTCGG CGCGCTGCCC GCCAACGCCG CCGTCGTCGC CGCGGACCCG GACGTGCTGC GTCCCTTCGA CGCCGCCGCC GACTACCTGC ATCACGTGGA CGGGCGCCCC CGCGAGGACG GGGTCCGCAC GTTCTTCGCC TCCCGCGGCC TGCACGTCCC GGAGGCCGAC TCCCCCGAGG CGGACGCGAT GCCGGAGCTG ACGGTCCTCG CCCTCGCCGA GCGCAAGCAG GGCTACTTCG AGCAGGTGCT CGAGCGCGAC GGCGTCCGCG TGTTCCCGGA GGCCCAGGAC CTCCTCGAGC GGCTGCGGGC CAAGGGCGTG CCGGTCGCGC TGGTCACCAG CTCCAAGAAC TCCCGCGCCG TGCTCACGGC CGGCGGCGTG CTGGACTTCT TCCCCGTGAT CGTGGACGGG AACACCGCCG TCGAGCGCGA GCTGCCGGGC AAGCCGGACC CGGCCATGTT CTGGGAGGCC GCCCGCGAGC TGGGCGTCGA CGTCGCGGAC GCGATGGTCC TCGAGGACGC CGTCTCCGGT GTGAAGGCCG CCTCGGACGG CCGCTTCGGC CTCGTGATCG GCGTGGACCG CGAGCCCGAG CTCGGCAAGG GCCGACTCAA GGCGGCCGGC GCCCACCTGG TGGTCCAGGA CTACGGGACC CTGCATCTGG AGGACCGCAC CACCACCCCG TTCGACCCCG CGTGGGTGCT GCGCTGGGAC CGCTTCGACC CGGCCTCCGA GGGCACGCGC GAGGTGCTGT GCACCCTGGC CAACGGGTAC TGGGGCACCC GCGGCGCCGT GCCCGGCACG CGTATCTCCT CCGTCCACTA CCCCGGCACC TACATGGCCG GGGTGTTCAA CCGGCTGACC TCGATGGTCC AGGGCCGGGT CGTGGAGACC GAGCACATGG TCAACATCCA GGACTGGACC CCGCTCGTGG TGACGCCGCG CCACGGCCGC CCGCTGCTGC CGGGCGAGGA GAACCTCGTG GAGTACGGCC AGGAGATGGA CCTGCGCCGC GGCGTGCTCT CCCGCACCAT GACCTTTGAG GACGAGCAGG GCCGGCGCAC CACGCTCCAC ACCCGGCAGT TCACCTCGCT GGCCAACCGG CACCTCGCCG CGATCGAGCT GACCGTGGTG GCGGAGAACT GGTCCGGGGA CCTCACAGTG CGCTCGAAGA TCGAGGGCCG CGTGGCCAAC CTCAACGTCT CCGACGACCG CACCCTCGCC AATCAGCACC TCGAGCCGGT CCAGGCCCGG GAGATCGACG GCGAGACCGT CCTGCTCGAG ACCGCCACCA ACCAGTCCGG CATCCACGTG GCCCTGGCCA CCCGCACCCG CCAGGTGGCG CCGGTGGGGC ACCACGAGCC CATCCGCCGC CCCGTGGACG GCTCGGACCT GGTGGTCGGC CAGGACATCC TCCTCCACGT GGACGAGGGC GTGCCGCTCG TGCTGGAGAA GATCGCCGCC GTCGCCACGA GCCACGACCA CGCCAACGCC TCCGTGTGGG AGTCGGCGGT GAAGGACGTC CAGCGCGCCC AGAACTTCCG CAACCTGCTG ACCCTGCACG AGCAGCGCTG GGGCACCAAC TGGGACCGGT TCTCCGTGCG CATCGACCTG GCCGAGCCGT ACCGGCACCA CCGCCGCTCC ACGGCAGCGG AGGCCGGCGG CGAGTACGCG CCGCCCGTCG TCGACGCCGG CCACTCGGCC CCGGTGGGGT CCGCGGTCCC GATGGGCAAG GACGGCGCGT CGCTGCGCCA GCAGCTCGCG CTGAACCTGC ACACGTTCCA CGTGCTGCAG ACGGCCTACG GCCGCCGCCG GGACCTGGAC GCCTCCGTGG GGGCGCGCGG CCTGCACGGC GAGGGCTACC GCGGCCACAT CTTCTGGGAC GAGATCTACG TCTACCCCAT GCTCACGCTG CGCCGCCCCG AGATCACCCG CGGCCTGCTG ATGTACCGCT ACCGGCGTCT CAACGAGGCC CGCGCCAACG CCCAGGCCGC GGGCTGGGCC GGCGCCATGT ACCCGTGGCA GTCCGGGGCA GACGGCTCCG AGGAGACGCC CACCGAGCTG TGGAACCCGC GCTCGCGCAT GTGGATGCCG GACAACTCGC ACAACCAGCG CCACGTCTCC CTGGACATCG CCTACTCGGT GCTGCGGTAC ATCGAGATCA CGAAGGACAC CTCCTTCATC TCGGACTACG GCGCGGAGAT GCTCGTGGAG ATCTCCCGTT TCTTCATGTC CATGACCCTG CACAACGCCG TCACGGACCG CTACGAGATC CACGGCGTCA TGGGCCCGGA CGAGTTCCAC GACGGCTACC CGGAGACCCC CGGCTCGGGC CTGCGCAACA ACGCGTACAC CAACGTGCTG ACCTCCTGGG TGCTCTCCGA GACCGCCCGG CTGGTGCGCT GGCTGGACAC CCTCGACGAC GGCCTGCCCG AGATCATGGA GATCACCGAG GAGGAGATCG AGCGCTGGGA GGACGTCAGC GCCCGCCTCA CCGTGCCGTT CTTCGAGGAC GGCGAGGAGG CCGGCATCCT CGCGCAGTTC GAGGGCTACC AGGACCTCAA GGAGTTCGAC TGGGAGGCCT ACCGGGCCAA GTACGGCAAC ATCGGCCGCA TGGACCTGAT CCTGCAGGCC GAGGGCGACG CCACCAACCG GTACAAGCTC TCCAAGCAGG CGGACACCCT CATGCTCGGC TACCTGTTCT CCGCCGAGGA GCTGGACGGG ATCCTGCGCC GCATGGGCTA CGAGCTGCCG CAGGAGGCGT TCGAGCGCAT GGTGACGTAC TACGAGGCCC GCTCCACGCA CGGCTCCACG CTCTCCCGGC TGGTGCACGC CTGGGTGGCG GCCCGCACCG ATCCGGACCG CTCGTGGGAC CTGTTCACGG AGGCGCTCGA GTCGGACCTC TCCGACACCC AGGGCGGCAC CACCCGCGAG GGCATCCACC TGGGCCTGAT GGCCGGCACC GTGGACACCG TGATCCGCTG CTACGCGGGC CTGGAGACGC GCGAGGACGT GGTGCGCGTG GACCCGCGCA TGCCCTCCCA GCTGCCCGGC GCCAGCTTCA CCATCCGGTT CCGCCAGCAG CCGGTGCAGA TCCACATGCG CCGCTCGGAG GTCACGGTCC GCGCCGGCAG CGGCATGTGG CACGACGTGC CGATGATCAT CGCCGGCCAG GAGCACACCC TGTCCCCGGG CGCCGAGATC ACGGTCCCGC TGGGCTGA
|
Protein sequence | MSQTLTPAAP QDLNARRHRL RRYRAVIFDM DGVITDTAGV HAAAWKELFD AALPDVGALP ANAAVVAADP DVLRPFDAAA DYLHHVDGRP REDGVRTFFA SRGLHVPEAD SPEADAMPEL TVLALAERKQ GYFEQVLERD GVRVFPEAQD LLERLRAKGV PVALVTSSKN SRAVLTAGGV LDFFPVIVDG NTAVERELPG KPDPAMFWEA ARELGVDVAD AMVLEDAVSG VKAASDGRFG LVIGVDREPE LGKGRLKAAG AHLVVQDYGT LHLEDRTTTP FDPAWVLRWD RFDPASEGTR EVLCTLANGY WGTRGAVPGT RISSVHYPGT YMAGVFNRLT SMVQGRVVET EHMVNIQDWT PLVVTPRHGR PLLPGEENLV EYGQEMDLRR GVLSRTMTFE DEQGRRTTLH TRQFTSLANR HLAAIELTVV AENWSGDLTV RSKIEGRVAN LNVSDDRTLA NQHLEPVQAR EIDGETVLLE TATNQSGIHV ALATRTRQVA PVGHHEPIRR PVDGSDLVVG QDILLHVDEG VPLVLEKIAA VATSHDHANA SVWESAVKDV QRAQNFRNLL TLHEQRWGTN WDRFSVRIDL AEPYRHHRRS TAAEAGGEYA PPVVDAGHSA PVGSAVPMGK DGASLRQQLA LNLHTFHVLQ TAYGRRRDLD ASVGARGLHG EGYRGHIFWD EIYVYPMLTL RRPEITRGLL MYRYRRLNEA RANAQAAGWA GAMYPWQSGA DGSEETPTEL WNPRSRMWMP DNSHNQRHVS LDIAYSVLRY IEITKDTSFI SDYGAEMLVE ISRFFMSMTL HNAVTDRYEI HGVMGPDEFH DGYPETPGSG LRNNAYTNVL TSWVLSETAR LVRWLDTLDD GLPEIMEITE EEIERWEDVS ARLTVPFFED GEEAGILAQF EGYQDLKEFD WEAYRAKYGN IGRMDLILQA EGDATNRYKL SKQADTLMLG YLFSAEELDG ILRRMGYELP QEAFERMVTY YEARSTHGST LSRLVHAWVA ARTDPDRSWD LFTEALESDL SDTQGGTTRE GIHLGLMAGT VDTVIRCYAG LETREDVVRV DPRMPSQLPG ASFTIRFRQQ PVQIHMRRSE VTVRAGSGMW HDVPMIIAGQ EHTLSPGAEI TVPLG
|
| |