Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3723 |
Symbol | |
ID | 7093077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 4081139 |
End bp | 4083793 |
Gene Length | 2655 bp |
Protein Length | 884 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643467009 |
Product | Erythromycin esterase |
Protein accession | YP_002363968 |
Protein GI | 217979821 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG0412] Dienelactone hydrolase and related enzymes [COG1926] Predicted phosphoribosyltransferases [COG2312] Erythromycin esterase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.146777 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGCT TGCCCTTCAA ATTTCTCGAC CGGGCCGACG CTGGCCGCCA GTTGGCGGCC AAATTGGTCA CGATGCAGCT TGATCGCCCG GCAGTCTACG CGTTGCCCCG TGGCGGCGTG CCGGTAGCTC TCGAAATCGC GCGCGCCCTG CGCGCCCCTC TCGATCTCAT TCTGGTGCGT AAGATTGGCG CTCCGGGCGC GCCGGAATTG GCGCTTGGCG CCGTCGTCGA AGGCGAAATT CCGCAGACCG TGATCAACGA AGACGTGCGG CGAGCTTCCG GCGCGGACGA CGCTTATCTT GAGCGCGCGC GGCAGCGTGA ACTGGCGGAG CTCGAACGCC GGCGCGTCCG ATATTTGGGC GACCGCGCGC GACTGCAGCC AACCGGACGC ACCGCCATCA TCGTCGATGA TGGGCTTGCC ACCGGGGCCA CGGCGAAAGC CGCCCTCATC GCGATCAAAC GCCAGGGCGC CGCCAGGATC ATACTCGCCA TTCCCGCCGC GCCGGAGGAA ACGCTGGCCG AAATGCGCCA GTATGCAGAT CTTGTTGTGT GTCTACATCC CGCCAGGCAT TTCCAGGGGG TCGGCGCATT CTACAGCGAT TTCCACCAAC TCACCGATGA AGAGACAATC GGGTTGCTGC GTCAGGGCTG GGCGGAGACC GGCGAGGCCG AGTCCGGTTC AGCCAGGCGT CAGATCGCGG TCCCGCCGCT CGGTTTGGTC GGCGATCTCT ACGTGCCGCC GGACCCGCGC GGCGTCATTC TGTTCGCCCA TGGAAGCGGA TCGAGCCGCC TCAGTCCGCG CAACGCCGCC GTCGCCCACA CTTTGAACGC ACAGGGCTTC GCCACGCTTC TGCTGGATCT GTTGACCGAG AAAGAGGCGA AGGACCGCCG TAACGTCTTC GATATTCCAC TGCTCGCGGA GCGCCTGCTG GAGGCGGCGA TATGGATCCG CGCCGAGCCC GACATCGCCG ATCTGCCGCT GGGCCTGTTT GGCGCGAGCA CCGGCGCGGG GGCAGCTATG CTGGCCGCGG CGGAGCTTCT GGGCGGCGTG GCGGCGGTGG TCTCGCGCGG CGGACGCCCC GATCTGGCCG GCCCCCGGCT GGCGGAAGTC CGCGCGCCGA GCCTGCTGAT TGTCGGCGGC GACGATCGGC AGGTCCTCGC GCTGAACCGG CAGGCGCTGG CTGCGCTGAA ATGTGAAAGG CTGCTGAAGA TCGTTCCCAA CGCCACCCAT CTGTTTGAGG AGCCTGGCGC GCTGGAGCTG GCGACCGACA TGGCGAGCGC CTGGTTCCAG CACTATCTGA CGCCCCCGGC GCCCAGCCGC GCGCCAGCGC CCCCAGCGGC GCGGCCGACG CCACAGATAT CGACAGCGGC CGTGCTGCGC GCCGCCGCCG AGCCGTTGCT TTCGTTGGAC GACCCGACCT TCGCCGCCGC CTTTGACCGC TTTGCCCAGT CCCGCGTGGT GTTGCTTGGC GAGGCTTCGC ACGGGACCTC GGAATTCTAC CGCGCCCGCG CCGCCATCAC GCGGCGCCTG ATCGAGCGGC ACGGTTTTAC GATCGTGGCT GTCGAGGCGG ATTGGCCGGA CGCCGCTGCA ATCGACCGCT ATGTGCGGCG CAGCCCGCAT CAGCCGATGA GCTCAACGCC ATTCGCCCGC TTCCCAAGCT GGATGTGGCG CAACAAGGAT GTCGACGCCT TCGTCGGCTT TCTGCGCGGC CACAACGCCG CGACGTCGCC CGGCGACGAG GTTGGGTTCT ATGGCCTCGA CCTGTACAAC ATGACGGCCT CGATCGCCGC CGTGCTTGCC TATCTCGATC GCATCGATCC CAAGGCTGCG GAGGCGGCGC GGGCGCGCTA CGCCTGCCTG TCGCCGTGGT CGCGGGAGCC GGCGGCCTAT GGACGGGCCT CGCTCACCGA AGGCTATGCG CTGTGCGAAC AGCCGGTGAC GCGTATTCTC GTTGATCTTT TGGAGAAGGA ACTCCAATAC GCGCGCCTCG ATGGCGGCCA TTTCTTCGAC GCAACGCAGA ACGCCCGACT GGTGGCCGAC GCGGAGCGCT ACTACCGGGC CATGTATTAC GGCGCCCACG AATCCTGGAA TCTGCGCGAC CGGCACATGT TCGACACACT GCAGAACATC CTCGCGCAGG CCGGTCCAGA CAAGAAGGCG GTTGTCTGGG CGCATAATTC CCACATCGGG GACGCGCGAT TCACGGACAT GGGCGCGGAG CGCGGCGAGC TGAACATCGG CCAGTTGTGC CGCCAGACAT ATGGACGCGG CGCGGCCTTG ATCGGCTTTG GCACGCATAC CGGCACCGTC GCGGCTGCGT CCGAATGGGA CGCGCCGATG GAGGTCAAGG CCGTCAGGCC ATCGCGTCCG GACAGTTACG AGGCCTTGTG CCACGAGGTA GGCTCCGAAC GATTCCTGCT GGACCTACGA GCGGGGCAGC ATGACGATCT CCGCCGCGTC ATGGCCGAAC CACGACTTGA ACGCTACATT GGCGTCATCT ATCGGCCCGA GACCGAGCGC TGGAGCCACT ACAGCTACGC GACGCTTCCC GACCAGTACG ACGCCTTCGT GTGGTTCGAC GAGACTCATG CGGTGATCCC GTTGCCGACG AAGGTAACCG CCGGCGAGGA TGAGACCTAT CCCTTCGGCC TCTGA
|
Protein sequence | MAGLPFKFLD RADAGRQLAA KLVTMQLDRP AVYALPRGGV PVALEIARAL RAPLDLILVR KIGAPGAPEL ALGAVVEGEI PQTVINEDVR RASGADDAYL ERARQRELAE LERRRVRYLG DRARLQPTGR TAIIVDDGLA TGATAKAALI AIKRQGAARI ILAIPAAPEE TLAEMRQYAD LVVCLHPARH FQGVGAFYSD FHQLTDEETI GLLRQGWAET GEAESGSARR QIAVPPLGLV GDLYVPPDPR GVILFAHGSG SSRLSPRNAA VAHTLNAQGF ATLLLDLLTE KEAKDRRNVF DIPLLAERLL EAAIWIRAEP DIADLPLGLF GASTGAGAAM LAAAELLGGV AAVVSRGGRP DLAGPRLAEV RAPSLLIVGG DDRQVLALNR QALAALKCER LLKIVPNATH LFEEPGALEL ATDMASAWFQ HYLTPPAPSR APAPPAARPT PQISTAAVLR AAAEPLLSLD DPTFAAAFDR FAQSRVVLLG EASHGTSEFY RARAAITRRL IERHGFTIVA VEADWPDAAA IDRYVRRSPH QPMSSTPFAR FPSWMWRNKD VDAFVGFLRG HNAATSPGDE VGFYGLDLYN MTASIAAVLA YLDRIDPKAA EAARARYACL SPWSREPAAY GRASLTEGYA LCEQPVTRIL VDLLEKELQY ARLDGGHFFD ATQNARLVAD AERYYRAMYY GAHESWNLRD RHMFDTLQNI LAQAGPDKKA VVWAHNSHIG DARFTDMGAE RGELNIGQLC RQTYGRGAAL IGFGTHTGTV AAASEWDAPM EVKAVRPSRP DSYEALCHEV GSERFLLDLR AGQHDDLRRV MAEPRLERYI GVIYRPETER WSHYSYATLP DQYDAFVWFD ETHAVIPLPT KVTAGEDETY PFGL
|
| |