Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mnod_8070 |
Symbol | |
ID | 7295899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium nodulans ORS 2060 |
Kingdom | Bacteria |
Replicon accession | NC_011887 |
Strand | + |
Start bp | 291084 |
End bp | 292757 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643592594 |
Product | sulfatase |
Protein accession | YP_002490226 |
Protein GI | 220914918 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.883043 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGATC CGTTACCGTC GAACGCCGCC TTGGATGAGG AAGCAGCCGC CAAAACGATC AGCCGCCGGA GGATGCTGCT TGGGGGAACT GCCCTGGCCG CCTCGGTCGC CGGCACCGCA CCCACGATTG CGCAGGCGCA GCAACCGGCC CCGGCCCCGC AGCCGGCCCC CGTCAGGACC GGCGCCACAG GCCGGCCAGT CAACATCCTG GTGATGTTCG GCGACGACAT CGGGCAGTCG AACATCAGCG CCTACACCTT CGGCCTGATG GGTTACCGCA CACCCAACAT CGACCGCATC GCCCGCGAGG GCATGATGTT CACCGACTAC TACGCCGAGC AAAGCTGTAC GGCCGGCCGC TCCTCCTTTA TCACCGGCCA GTCGACCCTG CGGACCGGCC TGTCGAAGGT TGGCCTACCC GGCGCGACGG TGGGTCTTCA GAAGGAAGAT CCAACGCTGG CGGAACTCCT CAAGCCGCTC GGCTACGCCA CGGGGCAGTT CGGCAAGAAC CATCTCGGCG ACCGCGACGA GTACCTGCCG ACGAACCACG GCTTCGACGA GTTCTTCGGC AACCTCTATC ATCTCAACGC CGAGGAGGAG CCGGAGCAGC GGACCTACCC GCGCGATCCC GAGTTCCGCA AGCGGTTCGG CCCGCGCGGG GTGATCCGCT CGTCGGCGGA CGGCAAGATC GAGGACACGG GTCCGCTCAC CAAAAAGCGG ATGGAAACAA TTGACGACGA GACCTCGGCC GCCGCTATGG ATTTTATCGA GCGCCAGGTC CGGGCGAACA AGCCGTTCTT CTGCTGGTTC AACGCGACCC GGATGCACCT CCGGACGCAT GTCGCGGAGA ACCATCGCAG CCCGCCGGGC CTGACTGCCC GGACCGAGTA CGCGGACGGC ATGGTCGAGC ATGACGGGCA CATCGGGCAG CTCCTGAAGA AACTCGACGA CCTCGGCATC GCGAACGACA CCATCGTGCT CTACACCACC GACAACGGCC CGCACATGAA CTCGTGGCCG GACAGCGCCA TGACGCCGTT CCGCAGCGAG AAGGACACGA ACTGGGAGGG CGCCTTCCGG GTGCCTTGCA TGATCCGCTG GCCGGGCCAC ATCCAGGCGG GCTCCGTCTC GAACGAGATC GTCAGTGGGC TCGACTGGGT GCCGACCCTG GTGGCCGCCG CGGGTGATCC CAACATCGTG GACAAGCTGC TCAAGGGCCA CACGGCCGGA GCGAAGTCCT TCAAGGTCCA CCTCGACGGC TACAACCAGC TCCCGTACCT GACCGGCCAG CAGGACCGCG GCGCCCGCAA GGGGTTCTTC TACTTCAACG ACGACGGCGA CCTCGTCGGG ATGCGCTACG AGAACTGGAA GATCGTCTTC GAGGAGCAGC GCGCCCCCGG AACGATGCGG ATCTGGGCCG AGCCGTTCAC GCCGCTGCGG GTGCCGAAAC TGTTTGACCT GAGGGCTGAC CCCTACGAGC GGGCCGACAT CACCTCGAAC ACCTACTACG ACTGGCTCAT TTCAAATGTG TACGTCCTCG TTCCTGCTCA GGCGGAGGTC GCGAAGTTCC TCGACACGTT CCGCGAGTTC CCGCCCCGAC AGCGGGCGGC AAGCTTCAGC GTCGACCAGA TCGTTGAGAA GATGAAGCGG GCGACGGAGG TCCCCAGCCG GTGA
|
Protein sequence | MTDPLPSNAA LDEEAAAKTI SRRRMLLGGT ALAASVAGTA PTIAQAQQPA PAPQPAPVRT GATGRPVNIL VMFGDDIGQS NISAYTFGLM GYRTPNIDRI AREGMMFTDY YAEQSCTAGR SSFITGQSTL RTGLSKVGLP GATVGLQKED PTLAELLKPL GYATGQFGKN HLGDRDEYLP TNHGFDEFFG NLYHLNAEEE PEQRTYPRDP EFRKRFGPRG VIRSSADGKI EDTGPLTKKR METIDDETSA AAMDFIERQV RANKPFFCWF NATRMHLRTH VAENHRSPPG LTARTEYADG MVEHDGHIGQ LLKKLDDLGI ANDTIVLYTT DNGPHMNSWP DSAMTPFRSE KDTNWEGAFR VPCMIRWPGH IQAGSVSNEI VSGLDWVPTL VAAAGDPNIV DKLLKGHTAG AKSFKVHLDG YNQLPYLTGQ QDRGARKGFF YFNDDGDLVG MRYENWKIVF EEQRAPGTMR IWAEPFTPLR VPKLFDLRAD PYERADITSN TYYDWLISNV YVLVPAQAEV AKFLDTFREF PPRQRAASFS VDQIVEKMKR ATEVPSR
|
| |