Gene Mnod_8070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_8070 
Symbol 
ID7295899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011887 
Strand
Start bp291084 
End bp292757 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content65% 
IMG OID643592594 
Productsulfatase 
Protein accessionYP_002490226 
Protein GI220914918 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.883043 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGATC CGTTACCGTC GAACGCCGCC TTGGATGAGG AAGCAGCCGC CAAAACGATC 
AGCCGCCGGA GGATGCTGCT TGGGGGAACT GCCCTGGCCG CCTCGGTCGC CGGCACCGCA
CCCACGATTG CGCAGGCGCA GCAACCGGCC CCGGCCCCGC AGCCGGCCCC CGTCAGGACC
GGCGCCACAG GCCGGCCAGT CAACATCCTG GTGATGTTCG GCGACGACAT CGGGCAGTCG
AACATCAGCG CCTACACCTT CGGCCTGATG GGTTACCGCA CACCCAACAT CGACCGCATC
GCCCGCGAGG GCATGATGTT CACCGACTAC TACGCCGAGC AAAGCTGTAC GGCCGGCCGC
TCCTCCTTTA TCACCGGCCA GTCGACCCTG CGGACCGGCC TGTCGAAGGT TGGCCTACCC
GGCGCGACGG TGGGTCTTCA GAAGGAAGAT CCAACGCTGG CGGAACTCCT CAAGCCGCTC
GGCTACGCCA CGGGGCAGTT CGGCAAGAAC CATCTCGGCG ACCGCGACGA GTACCTGCCG
ACGAACCACG GCTTCGACGA GTTCTTCGGC AACCTCTATC ATCTCAACGC CGAGGAGGAG
CCGGAGCAGC GGACCTACCC GCGCGATCCC GAGTTCCGCA AGCGGTTCGG CCCGCGCGGG
GTGATCCGCT CGTCGGCGGA CGGCAAGATC GAGGACACGG GTCCGCTCAC CAAAAAGCGG
ATGGAAACAA TTGACGACGA GACCTCGGCC GCCGCTATGG ATTTTATCGA GCGCCAGGTC
CGGGCGAACA AGCCGTTCTT CTGCTGGTTC AACGCGACCC GGATGCACCT CCGGACGCAT
GTCGCGGAGA ACCATCGCAG CCCGCCGGGC CTGACTGCCC GGACCGAGTA CGCGGACGGC
ATGGTCGAGC ATGACGGGCA CATCGGGCAG CTCCTGAAGA AACTCGACGA CCTCGGCATC
GCGAACGACA CCATCGTGCT CTACACCACC GACAACGGCC CGCACATGAA CTCGTGGCCG
GACAGCGCCA TGACGCCGTT CCGCAGCGAG AAGGACACGA ACTGGGAGGG CGCCTTCCGG
GTGCCTTGCA TGATCCGCTG GCCGGGCCAC ATCCAGGCGG GCTCCGTCTC GAACGAGATC
GTCAGTGGGC TCGACTGGGT GCCGACCCTG GTGGCCGCCG CGGGTGATCC CAACATCGTG
GACAAGCTGC TCAAGGGCCA CACGGCCGGA GCGAAGTCCT TCAAGGTCCA CCTCGACGGC
TACAACCAGC TCCCGTACCT GACCGGCCAG CAGGACCGCG GCGCCCGCAA GGGGTTCTTC
TACTTCAACG ACGACGGCGA CCTCGTCGGG ATGCGCTACG AGAACTGGAA GATCGTCTTC
GAGGAGCAGC GCGCCCCCGG AACGATGCGG ATCTGGGCCG AGCCGTTCAC GCCGCTGCGG
GTGCCGAAAC TGTTTGACCT GAGGGCTGAC CCCTACGAGC GGGCCGACAT CACCTCGAAC
ACCTACTACG ACTGGCTCAT TTCAAATGTG TACGTCCTCG TTCCTGCTCA GGCGGAGGTC
GCGAAGTTCC TCGACACGTT CCGCGAGTTC CCGCCCCGAC AGCGGGCGGC AAGCTTCAGC
GTCGACCAGA TCGTTGAGAA GATGAAGCGG GCGACGGAGG TCCCCAGCCG GTGA
 
Protein sequence
MTDPLPSNAA LDEEAAAKTI SRRRMLLGGT ALAASVAGTA PTIAQAQQPA PAPQPAPVRT 
GATGRPVNIL VMFGDDIGQS NISAYTFGLM GYRTPNIDRI AREGMMFTDY YAEQSCTAGR
SSFITGQSTL RTGLSKVGLP GATVGLQKED PTLAELLKPL GYATGQFGKN HLGDRDEYLP
TNHGFDEFFG NLYHLNAEEE PEQRTYPRDP EFRKRFGPRG VIRSSADGKI EDTGPLTKKR
METIDDETSA AAMDFIERQV RANKPFFCWF NATRMHLRTH VAENHRSPPG LTARTEYADG
MVEHDGHIGQ LLKKLDDLGI ANDTIVLYTT DNGPHMNSWP DSAMTPFRSE KDTNWEGAFR
VPCMIRWPGH IQAGSVSNEI VSGLDWVPTL VAAAGDPNIV DKLLKGHTAG AKSFKVHLDG
YNQLPYLTGQ QDRGARKGFF YFNDDGDLVG MRYENWKIVF EEQRAPGTMR IWAEPFTPLR
VPKLFDLRAD PYERADITSN TYYDWLISNV YVLVPAQAEV AKFLDTFREF PPRQRAASFS
VDQIVEKMKR ATEVPSR