Gene M446_1195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1195 
Symbol 
ID6133775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1316410 
End bp1318068 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content67% 
IMG OID641641483 
Productsulfatase 
Protein accessionYP_001768154 
Protein GI170739499 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.031071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA CACGGCGGCT GAACGACCTC GACGTGAAGC AGGAGGCGGG CCGGGCGTTC 
AGCCGCCGCG CGATGCTCCT GGGGGGGACC GCGCTGGCCG CATCCACGAT GGGCGCCCAG
GCACAGGCGC CCGCGCCGTC CCCGCAGCCC GCGCCCGCCA GGACCGGCGG CTCGGGCCGA
CCGGTCAACA TCCTGGTCAT GTTCGGCGAC GATATCGGGC AGTCGAACAT CAGCGCCTAC
ACCTTCGGCC TGATGGGATA CCGCACCCCC AACATCGACC GCATCGCCCG CGAGGGCATG
ATGTTCACCG ACTACTATGC CGAGCAGAGC TGCACGGCCG GCCGCTCGTC CTTCATCACG
GGGCAGTCGA CCCTGCGTAC GGGGCTGTCG AAGGTCGGCG TGCCCGGCGC GACGGTGGGC
CTGCAGAAGG AGGACCCAAC CCTCGCCGAG CTCCTCAAGC CGCTCGGCTA CGCCACGGGG
CAGTTCGGCA AGAACCACCT CGGCGACCGG GACGAGTACC TGCCGACCAA CCACGGCTTC
GACGAGTTCT TCGGCAACCT CTACCACCTC AACGCCGAGG AGGAGCCCGA GCAGCGGACC
TACCCGCGCG ATCCCGAGTT CCGGAAGAAG TTCGGTCCCC GCGGCGTCCT CAAGGCGTCG
GCGGACGGCA AGATCGAGGA CACGGGCCCG CTCACCAAGA AGCGGATGGA GACCATCGAC
GACGAGACCT CGGCCGCCGC GATGGATTTC ATGGAGCGCC AGGTTCGGGC CAGCAAGCCG
TTCTTCTGCT GGTTCAACTC GACCCGGATG CACTTCCGCA CCCACGTCGC GGCGGACCGC
CGGAGCCCCC CGGGCCTCAC AGCCCGGACC GAGTACGCCG ACGGGATGGT CGAGCACGAC
GGGCACATCG GGCAGCTCTT GAAGAAGCTC GACGACCTCG GCATCGCGAA CGACACCATC
GTCCTCTACA CGACCGACAA CGGCCCGCAC ATGAACTCCT GGCCGGACGC CGCGATGACG
CCGTTCCGGA GCGAGAAGGA CACGAACTGG GAAGGCGCCT TCCGGGTGCC CTGCATGATC
CGCTGGCCCG GGCACATCCA GGCCGGATCG GTTTCGAACG AGATGGTCAG CGGGCTCGAC
TGGGTGCCGA CGCTCATGGC GGCCGCAGGT GATCCCGACA TCACGGGCAA GCTCCTGAAG
GGTCACACGG CCGGTGCGAA GACCTTCAAG GTTCACCTGG ACGGCTACAA CCAGCTTCCG
TATCTCACCG GGCAGCAGGA CCGTAGCGCC CGCAACAAGT TCTTCTACTT CAACGACGAC
GGCGACCTCG TGGCGATGCG CTACGAGAAC TGGAAAATCG TCTTCGAGGA ACAGCGTGCC
CCGGGCACGA TGCGGATCTG GGCCGAGCCC TTCACTACGC TGCGCATGCC GAAGCTGTTC
GACCTGCGGG CGGACCCTTA CGAGCGCGCC GACATCACAT CGAACACCTA CTACGACTGG
TTCGTCTCGC AGCCCTACCT GATCTTTCCC GCCCAGGAGG AAGTCGCCAA GTTCCTCGCG
ACGTTCCGCG AATTCCCCCC TCGCCAGCGG GCGGCGAGCT TCAGCATCGA CCAGATCATC
GAGAAGATGC GCCGTGCGAC AGAGGCGCAT AGCCAGTAG
 
Protein sequence
MSETRRLNDL DVKQEAGRAF SRRAMLLGGT ALAASTMGAQ AQAPAPSPQP APARTGGSGR 
PVNILVMFGD DIGQSNISAY TFGLMGYRTP NIDRIAREGM MFTDYYAEQS CTAGRSSFIT
GQSTLRTGLS KVGVPGATVG LQKEDPTLAE LLKPLGYATG QFGKNHLGDR DEYLPTNHGF
DEFFGNLYHL NAEEEPEQRT YPRDPEFRKK FGPRGVLKAS ADGKIEDTGP LTKKRMETID
DETSAAAMDF MERQVRASKP FFCWFNSTRM HFRTHVAADR RSPPGLTART EYADGMVEHD
GHIGQLLKKL DDLGIANDTI VLYTTDNGPH MNSWPDAAMT PFRSEKDTNW EGAFRVPCMI
RWPGHIQAGS VSNEMVSGLD WVPTLMAAAG DPDITGKLLK GHTAGAKTFK VHLDGYNQLP
YLTGQQDRSA RNKFFYFNDD GDLVAMRYEN WKIVFEEQRA PGTMRIWAEP FTTLRMPKLF
DLRADPYERA DITSNTYYDW FVSQPYLIFP AQEEVAKFLA TFREFPPRQR AASFSIDQII
EKMRRATEAH SQ