Gene M446_6751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_6751 
Symbol 
ID6130345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp7422270 
End bp7423979 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content66% 
IMG OID641646833 
Productsulfatase 
Protein accessionYP_001773432 
Protein GI170744777 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0169669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATG ATCGTGCATT CACCCCGGGT CCGTCGGGCC GGAGCCGCCC GCGCTCCACG 
AGGGCGGCGC TGATCGGGGC GACGGCCCTG GTCGCGGCGA CGCCGGCCGT GCCGAGCTTC
GCCCAGGCGC CGCAGCAGCA GAAGCCCAAC ATCCTCTTCA TCGTCTCCGA CGACACCGGC
TACGGCGACC TCGGGCCCTA CGGCGGCGGG GAGGGACGGG GCATGCCCAC GCCCAACATC
GACCGCCTCG CCGAGGACGG GATGACCTTC TTCTCGTTCT ACGCCCAGCC CAGCTGCACC
CCGGGCCGGG CCGCGATGCA GACGGGGCGG ATCCCGAACC GCAGCGGGAT GACGACGGTA
GCGTTCCAGG GCCAGGGCGG CGGCTTGCCG GCCGCCGAGT GGACGTTGGG CTCGGTGCTG
AAGCAGGGCG GCTACAAGAC CTACTTCACG GGGAAATGGC ACCTCGGCGA GGCCGACTAC
GCGCTGCCCA ACGCCCAGGG CTACGACGTC ATGCAGTATT GCGGCCTCTA TCACCTCAAC
GCCTACACCT ACGCCGACCC GACCTGGTTC CCCGACATGG ACCCCGAGCT CAGGGCCATG
TTCCAGAGGG TCACCAGGGG AGCCCTGTCC GGCAAGGCTG GCGAGAAGGC CGTCGAGGAT
TTCAAAGTCA ACGGTCAGTA CGTGAACACC CCCGTCGTCG ACGGCAAGGC CGGCGTGGTC
GGCATCCCAT TCTTCGACAG CTACGTCGAG AAAGCCGCGC TCGGCTTCCT CGACGACGCC
GCGAAGGCGG GCAGCCCTTT CTACATCAAC GTCAACTTCA TGAAGGTGCA CCAACCGAAC
ATGCCGGCCC CCGAGTTCGA GCACAAATCG CTCTCCAAGA GTAAGTACGC CGACTCGGTC
GTCGAGCTCG ATGCCCGGAT CGGGCGGATC ATGGACAAGC TGCGCTCGCT CGGGCTCGAT
AAGAACACGC TCGTCTTCTA CACGACTGAC AACGGCGCGT GGCAGGACGT CTACCCCGAC
GCGGGCTACA CCCCCTTCCG GGGCACGAAG GGCACCGTGC GCGAAGGCGG CAACAGGGTG
CCGGCAATGG CGGTCTGGCC GGGCAAGATC AAGCCCGGCA CGAAGAACCA CGACATCGTT
GGGGGCCTCG ACTTGATGGC CACCTTCGCC TCGGTCGCGG GCCTCACGCT GCCGGACAAA
GACCGCGACG GCCAGCCGAT GATCTTCGAC AGCTACGACA TGTCGCCGGT GCTACTCGGG
ACGGGTAAGT CCGCGCGTAA ATCGTGGTTC TACTTCACCG AGGACGAGCT GAGCCCGGGC
GCGGTCCGCG TCGGCAACTA CAAGGCGGTG TTCAACCTGC GCGGCGACGA CGGCGCCGCC
ACTGGCGCCC TCGCGGTCGA CACCAATCTG GGCTGGAAGG GATCCAGCAA GTACGTCGCG
ACGGTTCCGC AGATTTTCGA TCTCTGGCAG GACCCGCAGG AGCGCTACGA CGTCTTCATG
AACAACTACA CCGAGCGGAC GTGGACGCTC GTGACAATGA GCGCGGCAGT GAAGAACTTG
ATGAAGACGT ACGTGCAGTA CCCACCGCGT AAGCTGCAGA GCGAGGTCTA CACAGGTCCT
ATCACGATCT CGCAGTACGA GCGGCTGCAA TCCGTCCGTG ACGCGCTCGC GAAGGAGGGG
ATCACCCTTC CGATGCCCAC GGGCCAGTAG
 
Protein sequence
MTHDRAFTPG PSGRSRPRST RAALIGATAL VAATPAVPSF AQAPQQQKPN ILFIVSDDTG 
YGDLGPYGGG EGRGMPTPNI DRLAEDGMTF FSFYAQPSCT PGRAAMQTGR IPNRSGMTTV
AFQGQGGGLP AAEWTLGSVL KQGGYKTYFT GKWHLGEADY ALPNAQGYDV MQYCGLYHLN
AYTYADPTWF PDMDPELRAM FQRVTRGALS GKAGEKAVED FKVNGQYVNT PVVDGKAGVV
GIPFFDSYVE KAALGFLDDA AKAGSPFYIN VNFMKVHQPN MPAPEFEHKS LSKSKYADSV
VELDARIGRI MDKLRSLGLD KNTLVFYTTD NGAWQDVYPD AGYTPFRGTK GTVREGGNRV
PAMAVWPGKI KPGTKNHDIV GGLDLMATFA SVAGLTLPDK DRDGQPMIFD SYDMSPVLLG
TGKSARKSWF YFTEDELSPG AVRVGNYKAV FNLRGDDGAA TGALAVDTNL GWKGSSKYVA
TVPQIFDLWQ DPQERYDVFM NNYTERTWTL VTMSAAVKNL MKTYVQYPPR KLQSEVYTGP
ITISQYERLQ SVRDALAKEG ITLPMPTGQ