Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_1195 |
Symbol | |
ID | 6133775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 1316410 |
End bp | 1318068 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641641483 |
Product | sulfatase |
Protein accession | YP_001768154 |
Protein GI | 170739499 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.031071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGA CACGGCGGCT GAACGACCTC GACGTGAAGC AGGAGGCGGG CCGGGCGTTC AGCCGCCGCG CGATGCTCCT GGGGGGGACC GCGCTGGCCG CATCCACGAT GGGCGCCCAG GCACAGGCGC CCGCGCCGTC CCCGCAGCCC GCGCCCGCCA GGACCGGCGG CTCGGGCCGA CCGGTCAACA TCCTGGTCAT GTTCGGCGAC GATATCGGGC AGTCGAACAT CAGCGCCTAC ACCTTCGGCC TGATGGGATA CCGCACCCCC AACATCGACC GCATCGCCCG CGAGGGCATG ATGTTCACCG ACTACTATGC CGAGCAGAGC TGCACGGCCG GCCGCTCGTC CTTCATCACG GGGCAGTCGA CCCTGCGTAC GGGGCTGTCG AAGGTCGGCG TGCCCGGCGC GACGGTGGGC CTGCAGAAGG AGGACCCAAC CCTCGCCGAG CTCCTCAAGC CGCTCGGCTA CGCCACGGGG CAGTTCGGCA AGAACCACCT CGGCGACCGG GACGAGTACC TGCCGACCAA CCACGGCTTC GACGAGTTCT TCGGCAACCT CTACCACCTC AACGCCGAGG AGGAGCCCGA GCAGCGGACC TACCCGCGCG ATCCCGAGTT CCGGAAGAAG TTCGGTCCCC GCGGCGTCCT CAAGGCGTCG GCGGACGGCA AGATCGAGGA CACGGGCCCG CTCACCAAGA AGCGGATGGA GACCATCGAC GACGAGACCT CGGCCGCCGC GATGGATTTC ATGGAGCGCC AGGTTCGGGC CAGCAAGCCG TTCTTCTGCT GGTTCAACTC GACCCGGATG CACTTCCGCA CCCACGTCGC GGCGGACCGC CGGAGCCCCC CGGGCCTCAC AGCCCGGACC GAGTACGCCG ACGGGATGGT CGAGCACGAC GGGCACATCG GGCAGCTCTT GAAGAAGCTC GACGACCTCG GCATCGCGAA CGACACCATC GTCCTCTACA CGACCGACAA CGGCCCGCAC ATGAACTCCT GGCCGGACGC CGCGATGACG CCGTTCCGGA GCGAGAAGGA CACGAACTGG GAAGGCGCCT TCCGGGTGCC CTGCATGATC CGCTGGCCCG GGCACATCCA GGCCGGATCG GTTTCGAACG AGATGGTCAG CGGGCTCGAC TGGGTGCCGA CGCTCATGGC GGCCGCAGGT GATCCCGACA TCACGGGCAA GCTCCTGAAG GGTCACACGG CCGGTGCGAA GACCTTCAAG GTTCACCTGG ACGGCTACAA CCAGCTTCCG TATCTCACCG GGCAGCAGGA CCGTAGCGCC CGCAACAAGT TCTTCTACTT CAACGACGAC GGCGACCTCG TGGCGATGCG CTACGAGAAC TGGAAAATCG TCTTCGAGGA ACAGCGTGCC CCGGGCACGA TGCGGATCTG GGCCGAGCCC TTCACTACGC TGCGCATGCC GAAGCTGTTC GACCTGCGGG CGGACCCTTA CGAGCGCGCC GACATCACAT CGAACACCTA CTACGACTGG TTCGTCTCGC AGCCCTACCT GATCTTTCCC GCCCAGGAGG AAGTCGCCAA GTTCCTCGCG ACGTTCCGCG AATTCCCCCC TCGCCAGCGG GCGGCGAGCT TCAGCATCGA CCAGATCATC GAGAAGATGC GCCGTGCGAC AGAGGCGCAT AGCCAGTAG
|
Protein sequence | MSETRRLNDL DVKQEAGRAF SRRAMLLGGT ALAASTMGAQ AQAPAPSPQP APARTGGSGR PVNILVMFGD DIGQSNISAY TFGLMGYRTP NIDRIAREGM MFTDYYAEQS CTAGRSSFIT GQSTLRTGLS KVGVPGATVG LQKEDPTLAE LLKPLGYATG QFGKNHLGDR DEYLPTNHGF DEFFGNLYHL NAEEEPEQRT YPRDPEFRKK FGPRGVLKAS ADGKIEDTGP LTKKRMETID DETSAAAMDF MERQVRASKP FFCWFNSTRM HFRTHVAADR RSPPGLTART EYADGMVEHD GHIGQLLKKL DDLGIANDTI VLYTTDNGPH MNSWPDAAMT PFRSEKDTNW EGAFRVPCMI RWPGHIQAGS VSNEMVSGLD WVPTLMAAAG DPDITGKLLK GHTAGAKTFK VHLDGYNQLP YLTGQQDRSA RNKFFYFNDD GDLVAMRYEN WKIVFEEQRA PGTMRIWAEP FTTLRMPKLF DLRADPYERA DITSNTYYDW FVSQPYLIFP AQEEVAKFLA TFREFPPRQR AASFSIDQII EKMRRATEAH SQ
|
| |