Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_6751 |
Symbol | |
ID | 6130345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 7422270 |
End bp | 7423979 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641646833 |
Product | sulfatase |
Protein accession | YP_001773432 |
Protein GI | 170744777 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0169669 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCATG ATCGTGCATT CACCCCGGGT CCGTCGGGCC GGAGCCGCCC GCGCTCCACG AGGGCGGCGC TGATCGGGGC GACGGCCCTG GTCGCGGCGA CGCCGGCCGT GCCGAGCTTC GCCCAGGCGC CGCAGCAGCA GAAGCCCAAC ATCCTCTTCA TCGTCTCCGA CGACACCGGC TACGGCGACC TCGGGCCCTA CGGCGGCGGG GAGGGACGGG GCATGCCCAC GCCCAACATC GACCGCCTCG CCGAGGACGG GATGACCTTC TTCTCGTTCT ACGCCCAGCC CAGCTGCACC CCGGGCCGGG CCGCGATGCA GACGGGGCGG ATCCCGAACC GCAGCGGGAT GACGACGGTA GCGTTCCAGG GCCAGGGCGG CGGCTTGCCG GCCGCCGAGT GGACGTTGGG CTCGGTGCTG AAGCAGGGCG GCTACAAGAC CTACTTCACG GGGAAATGGC ACCTCGGCGA GGCCGACTAC GCGCTGCCCA ACGCCCAGGG CTACGACGTC ATGCAGTATT GCGGCCTCTA TCACCTCAAC GCCTACACCT ACGCCGACCC GACCTGGTTC CCCGACATGG ACCCCGAGCT CAGGGCCATG TTCCAGAGGG TCACCAGGGG AGCCCTGTCC GGCAAGGCTG GCGAGAAGGC CGTCGAGGAT TTCAAAGTCA ACGGTCAGTA CGTGAACACC CCCGTCGTCG ACGGCAAGGC CGGCGTGGTC GGCATCCCAT TCTTCGACAG CTACGTCGAG AAAGCCGCGC TCGGCTTCCT CGACGACGCC GCGAAGGCGG GCAGCCCTTT CTACATCAAC GTCAACTTCA TGAAGGTGCA CCAACCGAAC ATGCCGGCCC CCGAGTTCGA GCACAAATCG CTCTCCAAGA GTAAGTACGC CGACTCGGTC GTCGAGCTCG ATGCCCGGAT CGGGCGGATC ATGGACAAGC TGCGCTCGCT CGGGCTCGAT AAGAACACGC TCGTCTTCTA CACGACTGAC AACGGCGCGT GGCAGGACGT CTACCCCGAC GCGGGCTACA CCCCCTTCCG GGGCACGAAG GGCACCGTGC GCGAAGGCGG CAACAGGGTG CCGGCAATGG CGGTCTGGCC GGGCAAGATC AAGCCCGGCA CGAAGAACCA CGACATCGTT GGGGGCCTCG ACTTGATGGC CACCTTCGCC TCGGTCGCGG GCCTCACGCT GCCGGACAAA GACCGCGACG GCCAGCCGAT GATCTTCGAC AGCTACGACA TGTCGCCGGT GCTACTCGGG ACGGGTAAGT CCGCGCGTAA ATCGTGGTTC TACTTCACCG AGGACGAGCT GAGCCCGGGC GCGGTCCGCG TCGGCAACTA CAAGGCGGTG TTCAACCTGC GCGGCGACGA CGGCGCCGCC ACTGGCGCCC TCGCGGTCGA CACCAATCTG GGCTGGAAGG GATCCAGCAA GTACGTCGCG ACGGTTCCGC AGATTTTCGA TCTCTGGCAG GACCCGCAGG AGCGCTACGA CGTCTTCATG AACAACTACA CCGAGCGGAC GTGGACGCTC GTGACAATGA GCGCGGCAGT GAAGAACTTG ATGAAGACGT ACGTGCAGTA CCCACCGCGT AAGCTGCAGA GCGAGGTCTA CACAGGTCCT ATCACGATCT CGCAGTACGA GCGGCTGCAA TCCGTCCGTG ACGCGCTCGC GAAGGAGGGG ATCACCCTTC CGATGCCCAC GGGCCAGTAG
|
Protein sequence | MTHDRAFTPG PSGRSRPRST RAALIGATAL VAATPAVPSF AQAPQQQKPN ILFIVSDDTG YGDLGPYGGG EGRGMPTPNI DRLAEDGMTF FSFYAQPSCT PGRAAMQTGR IPNRSGMTTV AFQGQGGGLP AAEWTLGSVL KQGGYKTYFT GKWHLGEADY ALPNAQGYDV MQYCGLYHLN AYTYADPTWF PDMDPELRAM FQRVTRGALS GKAGEKAVED FKVNGQYVNT PVVDGKAGVV GIPFFDSYVE KAALGFLDDA AKAGSPFYIN VNFMKVHQPN MPAPEFEHKS LSKSKYADSV VELDARIGRI MDKLRSLGLD KNTLVFYTTD NGAWQDVYPD AGYTPFRGTK GTVREGGNRV PAMAVWPGKI KPGTKNHDIV GGLDLMATFA SVAGLTLPDK DRDGQPMIFD SYDMSPVLLG TGKSARKSWF YFTEDELSPG AVRVGNYKAV FNLRGDDGAA TGALAVDTNL GWKGSSKYVA TVPQIFDLWQ DPQERYDVFM NNYTERTWTL VTMSAAVKNL MKTYVQYPPR KLQSEVYTGP ITISQYERLQ SVRDALAKEG ITLPMPTGQ
|
| |