Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_4467 |
Symbol | |
ID | 4649083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 4794156 |
End bp | 4796489 |
Gene Length | 2334 bp |
Protein Length | 777 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639807937 |
Product | sulfatase |
Protein accession | YP_955248 |
Protein GI | 120405419 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGCGGG ACATCCTGCC GATTCCGGAT CCCCAGCACG TCGGACTGAC GACATATGAC GCCAAGGACC CCGACACCAC CTACCCGCCC ATCACTCCGC TGCGCCCGCC GCAGGGTGCG CCCAACGTCC TGATCGTCCT GCTCGACGAC GTCGGCTTCG GCGCGAGCTC GGCCTTCGGC GGACCCTGCG CCACCCCGAC CGCGGAACGC CTGGCCGCGA ACGGGCTCAA GCTCAACAGG TTCCACACCA CGGCGCTCTG CTCTCCGACG CGTCAGGCGT TGCTCACCGG CCGGAACCAC CACTCAGTGG GAATGGGTGG CGTCACCGAG ATCGCCACGT CGGCGCCGGG CTACTCCAGC ATCCGGCCCA AGGACAAGGC GCCGGTCGCC GAAACCCTTC GGCTCAACGG GTACTCGACC AGTCAGTTCG GCAAGTGTCA CGAGGTGCCG GTCTGGGAGG TGTCGCCCGT CGGGCCGTTC GGACAGTGGC CGACGGGTTC GGGGTTCGAG CACTTCTACG GGTTCATCGG TGGCGAGGCC AACCAGTACT ATCCCGGCCT GTACGAGGGC ACCAAACCGG TGGAGCCGGA GAAGACGCCG GAGCAGGGCT ACACCCTCAC CGAGGACCTG GCCGATCGCG CGATCACCTG GGTGCGTCAG CAGCAGGCGC TGACACCGGA CAAGCCCTTC TTCATGTACT TCGCCCCCGG CGCCACGCAC GCTCCGCACC ATGTCCCCAA ACAGTGGTCC GACAAGTACC GCGGCAAGTT CGACGACGGC TGGGATGTGT TGCGGGAGAG CATGCTCGAC AACCAGAAAG CGCTCGGTGT CGTCCCTGAG GATGCGCAGT TGACCGCGCG TCACGACGAG ATACCGGCGT GGGACGACAT GCCCGATGTG CTCAAGCCGG TGCTCGCGCG GCAGATGGAG ATCTATGCCG GATTCCTCGA ACAGACCGAC CACGAGATCG GCCGGCTGGT CGACGCGATC GACGACCTCG GTGCGCTCGA CAACACGTTG ATCTACTACA TCATCGGCGA CAACGGGGCC TCGGCCGAGG GCACACCGAA CGGCTGCTTC AACGAGATGT GCACGCTGAA CGGCCTGGCG GGCATCGAGA CACCGGAGTT TCTGCTGTCG AAGATCGACG ACTTCGGTAC ACCCGACGCG TACAACCACT ACGCCGTCGG TTGGGCGCAC GCGCTGTGCG GACCGTATCA ATGGACCAAG CAGGTCGCGT CGCATTGGGG CGGCACCCGA AACGGAACGA TCGTGCACTG GCCGAACGGG ATTGCCGCCA AGGGAGAAAC CCGGAACCAG TTCCATCATG TGATCGACGT CGTGCCGACG ATCCTCGAGG CCGCGAAGCT TCCCGCGCCC ACGGTCGTGA ACAGCATCCA GCAGGCACCA CTGGAGGGTG TCAGCATGAT GTCCACCCTG CGGGACCGCG ACGCGGACGA GACGCACACC GTGCAGTACT TCGAGATGTT CGGCAACCGC GGGATCTATC ACAAGGGCTG GACGGCGGTC ACCAAACACC GAACGCCCTG GATCGCCGAC CAGCCGCCCC TCGACGAGGA CGTCTGGGAG CTCTATGCAC CCGACGACTG GACGCAGGCC CACGACCTCG CGGCGGAGCA GCCGGAGAGA CTGGCTGCGC TTCAGCGCCT TTGGTTGATC GAAGCCGTCA AGTACAACGT GGTGCCCCTC GACGACCGGT CCTTCGAACG ATTCAATCCC GACATCGCCG GCCGGCCGCA GCTGATCAAA GGGACCACCC AGACCCTGTT CTCCGGCATG AGGCTGCTGG AGAACTGTGT GCTGAACATC AAGAACAGAT CGCATGCGGT GAGCGCGTTG ATCTCGGTGC CCGACAGCGG CGCGCAGGGC GTGATCGTCA GTCAGGGTGG CGGAGTGGGC GGTTGGTGCG TGTACGCCCA CGAGAACACG CTGAAGTACT GCTACAACTT CTTCGGCATC GAGTACTACT TCGTCACCGC TGAACTCCCG CTCCCTGGGG GCCAGCACCT CGTCGGTTTC GAGTTCGCTT ACGACGGCGG GGGTCTCGGC AAGGGCGGTA CCGTCACGCT CTACTGCGAC GGAGAGCCAG TCGGCACCGG ACGAGTCGAG CGGACCGAAC CGATGGCATT CTCGGCCGAC GAGGCCTGCG ATGTCGGTTC GGACACCGGC TCACCGACGT CGCCGGATTA CGGCCCGCAC GGAAACGGAT TCAACGGCCG GATCGATTGG GTGAAGATCG ACATCAGCAC CGACGATCAT GAGCACCTCA TCACCCCGCA GGACAGATTC AACATCTCGA TGGCGCGGCA GTAA
|
Protein sequence | MRRDILPIPD PQHVGLTTYD AKDPDTTYPP ITPLRPPQGA PNVLIVLLDD VGFGASSAFG GPCATPTAER LAANGLKLNR FHTTALCSPT RQALLTGRNH HSVGMGGVTE IATSAPGYSS IRPKDKAPVA ETLRLNGYST SQFGKCHEVP VWEVSPVGPF GQWPTGSGFE HFYGFIGGEA NQYYPGLYEG TKPVEPEKTP EQGYTLTEDL ADRAITWVRQ QQALTPDKPF FMYFAPGATH APHHVPKQWS DKYRGKFDDG WDVLRESMLD NQKALGVVPE DAQLTARHDE IPAWDDMPDV LKPVLARQME IYAGFLEQTD HEIGRLVDAI DDLGALDNTL IYYIIGDNGA SAEGTPNGCF NEMCTLNGLA GIETPEFLLS KIDDFGTPDA YNHYAVGWAH ALCGPYQWTK QVASHWGGTR NGTIVHWPNG IAAKGETRNQ FHHVIDVVPT ILEAAKLPAP TVVNSIQQAP LEGVSMMSTL RDRDADETHT VQYFEMFGNR GIYHKGWTAV TKHRTPWIAD QPPLDEDVWE LYAPDDWTQA HDLAAEQPER LAALQRLWLI EAVKYNVVPL DDRSFERFNP DIAGRPQLIK GTTQTLFSGM RLLENCVLNI KNRSHAVSAL ISVPDSGAQG VIVSQGGGVG GWCVYAHENT LKYCYNFFGI EYYFVTAELP LPGGQHLVGF EFAYDGGGLG KGGTVTLYCD GEPVGTGRVE RTEPMAFSAD EACDVGSDTG SPTSPDYGPH GNGFNGRIDW VKIDISTDDH EHLITPQDRF NISMARQ
|
| |