Gene Mvan_4467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4467 
Symbol 
ID4649083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4794156 
End bp4796489 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content65% 
IMG OID639807937 
Productsulfatase 
Protein accessionYP_955248 
Protein GI120405419 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCGGG ACATCCTGCC GATTCCGGAT CCCCAGCACG TCGGACTGAC GACATATGAC 
GCCAAGGACC CCGACACCAC CTACCCGCCC ATCACTCCGC TGCGCCCGCC GCAGGGTGCG
CCCAACGTCC TGATCGTCCT GCTCGACGAC GTCGGCTTCG GCGCGAGCTC GGCCTTCGGC
GGACCCTGCG CCACCCCGAC CGCGGAACGC CTGGCCGCGA ACGGGCTCAA GCTCAACAGG
TTCCACACCA CGGCGCTCTG CTCTCCGACG CGTCAGGCGT TGCTCACCGG CCGGAACCAC
CACTCAGTGG GAATGGGTGG CGTCACCGAG ATCGCCACGT CGGCGCCGGG CTACTCCAGC
ATCCGGCCCA AGGACAAGGC GCCGGTCGCC GAAACCCTTC GGCTCAACGG GTACTCGACC
AGTCAGTTCG GCAAGTGTCA CGAGGTGCCG GTCTGGGAGG TGTCGCCCGT CGGGCCGTTC
GGACAGTGGC CGACGGGTTC GGGGTTCGAG CACTTCTACG GGTTCATCGG TGGCGAGGCC
AACCAGTACT ATCCCGGCCT GTACGAGGGC ACCAAACCGG TGGAGCCGGA GAAGACGCCG
GAGCAGGGCT ACACCCTCAC CGAGGACCTG GCCGATCGCG CGATCACCTG GGTGCGTCAG
CAGCAGGCGC TGACACCGGA CAAGCCCTTC TTCATGTACT TCGCCCCCGG CGCCACGCAC
GCTCCGCACC ATGTCCCCAA ACAGTGGTCC GACAAGTACC GCGGCAAGTT CGACGACGGC
TGGGATGTGT TGCGGGAGAG CATGCTCGAC AACCAGAAAG CGCTCGGTGT CGTCCCTGAG
GATGCGCAGT TGACCGCGCG TCACGACGAG ATACCGGCGT GGGACGACAT GCCCGATGTG
CTCAAGCCGG TGCTCGCGCG GCAGATGGAG ATCTATGCCG GATTCCTCGA ACAGACCGAC
CACGAGATCG GCCGGCTGGT CGACGCGATC GACGACCTCG GTGCGCTCGA CAACACGTTG
ATCTACTACA TCATCGGCGA CAACGGGGCC TCGGCCGAGG GCACACCGAA CGGCTGCTTC
AACGAGATGT GCACGCTGAA CGGCCTGGCG GGCATCGAGA CACCGGAGTT TCTGCTGTCG
AAGATCGACG ACTTCGGTAC ACCCGACGCG TACAACCACT ACGCCGTCGG TTGGGCGCAC
GCGCTGTGCG GACCGTATCA ATGGACCAAG CAGGTCGCGT CGCATTGGGG CGGCACCCGA
AACGGAACGA TCGTGCACTG GCCGAACGGG ATTGCCGCCA AGGGAGAAAC CCGGAACCAG
TTCCATCATG TGATCGACGT CGTGCCGACG ATCCTCGAGG CCGCGAAGCT TCCCGCGCCC
ACGGTCGTGA ACAGCATCCA GCAGGCACCA CTGGAGGGTG TCAGCATGAT GTCCACCCTG
CGGGACCGCG ACGCGGACGA GACGCACACC GTGCAGTACT TCGAGATGTT CGGCAACCGC
GGGATCTATC ACAAGGGCTG GACGGCGGTC ACCAAACACC GAACGCCCTG GATCGCCGAC
CAGCCGCCCC TCGACGAGGA CGTCTGGGAG CTCTATGCAC CCGACGACTG GACGCAGGCC
CACGACCTCG CGGCGGAGCA GCCGGAGAGA CTGGCTGCGC TTCAGCGCCT TTGGTTGATC
GAAGCCGTCA AGTACAACGT GGTGCCCCTC GACGACCGGT CCTTCGAACG ATTCAATCCC
GACATCGCCG GCCGGCCGCA GCTGATCAAA GGGACCACCC AGACCCTGTT CTCCGGCATG
AGGCTGCTGG AGAACTGTGT GCTGAACATC AAGAACAGAT CGCATGCGGT GAGCGCGTTG
ATCTCGGTGC CCGACAGCGG CGCGCAGGGC GTGATCGTCA GTCAGGGTGG CGGAGTGGGC
GGTTGGTGCG TGTACGCCCA CGAGAACACG CTGAAGTACT GCTACAACTT CTTCGGCATC
GAGTACTACT TCGTCACCGC TGAACTCCCG CTCCCTGGGG GCCAGCACCT CGTCGGTTTC
GAGTTCGCTT ACGACGGCGG GGGTCTCGGC AAGGGCGGTA CCGTCACGCT CTACTGCGAC
GGAGAGCCAG TCGGCACCGG ACGAGTCGAG CGGACCGAAC CGATGGCATT CTCGGCCGAC
GAGGCCTGCG ATGTCGGTTC GGACACCGGC TCACCGACGT CGCCGGATTA CGGCCCGCAC
GGAAACGGAT TCAACGGCCG GATCGATTGG GTGAAGATCG ACATCAGCAC CGACGATCAT
GAGCACCTCA TCACCCCGCA GGACAGATTC AACATCTCGA TGGCGCGGCA GTAA
 
Protein sequence
MRRDILPIPD PQHVGLTTYD AKDPDTTYPP ITPLRPPQGA PNVLIVLLDD VGFGASSAFG 
GPCATPTAER LAANGLKLNR FHTTALCSPT RQALLTGRNH HSVGMGGVTE IATSAPGYSS
IRPKDKAPVA ETLRLNGYST SQFGKCHEVP VWEVSPVGPF GQWPTGSGFE HFYGFIGGEA
NQYYPGLYEG TKPVEPEKTP EQGYTLTEDL ADRAITWVRQ QQALTPDKPF FMYFAPGATH
APHHVPKQWS DKYRGKFDDG WDVLRESMLD NQKALGVVPE DAQLTARHDE IPAWDDMPDV
LKPVLARQME IYAGFLEQTD HEIGRLVDAI DDLGALDNTL IYYIIGDNGA SAEGTPNGCF
NEMCTLNGLA GIETPEFLLS KIDDFGTPDA YNHYAVGWAH ALCGPYQWTK QVASHWGGTR
NGTIVHWPNG IAAKGETRNQ FHHVIDVVPT ILEAAKLPAP TVVNSIQQAP LEGVSMMSTL
RDRDADETHT VQYFEMFGNR GIYHKGWTAV TKHRTPWIAD QPPLDEDVWE LYAPDDWTQA
HDLAAEQPER LAALQRLWLI EAVKYNVVPL DDRSFERFNP DIAGRPQLIK GTTQTLFSGM
RLLENCVLNI KNRSHAVSAL ISVPDSGAQG VIVSQGGGVG GWCVYAHENT LKYCYNFFGI
EYYFVTAELP LPGGQHLVGF EFAYDGGGLG KGGTVTLYCD GEPVGTGRVE RTEPMAFSAD
EACDVGSDTG SPTSPDYGPH GNGFNGRIDW VKIDISTDDH EHLITPQDRF NISMARQ