Gene Mvan_4274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4274 
Symbol 
ID4648324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4583446 
End bp4585269 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content70% 
IMG OID639807741 
Productsulfatase 
Protein accessionYP_955057 
Protein GI120405228 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.374649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG AACGGCCCGA CATCGTCGTC GTCATGACCG ACGAGGAGCG CGCGACACCT 
CCCTACGAAC CCGACACCGT TCGCGCATGG CGCAGTCGGA CACTGGGCGG CAGAAGGTGG
TTCGACGAGA ACGGCGTCAG CTTCCTACGC CACTACACAG GCTCACTGGC CTGCGTGCCG
AGTCGCCCGA CCATCTTCAC CGGCCAGTAC CCCGACCTGC ACGGGGTGAC CCAGACCGAC
GGCATCGGCA AGGCACACGA CGATTCCCGG CTGCGCTGGC TGCGCCGCGG CGAGGTGCCC
ACCCTGGGCA ACTGGTTGCG CGCCGCCGGC TACGACACCC ACTACGACGG CAAGTGGCAC
ATCTCGCACG CCGATCTCAT CGACCCCGGC ACCGGCCGGT CGCTCGACAC CAACGACGAC
GACGGCGTCG TCGACCCGGC CGCGGTGCAC AGGTATCTGG AGGCGGATCC GCTGTCCCCA
TACGGCTTTT CGGGTTGGGT CGGCCCGGAA CCGCACGGCG CCAAGCTTTC CAACGCCGGG
ATCCGCCGTG ACCCGCTGAT CGCCGACCGC GTGGTCGCCT GGCTGAAAGA CCGCTACGCG
CGGCGACGCG CGGGTGACCC CGACGCGATG CGGCCGTTCC TGCTGGTGGC CAGCTTCGTC
AATCCCCATG ACATCGTGCT GTTTCCGGCC TGGGCGCGCC GCAACCCGCT GCCCGCCTCA
CCGCTGGACC CTCCCCCGGT TCCTGCCGCA CCGACCGCGG ACGAGGACCT GTCCACCAAA
CCCGCCGCGC AGATCGCGTT CCGCGAGGCG TACTACTCCG GATACGGCCC GGCATGGTCG
ATCGAGCGGA CCTACCGGCG CAACGCCCAG CGCTACCGCG ACCTGTACTA CCGACTGCAC
GCCGAGGTCG ACACCCCCAT CGACCGGGTG CGCCGCGCGG TCACCGAGGG TGGATCCGGC
GACGGACCTG ACGACACCGT GCTGGTGCGC ACCGCTGACC ACGGCGACCT GCTCGGCGCC
CACGGCGGGT TGCACCAGAA ATGGTTCAAC CTCTACGACG AGGCCACCCG GGTCCCGTTC
GTCATCGCCA GGGTGGGCGC CCGTCCGACC ACGGCGCGCA CCGTGACGGC GCCGACGTCG
CACGTCGACC TGGTGCCGAC CCTGCTGTCG GCGGCGGGCG TCGACGTCGA CGCGGCCGCA
ACCGTGCTGG CAGAGTCGTT CTCGGAGGTG CATCCGCTGC CGGGAAGCGA CCTGATGCCC
GTGGTCGACG GCGCGCCGGC CGACGATCAT CGGTGCGTGT ACCTGATGAC GCGGGACAAC
GTCCTCGAAG GCGACACCGG TGCGTCCGGG TTGGCCCGCG CATTGAAGCT GACCTCGAAA
GTGCCTGCAC CCCTGCGCAT CCGGATTCCG GCCCACACGG CCGCCAACTT CGAGGGTCTG
GTGATCCGGG TCGACGAAGA TGCCGCGCCG GGCGGACGCG GACACCTGTG GAAACTGGTG
CGTTCGTTCG ACGATCCCGG CACCTGGACA GAACCGGGTG TGCGCCATCT GGCGGCCGAC
GGCATCGGCG GACCCATGTA CCGCACCGAT CCGCTCGACG ATCAGTGGGA GCTCTACGAC
CTCACCGATG ATCCCGTGGA GCAGCACAAC CGCTGGACCG ATCCCGACCT GCACGAACTG
CGCGCGTATC TACGGGCCCA GCTCAAATCC GTGCGGGCGG AATCCATCCC GGAGCGCAAC
CGCCCGTGGC CCTACGTCCG GCGTCAGCCC CCGCAGCCGG CACGATGGTC ACCGGGGCGC
GCACTGCGCC GGCTGCGGGA CTGA
 
Protein sequence
MTTERPDIVV VMTDEERATP PYEPDTVRAW RSRTLGGRRW FDENGVSFLR HYTGSLACVP 
SRPTIFTGQY PDLHGVTQTD GIGKAHDDSR LRWLRRGEVP TLGNWLRAAG YDTHYDGKWH
ISHADLIDPG TGRSLDTNDD DGVVDPAAVH RYLEADPLSP YGFSGWVGPE PHGAKLSNAG
IRRDPLIADR VVAWLKDRYA RRRAGDPDAM RPFLLVASFV NPHDIVLFPA WARRNPLPAS
PLDPPPVPAA PTADEDLSTK PAAQIAFREA YYSGYGPAWS IERTYRRNAQ RYRDLYYRLH
AEVDTPIDRV RRAVTEGGSG DGPDDTVLVR TADHGDLLGA HGGLHQKWFN LYDEATRVPF
VIARVGARPT TARTVTAPTS HVDLVPTLLS AAGVDVDAAA TVLAESFSEV HPLPGSDLMP
VVDGAPADDH RCVYLMTRDN VLEGDTGASG LARALKLTSK VPAPLRIRIP AHTAANFEGL
VIRVDEDAAP GGRGHLWKLV RSFDDPGTWT EPGVRHLAAD GIGGPMYRTD PLDDQWELYD
LTDDPVEQHN RWTDPDLHEL RAYLRAQLKS VRAESIPERN RPWPYVRRQP PQPARWSPGR
ALRRLRD