Gene Mvan_1317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1317 
Symbol 
ID4643775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1406153 
End bp1408507 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content65% 
IMG OID639804817 
Productsulfatase 
Protein accessionYP_952157 
Protein GI120402328 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.242414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGACGA CGGAGTTCAA CGGCAAGATC GCGGTAGACA TCCGGGATTC CGAACCGGAC 
TGGGGGCCGT TCGCTGCCCC GACTGCACAA CCGGACGCGC CCAACGTCCT CTATCTGGTC
TGGGACGACA TCGGCATCGC CACCTGGGAC TGTTTCGGCG GGTTGGTCAA CATGCCCGCA
ATGAGCCGCA TTGCCGAACG AGGAGTGCGG CTTTCGCAGT TTCACACCAC CGCGTTGTGC
TCACCGACCA GGGCGTCGCT GCTGACCGGC CGAAACGCCA CCACGGTCGG AATGGCAACC
ATCGAGGAGT TCACCGACGG CTTCCCGAAT TGCAGCGGCC GAATCCCGTT CGACACCGCC
CTGATCTCCG AGGTGCTGGC CGAGAACGGT TACAACACTT ACTGCGTCGG CAAGTGGCAT
CTGACCCCGC TGGAGGAGTC CAACCTCGCG GCGACGAAGC GGCACTGGCC GCTGTCGCGT
GGCTTCGAGC GGTTCTACGG ATTCATGGGC GGCGAAACCG ATCAGTGGTA TCCCGACCTG
GTCTACGACA ACCATCCCGT CCCGCCGCCT GCCGGCCCGG AAGAGGGCTA CCACCTGTCG
AAGGATCTCG CGGACAAGAC CATCGAGTTC ATCCGCGACT CGAAGGTCAT CGCGCCCGAC
AAACCGTGGT TCTCCTACGT GTGTCCCGGC GCGGGGCACG CGCCCCATCA CGTCTTCAAG
GAGTGGGCGG ACCGCTACTC CGGGGTCTTC GACATGGGCT ATGAGCGCTA CCGCGAGATC
GTGCTGGAGA ACCAGAAGCG TCTCGGCATC GTCCCGCCCG AGACCGAACT CTCGCCGGTG
AACCCGTATC TGGACGTCAA GGGCCCCGAC GGGCAGGAGT GGCCGGCGCA GGACACGGTG
CGGCCGTGGG ATTCGTTGAG CGAGGAGGAG AAGCGCCTTT TCGCCAGGAT GGCCGAGGTG
TTCGCCGGGT TCCTCTCCTA CACCGATGCC CAGATCGGGC GTGTCCTGGA CTATCTCGAC
GAATCGGGCC AACTCGACAA CACCATCATC GTGGTGATCT CGGACAACGG CGCCAGCGGG
GAGGGTGGGC CCAACGGTTC GGTGAACGAG GTCAAGTTCT TCAACGGCTA CATCGACTCG
GTCGAAGAGA GCCTGAAGGC CTTCGACGAG CTCGGTGGCA CCCAGACCTA CAACCATTAC
CCGATCGGCT GGGCGATGGC GTTCAACACC CCGTACAAGT TGTTCAAGCG CTACGCGTCT
CATGAAGGCG GCATCGCCGA CACCGCAATC ATCTCGTGGC CCAACGGCAT TGCCGCCCAC
GGTGAGGTGC GCGACAACTA CGTCAACGTC TGCGACATCA CCCCGACGGT GTTCGACCTG
CTTGACATCA CGCCTCCCGC CACGGTGCGC GGCGTGGCGC AGAAGCCGAT GGACGGTGTC
AGTTTCAAGG TGGCCCTGGA CAATCCGACG GCGCCGACCG GCAAAGAGAC CCAGTTCTAC
ACCATGCTCG GTACCCGGGG GATCTGGCAC AAGGGGTGGT TCGCCAGTGC CGTGCACGCG
GCCTCACCCT CCGGATGGTC GCATTTCGAC GACGACCGCT GGGAGCTGTT CCACATCGAG
GCCGACCGCA GCCAGTGCCA CGATCTGGCC GCCGAACACC CGGACAAGGT CGAGGAGCTC
AAGGCGCTGT GGTTCGCCGA GGCGGCGAAG TACAACGGTC TTCCGTTGGG GGACCTCGAC
ATCCTGGAGA CCATCACGCG GTGGCGGCCC TACCTGACCG GGGAACGTAA TTCGTACGCC
TACTACCCGG GTACGGCCGA CGTCGGTATG GGTGCCGTCG TGGAGCTGCG TGGCCGCTCC
TTCGCGGTGC TCGCCGAGGT CGCCGTCGAC CCGGACGGCG CCGACGGTGT AGTGGTCAAA
CACGGTGGGG CACACGGTGG ATACGTGATG TACGTGCAGG GCGGGCGGCT GCACTTCTGC
TACAACTTCC TCGGCGAGTA CGAGCAGACG CTGGCCTCAG CGGACCCGGT GAGCGCCGGT
CTGCACACGC TCGGGTTCAC GTTCACCCTC ACCGGAACCG CCGAGGGCAG CCACACCCCG
GTAGGTGACG CCGCGCTGTT CATCGACAGC GCCCAGGTGG CGTCACTGGC CGAAATGCGG
GTCCACCCGG GTACTTTCGG ACTCGCGGGG GCGACCTTGA GCGTGGGCCG CAACAGTGGG
TCGCCGGTGT CGCAGGCCTA TCAGGCGCCG TACCCGTTCA CCGGGGGAAC CATCGCCCGG
GTCAACATCG ACGTCTCCGG CGCCCCGTAT CTCGATCTGG AGCGCGAGTT CGCGCGGGCG
TTCGCCAGGG ACTGA
 
Protein sequence
MATTEFNGKI AVDIRDSEPD WGPFAAPTAQ PDAPNVLYLV WDDIGIATWD CFGGLVNMPA 
MSRIAERGVR LSQFHTTALC SPTRASLLTG RNATTVGMAT IEEFTDGFPN CSGRIPFDTA
LISEVLAENG YNTYCVGKWH LTPLEESNLA ATKRHWPLSR GFERFYGFMG GETDQWYPDL
VYDNHPVPPP AGPEEGYHLS KDLADKTIEF IRDSKVIAPD KPWFSYVCPG AGHAPHHVFK
EWADRYSGVF DMGYERYREI VLENQKRLGI VPPETELSPV NPYLDVKGPD GQEWPAQDTV
RPWDSLSEEE KRLFARMAEV FAGFLSYTDA QIGRVLDYLD ESGQLDNTII VVISDNGASG
EGGPNGSVNE VKFFNGYIDS VEESLKAFDE LGGTQTYNHY PIGWAMAFNT PYKLFKRYAS
HEGGIADTAI ISWPNGIAAH GEVRDNYVNV CDITPTVFDL LDITPPATVR GVAQKPMDGV
SFKVALDNPT APTGKETQFY TMLGTRGIWH KGWFASAVHA ASPSGWSHFD DDRWELFHIE
ADRSQCHDLA AEHPDKVEEL KALWFAEAAK YNGLPLGDLD ILETITRWRP YLTGERNSYA
YYPGTADVGM GAVVELRGRS FAVLAEVAVD PDGADGVVVK HGGAHGGYVM YVQGGRLHFC
YNFLGEYEQT LASADPVSAG LHTLGFTFTL TGTAEGSHTP VGDAALFIDS AQVASLAEMR
VHPGTFGLAG ATLSVGRNSG SPVSQAYQAP YPFTGGTIAR VNIDVSGAPY LDLEREFARA
FARD