Gene TBFG_13328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_13328 
Symbol 
ID5224017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp3694766 
End bp3697678 
Gene Length2913 bp 
Protein Length970 aa 
Translation table11 
GC content65% 
IMG OID640608097 
Productarylsulfatase atsB (aryl-sulfate sulphohydrolase) 
Protein accessionYP_001289255 
Protein GI148824501 
COG category[P] Inorganic ion transport and metabolism
[S] Function unknown 
COG ID[COG3119] Arylsulfatase A and related enzymes
[COG4803] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones137 
Plasmid unclonability p-value0.00105728 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones213 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATGAGTG AAGACAACGC GCTGGTGCTC GTCGCCGGCT ATCAGGACCT CGATTCGGCT 
CGTCACGATT TTCAAACCCT CGTCGATGCC GCCAAGGACA AAAGCATTCC GCTGCAGGGT
GCGGTGCTGA TCGGCAAGGA CGCCGAGGGC AGTCCGGTTT TGGTCGACAC CGGAAATCGG
CTCGGCCGGC GCGGCGCCGC GTGGGGCGCC GGGGTGGGCC TGGCGATCGG CCTGTTCTCG
CCGGCACTGT TGGCCTCGGC GGCGCTCGGC GCCGCGACCG GAGCATTGGC CGGCACCTTC
GCCCACCATC GGATCAAGAC CGGGCTGGCC GACAAGATCG GCCAAGCGCT GGCGGCGGGC
CGAGCAGTGG TCATCGCCGT GACGGAGGCA CAAGGCCGAC TGGAAGCGGG GCAGGCGTTG
GCTAGCTCTC CAATGAAATC GGTTGCGGAG CTGAGTCGTT CGACGTTGCG AAGCCTGGGT
GCGGCGTTGC GAGAGGCGAT GGGCAAGTTC AACCCAGACC GCACCCGGCT GCCGCTACCG
CAGCGCCGCT TTGGTGGCGT GGTTGGCCGC ACCATGGCAG AGTCGGTCGG CGACTGGTCG
ATTGTCCCCG GTCCCTTTCC GCCCGACGAC GCACCGAATG TGCTGATCGT GTTGATCGAT
GACGCTGGGT TCGGCGGACC GGATACATTC GGCGGCGCGA TCCGAACCCC GACGCTGTCC
CGGCTAGCCC AGAATGGGTT GATCTACAAC CGTTTTCATG TGACCGCGGT GTGCTCGCCG
ACCCGTGCGG CGCTGTTGAC CGGGCGTAAC CATCACCGGG TGGGCTTCGG GTCGGTCTGC
GAGTTCCCCG GCCCGTACCC GGGGTATTCG GCGGTCAGGC CACGCAGTTG CGCAGCGCTG
CCGCGTATTC TGCGCGACAA CGGTTATGTG ACTGGCGCTT TCGGCAAGTG GCATCTGACC
CCGGACAATG TCCAGGGAGC CGCGGGGCCG TTCGACAACT GGCCGCTGGG TTGGGGATTC
GACCATTTCT GGGGCTTCCC GAGCGGCGCC GCGGGTCAGT ACGACCCGAT CATCAGTCAG
GACAACTCCG TCATAGGCAT ACCCGAGGGT TCTGGGGAAG ACGGCCGTCC CTACTATTTC
CCCGACGACC TCACCGACAA GGCTATCGAG TGGCTGCACA CCGTGCGGGC CCAGAATGCC
ACCAAGCCGT GGATGCTGTA CTACGCGACC GGCGCCACCC ACGCGCCACA CCACGTATTC
AAGGAATGGG CCGACAAGTA CCGAGGTGAG TTCGATGATG GCTGGGATGT GTACCGGCAG
AAGACATTCG AACGGCAAAA GCGACTCGGG ATCATTCCAC CCGACGCCGA ACTCACCGAG
CGGCCCGACC TATTCCCCGC GTGGGACAGT ATGTCGGAGG CGCAAAAACG GCTCTTTGCC
CGCCAGATGG AGGTGTTCGC CGGGTTCTCG GAAAATGCGG ACTGGAATGT TGGCCGGCTG
CTGGACGCGA TCGAGGATCT CGGCGAGTCC GACAACACGT TGGTGTTCTA CATCTGGGGC
GACAATGGCG CCAGCATGGA GGGCACCAAC ACCGGTTCGT TCAATGAGAT GACGTTCCTT
AACGGCCTGG ATCTGGATGC CGAGCGGCAA TTGGAGCTGA TCGAACAATA CGGCGGCATC
GCCGCACTCG GCGACGAGTT CACCGCACCG CATTTCGCCA GCGCGTGGGC GCATGCGAGC
AACACCCCGT TGCAGTGGGG CAAGCAGATG GCCAGCCACC TGGGCGGCAC GCGCGATCCA
TTGGTGGTCG CTTGGCCGGC CCGGATCCGG CCAGACGGCC GTGTTCGTAG CCAGTTCACC
CACTGCATCG ACATCGCGCC GACCGTGTTG GCGGCCATCG GTTTACCGGA GCCGACCCAT
GTCGACGGCT TCGAGCAGGA ACCGATGGAC GGAACCAGTT TCGTGCGGAC CTTCGACGAC
GCTGAAGCCG AAGACCGCCA CACCGTGCAG TACTTCGAAA ACTTCGGCAG CCGTGCCATC
TACAAAGACG GCTGGTGGGC GTGCGCTCGC TTGGACAAGG CGCCCTGGGA TCTGTCACCG
GAGACGATGC GACGGTTCGC GCCGGGGACC TACGACCCGG ACCAGGACGT CTGGGAGCTG
TACTACCTAC CAGATGACTT CTCCCAGGCG AAAAACCTGG CAGCCGAGCA TCCCGACAAG
GTCGCCGAGC TCACCCAGCT GTGGTGGCAG GAGGCCGAAC GAAACCGGGT GCTGCCGCTG
CTGGGCGGGC TCGCGGTAAT GTTCGGCGAC CTGCCGCCCC TGCCCACCAC CGCACGGTTC
AGTTTCAAAG GTGACGTGCA GAACATTCAG CGCGGCATGG TCCCCCGTAT CTGCGGTCGT
TCTTACGCGA TCGAGGCACG GCTGCACATC CCCGACGGCG GCGCGCAGGG TGTGATCGTC
GCCAACGCCG ACTTCATGGG AGGGTTCGCG CTATGGGTCG ACGAACAGCG GCACCTGCAC
CACACCTACT CCTTCCTGGG CGTCGAAACC TACCGGCAGG TGTCCAGCGA GCCGCTCCCC
ACCGGGGATG TCACGGTGCG GATGCTGTTC GATTCCCATC AACCCGTCGC CGCCTCCGGT
GGTCGGGTGA CGCTCTGGGC CGACGATCGG TTGATCGGAG AGGGTGAGCT GCCCCAGACG
GTGCCGCTGG CCTTTACCTC CTATGCCGGC ATGGACATCG GCCGCGACAA CGGCCTGGTC
GTTGACCGCG GCTATGAGGA CAAGGCGCCC TATGCGTTCA CCGGGACCGT CACCGAGGTC
ATCTTCGACC TCAAGCCCGT ACATCCCGAA GCCGCCAGGG CGTTGCACGA GCACGCATCG
GTCCAAGCGG TGGGACAGGG CGCCGCGGGC TGA
 
Protein sequence
MMSEDNALVL VAGYQDLDSA RHDFQTLVDA AKDKSIPLQG AVLIGKDAEG SPVLVDTGNR 
LGRRGAAWGA GVGLAIGLFS PALLASAALG AATGALAGTF AHHRIKTGLA DKIGQALAAG
RAVVIAVTEA QGRLEAGQAL ASSPMKSVAE LSRSTLRSLG AALREAMGKF NPDRTRLPLP
QRRFGGVVGR TMAESVGDWS IVPGPFPPDD APNVLIVLID DAGFGGPDTF GGAIRTPTLS
RLAQNGLIYN RFHVTAVCSP TRAALLTGRN HHRVGFGSVC EFPGPYPGYS AVRPRSCAAL
PRILRDNGYV TGAFGKWHLT PDNVQGAAGP FDNWPLGWGF DHFWGFPSGA AGQYDPIISQ
DNSVIGIPEG SGEDGRPYYF PDDLTDKAIE WLHTVRAQNA TKPWMLYYAT GATHAPHHVF
KEWADKYRGE FDDGWDVYRQ KTFERQKRLG IIPPDAELTE RPDLFPAWDS MSEAQKRLFA
RQMEVFAGFS ENADWNVGRL LDAIEDLGES DNTLVFYIWG DNGASMEGTN TGSFNEMTFL
NGLDLDAERQ LELIEQYGGI AALGDEFTAP HFASAWAHAS NTPLQWGKQM ASHLGGTRDP
LVVAWPARIR PDGRVRSQFT HCIDIAPTVL AAIGLPEPTH VDGFEQEPMD GTSFVRTFDD
AEAEDRHTVQ YFENFGSRAI YKDGWWACAR LDKAPWDLSP ETMRRFAPGT YDPDQDVWEL
YYLPDDFSQA KNLAAEHPDK VAELTQLWWQ EAERNRVLPL LGGLAVMFGD LPPLPTTARF
SFKGDVQNIQ RGMVPRICGR SYAIEARLHI PDGGAQGVIV ANADFMGGFA LWVDEQRHLH
HTYSFLGVET YRQVSSEPLP TGDVTVRMLF DSHQPVAASG GRVTLWADDR LIGEGELPQT
VPLAFTSYAG MDIGRDNGLV VDRGYEDKAP YAFTGTVTEV IFDLKPVHPE AARALHEHAS
VQAVGQGAAG