Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_13328 |
Symbol | |
ID | 5224017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | - |
Start bp | 3694766 |
End bp | 3697678 |
Gene Length | 2913 bp |
Protein Length | 970 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640608097 |
Product | arylsulfatase atsB (aryl-sulfate sulphohydrolase) |
Protein accession | YP_001289255 |
Protein GI | 148824501 |
COG category | [P] Inorganic ion transport and metabolism [S] Function unknown |
COG ID | [COG3119] Arylsulfatase A and related enzymes [COG4803] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 137 |
Plasmid unclonability p-value | 0.00105728 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 213 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATGAGTG AAGACAACGC GCTGGTGCTC GTCGCCGGCT ATCAGGACCT CGATTCGGCT CGTCACGATT TTCAAACCCT CGTCGATGCC GCCAAGGACA AAAGCATTCC GCTGCAGGGT GCGGTGCTGA TCGGCAAGGA CGCCGAGGGC AGTCCGGTTT TGGTCGACAC CGGAAATCGG CTCGGCCGGC GCGGCGCCGC GTGGGGCGCC GGGGTGGGCC TGGCGATCGG CCTGTTCTCG CCGGCACTGT TGGCCTCGGC GGCGCTCGGC GCCGCGACCG GAGCATTGGC CGGCACCTTC GCCCACCATC GGATCAAGAC CGGGCTGGCC GACAAGATCG GCCAAGCGCT GGCGGCGGGC CGAGCAGTGG TCATCGCCGT GACGGAGGCA CAAGGCCGAC TGGAAGCGGG GCAGGCGTTG GCTAGCTCTC CAATGAAATC GGTTGCGGAG CTGAGTCGTT CGACGTTGCG AAGCCTGGGT GCGGCGTTGC GAGAGGCGAT GGGCAAGTTC AACCCAGACC GCACCCGGCT GCCGCTACCG CAGCGCCGCT TTGGTGGCGT GGTTGGCCGC ACCATGGCAG AGTCGGTCGG CGACTGGTCG ATTGTCCCCG GTCCCTTTCC GCCCGACGAC GCACCGAATG TGCTGATCGT GTTGATCGAT GACGCTGGGT TCGGCGGACC GGATACATTC GGCGGCGCGA TCCGAACCCC GACGCTGTCC CGGCTAGCCC AGAATGGGTT GATCTACAAC CGTTTTCATG TGACCGCGGT GTGCTCGCCG ACCCGTGCGG CGCTGTTGAC CGGGCGTAAC CATCACCGGG TGGGCTTCGG GTCGGTCTGC GAGTTCCCCG GCCCGTACCC GGGGTATTCG GCGGTCAGGC CACGCAGTTG CGCAGCGCTG CCGCGTATTC TGCGCGACAA CGGTTATGTG ACTGGCGCTT TCGGCAAGTG GCATCTGACC CCGGACAATG TCCAGGGAGC CGCGGGGCCG TTCGACAACT GGCCGCTGGG TTGGGGATTC GACCATTTCT GGGGCTTCCC GAGCGGCGCC GCGGGTCAGT ACGACCCGAT CATCAGTCAG GACAACTCCG TCATAGGCAT ACCCGAGGGT TCTGGGGAAG ACGGCCGTCC CTACTATTTC CCCGACGACC TCACCGACAA GGCTATCGAG TGGCTGCACA CCGTGCGGGC CCAGAATGCC ACCAAGCCGT GGATGCTGTA CTACGCGACC GGCGCCACCC ACGCGCCACA CCACGTATTC AAGGAATGGG CCGACAAGTA CCGAGGTGAG TTCGATGATG GCTGGGATGT GTACCGGCAG AAGACATTCG AACGGCAAAA GCGACTCGGG ATCATTCCAC CCGACGCCGA ACTCACCGAG CGGCCCGACC TATTCCCCGC GTGGGACAGT ATGTCGGAGG CGCAAAAACG GCTCTTTGCC CGCCAGATGG AGGTGTTCGC CGGGTTCTCG GAAAATGCGG ACTGGAATGT TGGCCGGCTG CTGGACGCGA TCGAGGATCT CGGCGAGTCC GACAACACGT TGGTGTTCTA CATCTGGGGC GACAATGGCG CCAGCATGGA GGGCACCAAC ACCGGTTCGT TCAATGAGAT GACGTTCCTT AACGGCCTGG ATCTGGATGC CGAGCGGCAA TTGGAGCTGA TCGAACAATA CGGCGGCATC GCCGCACTCG GCGACGAGTT CACCGCACCG CATTTCGCCA GCGCGTGGGC GCATGCGAGC AACACCCCGT TGCAGTGGGG CAAGCAGATG GCCAGCCACC TGGGCGGCAC GCGCGATCCA TTGGTGGTCG CTTGGCCGGC CCGGATCCGG CCAGACGGCC GTGTTCGTAG CCAGTTCACC CACTGCATCG ACATCGCGCC GACCGTGTTG GCGGCCATCG GTTTACCGGA GCCGACCCAT GTCGACGGCT TCGAGCAGGA ACCGATGGAC GGAACCAGTT TCGTGCGGAC CTTCGACGAC GCTGAAGCCG AAGACCGCCA CACCGTGCAG TACTTCGAAA ACTTCGGCAG CCGTGCCATC TACAAAGACG GCTGGTGGGC GTGCGCTCGC TTGGACAAGG CGCCCTGGGA TCTGTCACCG GAGACGATGC GACGGTTCGC GCCGGGGACC TACGACCCGG ACCAGGACGT CTGGGAGCTG TACTACCTAC CAGATGACTT CTCCCAGGCG AAAAACCTGG CAGCCGAGCA TCCCGACAAG GTCGCCGAGC TCACCCAGCT GTGGTGGCAG GAGGCCGAAC GAAACCGGGT GCTGCCGCTG CTGGGCGGGC TCGCGGTAAT GTTCGGCGAC CTGCCGCCCC TGCCCACCAC CGCACGGTTC AGTTTCAAAG GTGACGTGCA GAACATTCAG CGCGGCATGG TCCCCCGTAT CTGCGGTCGT TCTTACGCGA TCGAGGCACG GCTGCACATC CCCGACGGCG GCGCGCAGGG TGTGATCGTC GCCAACGCCG ACTTCATGGG AGGGTTCGCG CTATGGGTCG ACGAACAGCG GCACCTGCAC CACACCTACT CCTTCCTGGG CGTCGAAACC TACCGGCAGG TGTCCAGCGA GCCGCTCCCC ACCGGGGATG TCACGGTGCG GATGCTGTTC GATTCCCATC AACCCGTCGC CGCCTCCGGT GGTCGGGTGA CGCTCTGGGC CGACGATCGG TTGATCGGAG AGGGTGAGCT GCCCCAGACG GTGCCGCTGG CCTTTACCTC CTATGCCGGC ATGGACATCG GCCGCGACAA CGGCCTGGTC GTTGACCGCG GCTATGAGGA CAAGGCGCCC TATGCGTTCA CCGGGACCGT CACCGAGGTC ATCTTCGACC TCAAGCCCGT ACATCCCGAA GCCGCCAGGG CGTTGCACGA GCACGCATCG GTCCAAGCGG TGGGACAGGG CGCCGCGGGC TGA
|
Protein sequence | MMSEDNALVL VAGYQDLDSA RHDFQTLVDA AKDKSIPLQG AVLIGKDAEG SPVLVDTGNR LGRRGAAWGA GVGLAIGLFS PALLASAALG AATGALAGTF AHHRIKTGLA DKIGQALAAG RAVVIAVTEA QGRLEAGQAL ASSPMKSVAE LSRSTLRSLG AALREAMGKF NPDRTRLPLP QRRFGGVVGR TMAESVGDWS IVPGPFPPDD APNVLIVLID DAGFGGPDTF GGAIRTPTLS RLAQNGLIYN RFHVTAVCSP TRAALLTGRN HHRVGFGSVC EFPGPYPGYS AVRPRSCAAL PRILRDNGYV TGAFGKWHLT PDNVQGAAGP FDNWPLGWGF DHFWGFPSGA AGQYDPIISQ DNSVIGIPEG SGEDGRPYYF PDDLTDKAIE WLHTVRAQNA TKPWMLYYAT GATHAPHHVF KEWADKYRGE FDDGWDVYRQ KTFERQKRLG IIPPDAELTE RPDLFPAWDS MSEAQKRLFA RQMEVFAGFS ENADWNVGRL LDAIEDLGES DNTLVFYIWG DNGASMEGTN TGSFNEMTFL NGLDLDAERQ LELIEQYGGI AALGDEFTAP HFASAWAHAS NTPLQWGKQM ASHLGGTRDP LVVAWPARIR PDGRVRSQFT HCIDIAPTVL AAIGLPEPTH VDGFEQEPMD GTSFVRTFDD AEAEDRHTVQ YFENFGSRAI YKDGWWACAR LDKAPWDLSP ETMRRFAPGT YDPDQDVWEL YYLPDDFSQA KNLAAEHPDK VAELTQLWWQ EAERNRVLPL LGGLAVMFGD LPPLPTTARF SFKGDVQNIQ RGMVPRICGR SYAIEARLHI PDGGAQGVIV ANADFMGGFA LWVDEQRHLH HTYSFLGVET YRQVSSEPLP TGDVTVRMLF DSHQPVAASG GRVTLWADDR LIGEGELPQT VPLAFTSYAG MDIGRDNGLV VDRGYEDKAP YAFTGTVTEV IFDLKPVHPE AARALHEHAS VQAVGQGAAG
|
| |