Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_10725 |
Symbol | |
ID | 5221393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | + |
Start bp | 810204 |
End bp | 812567 |
Gene Length | 2364 bp |
Protein Length | 787 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640605470 |
Product | arylsulfatase atsA (aryl-sulfate sulphohydrolase) |
Protein accession | YP_001286670 |
Protein GI | 148821916 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 222 |
Plasmid unclonability p-value | 0.64546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 187 |
Fosmid unclonability p-value | 0.178722 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACCCG AGGCCACCGA GGCGTTCAAC GGCACCATCG AGCTGGATAT TCGTGATTCG GAGCCGGATT GGGGCCCATA CGCAGCGCCG GTGGCACCGG AGCACTCACC AAACATCCTG TATCTGGTCT GGGACGACGT CGGCATCGCG ACCTGGGACT GCTTTGGCGG CCTGGTCGAG ATGCCCGCGA TGACGCGCGT CGCCGAGCGT GGCGTGCGAC TGTCGCAATT TCACACCACC GCACTGTGCT CGCCGACCCG GGCGTCGCTG CTGACCGGTC GCAACGCCAC CACCGTAGGC ATGGCTACCA TCGAAGAGTT CACCGACGGG TTCCCCAACT GCAACGGGCG GATCCCGGCT GACACCGCGT TGCTCCCAGA GGTGCTGGCC GAACATGGCT ACAACACCTA CTGTGTGGGC AAGTGGCACC TGACGCCACT CGAAGAATCC AATATGGCGT CGACGAAGCG GCACTGGCCG ACCTCGCGTG GGTTCGAGCG GTTCTACGGA TTCCTAGGCG GGGAGACCGA CCAGTGGTAT CCCGACCTGG TATACGACAA CCACCCAGTG AGTCCTCCCG GCACACCCGA GGGTGGCTAC CACCTGTCAA AAGACATCGC CGACAAGACG ATCGAGTTCA TTCGTGATGC CAAGGTGATC GCGCCCGACA AGCCGTGGTT CAGCTACGTG TGCCCAGGCG CCGGGCATGC GCCGCACCAC GTCTTCAAGG AATGGGCGGA CAGATACGCC GGCCGATTCG ACATGGGGTA TGAGCGCTAT CGCGAGATCG TGCTGGAAAG GCAAAAGGCG CTAGGGATCG TGCCACCCGA CACCGAACTG TCGCCCATAA ACCCTTATCT GGATGTGCCG GGGCCAAACG GCGAGACCTG GCCGCTGCAG GACACGGTGC GGCCGTGGGA CTCGCTGAGC GATGAAGAAA AGAAGCTGTT TTGCCGGATG GCCGAGGTGT TCGCCGGCTT TCTGAGCTAC ACCGACGCCC AGATCGGACG GATCCTGGAC TACCTCGAGG AATCCGGCCA GCTGGACAAC ACCATCATCG TGGTGATCTC CGACAACGGC GCCAGCGGCG AGGGCGGACC CAACGGATCG GTCAACGAAG GCAAGTTCTT CAACGGCTAC ATCGACACCG TCGCTGAAAG CATGAAGCTC TTCGACCACC TCGGTGGCCC GCAGACCTAC AACCACTACC CCATCGGGTG GGCAATGGCC TTCAACACCC CCTACAAGCT GTTCAAGCGC TACGCCTCGC ATGAAGGCGG CATTGCCGAC CCGGCAATCA TCTCCTGGCC CAACGGCATT GCCGCACACG GTGAAATCCG CGACAACTAC GTCAATGTCA GCGACATCAC GCCCACCGTC TACGACCTGT TGGGCATGAC ACCGCCGGGG ACCGTCAAGG GGATTCCGCA GAAACCGATG GACGGCGTGA GCTTCATAGC GGCCCTTGCC GACCCGGCCG CCGACACCGG CAAGACCACC CAGTTCTACA CCATGCTGGG CACCCGCGGG ATCTGGCATG AAGGTTGGTT CGCCAACACC ATTCACGCGG CCACGCCCGC CGGCTGGTCG AATTTCAACG CTGACCGCTG GGAACTGTTC CACATCGCAG CAGACCGCAG CCAGTGCCAC GACCTGGCCG CCGAGCATCC CGACAAACTT GAGGAGCTCA AGGCGCTGTG GTTCTCCGAA GCCGCCAAGT ACAACGGGCT GCCGCTGGCC GATCTGAACC TCCTGGAAAC GATGACTCGG TCGCGGCCTT ACCTGGTCAG CGAACGAGCC AGCTACGTCT ACTATCCCGA CTGCGCTGAC GTCGGCATCG GCGCGGCCGT AGAGATTCGC GGGCGCTCGT TCGCCGTGCT GGCCGATGTG ACCATCGATA CCACCGGCGC CGAGGGCGTG CTGTTCAAGC ACGGCGGCGC CCATGGCGGG CACGTGCTGT TCGTCCGGGA CGGACGCTTG CACTACGTCT ACAACTTCCT CGGTGAGCGC CAGCAGCTGG TCAGCTCGTC GGGTCCGGTC CCGTCGGGAA GACATCTACT CGGGGTTCGT TATTTGCGGA CCGGAACCGT GCCCAACAGT CACACGCCGG TGGGCGATCT TGAGCTGTTC TTCGACGAGA ACCTGGTCGG CGCCCTGACC AATGTGCTGA CCCACCCTGG AACGTTCGGG TTGGCCGGCG CCGCTATCAG CGTTGGCCGC AACGGCGGTT CGGCTGTGTC CAGCCACTAC GAAGCGCCGT TCGCGTTCAC CGGCGGTACC ATCACCCAGG TCACCGTCGA CGTGTCAGGC CGACCGTTCG AAGATGTGGA ATCCGATCTT GCGCTTGCTT TTTCGCGTGA CTGA
|
Protein sequence | MAPEATEAFN GTIELDIRDS EPDWGPYAAP VAPEHSPNIL YLVWDDVGIA TWDCFGGLVE MPAMTRVAER GVRLSQFHTT ALCSPTRASL LTGRNATTVG MATIEEFTDG FPNCNGRIPA DTALLPEVLA EHGYNTYCVG KWHLTPLEES NMASTKRHWP TSRGFERFYG FLGGETDQWY PDLVYDNHPV SPPGTPEGGY HLSKDIADKT IEFIRDAKVI APDKPWFSYV CPGAGHAPHH VFKEWADRYA GRFDMGYERY REIVLERQKA LGIVPPDTEL SPINPYLDVP GPNGETWPLQ DTVRPWDSLS DEEKKLFCRM AEVFAGFLSY TDAQIGRILD YLEESGQLDN TIIVVISDNG ASGEGGPNGS VNEGKFFNGY IDTVAESMKL FDHLGGPQTY NHYPIGWAMA FNTPYKLFKR YASHEGGIAD PAIISWPNGI AAHGEIRDNY VNVSDITPTV YDLLGMTPPG TVKGIPQKPM DGVSFIAALA DPAADTGKTT QFYTMLGTRG IWHEGWFANT IHAATPAGWS NFNADRWELF HIAADRSQCH DLAAEHPDKL EELKALWFSE AAKYNGLPLA DLNLLETMTR SRPYLVSERA SYVYYPDCAD VGIGAAVEIR GRSFAVLADV TIDTTGAEGV LFKHGGAHGG HVLFVRDGRL HYVYNFLGER QQLVSSSGPV PSGRHLLGVR YLRTGTVPNS HTPVGDLELF FDENLVGALT NVLTHPGTFG LAGAAISVGR NGGSAVSSHY EAPFAFTGGT ITQVTVDVSG RPFEDVESDL ALAFSRD
|
| |