Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_10676 |
Symbol | |
ID | 5221344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | + |
Start bp | 759898 |
End bp | 762261 |
Gene Length | 2364 bp |
Protein Length | 787 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640605421 |
Product | arylsulfatase atsD (aryl-sulfate sulphohydrolase) |
Protein accession | YP_001286621 |
Protein GI | 148821867 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 254 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 206 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCAGC CAAGAACGCA TCTGCCGATT CCCAGTGCTG CTCGCACCGG GCTGATCACG TATGACGCGA AGGATCCCGA CAGCACCTAT CCGCCGATCG AGCAGCTGCG CCCACCGGCG GGTGCCCCGA ATGTGTTGCT GATCCTGCTT GACGATGTCG GGTTCGGTGC GTCGAGCGCG TTCGGAGGCC CATGCAGGAC GTCGACGGCG GAACTGCTTG CCGGTAACGG GTTGCGGTAC AACCGGTTTC ACACCACCGC GCTGTGCTCG CCGACGCGTC AGGCGTTGTT AACTGGACGC AACCATCACT CCGCCGGCAT GGGCGGTATC ACCGAAATCG CCACCGGTGC ACCGGGATAC AGCTCAGTAC TACCGAACAC CATGTCGCCG ATCGCGCGGA CGCTAAAGCT CAACGGCTAC AACACCGCCC AGTTCGGCAA GTGCCACGAA GTCCCGGTCT GGCAGACCAG CCCGGTCGGG CCGTTCGACG CGTGGCCCAG CGGCGGCGGT GGTTTCGAAT ACTTCTACGG GTTTATCGGT GGCGAGGCTA ACCAGTGGTA TCCGAGTCTG TACGAGGGCA CCACGCCGGT CGAGGTGAAC CGCACGCCCG AGGAGGGTTA CCATTTCATG GCGGACATGA CCGACAAGGC CCTCGGCTGG ATCGGACAGC AGAAGGCACT GGCCCCCGAC CGGCCGTTCT TCGTGTACTT CGCCCCGGGC GCCACCCACG CGCCCCACCA CGTTCCGCGG GAGTGGGCCG ACAAGTACCG GGGCCGCTTC GATGTGGGCT GGGACGCACT GCGAGAGGAA ACCTTCGCCC GGCAAAAGGA ACTCGGGGTG ATCCCGGCGG ACTGCCAGCT GACCGCGCGG CACGCCGAAA TCCCGGCGTG GGACGACATG CCGGAGGACC TCAAACCCGT GCTATGCCGG CAGATGGAGG TCTACGCGGG CTTTCTGGAA TACACCGACC ACCACGTCGG CCGGCTCGTC GACGGCCTGC AGCGCCTCGG TGTGCTCGAC GACACGCTGG TGTTCTACAT CATCGACGAC AACGGCGCCT CGGCCGAGGG CACGATCAAC GGCACCTACA ACGAGATGTT GAACTTCAAC GGCCTGGCCG ACATCGAGAC GCCGCGGTTC ATGACCGACC GGCTCGACAA GTTCGGCGGG CCGGAGTCCT ACAACCACTA TTCGGTGGGT TGGGCGCATG CGATGGATAC CCCCTATCAG TGGACCAAAC AAGTGGCCTC GCACTGGGGT GGCACGCGTA ACGGCACGAT TGTGCACTGG CCCAACGGAA TTGCCGCCAA GGGGGAGATG CGCTGGCAGT TTCACCACGT CATCGACGTG GCGCCGACCA TCCTGGAGGC GGCGGGGTTG CCGGAACCGT TATTCGTCAA CGGCGTGCAG CAACACCCCA TCGAAGGGGT CAGCATGGCC TATTCGTTCG ACGACGCGCA GGCGCCGGAT CGGCACGAGA CGCAGTATTT CGAGATGTTC GGAAACCGGG GCATCTACCA CAAGGGTTGG ACCGCGGTGA CCAAGCACAA GACGCCGTGG ATTTTGGTTG GCGAGCAGAC CGTCGCGTTC GACGACGACG TGTGGGAGCT CTACGACACC ACCAAGGATT GGAGCCAGGC CAAAGACTTG GCCAAGGAGA TGCCGGAAAA GCTGCATGAG CTGCAGCGGC TGTGGCTGAT CGAGGCGACG CGCTACAACG TGCTTCCGCT GGACGACGAC ACCGCCAGCC GCATCAACCC CGATCTGGCG GGCAGGCCGG TGCTCATCAG GGGCAACACC CAGGTGCTGT TTTCGAACAT GGGCCGGTTG TCGGAGAACT GTGTGCTCAA CCTCAAGAAC AAATCGCACA CGGTGACCGC TGAGGTCGAG GTGCCCGAGA CCGGTGCTGA GGGCGTGATC GTCGCGCAGG GCGCCAGCAT CGGCGGCTGG AGCCTGTATG CCAACGACGG CAAGCTCAAG TACTGCTACA ACCTGGGTGG TATCAAGCAC TTCTACGCCG AGTCCGCCGA CCCGCTGCCG GCCGGCGCCC ATCAGGTGCG CATGGAATTC GCTTATGCCG GTGGCGGTTT GGGCAAGGGC GGCGAGGTAA CTCTTTATGT CGACGGCCAA CAGGTCGGCG AAGGACATGT CGAAGCCACC CTTGCCATCG TCTTCTCGGC CGACGACGGC TGCGATGTCG GCATGGATTC GGGCTCGCCC GTCTCACCCG ACTATGCCCC GGGGAGTAAC GCGTTCAACG GGCGGATCAA GGGCGTGCAG CTCGCGATCG CCGAGGCCGC CGCTGCTGCG GGCCATCTGG TCGACCCGGA GCACGCGATC CGCATCGCGC TGGCGCGCCA ATAG
|
Protein sequence | MPQPRTHLPI PSAARTGLIT YDAKDPDSTY PPIEQLRPPA GAPNVLLILL DDVGFGASSA FGGPCRTSTA ELLAGNGLRY NRFHTTALCS PTRQALLTGR NHHSAGMGGI TEIATGAPGY SSVLPNTMSP IARTLKLNGY NTAQFGKCHE VPVWQTSPVG PFDAWPSGGG GFEYFYGFIG GEANQWYPSL YEGTTPVEVN RTPEEGYHFM ADMTDKALGW IGQQKALAPD RPFFVYFAPG ATHAPHHVPR EWADKYRGRF DVGWDALREE TFARQKELGV IPADCQLTAR HAEIPAWDDM PEDLKPVLCR QMEVYAGFLE YTDHHVGRLV DGLQRLGVLD DTLVFYIIDD NGASAEGTIN GTYNEMLNFN GLADIETPRF MTDRLDKFGG PESYNHYSVG WAHAMDTPYQ WTKQVASHWG GTRNGTIVHW PNGIAAKGEM RWQFHHVIDV APTILEAAGL PEPLFVNGVQ QHPIEGVSMA YSFDDAQAPD RHETQYFEMF GNRGIYHKGW TAVTKHKTPW ILVGEQTVAF DDDVWELYDT TKDWSQAKDL AKEMPEKLHE LQRLWLIEAT RYNVLPLDDD TASRINPDLA GRPVLIRGNT QVLFSNMGRL SENCVLNLKN KSHTVTAEVE VPETGAEGVI VAQGASIGGW SLYANDGKLK YCYNLGGIKH FYAESADPLP AGAHQVRMEF AYAGGGLGKG GEVTLYVDGQ QVGEGHVEAT LAIVFSADDG CDVGMDSGSP VSPDYAPGSN AFNGRIKGVQ LAIAEAAAAA GHLVDPEHAI RIALARQ
|
| |