Gene TBFG_10676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10676 
Symbol 
ID5221344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp759898 
End bp762261 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content64% 
IMG OID640605421 
Productarylsulfatase atsD (aryl-sulfate sulphohydrolase) 
Protein accessionYP_001286621 
Protein GI148821867 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones254 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones206 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCAGC CAAGAACGCA TCTGCCGATT CCCAGTGCTG CTCGCACCGG GCTGATCACG 
TATGACGCGA AGGATCCCGA CAGCACCTAT CCGCCGATCG AGCAGCTGCG CCCACCGGCG
GGTGCCCCGA ATGTGTTGCT GATCCTGCTT GACGATGTCG GGTTCGGTGC GTCGAGCGCG
TTCGGAGGCC CATGCAGGAC GTCGACGGCG GAACTGCTTG CCGGTAACGG GTTGCGGTAC
AACCGGTTTC ACACCACCGC GCTGTGCTCG CCGACGCGTC AGGCGTTGTT AACTGGACGC
AACCATCACT CCGCCGGCAT GGGCGGTATC ACCGAAATCG CCACCGGTGC ACCGGGATAC
AGCTCAGTAC TACCGAACAC CATGTCGCCG ATCGCGCGGA CGCTAAAGCT CAACGGCTAC
AACACCGCCC AGTTCGGCAA GTGCCACGAA GTCCCGGTCT GGCAGACCAG CCCGGTCGGG
CCGTTCGACG CGTGGCCCAG CGGCGGCGGT GGTTTCGAAT ACTTCTACGG GTTTATCGGT
GGCGAGGCTA ACCAGTGGTA TCCGAGTCTG TACGAGGGCA CCACGCCGGT CGAGGTGAAC
CGCACGCCCG AGGAGGGTTA CCATTTCATG GCGGACATGA CCGACAAGGC CCTCGGCTGG
ATCGGACAGC AGAAGGCACT GGCCCCCGAC CGGCCGTTCT TCGTGTACTT CGCCCCGGGC
GCCACCCACG CGCCCCACCA CGTTCCGCGG GAGTGGGCCG ACAAGTACCG GGGCCGCTTC
GATGTGGGCT GGGACGCACT GCGAGAGGAA ACCTTCGCCC GGCAAAAGGA ACTCGGGGTG
ATCCCGGCGG ACTGCCAGCT GACCGCGCGG CACGCCGAAA TCCCGGCGTG GGACGACATG
CCGGAGGACC TCAAACCCGT GCTATGCCGG CAGATGGAGG TCTACGCGGG CTTTCTGGAA
TACACCGACC ACCACGTCGG CCGGCTCGTC GACGGCCTGC AGCGCCTCGG TGTGCTCGAC
GACACGCTGG TGTTCTACAT CATCGACGAC AACGGCGCCT CGGCCGAGGG CACGATCAAC
GGCACCTACA ACGAGATGTT GAACTTCAAC GGCCTGGCCG ACATCGAGAC GCCGCGGTTC
ATGACCGACC GGCTCGACAA GTTCGGCGGG CCGGAGTCCT ACAACCACTA TTCGGTGGGT
TGGGCGCATG CGATGGATAC CCCCTATCAG TGGACCAAAC AAGTGGCCTC GCACTGGGGT
GGCACGCGTA ACGGCACGAT TGTGCACTGG CCCAACGGAA TTGCCGCCAA GGGGGAGATG
CGCTGGCAGT TTCACCACGT CATCGACGTG GCGCCGACCA TCCTGGAGGC GGCGGGGTTG
CCGGAACCGT TATTCGTCAA CGGCGTGCAG CAACACCCCA TCGAAGGGGT CAGCATGGCC
TATTCGTTCG ACGACGCGCA GGCGCCGGAT CGGCACGAGA CGCAGTATTT CGAGATGTTC
GGAAACCGGG GCATCTACCA CAAGGGTTGG ACCGCGGTGA CCAAGCACAA GACGCCGTGG
ATTTTGGTTG GCGAGCAGAC CGTCGCGTTC GACGACGACG TGTGGGAGCT CTACGACACC
ACCAAGGATT GGAGCCAGGC CAAAGACTTG GCCAAGGAGA TGCCGGAAAA GCTGCATGAG
CTGCAGCGGC TGTGGCTGAT CGAGGCGACG CGCTACAACG TGCTTCCGCT GGACGACGAC
ACCGCCAGCC GCATCAACCC CGATCTGGCG GGCAGGCCGG TGCTCATCAG GGGCAACACC
CAGGTGCTGT TTTCGAACAT GGGCCGGTTG TCGGAGAACT GTGTGCTCAA CCTCAAGAAC
AAATCGCACA CGGTGACCGC TGAGGTCGAG GTGCCCGAGA CCGGTGCTGA GGGCGTGATC
GTCGCGCAGG GCGCCAGCAT CGGCGGCTGG AGCCTGTATG CCAACGACGG CAAGCTCAAG
TACTGCTACA ACCTGGGTGG TATCAAGCAC TTCTACGCCG AGTCCGCCGA CCCGCTGCCG
GCCGGCGCCC ATCAGGTGCG CATGGAATTC GCTTATGCCG GTGGCGGTTT GGGCAAGGGC
GGCGAGGTAA CTCTTTATGT CGACGGCCAA CAGGTCGGCG AAGGACATGT CGAAGCCACC
CTTGCCATCG TCTTCTCGGC CGACGACGGC TGCGATGTCG GCATGGATTC GGGCTCGCCC
GTCTCACCCG ACTATGCCCC GGGGAGTAAC GCGTTCAACG GGCGGATCAA GGGCGTGCAG
CTCGCGATCG CCGAGGCCGC CGCTGCTGCG GGCCATCTGG TCGACCCGGA GCACGCGATC
CGCATCGCGC TGGCGCGCCA ATAG
 
Protein sequence
MPQPRTHLPI PSAARTGLIT YDAKDPDSTY PPIEQLRPPA GAPNVLLILL DDVGFGASSA 
FGGPCRTSTA ELLAGNGLRY NRFHTTALCS PTRQALLTGR NHHSAGMGGI TEIATGAPGY
SSVLPNTMSP IARTLKLNGY NTAQFGKCHE VPVWQTSPVG PFDAWPSGGG GFEYFYGFIG
GEANQWYPSL YEGTTPVEVN RTPEEGYHFM ADMTDKALGW IGQQKALAPD RPFFVYFAPG
ATHAPHHVPR EWADKYRGRF DVGWDALREE TFARQKELGV IPADCQLTAR HAEIPAWDDM
PEDLKPVLCR QMEVYAGFLE YTDHHVGRLV DGLQRLGVLD DTLVFYIIDD NGASAEGTIN
GTYNEMLNFN GLADIETPRF MTDRLDKFGG PESYNHYSVG WAHAMDTPYQ WTKQVASHWG
GTRNGTIVHW PNGIAAKGEM RWQFHHVIDV APTILEAAGL PEPLFVNGVQ QHPIEGVSMA
YSFDDAQAPD RHETQYFEMF GNRGIYHKGW TAVTKHKTPW ILVGEQTVAF DDDVWELYDT
TKDWSQAKDL AKEMPEKLHE LQRLWLIEAT RYNVLPLDDD TASRINPDLA GRPVLIRGNT
QVLFSNMGRL SENCVLNLKN KSHTVTAEVE VPETGAEGVI VAQGASIGGW SLYANDGKLK
YCYNLGGIKH FYAESADPLP AGAHQVRMEF AYAGGGLGKG GEVTLYVDGQ QVGEGHVEAT
LAIVFSADDG CDVGMDSGSP VSPDYAPGSN AFNGRIKGVQ LAIAEAAAAA GHLVDPEHAI
RIALARQ