Gene TBFG_10303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10303 
Symbol 
ID5220966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp362899 
End bp364296 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content65% 
IMG OID640605043 
Productsulfatase 
Protein accessionYP_001286248 
Protein GI148821494 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones206 
Plasmid unclonability p-value0.000000000455468 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones199 
Fosmid unclonability p-value0.448256 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGAGTG AGCGTGCCAC AGGGCAGCGC GAGAACCTGC TGATCGTGCA CTGGCACGAC 
CTGGGGCGCT ATCTCGGCGT CTACCACCAT CCGGACGTCT ACAGCCCGCG GCTGGACCGG
CTTGCCGCCG AGGGCATCCT GTTCACCAGG GCACATGCCA CCGCGCCGCT GTGCACACCA
TCGCGGGGCT CGCTGTTCAC CGGCCGCTAC CCGCAAAGCA ACGGGTTGGT CGGCCTGGCC
CATCACGGCT GGGAATACCG CACCGGGGTC CAAACCCTAC CGCAATTGCT ATCCGAATCG
GGTTGGTACT CAGCTCTTTT CGGTATGCAG CATGAGACGT CCTACCCAAA GCGGCTGGGC
TTCGACGAAT TCGACGTGTC GAACTCCTAC TGCGAATACG TGGTCGCCAA AGCCCAGGAC
TGGCTGCATA ATCGCGTGCC CGCGTTAGAC GGACAACGGT TCCTGTTGAC CGCCGGCTTC
TTCGAAACCC ACCGGCCCTA TCCGCATGAG CGCTACCGGC CGGCCGACAG CGCGGCCGTC
GAGCTGCCCG ACTATCTGCC CGATACCCCC GAGGTGCGCC AAGACGTCGC CGAGTTCTAC
GGTTCTATCG CCACAGCCGA CGAGGCGGTT GGCCGGCTAC TTGACACACT GGCCGATACC
GGCCTAGACG CCAGCACCTG GGTGGTGTTC GTCACCGATC ACGGTCCGGC ATTTCCGCGG
GCGAAGTCCA CACTGTATGA CGCCGGAACC GGTATCGCGC TGATCATCCG CCCGCCCACT
CGCCGGGCGA TGGCGCCTCG CGTCTATGAC GAGCTTTTCA GCGGCGTCGA TCTGGTTCCG
ACGCTATTGG ACCTGCTGAG ACTCGAGGTA CCCGCCGATG TCGAGGGTGT GTCACACGCA
CCGGCCCTCC TCGCGCCGGA CACTGAAAAC GCTGCGGTGC GTGACCACGT ATACACCGCC
AAGACCTATC ACGACTCGTT CGATCCGATT CGGGCAATCC GCACCAAGGA ATACAGCTAC
ATCGAGAATT ACGCGCCCCG GCCGCTGCTG GACCTACCGT GGGATATCCA GGAAAGCCCG
GCCGGCATGG CCGTCGCACC GTTGGTCAAG GCGCCCCGCC CGCAGCGGGA ACTCTACGAT
CTACGCGCCG ATCCCACCGA GACCAATAAC CTGTTAGCCG GCGACGACAG CACCCAGGGC
GTGGCCGCGA TCGCGGCCGA TCTGGCCGTG CGACTGCATG ATTGGCGACA GCGCACGGCC
GACGTCATTC CGTCGGACTT CGCCGGTTCC CGCATCGCCG AGCGCTACAC CGAAACGTAT
CTGCGGATCC ACCGCAAGAC GCCAACGGGC CGGTCAGCGA TCGCCGCCGA CCGCGGCATC
GACGAACACT GCAGCTAG
 
Protein sequence
MTSERATGQR ENLLIVHWHD LGRYLGVYHH PDVYSPRLDR LAAEGILFTR AHATAPLCTP 
SRGSLFTGRY PQSNGLVGLA HHGWEYRTGV QTLPQLLSES GWYSALFGMQ HETSYPKRLG
FDEFDVSNSY CEYVVAKAQD WLHNRVPALD GQRFLLTAGF FETHRPYPHE RYRPADSAAV
ELPDYLPDTP EVRQDVAEFY GSIATADEAV GRLLDTLADT GLDASTWVVF VTDHGPAFPR
AKSTLYDAGT GIALIIRPPT RRAMAPRVYD ELFSGVDLVP TLLDLLRLEV PADVEGVSHA
PALLAPDTEN AAVRDHVYTA KTYHDSFDPI RAIRTKEYSY IENYAPRPLL DLPWDIQESP
AGMAVAPLVK APRPQRELYD LRADPTETNN LLAGDDSTQG VAAIAADLAV RLHDWRQRTA
DVIPSDFAGS RIAERYTETY LRIHRKTPTG RSAIAADRGI DEHCS