Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_10303 |
Symbol | |
ID | 5220966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | - |
Start bp | 362899 |
End bp | 364296 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640605043 |
Product | sulfatase |
Protein accession | YP_001286248 |
Protein GI | 148821494 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 206 |
Plasmid unclonability p-value | 0.000000000455468 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 199 |
Fosmid unclonability p-value | 0.448256 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGAGTG AGCGTGCCAC AGGGCAGCGC GAGAACCTGC TGATCGTGCA CTGGCACGAC CTGGGGCGCT ATCTCGGCGT CTACCACCAT CCGGACGTCT ACAGCCCGCG GCTGGACCGG CTTGCCGCCG AGGGCATCCT GTTCACCAGG GCACATGCCA CCGCGCCGCT GTGCACACCA TCGCGGGGCT CGCTGTTCAC CGGCCGCTAC CCGCAAAGCA ACGGGTTGGT CGGCCTGGCC CATCACGGCT GGGAATACCG CACCGGGGTC CAAACCCTAC CGCAATTGCT ATCCGAATCG GGTTGGTACT CAGCTCTTTT CGGTATGCAG CATGAGACGT CCTACCCAAA GCGGCTGGGC TTCGACGAAT TCGACGTGTC GAACTCCTAC TGCGAATACG TGGTCGCCAA AGCCCAGGAC TGGCTGCATA ATCGCGTGCC CGCGTTAGAC GGACAACGGT TCCTGTTGAC CGCCGGCTTC TTCGAAACCC ACCGGCCCTA TCCGCATGAG CGCTACCGGC CGGCCGACAG CGCGGCCGTC GAGCTGCCCG ACTATCTGCC CGATACCCCC GAGGTGCGCC AAGACGTCGC CGAGTTCTAC GGTTCTATCG CCACAGCCGA CGAGGCGGTT GGCCGGCTAC TTGACACACT GGCCGATACC GGCCTAGACG CCAGCACCTG GGTGGTGTTC GTCACCGATC ACGGTCCGGC ATTTCCGCGG GCGAAGTCCA CACTGTATGA CGCCGGAACC GGTATCGCGC TGATCATCCG CCCGCCCACT CGCCGGGCGA TGGCGCCTCG CGTCTATGAC GAGCTTTTCA GCGGCGTCGA TCTGGTTCCG ACGCTATTGG ACCTGCTGAG ACTCGAGGTA CCCGCCGATG TCGAGGGTGT GTCACACGCA CCGGCCCTCC TCGCGCCGGA CACTGAAAAC GCTGCGGTGC GTGACCACGT ATACACCGCC AAGACCTATC ACGACTCGTT CGATCCGATT CGGGCAATCC GCACCAAGGA ATACAGCTAC ATCGAGAATT ACGCGCCCCG GCCGCTGCTG GACCTACCGT GGGATATCCA GGAAAGCCCG GCCGGCATGG CCGTCGCACC GTTGGTCAAG GCGCCCCGCC CGCAGCGGGA ACTCTACGAT CTACGCGCCG ATCCCACCGA GACCAATAAC CTGTTAGCCG GCGACGACAG CACCCAGGGC GTGGCCGCGA TCGCGGCCGA TCTGGCCGTG CGACTGCATG ATTGGCGACA GCGCACGGCC GACGTCATTC CGTCGGACTT CGCCGGTTCC CGCATCGCCG AGCGCTACAC CGAAACGTAT CTGCGGATCC ACCGCAAGAC GCCAACGGGC CGGTCAGCGA TCGCCGCCGA CCGCGGCATC GACGAACACT GCAGCTAG
|
Protein sequence | MTSERATGQR ENLLIVHWHD LGRYLGVYHH PDVYSPRLDR LAAEGILFTR AHATAPLCTP SRGSLFTGRY PQSNGLVGLA HHGWEYRTGV QTLPQLLSES GWYSALFGMQ HETSYPKRLG FDEFDVSNSY CEYVVAKAQD WLHNRVPALD GQRFLLTAGF FETHRPYPHE RYRPADSAAV ELPDYLPDTP EVRQDVAEFY GSIATADEAV GRLLDTLADT GLDASTWVVF VTDHGPAFPR AKSTLYDAGT GIALIIRPPT RRAMAPRVYD ELFSGVDLVP TLLDLLRLEV PADVEGVSHA PALLAPDTEN AAVRDHVYTA KTYHDSFDPI RAIRTKEYSY IENYAPRPLL DLPWDIQESP AGMAVAPLVK APRPQRELYD LRADPTETNN LLAGDDSTQG VAAIAADLAV RLHDWRQRTA DVIPSDFAGS RIAERYTETY LRIHRKTPTG RSAIAADRGI DEHCS
|
| |