Gene TBFG_11195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_11195 
Symbol 
ID5221872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp1304068 
End bp1304979 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content68% 
IMG OID640605949 
ProductN-acetyl-1-D-myo-inosityl-2-amino-2-deoxy-alpha- D-glucopyranoside deacetylase mshB 
Protein accessionYP_001287140 
Protein GI148822386 
COG category[S] Function unknown 
COG ID[COG2120] Uncharacterized proteins, LmbE homologs 
TIGRFAM ID[TIGR03445] 1D-myo-inosityl-2-acetamido-2-deoxy-alpha-D-glucopyranoside deacetylase 


Plasmid Coverage information

Num covering plasmid clones420 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones262 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGA CGCCGCGGCT GCTGTTTGTT CATGCACACC CCGACGATGA GAGCCTGAGC 
AACGGCGCAA CCATCGCGCA CTACACCTCC CGTGGCGCAC AGGTCCATGT CGTCACGTGC
ACCCTGGGTG AGGAGGGCGA GGTCATTGGC GATCGCTGGG CTCAACTCAC CGCCGATCAT
GCGGACCAAC TCGGTGGCTA CCGCATCGGC GAGCTCACCG CGGCGTTGCG AGCGCTCGGG
GTCAGCGCAC CGATCTACCT TGGCGGCGCG GGTCGCTGGC GCGACTCCGG CATGGCCGGC
ACAGACCAGC GGAGTCAGCG GAGATTCGTC GATGCTGACC CCCGGCAGAC CGTCGGGGCA
TTGGTCGCGA TCATTCGCGA GCTGCGGCCG CATGTCGTGG TGACCTATGA CCCCAATGGC
GGTTACGGTC ATCCTGACCA CGTGCACACC CACACCGTCA CTACCGCCGC GGTGGCCGCA
GCGGGTGTTG GGTCCGGTAC CGCAGATCAC CCCGGCGACC CGTGGACGGT GCCGAAGTTC
TACTGGACGG TCTTGGGTCT GAGCGCGCTC ATTTCGGGCG CGCGAGCCCT GGTCCCCGAC
GATCTGCGAC CCGAATGGGT GTTGCCGCGG GCCGACGAGA TTGCATTCGG GTACTCCGAC
GACGGTATCG ACGCCGTCGT CGAGGCCGAT GAGCAGGCGC GAGCCGCCAA GGTTGCGGCA
CTGGCTGCCC ATGCCACCCA AGTTGTCGTC GGCCCGACCG GCCGGGCCGC CGCCTTGTCG
AACAACCTGG CACTGCCCAT CCTGGCCGAT GAGCATTACG TGCTCGCCGG CGGCTCCGCG
GGCGCCCGCG ATGAACGTGG CTGGGAAACT GATCTGCTCG CCGGTCTGGG CTTCACCGCG
TCCGGCACGT AG
 
Protein sequence
MSETPRLLFV HAHPDDESLS NGATIAHYTS RGAQVHVVTC TLGEEGEVIG DRWAQLTADH 
ADQLGGYRIG ELTAALRALG VSAPIYLGGA GRWRDSGMAG TDQRSQRRFV DADPRQTVGA
LVAIIRELRP HVVVTYDPNG GYGHPDHVHT HTVTTAAVAA AGVGSGTADH PGDPWTVPKF
YWTVLGLSAL ISGARALVPD DLRPEWVLPR ADEIAFGYSD DGIDAVVEAD EQARAAKVAA
LAAHATQVVV GPTGRAAALS NNLALPILAD EHYVLAGGSA GARDERGWET DLLAGLGFTA
SGT