Gene TBFG_10059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10059 
Symbol 
ID5220722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp64013 
End bp65071 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content60% 
IMG OID640604799 
Producthypothetical protein 
Protein accessionYP_001286004 
Protein GI148821250 
COG category[R] General function prediction only 
COG ID[COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones103 
Plasmid unclonability p-value5.22819e-51 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones210 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACGT ACGGCTCTGG CGACCTCCTT CGGGCTGACA CCGAAGCGCT CGTCAACACC 
GTCAACTGTG TTGGGGTGAT GGGCAAGGGA ATTGCGCTGC AGTTCAAACG CCGCTACCCC
GAGATGTTCA CCGCCTACGA AAAGGCGTGC AAACGCGGCG AAGTTACCAT CGGCAAGATG
TTCGTCGTCG ACACCGGACA GCTCGACGGA CCGAAACACA TCATCAACTT CCCCACCAAG
AAACACTGGC GTGCACCGTC GAAGCTGGCC TATATCGACG CCGGCCTCAT TGATCTCATC
CGCGTGATCC GTGAACTCAA CATTGCTTCT GTGGCAGTTC CCCCGCTGGG GGTGGGCAAC
GGAGGTCTGG ATTGGGAAGA TGTCGAGCAA CGGCTCGTAT CAGCATTCCA GCAGCTGCCC
GACGTTGACG CCGTGATCTA CCCCCCATCA GGTGGATCTC GCGCCATCGA GGGCGTCGAA
GGACTTCGGA TGACCTGGGG GCGCGCCGTC ATACTCGAAG CGATGCGGCG ATATCTCCAG
CAGCGCCGCG CGATGGAGCC GTGGGAAGAC CCTGCAGGGA TCTCGCATCT GGAGATTCAG
AAGCTCATGT ACTTCGCCAA CGAGGCCGAT CCCGATCTTG CGCTAGATTT CACGCCCGGC
CGATACGGGC CATACAGCGA ACGTGTCCGT CACTTACTGC AAGGAATGGA GGGCGCATTC
ACAGTCGGCC TGGGTGACGG CACCGCAAGA GTTCTTGCGA ACCAACCGAT CTCGTTGACT
ACTAAGGGAA CTGACGCCAT AACGGACTAT CTGGCCACCG ATGCGGCAGC TGACCGGGTG
AGCGCCGCAG TCGACACGGT GTTGCGCGTC ATCGAAGGCT TTGAAGGCCC ATACGGGGTT
GAGCTGCTCG CCAGTACGCA TTGGGTGGCC ACACGTGAGG GCGCCAAGGA ACCAGCCACG
GCAGCGGCCG CGGTCCGAAA GTGGACAAAA CGCAAGGGTC GGATCTACAG CGACGATCGC
ATCGGTGTTG CCCTCGACCG CATTCTTATG ACTGCCTGA
 
Protein sequence
MITYGSGDLL RADTEALVNT VNCVGVMGKG IALQFKRRYP EMFTAYEKAC KRGEVTIGKM 
FVVDTGQLDG PKHIINFPTK KHWRAPSKLA YIDAGLIDLI RVIRELNIAS VAVPPLGVGN
GGLDWEDVEQ RLVSAFQQLP DVDAVIYPPS GGSRAIEGVE GLRMTWGRAV ILEAMRRYLQ
QRRAMEPWED PAGISHLEIQ KLMYFANEAD PDLALDFTPG RYGPYSERVR HLLQGMEGAF
TVGLGDGTAR VLANQPISLT TKGTDAITDY LATDAAADRV SAAVDTVLRV IEGFEGPYGV
ELLASTHWVA TREGAKEPAT AAAAVRKWTK RKGRIYSDDR IGVALDRILM TA