Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_13470 |
Symbol | |
ID | 5224159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | - |
Start bp | 3865503 |
End bp | 3866924 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640608239 |
Product | hypothetical protein |
Protein accession | YP_001289397 |
Protein GI | 148824643 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 329 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 161 |
Fosmid unclonability p-value | 0.000625106 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCCACT ACTACTCTGT CGACACCATC CGCGCGGCTG AGGCGCCCCT GTTGGCCAGC CTGCCCGACG GTGCGCTGAT GCGACGCGCG GCCTTCGGGC TGGCCACCGA GATCGGCCGT GAGTTGACCG CTCGCACGGG TGGGGTGGTC GGCCGCCGGG TGTGCGCGGT CGTCGGATCC GGCGACAACG GTGGCGACGC GCTGTGGGCG GCGACGTTCC TGCGACGCCG CGGCGCCGCC GCCGACGCGG TGCTGCTCAA CCCGGACCGC ACGCATCGCA AGGCGCTGGC GGCGTTCACC AAATCCGGGG GTCGCCTCGT CGAGAGTGTC TCGGCGGCGA CCGATCTCGT CATCGACGGG GTGGTCGGCA TCTCCGGCTC GGGGCCGCTG CGACCGGCGG CCGCGCAGGT GTTCGCCGCG GTTCAGGCCG CCGCCATACC GGTGGTCGCC GTCGACATCC CCAGCGGCAT CGATGTGGCG ACCGGGGCGA TCACCGGCCC CGCCGTGCAC GCCGCGCTGA CCGTCACCTT TGGCGGGCTC AAACCGGTGC ACGCGCTGGC CGACTGCGGC CGCGTCGTCC TTGTCGATAT CGGGCTGGAC CTGGCGCACA CCGACGTGTT GGGTTTCGAG GCTACCGACG TGGCCGCGCG CTGGCCGGTG CCCGGTCCCC GCGACGACAA ATACACCCAG GGCGTGACCG GCGTGCTGGC CGGGTCGTCG ACGTATCCGG GTGCGGCCGT GCTGTGCACC GGGGCGGCCG TCGCCGCCAC CTCCGGCATG GTCCGCTACG CCGGGACCGC CCATGCGGAA GTCCTCGCGC ACTGGCCGGA GGTCATCGCC TCGCCCACCC CGGCGGCGGC CGGGCGGGTG CAGGCCTGGG TCGTCGGGCC GGGCCTGGGC ACCGACGAAG CCGGGGCCGC CGCGTTGTGG TTCGCGCTGG ACACCGACCT GCCGGTGTTG GTCGACGCCG ACGGGCTGAC CATGCTGGCG GACCACCCCG ATCTGGTGGC GGGCCGCAAC GCCCCGACGG TCTTGACGCC GCACGCCGGT GAGTTCGCCC GGCTGGCCGG GGCGCCGCCC GGTGACGACC GCGTGGGGGC CTGCCGCCAG CTGGCCGACG CGCTGGGCGC CACCGTGCTG CTCAAGGGCA ATGTCACCGT CATCGCCGAT CCCGGCGGCC CGGTCTATCT CAATCCGGCC GGCCAGTCCT GGGCGGCCAC CGCCGGGTCC GGTGACGTGC TGTCCGGGAT GATCGGTGCG CTGCTGGCGT CGGGATTGCC GTCTGGGGAG GCGGCCGCGG CCGCGGCGTT CGTGCACGCC CGGGCATCGG CGGCCGCGGC CGCCGATCCC GGCCCCGGCG ATGCGCCCAC GTCGGCGTCG CGCATCAGCG GCCACATTCG GGCCGCTCTG GCTGCCCTGT AG
|
Protein sequence | MRHYYSVDTI RAAEAPLLAS LPDGALMRRA AFGLATEIGR ELTARTGGVV GRRVCAVVGS GDNGGDALWA ATFLRRRGAA ADAVLLNPDR THRKALAAFT KSGGRLVESV SAATDLVIDG VVGISGSGPL RPAAAQVFAA VQAAAIPVVA VDIPSGIDVA TGAITGPAVH AALTVTFGGL KPVHALADCG RVVLVDIGLD LAHTDVLGFE ATDVAARWPV PGPRDDKYTQ GVTGVLAGSS TYPGAAVLCT GAAVAATSGM VRYAGTAHAE VLAHWPEVIA SPTPAAAGRV QAWVVGPGLG TDEAGAAALW FALDTDLPVL VDADGLTMLA DHPDLVAGRN APTVLTPHAG EFARLAGAPP GDDRVGACRQ LADALGATVL LKGNVTVIAD PGGPVYLNPA GQSWAATAGS GDVLSGMIGA LLASGLPSGE AAAAAAFVHA RASAAAAADP GPGDAPTSAS RISGHIRAAL AAL
|
| |