Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_13412 |
Symbol | |
ID | 5224101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | + |
Start bp | 3802631 |
End bp | 3803284 |
Gene Length | 654 bp |
Protein Length | 217 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640608181 |
Product | hypothetical protein |
Protein accession | YP_001289339 |
Protein GI | 148824585 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E [TIGR02247] Epoxide hydrolase N-terminal domain-like phosphatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 558 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 224 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCATTA GCGCGGTTGT TTTCGACCGT GACGGTGTGC TCACCAGCTT TGACTGGACA CGTGCCGAGG AGGATGTGCG GCGAATCACG GGCCTACCAT TGGAGGAGAT CGAACGCCGC TGGGGTGGGT GGCTCAACGG ATTGACTATC GACGACGCGT TCGTTGAAAC CCAGCCAATT AGCGAGTTCC TCTCGAGCCT GGCGCGCGAG CTCGAGCTCG GTTCGAAGGC AAGAGACGAG CTAGTGCGCC TCGACTACAT GGCGTTCGCC CAGGGATATC CAGACGCGCG TCCAGCCCTT GAAGAAGCCC GGCGCCGTGG CCTCAAGGTC GGTGTTCTCA CAAACAACAG CCTGTTGGTC AGCGCCCGCA GCCTCCTTCA GTGCGCCGCT CTGCACGACC TCGTCGACGT CGTGCTGAGT TCGCAGATGA TCGGAGCTGC CAAGCCTGAC CCGCGGGCCT ATCAAGCGAT CGCGGAAGCC CTCGGCGTCT CGACAACGTC ATGCCTGTTC TTCGACGACA TCGCCGACTG GGTTGAGGGC GCACGGTGCG CGGGCATGCG CGCGTACCTC GTGGACCGTT CCGGACAAAC TCGCGACGGC GTCGTTCGCG ATTTGTCCAG CCTTGGAGCG ATCCTGGACG GCGCGGGACC ATGA
|
Protein sequence | MSISAVVFDR DGVLTSFDWT RAEEDVRRIT GLPLEEIERR WGGWLNGLTI DDAFVETQPI SEFLSSLARE LELGSKARDE LVRLDYMAFA QGYPDARPAL EEARRRGLKV GVLTNNSLLV SARSLLQCAA LHDLVDVVLS SQMIGAAKPD PRAYQAIAEA LGVSTTSCLF FDDIADWVEG ARCAGMRAYL VDRSGQTRDG VVRDLSSLGA ILDGAGP
|
| |