Gene TBFG_10957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10957 
Symbol 
ID5221631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp1052238 
End bp1054172 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content67% 
IMG OID640605708 
Productbifunctional 2-hydroxyhepta-2,4-diene-1,7-dioate isomerase/cyclase/dehydrase 
Protein accessionYP_001286902 
Protein GI148822148 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway)
[COG0491] Zn-dependent hydrolases, including glyoxylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones334 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones200 
Fosmid unclonability p-value0.761321 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTGGG TGACGTATCG AAGTGACCAC GGCGAACGAA CGGGAGTGCT TTCCGGTGAC 
GCCATCTACG CGATGCCGCC GGACGTGTCG TTGCTGGATC TGGTCGGGCG CGGCGCCGAC
GGTCTGCGCA CGGCGGGCGA ACGGGCAGTG CGCTCACCGG CCGCGGTGGT AGCGCTCGAC
GAGGTTACGC TGGCGGCGCC GATTCCGCGC CCGCCGTCGA TCCGGGACTC GTTGTGCTTT
CTGGACCACA TGCGTAACTG CCAGGAAGCG ATGGGGGGCG GCCGGGTGCT CATGGATACT
TGGTACCGCA TCCCGGCGTT CTACTTCGCG TGCCCGTCAA CGGTTTTGGG ACCGTACGAC
GACGCACCCA CCGCACCCGG AAGTGCGTGG CAGGACTTCG AATTGGAGAT CGCGGCGGTT
ATCGGAACCA GCGGCAAAGA CTTGACCGTC GAGCAGGCCG AACGGTCGAT CATCGGCTAT
ACCATTTTCA ACGACTGGTC CGCACGGGAC CTGCAGATGC TGGAGGGCCA GCTGCGCATC
GGACAGGCCA AGGGCAAAGA CAGCGGTATC ACCCTGGGCC CCTATCTGGT CACACCGGAT
GAGCTGGAGC CCTATTGCCG GGGCGGGAAG CTAAGCTTGC GGGTGATCGC CTTGGTCAAC
GGCACCGTGA TCGGATCGGG GTCGACCGCA CAGATGGACT GGAGCTTCGG CGAAGTCATC
GCCTATGCCT CGCGGGGGGT GACGCTGACC CCGGGTGACG TGTTCGGCTC GGGCACGGTG
CCCACCTGCA CGCTCGTCGA GCACCTCAGG CCACCGGAAT CATTCCCGGG CTGGCTGCAC
GACGGCGACG TGGTCACCCT CCAGGTCGAA GGGCTGGGCG AGACGAGGCA GACCGTCCGG
ACGAGCGGCA CTCCTTTTCC GTTGGCTCTT CGGCCGAATC CGGACGCCGA ACCCGACCGG
CGCGGGGTCA ACCCGGCACC GACGCGGGTG CCGTTTACCC GCGGGCTGCA CGAAGTCGCC
GACCGGGTAT GGGCGTGGAC GCTGCCCGAC GGGGGATACG GCTTCAGCAA CGCCGGGCTG
GTCGCCGGGG ACGGCGCGTC GCTGCTCGTG GATACCCTGT TCGACCTGGC ACTGACACGC
GAGATGTTGG CCGCGATGAA GCCGGTCACC GAGCGGGCGC CCATCACCGA CGCCCTGATC
ACGCACTCCA ACGGCGACCA CACGCACGGC ACTCAACTGT TGGACCGCTC AGTGCGCATC
ATCGCCGCCA AGGGCACCTC CGAGGAGATC GAGCATGGCC CGGCACCGGA GATGCTAGCC
CGGATCCAAA CCGCCGACCT GGGCCCCGTT GCGACGCGGT ATCTGCGTGA TCGCTTCGGT
CACTTTGACT TCAGCGGCAT CAAGCTGCGC AACGCCGACC TGACGTTCGA CCGCGACCTG
GCCATCGAGC TCGGCGGCCG GCGAGTCGAC CTGCTCAACC TCGGTCCCGC GCACACCACC
GCCGACTCGG TCGTGCACGT GGCCGACGCC GGTGTGCTGT TCGCCGGGGA TCTGCTGTTC
ATCGGTTGCA CCCCGATTGT GTGGGCGGGC CCGATCGCCA ACTGGGTGGC GGCCTGCGAC
GCGATGATCG CGCTGGACGC GCCCACGGTG GTGCCTGGGC ATGGTCCGGT CACCGGCCCG
GACGGGATCC GTGCCGTCCG TGGCTATCTG GCGCACATCG CCGAACAGGC CGAGGCGGCC
TACCGCAAGG GGCTATCGTT GCCCGAGGCC GTCGAGACCA TCGACCTGGG CGAGTACGCG
AGCTGGCTGG ACTCCGAACG GGTAGTGGTC AACGTCTACC AGCGTTACCG CGAATTGGAT
CCCGACACCC CGCGCCAGGA CTTGCTGGCG TTGCTGGTGA TGCAGGCCGA ATGGGCGGCG
CGCCACTGTA CGTAG
 
Protein sequence
MKWVTYRSDH GERTGVLSGD AIYAMPPDVS LLDLVGRGAD GLRTAGERAV RSPAAVVALD 
EVTLAAPIPR PPSIRDSLCF LDHMRNCQEA MGGGRVLMDT WYRIPAFYFA CPSTVLGPYD
DAPTAPGSAW QDFELEIAAV IGTSGKDLTV EQAERSIIGY TIFNDWSARD LQMLEGQLRI
GQAKGKDSGI TLGPYLVTPD ELEPYCRGGK LSLRVIALVN GTVIGSGSTA QMDWSFGEVI
AYASRGVTLT PGDVFGSGTV PTCTLVEHLR PPESFPGWLH DGDVVTLQVE GLGETRQTVR
TSGTPFPLAL RPNPDAEPDR RGVNPAPTRV PFTRGLHEVA DRVWAWTLPD GGYGFSNAGL
VAGDGASLLV DTLFDLALTR EMLAAMKPVT ERAPITDALI THSNGDHTHG TQLLDRSVRI
IAAKGTSEEI EHGPAPEMLA RIQTADLGPV ATRYLRDRFG HFDFSGIKLR NADLTFDRDL
AIELGGRRVD LLNLGPAHTT ADSVVHVADA GVLFAGDLLF IGCTPIVWAG PIANWVAACD
AMIALDAPTV VPGHGPVTGP DGIRAVRGYL AHIAEQAEAA YRKGLSLPEA VETIDLGEYA
SWLDSERVVV NVYQRYRELD PDTPRQDLLA LLVMQAEWAA RHCT