Gene TBFG_10074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10074 
Symbol 
ID5220737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp82881 
End bp84116 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content66% 
IMG OID640604814 
Producthypothetical protein 
Protein accessionYP_001286019 
Protein GI148821265 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones356 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones178 
Fosmid unclonability p-value0.0244263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGATC TGAGCATTAG CCAGGTGTCG GCGCGTCCGG GACGGATCGG GATTCGCGCT 
AGGCAAATGT TCGACGGATA CCGGTTTCAG CGTGGTCCCG TGCTGGTCGT GGTCGAGGAT
GGTCGGATCA GCGCGGTCGA TTTTGCTGGC TCCGCCTGCC CCGATATGAA CCTGGTTGAT
CTGGGTGAAT CGACTTTGTT GCCGGGTCTG GTGGATGCGC ATGCGCATTT GTGCTGGGAC
CCCGACGGTA GGCCAGAGGA TTTGGCCGGC GACCCCCATG CGGTGCTGGT GGGACGGGCG
CGACGGCACG CCGCGGCCGC GTTGCGCTCC GGGATCACCA CGATTCGCGA TCTCGGCGAC
CGTGACTATG CGGCCTTGGC GCTGCGGGAG GAGTATCGGC AGAAAACGAC GGTGGGGCCG
GAACTGGTGG TTTCTGGGCC ACCATTGACT CGCAGCGGCG GGCATTGCTG GTTCCTCGGC
GGCGTGGCCG ATAGCGTCGA GGAGCTGGTT GATGCGGTGC AGGAGCGGGC CGCGCGGGGA
GCGGATTGGA TCAAGGTGAT GGCCACGGGC GGATTCGTTA CCACAGCATC CGATCCGTGG
CAGCCGCAGT ACGGCAGCGG CCAACTGGCC GCGGTGGTGG CGGCCGCCGA GCAGGTAGGT
CTACCGGTGA CCGCACATGC ACATGCCACC GCAGGGATCG CCGCGGCGGT CGCCGCGGGT
GTTGACGGCA TCGAGCACTG CACGTTCTTG AGCGAAGGCA GCGCCGCCGC CAGCCCGGAT
GTTGTTGAAG CGATTGTTGC CCAAGGTGTG TGGTGCGGTA TGACGATTCC CCGGGTGTAT
CCGGAGATGC CGGAGAACCT TGTCGCGGTT GTGCAGGATG GATGGCGAAA CATCCGCCGG
CTCATCGACG CCGGTGCGCG TGTCGCCCTG TCCACCGACG CTGGAGTCGC CCCGGGCAGA
CGCCATGACG TGCTCCCCGA CGATTTGGTG TATCTGTCTC GACACGGGTT CACCAGCACA
GAGGTGCTGA CCGGCGCCAC CGCAGCGGCC GCTGCCAGCT GTGGGCTCGG CCACCGCAAG
GGTCGCATCG CGCCGGGCTA CGACGCTGAT CTGCTGGCTG TTGCGGCAGG TGTGGACCAT
GACCCCGCCG GACTCTGCGA CGTCAAAGCC GTCTGGCGCA GCGGAACCCA GGTACCGCTA
CAAGCATCCG CTGTGGGCTA CAACACCCCG TCATAA
 
Protein sequence
MGDLSISQVS ARPGRIGIRA RQMFDGYRFQ RGPVLVVVED GRISAVDFAG SACPDMNLVD 
LGESTLLPGL VDAHAHLCWD PDGRPEDLAG DPHAVLVGRA RRHAAAALRS GITTIRDLGD
RDYAALALRE EYRQKTTVGP ELVVSGPPLT RSGGHCWFLG GVADSVEELV DAVQERAARG
ADWIKVMATG GFVTTASDPW QPQYGSGQLA AVVAAAEQVG LPVTAHAHAT AGIAAAVAAG
VDGIEHCTFL SEGSAAASPD VVEAIVAQGV WCGMTIPRVY PEMPENLVAV VQDGWRNIRR
LIDAGARVAL STDAGVAPGR RHDVLPDDLV YLSRHGFTST EVLTGATAAA AASCGLGHRK
GRIAPGYDAD LLAVAAGVDH DPAGLCDVKA VWRSGTQVPL QASAVGYNTP S