Gene TBFG_10547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10547 
Symbol 
ID5221211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp630476 
End bp631516 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content70% 
IMG OID640605288 
ProductUDP-glucose 4-epimerase galE3 
Protein accessionYP_001286492 
Protein GI148821738 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones263 
Plasmid unclonability p-value0.00000700436 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones208 
Fosmid unclonability p-value0.976314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGTGC TGCTGACCGG CGCGGCCGGC TTCATCGGGT CGCGCGTGGA TGCGGCGTTA 
CGGGCTGCGG GTCACGACGT GGTGGGCGTC GACGCGCTGC TGCCCGCCGC GCACGGGCCA
AACCCGGTGC TGCCACCGGG CTGCCAGCGG GTCGACGTGC GCGACGCCAG CGCGCTGGCC
CCGTTGTTGG CCGGTGTCGA TCTGGTGTGT CACCAGGCCG CCATGGTGGG TGCCGGCGTC
AACGCCGCCG ACGCACCCGC CTATGGCGGC CACAACGATT TCGCCACCAC GGTGCTGCTG
GCGCAGATGT TCGCCGCCGG GGTCCGCCGT TTGGTGCTGG CGTCGTCGAT GGTGGTTTAC
GGGCAGGGGC GCTATGACTG TCCCCAGCAT GGACCGGTCG ACCCGCTGCC GCGGCGGCGA
GCCGACCTGG ACAATGGGGT CTTCGAGCAC CGTTGCCCGG GGTGCGGCGA GCCAGTCATC
TGGCAATTGG TCGACGAGGA TGCCCCGTTG CGCCCGCGCA GCCTGTACGC GGCCAGCAAG
ACCGCGCAGG AGCACTACGC GCTGGCGTGG TCGGAAGCGA GTGGCGGTTC GGTGGTGGCG
TTGCGCTACC ACAACGTCTA CGGCCCCGGC ATGCCGCGCG ACACCCCCTA CTCCGGAGTG
GCCGCGATCT TCCGCTCGGC GGTTGAAAAA GGCAAGCCAC CAAAGGTTTT CGAAGACGGC
GGCCAGATGC GGGACTTCGT GCACGTGGAC GACGTGGCCG CGGCGAACCT CGCCGCGGTG
CATCTGGGTG AAGCGGACCG CGACGGGTTT ACCGCGGTCA ACGTCTGTTC CGGGCGCCCC
ATCTCGATCC TTCAGGTGGC AACCGCGATA TGCGACGCCC GCGGTGGCTC GATGTCCCCG
GCCATCACCG GGCACTACCG CAGCGGCGAC GTGCGCCACA TTGTCGCCGA TCCCGCGCGG
GCCGCCCGCG TGCTCGGGTT CCGCGCGGCC GTCGATCCAG GCGAAGGACT GCGTGAGTTC
GCGTTCGCGC CGCTTCGCTG A
 
Protein sequence
MRVLLTGAAG FIGSRVDAAL RAAGHDVVGV DALLPAAHGP NPVLPPGCQR VDVRDASALA 
PLLAGVDLVC HQAAMVGAGV NAADAPAYGG HNDFATTVLL AQMFAAGVRR LVLASSMVVY
GQGRYDCPQH GPVDPLPRRR ADLDNGVFEH RCPGCGEPVI WQLVDEDAPL RPRSLYAASK
TAQEHYALAW SEASGGSVVA LRYHNVYGPG MPRDTPYSGV AAIFRSAVEK GKPPKVFEDG
GQMRDFVHVD DVAAANLAAV HLGEADRDGF TAVNVCSGRP ISILQVATAI CDARGGSMSP
AITGHYRSGD VRHIVADPAR AARVLGFRAA VDPGEGLREF AFAPLR