Gene Mmcs_5020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5020 
Symbol 
ID4113849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5321658 
End bp5322944 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content66% 
IMG OID638034178 
ProductUDP-galactopyranose mutase 
Protein accessionYP_642180 
Protein GI108801983 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0562] UDP-galactopyranose mutase 
TIGRFAM ID[TIGR00031] UDP-galactopyranose mutase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGACCTG GACTCGCGGG CCCGTTACGC CGCCCGGCGA GGCAGCATGT CGGGAAGAGT 
GGCCGCGGTC GGATACCCTG GCTGCCGATG CGCTCCCAGT ACGATCTTTT CGTCGTCGGC
TCAGGCTTCT TCGGTCTGAC GATCGCCGAG CGTGTGGCGA CCCAGTTGGA CAAACGGGTC
CTCGTCCTCG AGCGACGGCC ACACCTCGGG GGTAACGCCT ACTCCGAACC GGAACCACAG
ACCGGCATCG AGGTGCACCG TTACGGTGCG CACCTGTTCC ACACCTCCAA TCAGCGGGTG
TGGGACTACG TGCGTCAGTT CACCGAGTTC ACCGGCTACC AGCACCGGGT GTTCGCCCTG
CACAACGGGC AGGCCTACCA GTTCCCGATG GGGCTGGGCC TGGTCAGCCA GTTCTTCGGG
CGCTACTTCA CCCCCGACGA GGCGCGCGCG CTGATCGCCG AACAGGCCGC CGAGATCGAC
ACCGAGGACG CCCAGAACCT CGAGGAGAAG GCGATCTCGC TGATCGGCCG CCCGCTCTAC
GAGGCGTTCG TCAAGCACTA CACCGCCAAG CAGTGGCAGA CCGACCCCAA GGAACTGCCC
GCCTCCACCA TCAACCGCCT GCCGGTCCGC TACACCTTCG ACAACCGCTA TTTCAACGAC
ACCTACGAGG GTCTACCCGT CGAGGGCTAC ACGAAGTGGC TGGAGAACAT GGCCGCCGAC
GACCGGATCG AGGTCCGGCT GGACACCGAC TGGTTCGACG TCCGCGACGA ACTCCGGGCC
GCCAACCCCG ACGCTCCGGT GGTCTACACC GGCCCGGTGG ACCGTTACTT CGACTACGAC
GAGGGCCGGC TGGGCTGGCG CACCCTCGAC TTCGAACTCG AGGTGCTCGA GACCGGGGAT
TTCCAGGGCA CCCCCGTCAT GAACTACAAC GACGCCGACG TGCCCTACAC GCGTATCCAC
GAGTTCCGGC ACTTCCACCC GGAGCGGGCC TACCCCACGG ACAAGACCGT GATCATGCGC
GAGTTCTCCC GGTTCGCCGA CGAGGACGAC GAGCCCTACT ATCCGATCAA CACCGAATCC
GACCGCGCGC TGCTGGCCGC CTACCGGGCC AAGGCCAAGG CCGAGACGGC GTCGGCCAAG
GTCCTGTTCG GGGGGAGACT GGGCACCTAC CAGTACCTCG ACATGCACAT GGCCATCGCC
AGCGCGCTCA ACATGTACGA CAACACCCTG GCGCCACACC TGCGCGACGG CGCCGCCCTG
ACCCAGAGCG AGAGCACCAA CCCATGA
 
Protein sequence
MRPGLAGPLR RPARQHVGKS GRGRIPWLPM RSQYDLFVVG SGFFGLTIAE RVATQLDKRV 
LVLERRPHLG GNAYSEPEPQ TGIEVHRYGA HLFHTSNQRV WDYVRQFTEF TGYQHRVFAL
HNGQAYQFPM GLGLVSQFFG RYFTPDEARA LIAEQAAEID TEDAQNLEEK AISLIGRPLY
EAFVKHYTAK QWQTDPKELP ASTINRLPVR YTFDNRYFND TYEGLPVEGY TKWLENMAAD
DRIEVRLDTD WFDVRDELRA ANPDAPVVYT GPVDRYFDYD EGRLGWRTLD FELEVLETGD
FQGTPVMNYN DADVPYTRIH EFRHFHPERA YPTDKTVIMR EFSRFADEDD EPYYPINTES
DRALLAAYRA KAKAETASAK VLFGGRLGTY QYLDMHMAIA SALNMYDNTL APHLRDGAAL
TQSESTNP