Gene Mmcs_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1789 
Symbol 
ID4110623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1930076 
End bp1931116 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content71% 
IMG OID638030909 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_638954 
Protein GI108798757 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0412358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATCGG CGATGGACGG CGCCCAGGCG CGCAAGAGGG CACTGGTGCT CGGAGCCAGT 
GGAAACGTGG GCGCCGCAGT CGTCCGGCAC CTGGTGGCTG ACGGCGACGA CGTGCGAGTC
TTGTTGCGGC GCAGCAGTTC CACCAGGGGT ATCGACGGAC TCGACGTGGA CCGGCGCTAC
GGCGACATCT TCGACACCGA GGCGGTCGCC GCCGCGATGG CCGACCGCGA TGTCGTCTTC
TACTGCGTGG TGGACACCAG GGCGCATCTG GCCGATCCCG CACCGCTGTT CGCGACCAAC
GTGGAGGGTC TGCGCGGGGT GCTCGACATC GCCGCACGGG CGGATCTGAA GCGCTTCGTG
TTCCTCAGCA CCATCGGGAC CATCGCGGTC GGCGCCGACG GTGCGGCGGT GGACGAGGAC
ACACCGTTCA ACTGGAGCGG TAAGGGCGGA CCGTACATCG AATCCCGCCG TCAGGCCGAA
GACCTGGTGC TGCGCTGCGC CCGCGAGCGG GGACTGCCCG CGGTGGCGAT GTGTGTGTCC
AACCCGTACG GCCCGCCGGA CTGGAACCCC AGACAGGGTG CCCTCGTTGC GCTGGCCGCG
TTCGGCAAGA TGCCCTGCTA CATCCGCGGG GTGGGTGCGG AGGTGGTGGA CATCGACGAC
GCCGCACGGG CGTTGGTGTC GGCCGCCGAA CGCGGCCGGG TCGGCGAGCG CTACATCGTG
TCGGAGCGCT ACATGTCCCA GCGCGAGATG CTCACCCTCG CCGCGGAGGC GGCGGGTGCC
ACCCCGCCGA GGTTCGGCAT CCCGATGGCA CTGGTCCACG CCTTCGCCGC AGTCGCCGGG
ATGTCCAACC GGCTGTTCGG CACCGACCTC CCGATCAATC CGGCCGCGGC GCGGCTGATC
GCGCTGACCT CGCCGGCCGA CCACGGCAAG GCGACGCGTG ACCTCGGGTG GCGCCCCGGA
CCCACCGCCG ACGCGATCCG CCGCGCCGCC CGGTCCTACG TCGAACGGCG CGACCGCAAC
GAGCAGGTGG TCGCGCTGTG A
 
Protein sequence
MGSAMDGAQA RKRALVLGAS GNVGAAVVRH LVADGDDVRV LLRRSSSTRG IDGLDVDRRY 
GDIFDTEAVA AAMADRDVVF YCVVDTRAHL ADPAPLFATN VEGLRGVLDI AARADLKRFV
FLSTIGTIAV GADGAAVDED TPFNWSGKGG PYIESRRQAE DLVLRCARER GLPAVAMCVS
NPYGPPDWNP RQGALVALAA FGKMPCYIRG VGAEVVDIDD AARALVSAAE RGRVGERYIV
SERYMSQREM LTLAAEAAGA TPPRFGIPMA LVHAFAAVAG MSNRLFGTDL PINPAAARLI
ALTSPADHGK ATRDLGWRPG PTADAIRRAA RSYVERRDRN EQVVAL