Gene Mmcs_1701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1701 
Symbol 
ID4110535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1839053 
End bp1840234 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID638030820 
Product4-carboxymuconolactone decarboxylase 
Protein accessionYP_638866 
Protein GI108798669 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)
[COG0599] Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit 
TIGRFAM ID[TIGR00778] alkylhydroperoxidase AhpD family core domain
[TIGR02425] 4-carboxymuconolactone decarboxylase
[TIGR02427] 3-oxoadipate enol-lactonase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0186237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATC CGCCTCTCAC CGCCATCCAC CTCGGCGGAC CCGACGACGG GCCGCTGCTT 
TTGCTGGGCC CCTCGCTGGG GACAACGACG GCCACCCTGT GGACAGGCGT GGCGCAACGA
CTGGTCGATC ATGTGCGCGT AGTCGGATGG GACCTTCCCG GTCACGGCCG CGGCCGTCGG
GCCCACCCGT TCACCATCGC CGACCTCGCG GCAGCGGTGC TGGTGATCGC GGACGACCTT
AACGTGGAGA CATTCCACTA CGCCGGTGAT TCGGTGGGTG GTTGCGTCGG TCTGCAGCTG
CTGCTCGATG CTCCGCAACG GGTCAGCTCG GCGACCCTGC TGTGCACCGG CGCCGCCATC
GGCACCCCGG ACGGCTGGCT CGCACGTGCC GCTACCGTCC GCGCCGGCGG TGTCGACACG
ATGCTGACCG GCGCAGCCGA GCGCTGGTTC GCGCCGGGCT TTGTCGACCG CGAGCCGGGG
ACCGCCTCGG CGCTGCTGGA TGCCCTGAGT CACACCGATG CGGAGTCCTA TGCGCAGGTA
TGCGAAGCGT TAGCAGTGTT TGATGTAACC GATAGGTTGT CCGAGATCGT CACTCCGGTC
CTGGCCGTTG CGGGTAGCGC TGACAGCCCC ACGCCGCCGG AATCGTTGCG GCGCATCGCC
TCCGACGTAA AGGACGGGGA CCTGGTGGTG CTCGAAGGCG TCGGACACCT GGCCCCCGCC
GAAGCGCCGG AGCGCGTGGC CGGCCTCATC GCAGAGATCG TCGGTGTTCC GCAGCCCCCG
AGCAAGTCCC TCGAAGACGT GCACCGTGCA GGAATGGCGG TACGGCGGGA GGTGCTGGGC
CATGCGCACG TCGACCGGGC AGTGGCCGGT ACCACCGACC TGACCGCCGA CTTCCAGCAC
ATGATTACGC AGTATGCCTG GGGCAGCATC TGGACCCGCC CGGGTCTCGA CTTCCGCAGC
CGCTCGATGA TCACGCTGAC GGCGCTGGTC GCGCGCGGTC ACCACGAGGA ACTGGCGATG
CACCTGCGGG CGGCCCGCCG GAACGGTCTG AGCAACGACG AGATCAAAGA GCTGCTCTTG
CAGACCGCGA TCTACTGCGG AGTTCCCGAC GCCAACTCCG CCTTCCGCAT CGCCGCCGAG
GTCTTGCCGG AGTTTGACGA GCACCCAGGT GCGCCGTCAT GA
 
Protein sequence
MSNPPLTAIH LGGPDDGPLL LLGPSLGTTT ATLWTGVAQR LVDHVRVVGW DLPGHGRGRR 
AHPFTIADLA AAVLVIADDL NVETFHYAGD SVGGCVGLQL LLDAPQRVSS ATLLCTGAAI
GTPDGWLARA ATVRAGGVDT MLTGAAERWF APGFVDREPG TASALLDALS HTDAESYAQV
CEALAVFDVT DRLSEIVTPV LAVAGSADSP TPPESLRRIA SDVKDGDLVV LEGVGHLAPA
EAPERVAGLI AEIVGVPQPP SKSLEDVHRA GMAVRREVLG HAHVDRAVAG TTDLTADFQH
MITQYAWGSI WTRPGLDFRS RSMITLTALV ARGHHEELAM HLRAARRNGL SNDEIKELLL
QTAIYCGVPD ANSAFRIAAE VLPEFDEHPG APS