Gene Mvan_0563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0563 
Symbol 
ID4644299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp604943 
End bp606124 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID639804066 
Product4-carboxymuconolactone decarboxylase 
Protein accessionYP_951411 
Protein GI120401582 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)
[COG0599] Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit 
TIGRFAM ID[TIGR00778] alkylhydroperoxidase AhpD family core domain
[TIGR02425] 4-carboxymuconolactone decarboxylase
[TIGR02427] 3-oxoadipate enol-lactonase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.260619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATC CGCCTCTCAC CGCCATCCAC CTCGGCGGAC CCGACGACGG GCCGCTACTT 
TTGCTGGGCC CGTCGCTGGG GACAACGACG GCCACCCTGT GGACAGGCGT GGCGCAACGA
CTGGTCGATC ATGTGCGCGT AGTCGGATGG GACCTTCCCG GTCACGGCCG CGGCCGTCGG
GCCCACCCGT TCACCATCGC CGACCTCGCA GCAGCGGTGT TGGTGATCGC GGACGACCTT
AACGTGGAGA CATTCCACTA CGCCGGTGAT TCGGTGGGTG GTTGCGTCGG TCTGCAGCTG
CTGCTCGATG CTCCGCAACG GGTCAGTTCG GCGACCCTGC TGTGCACCGG CGCCGCCATC
GGCACCCCGG ACGGCTGGCT CGCACGTGCC GCTACCGTCC GCGCCGGCGG TGTCGACACG
ATGCTGGCCG GCGCGGCCGA GCGCTGGTTC GCGCCGGGCT TTGTCGACCG CGAGCCGGGG
ACCGCCTCGG CGCTGCTGGA TGCCCTGAGT CACACCGATG CGGAGTCCTA TGCGCAGGTA
TGCGAAGCGT TAGCAGTGTT CGATGTAACC AATCAGTTGT CCGAAATCGT CACTCCGGTC
CTGGCCGTTG CGGGTAGCGT CGACGTCCCC ACGCCGCCGG AATCGTTGCG GCGCATCGCC
TCCGACGTAA AAGACGGGGA CCTGGTGGTG CTCGAAGGCG TCGGACACCT GGCCCCCGCC
GAAGCGCCGC AGCGCGTGGC CGGCCTCATC GCAGAGATCG TCGGTGTTCC GCAGCCCCCG
AGCAAGACCC TCGAAGACGT GCACCGTGCA GGAATGGCAG TACGGCGCGA GGTGCTCGGC
GAGGCGCACG TCGACCGGGC AGTGGCCGGT ACCACCGACC TGACCGCCGA CTTCCAGCAC
ATGATCACCC AGTACGCCTG GGGCACCATC TGGACCCGCC CCGGTCTTGA CCTTCGCAGC
CGCTCGATGA TCACGCTGAC CGCGTTGGTC GCGCGCGGTC ACCACGAGGA ACTGGCGATG
CACCTGCGGG CGGCCCGCCG GAACGGTCTG AGCAACGACG AAATCAAGGA GCTACTAATG
CAAACCGCCA TCTACTGCGG TGTCCCCGAC GCCAACTCCG CCTTCCGCAT CGCCGCCGAG
GTCCTGCCCG AGTTCGACGA GCACCCAGGT GCGCCGTCAT GA
 
Protein sequence
MSNPPLTAIH LGGPDDGPLL LLGPSLGTTT ATLWTGVAQR LVDHVRVVGW DLPGHGRGRR 
AHPFTIADLA AAVLVIADDL NVETFHYAGD SVGGCVGLQL LLDAPQRVSS ATLLCTGAAI
GTPDGWLARA ATVRAGGVDT MLAGAAERWF APGFVDREPG TASALLDALS HTDAESYAQV
CEALAVFDVT NQLSEIVTPV LAVAGSVDVP TPPESLRRIA SDVKDGDLVV LEGVGHLAPA
EAPQRVAGLI AEIVGVPQPP SKTLEDVHRA GMAVRREVLG EAHVDRAVAG TTDLTADFQH
MITQYAWGTI WTRPGLDLRS RSMITLTALV ARGHHEELAM HLRAARRNGL SNDEIKELLM
QTAIYCGVPD ANSAFRIAAE VLPEFDEHPG APS