Gene Mvan_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1029 
Symbol 
ID4644250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1080142 
End bp1081401 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content69% 
IMG OID639804530 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_951873 
Protein GI120402044 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.250337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.268513 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCCGA TCCGCGTCAC CGGTGCGCGT ATCACCCCCG TCGCGTTCGC CGACCCTCCG 
CTGCTCAACA CCGTCGGGGT GCATCAGCCC TACGCGTTGC GCGCGATCAT CCAGCTGGAC
ACCGACGCCG GTCTGGTGGG CCTGGGCGAG ACCTACGCCG ACACGCGGCA CCTGGCGCGT
CTACGGGCCG CCGCGGAGGC CATCACCGGC CTGGATGTGT TCGCACTGAA CAGGATCCGC
GCATCCATCG GTTCCCGACT CGAAGGCGAC ACCACCGCGG TCGGCACCGC GGGAATGATC
ACCTCGGCGA GCGTCGTCGA CCAGGTGCTG TCCCCGTTCG AGGTTGCCTG CCTGGACGTG
CAGGGGCAGT CGCTGGGGCG GCCGGTGTCG GATCTGCTGG GCGGCGCGGT CCGTGATGCG
GTCCCGTTCA GCGCCTATCT GTTCTACAAG TGGGCGGGAC ACCCGAACGC CGAACCCGAC
CGGTTCGGCG AGGCCATGGA CCCGAACGGG CTGGTCGCGC AGGCCCGCCG GATCATCGAT
GAATACGGCT TCACCGCAAT CAAACTCAAG GGTGGGGTGT TCCCGCCCGA AGAGGAGATG
GCGGCCATCG AGGCGCTCTC GCGTAACTTC CCCGGTCTCC CGCTGCGGCT CGACCCGAAC
GCGGCGTGGA CGCCGCACAC CGCCGTGAAG GTCGCCTCCG GCCTGGCCGG GATCCTCGAA
TACCTGGAGG ATCCGACGCC GGGGCTGGCC GGGATGGCCG AGGTGGCACA GCAGGCGCCG
ATGCCGCTGG CGACCAACAT GTGCGTCGTC GCATTCGATC AGCTGGCGCC CGCTGTCACG
AAGAACGCAG TGCGCGTGGT GCTTTCGGAT CACCACTACT GGGGCGGGTT GCAGCGCTCG
CGGTTGCTGG CGGGCATCTG CGACACGTTC GGGCTGGGGC TGTCGATGCA CTCCAATTCG
CATCTGGGTA TCAGCCTGGC CGCGATGGTG CACCTGGCCG GTGCCACACC GAACCTCACC
TACGCGTGTG ACACGCACTG GCCGTGGAAG ACGGAGGACG TCGTCAAGGA CGGGGCGCTG
GCCTTCGTCG ACGGGGCCGT GCCGGTGCCC ACCTCCCCCG GCCTGGGTGT CGAGATCGAC
GACGACGCAC TCGACGCGCT GCACGAGCAG TACGTGCGCT GCGGCATCCG CGACCGCGAC
GACACCGGCT ACATGCGCAC CGTAGATCCG TCGTTCGAGC CGGCCGGCCC CCGCTGGTGA
 
Protein sequence
MAPIRVTGAR ITPVAFADPP LLNTVGVHQP YALRAIIQLD TDAGLVGLGE TYADTRHLAR 
LRAAAEAITG LDVFALNRIR ASIGSRLEGD TTAVGTAGMI TSASVVDQVL SPFEVACLDV
QGQSLGRPVS DLLGGAVRDA VPFSAYLFYK WAGHPNAEPD RFGEAMDPNG LVAQARRIID
EYGFTAIKLK GGVFPPEEEM AAIEALSRNF PGLPLRLDPN AAWTPHTAVK VASGLAGILE
YLEDPTPGLA GMAEVAQQAP MPLATNMCVV AFDQLAPAVT KNAVRVVLSD HHYWGGLQRS
RLLAGICDTF GLGLSMHSNS HLGISLAAMV HLAGATPNLT YACDTHWPWK TEDVVKDGAL
AFVDGAVPVP TSPGLGVEID DDALDALHEQ YVRCGIRDRD DTGYMRTVDP SFEPAGPRW