Gene Mvan_3123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3123 
Symbol 
ID4646153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3297119 
End bp3301531 
Gene Length4413 bp 
Protein Length1470 aa 
Translation table11 
GC content70% 
IMG OID639806600 
Productbeta-ketoacyl synthase 
Protein accessionYP_953931 
Protein GI120404102 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGAGCG GTGCCTCGAA CAACACCGAC GAACTGCCCG ATAATGCGAT AGCCGTCATC 
GGCATGGCGG GGAGGTTCCC CGGCGCGGGC TCGGTGTCGG AGTTCTGGCG CAACCTGCGC
AACGGAGTGG AGTCGATCGT CGACCTCCCC GAGGACGAGC TGCTCGCCAA CGGAGTCACC
GAGCGGACCC TGTCGAACCG CTCGTACGTG CGTCGTGCCG GCCTGATGCC GGGCATCGAC
GAGTTCGACG CGGACTTCTT CGGTTTCACG CCCTACGCGG CCCGGATGCT CGATCCGCAG
CACCGGCTGT TCCTGCAGAC GGTGTTCCAC GCCATGGAGG ACGCCGGCTA CGACCCGAAG
GGGCTGGAGG CCACCGTCGG CGTGTTCGGC ACCAGCAGCT CCAGCGGATA CCTGCTGCAC
AACCTGATGT CGAACTTCGA CCCGATGATG GTGATCGGGC AAGGCGCCAG CTTCGAGATG
GTCAACCTGT CGCTGCAGAA CGACAAGGAC CACCTGGCCA CCCGCGCGGC CCACCAGTTC
GACTTCCGCG GCCCCGCGCT GTCGGTGGCC ACGGCGTGTT CGTCGTCGCT GGTCGCCGTG
CATCTGGCCT GCCAGTCCCT GCTCAACGGC GAGTGCGACA TCGCGTTGGC GGGCGGCTCG
TCGCTGCGCA TCCCGCACCA TGTCGGCTAC TGGTACGAGC AGGGCGCGAT GGTGTCGCCC
ACCGGTCAGT GCCGCCCGTT CGACGTGCGC TCCGACGGCA CGATCTTCGC CAGCGGCGTC
GGCGTGGTGG TGCTCAAGGC GCTTGCCGAC GCCATCGACG ACGGCGACCA CATCCACGCG
GTGATCCGCG GCTCGGCGCT GAACAACGAC GGCTCGACGA AGATGACCTA TGCCGCGCCG
AACGCGCTGG GGCAGGCCGA GGTCATCGCC GAGGCGCACG CCATCGCCGG CGTCGACGCA
TCGTCGATCA CCTATGTCGA GACGCACGGC ACCGGCACCC CGCTGGGCGA CCCGATCGAG
ATCGAAGGCC TGCGCCAGGC GTTCGAGCTG TCCGAGGAGA CGCGCAGTGC GCCCTGCTAT
CTCGGGTCGG TCAAGTCCAA CATCGGCCAC CTGGAGACCG CCGCAGGCAT CGCCGGTCTG
ATCAAGGCCA TCCTGTGCCT CGAGCACAAG GCGATCCCGG CGACGCTGCA CTACACCAGC
CCGAACCCCG AGCTGCACCT CGACCGTGGC CCGTTCCGCA TCCGCAGCTC CGACGGGCCG
TGGGAGTCCG ACGGCATCCG TCGGGCGGGG GTCAGCTCGT TCGGTGTCGG CGGCACCAAC
GCCCACATCG TCCTCGAGGA GGCGCCGACC GCGCCCGTGC CTGCACCCCG GTCCGGGCCG
CAGGTCCTGG TGCTGTCCGC GAGAACCGAA GAGACACTCG CCCAGTCGCG GGCCGCGCTG
GCCGCCGAAC TGTCCGAGGT CGACGAGATC AGCCTGCCCG ATGCCGCCTA CACGCTGACC
CACCGGCGCA AGGACCCGGT CCGGCTGGCC GCCGTCGTGC ACGATCAGGA GAACGCGGCC
ACCGTGCTGT CGGCCGCCGA GACGGACAAC GTCTTCATCG GCCGGGCCGT CCCCGACCTG
CAGGACTCCG CGGAGCGGGT CGCGTTCCTG TTCCCCGGTC AGGGCGCCCA GCACGTCGGC
ATGGCCCGCG GCCTGTACGA CAACGAGCCG GTGTTCAAGC GGCACTTCGA CGAGTGCGCG
ACCGCGTTCA GCGACGACAT GGGCTATGAC CTGCGTGCCG AGATCTTCGA CGGGGTCGGA
CGCAACCTGG AGCACACCGA CCGGGCCCAG CCGGCGTTGT TCACCGTCGA GTACGCGCTG
GCCAAGCTGG TCCAGTCCTA CGGCGTCGAG CCGGCGATCA TGGCCGGGCA CAGCATCGGC
GAGTACCCGG CTGCCACCAT CGCCGGCGTG TTCGATCTGG ACACCGCGGT CAAGGTGGTG
TCCAAGCGGG CCAAGCTGAT GCACGCCGCC CCGCGCGGCG TGATGGTCGC GGTGCCGCTG
AGCCCAGAGG CGGTCGCCGA ACATCTCACC CCCGACGTCG ACGTCGCGAC GATCAACGAC
CCGGGCAGCT GCGTGGTCGC CGGCAGTGAG GAAGCGATCC GCACCTTCCA GGCGGCTCTG
GCCGAAAAGG GTGTGGCGGC TCGCCGGGTG CGCACGTCGC ACGCGTTCCA CTCCCGGCTG
ATGGACCCGG TCGTCGCCGA GTTCGGCGCG TTCCTGTCCG GGGTGACACT GCGCGAACCG
CAGATCCCGT TGCTGTCCAA CATCACCGGC ACCACGATGA CGGCGGCCGA GGCGACGAAC
CCGTCGACGT GGGCCCGCCA GATCCGGGCC ACCGTCCGTT TCGCCGACGA ACTGGATGCG
CTGCTGGCCG CGCCGGACCG CGTCCTGGTC GAGGTCGGTC CCGGCGGCAC GCTGACGTCG
TCGGCGGGCA GGCACCCGAA GTGGACGGAA CGGCACCGCG CCGTGCGCCT GATGCGTCAC
CAGGCGCAGA ACCGCAACGA CCACGACACG TTCCTGCTCG CATTGGGGCA GCTGTGGGCT
GCCGATGTGG AAGTGGATTT CAACCAGGGT GCCGAGGAGG ACCGCACCCT GATCACGCTG
CCCGGTTACC CGTTCGCCAA ACAGCGGCAC TGGGTCGAGC ACAACGCCAA CGCGGCGTGG
CTGGCCGGGG GAGCGGGTGC CGACGGAACG GCGGCGGCGG CGGGCTCCGC CGGTGTTGCG
CCGGTGGCTG CCGGCGGTAC CTCGACGGTG GAGGCCAAGC TGGCCCGCAT CTGGTCGCAG
TGCCTCGGCC TGTCCGACAT CGACCGCAAC GCCAACTTCT TCGAGATCGG CGGCGACTCG
CTGATCGCGA TCAGCGTCGC GATGACCGCG GGCCACGAGG GACTGGATCT CACCCCGCAG
GATCTCTACG AGAACCAGAC CGTGGCCGCG TTGGCCAAGG TGCTGACCGC CCGGTACGCC
GAGGGCGGCC TGGCCCGTCA GACGCTCGAC GACGCGGTGA ACCCGCCGGT GCCGCCGAAC
GTGGCGTACT TCCTCGAGCA CGGCCTGCGC GACATCGGAC GCTGGCGCAT CCCGGTGATC
CTCGGACTGC GTTCCGACGT CGGCGAGGAC GACGTCCGGG CGGTGCTGAC CGCCGTCACC
GAGGTGCACG ACGCGCTGCG CGTGCACCTG GTCGAGCGGG CCGGCACCTG GGACCAGCAC
ATCGCCGAGC CGGGGGAGTT CACCGAGCTG GTGGCCCGTC AGCTGCCCGA GGGGCTGGCC
GCGGGCAGCC CCCAGGAGCG GGAGGCGGTG CTCGGGTTCC TCGACGAGCA GGTCCGCGAA
CACCAGGTGG TGGTCCCGTT GGCCGCGACG TTCATCCGTG GCGTGACCGG CGGCCCGTCG
TACCTGGCGC TGAGCCTGCA CGGGATCGCC GGTGACGACG TGTCCCGCGA TGTGTTGCTC
ACCGACGTCT TCACCGCGTT CAGTCAGCGG ATGGCGGGCG AGGAGATCGT CCTGGCGCCT
GTTCCGGCGT CCTGGCGGGA GTGGTCGCAG CGGTGCGCGG GTCTGGCCAG CCACCCGGCG
GTGCTGGACA GCCGTGACTA CTGGTTGCAG ACGGCGGGCG CGTCGACGCT GCGAATCGCA
GGCCCGGAGC ACTCCGAACG CCCGGGCGTC GACGACGTGA CCCGGCTGTC CACCGCGCTG
TCGGCCGCCG AGACCGGCGA GATCGACGAT GCGCGGCGCA GGCTGCGCCT GCCCGTCGAG
GAGATCCTGC TGGCCGCGCT CGGCCGGGCC GTGGCGGCGA CCGTCGGTGA GGGCGCAGTG
TCCGTCGACC TGGGCGGTCG CGGCCGCTCG GTGCTCAAGC CGGACGTCGA CCTGCAGCGC
ACGGTCGGCT GGTTCACCAC GATCCACCCG GTCGTGCTGA CCGCGGCGCG GCAGGGCAGC
GCGACGCAGG CCCTCGGTGA CGTGCGCGAG ACGCTGAAGG CTGTTCCGCA CTACGGCATC
GGCTACGGGC TGCTGCGTTA CCTGTACGCG CCGACCGCGC GGGTGCTCGG CGCGAGCCGC
CCCGCCGACA TCCTGTTCTC GCACATCGGG ACCATCCCCG AGGTGCCTGC CGAGCAGCCC
GACGACGCTC CGGTGCGGTT CGACGCCGAC ACGGCCATGC CGATCCGGGA CGCCCTGCCG
GGCCTCGGGC ATGCCGTCGA GCTGCGGGTC TACCGGGCCG CGGGGGTGCT GCATCTGGAT
TGGTGGTACG ACAATCGCCG TCTGGGGCCC ACCGATGTGG AATCCTTTGC CCGGCAGTAC
TCGGAAGCGC TCCTGGATGT CACCCGGGAC GCGCTGGCCG AAGAGGACAC CGACGCGGCG
GGCGACGAGC TGGCTCTGGT CGATCTGTCG TGA
 
Protein sequence
MTSGASNNTD ELPDNAIAVI GMAGRFPGAG SVSEFWRNLR NGVESIVDLP EDELLANGVT 
ERTLSNRSYV RRAGLMPGID EFDADFFGFT PYAARMLDPQ HRLFLQTVFH AMEDAGYDPK
GLEATVGVFG TSSSSGYLLH NLMSNFDPMM VIGQGASFEM VNLSLQNDKD HLATRAAHQF
DFRGPALSVA TACSSSLVAV HLACQSLLNG ECDIALAGGS SLRIPHHVGY WYEQGAMVSP
TGQCRPFDVR SDGTIFASGV GVVVLKALAD AIDDGDHIHA VIRGSALNND GSTKMTYAAP
NALGQAEVIA EAHAIAGVDA SSITYVETHG TGTPLGDPIE IEGLRQAFEL SEETRSAPCY
LGSVKSNIGH LETAAGIAGL IKAILCLEHK AIPATLHYTS PNPELHLDRG PFRIRSSDGP
WESDGIRRAG VSSFGVGGTN AHIVLEEAPT APVPAPRSGP QVLVLSARTE ETLAQSRAAL
AAELSEVDEI SLPDAAYTLT HRRKDPVRLA AVVHDQENAA TVLSAAETDN VFIGRAVPDL
QDSAERVAFL FPGQGAQHVG MARGLYDNEP VFKRHFDECA TAFSDDMGYD LRAEIFDGVG
RNLEHTDRAQ PALFTVEYAL AKLVQSYGVE PAIMAGHSIG EYPAATIAGV FDLDTAVKVV
SKRAKLMHAA PRGVMVAVPL SPEAVAEHLT PDVDVATIND PGSCVVAGSE EAIRTFQAAL
AEKGVAARRV RTSHAFHSRL MDPVVAEFGA FLSGVTLREP QIPLLSNITG TTMTAAEATN
PSTWARQIRA TVRFADELDA LLAAPDRVLV EVGPGGTLTS SAGRHPKWTE RHRAVRLMRH
QAQNRNDHDT FLLALGQLWA ADVEVDFNQG AEEDRTLITL PGYPFAKQRH WVEHNANAAW
LAGGAGADGT AAAAGSAGVA PVAAGGTSTV EAKLARIWSQ CLGLSDIDRN ANFFEIGGDS
LIAISVAMTA GHEGLDLTPQ DLYENQTVAA LAKVLTARYA EGGLARQTLD DAVNPPVPPN
VAYFLEHGLR DIGRWRIPVI LGLRSDVGED DVRAVLTAVT EVHDALRVHL VERAGTWDQH
IAEPGEFTEL VARQLPEGLA AGSPQEREAV LGFLDEQVRE HQVVVPLAAT FIRGVTGGPS
YLALSLHGIA GDDVSRDVLL TDVFTAFSQR MAGEEIVLAP VPASWREWSQ RCAGLASHPA
VLDSRDYWLQ TAGASTLRIA GPEHSERPGV DDVTRLSTAL SAAETGEIDD ARRRLRLPVE
EILLAALGRA VAATVGEGAV SVDLGGRGRS VLKPDVDLQR TVGWFTTIHP VVLTAARQGS
ATQALGDVRE TLKAVPHYGI GYGLLRYLYA PTARVLGASR PADILFSHIG TIPEVPAEQP
DDAPVRFDAD TAMPIRDALP GLGHAVELRV YRAAGVLHLD WWYDNRRLGP TDVESFARQY
SEALLDVTRD ALAEEDTDAA GDELALVDLS