Gene Mvan_3126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3126 
Symbol 
ID4646156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3312389 
End bp3317134 
Gene Length4746 bp 
Protein Length1581 aa 
Translation table11 
GC content70% 
IMG OID639806603 
Productbeta-ketoacyl synthase 
Protein accessionYP_953934 
Protein GI120404105 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.228806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.663439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCTG CTTTTGACGA GGCAGCTGTT CGTCGGTGGT TGGTCGACTA CCTGGTCACG 
AACAACGGTT GCAGTCCGGA GCACATCGAA CGCGGCGCCT CCATGCACGA CCTCGGCGTG
GGATCCCGCG ACGCGGTCGT GCTCACCGGC GTGCTGTCGG AGTACCTCGG CCGCGCCGTG
TCGCCCGTGG ACTTCTGGCA GTACCCGACG GTGGACGCGC TGGCCAAGTT CCTGACCGGT
GGCGAAGTCG AACCCGTCGA CCCCGGCCCC GGCGTCCGGC CGGCCGCCAC GAACGAGCCG
ATCGCGGTGA TCGGGCTCGG TCTGCGTCTG CCCGGTGGCG CCGACCTGGA CTCCAACATC
GAAGGGCCCG AAGCCTTCTG GGAGTTCCTC ACCGAGGGCC GCTCGTCGGT GCGGGAGGTC
CCCGAAGACC GCTGGGAGTG GTGCGAGGAC GGCACACCCG AGGGCGCCGC CGCGCTGGCA
GACACCACCA GGTGGGGTTC GTTCCTGCGC GACCTGGACG CCTTCGACGC CGAGTACTTC
GAGATCATCC CGCGCGAGGC CACCCGGATG GATCCGCAGC AGCGGCTGCT GCTCGAGGTC
ACCCACGAGG CCCTGGAGAA CGCGGGCATC GCCGCCGACT CGCTGGCCGA GACGCAGACC
GGCGTGTTCG CCGGCGCGAG CGCCGCCGAC TACGCACAGC TGGGCGCCAC GGACCTGAGC
GGCATCGACG CCTGGTACAG CACAGGTGGA GCACTGAGCA TCATCGCCAA CCGGGTGTCG
TACTACTTCG ACCTGCGCGG CCCGTCGGTG ACCGTGGACA CCGCCTGCTC GTCGTCACTG
GTGGCCATCC ATCTGGCGTG CCAGAGCCTG CGCTCGGGCG ACTCGGAACT GGCGTTGGCG
GCCGGGGTGA ACCTGCTGTT GTCACCGGCC CCGACGAGGA GTTTCGACCG GGCCAAGGCG
ATGTCGCCGA CGGGACAGTG TCACGCTTTC GACGCCGGCG CCGACGGCTT CGTGCGCGGC
GAGGGTTGCG GGGTGGCGGT GCTCAAGCGG CTGTCCGACG CGCAGCGCGA CGGTGACCGG
GTGCTCGCGG TGATCCGCGG GTCCGCGGTG AACCAGGACG GCCGCTCCAA CGGCCTGATG
GCCCCGAATC CGTCGGCGCA GATGGCGGTG CTGCGCGCCG CCTACGCGGC GGCCGGGGTG
GACCCCCGTG AGGTCGACTA CGTCGAGGCG CACGGCACCG GCACGCTGCT GGGCGATCCG
ATCGAGGCGC GGGCGCTGGG CACCGTGCTC GGTAAGGGCC GCGCGGCCGA TGCGCCGCTG
CTGCTCGGTG CGGTCAAGTC CAACCTGGGC CACCTCGAAG CCGCGGCAGG CATCGCAGGG
TTCGCCAAAG CGGTATTGGC GTTGCAGCAC AACATGATTC CGGCCAACCT CGGCTACCAG
AGCCCGAACC CGCACATTCC GTTCGAGAAG CTGCGGTTGA AGGTCGTCGC CGAGCACACC
GATTGGGCCC CCGCCGGGCG GCCCCGGCGT GCGGGCATCT CGTCGTTCGG GTTCGGCGGC
ACCAACGCCC ACGTCGTGAT CGAACAGGCT CCGGTGTTCG CCCCGTCGCC GGTCGAGGAA
CCCGAAGCGG TCACCACGCT GGTGGTGTCG GGCAAGTCGC CCGAGCGCAT CGCGTCGCAG
GCCGCCGCAC TGGCGGAGTG GATGGCCGGC GCCGGATCCG ACGCCTCACT GGTGGAGGTG
GCGCACTCGC TCAACCACCA CCGCGCCCAG CACGCCAAAT TCGCGACCGT GGTGGCCCGC
GACCGTGATC AGGCCATCGC GGGGCTGCAG GCGCTGGCCG CCGGGCAGTC CGCGCCGGGC
GTGGTCGGGG TTGCCGCAGG CAACCCGCAG CCTGGCACGG TGTTCGTGTA TTCGGGTCAG
GGGTCGCAGT GGCCGGGGAT GGCCCGGCAG CTGTTGGTTG ACGAGCCGGC GTTCGCGAAC
GCGTTGGCCG AGATCGAGCC GGTGTTCGTC GAGCAGGTCG GTTTCTCGTT GCGTGACGTC
ATCGCAGGTG GCGAGACCGT CAGTGGTGAT GCGCAGGTTC AGCCGGTGCT GATGGGTCTG
CAGCTGGCGT TGACCGAGCT GTGGCGGTCC TACGATGTGC ATCCGGATGC GGTGATCGGT
CATTCGATGG GTGAGGTGAC CGCCGCAGTG GTGGCCGGGG CGTTGAGTCT GGCCGACGGG
CTGAAGGTCA TTGCCGCGCG GTCGTCGATC ATGTCCCGGC TGGCCGGGCA GGGCGCCGTC
GCGCTGCTCA ACCTCGACGC CGACGCGGCC CGCTCGCTGA TCGCCGACCA TCCGAGCGTC
GAGATCGCCG GATATCTGTC GCCGCGCCAG ACGGTGGTGG CCGGCCTGCC CGAGCAGGTC
GACGCCGTCA TCGCCGCGGT CACCGCACAG AACACGTTCG CCCGCCGGGT CAACATGGTC
GTCGCCTCCC ACACCGCGCT GATGGACCCG GTGCTGCCCG ACATCCGCGC GGCGCTGGCC
GGAATCGAAC CCAACATCCC GACGCTGCCG TTCCTGTCGA CGGTCACCGG GGCCGACAGC
GCACCGGTGC TGGACGCCGA CTACTGGGTG GCCAACGTGC GCCAACCGGT CAAATTCAGC
CAGGCCATCG TCGCGGCGGC AGCCGACAAC GGCACGTTCA TCGAGATCAG CCCGCACTCG
ACGCTGGGTC AGGCGATCGG CAACACGCTT GGTGACGGGC CGGACCATCA CATGCTCGGC
ACCCTGGCGC GCGACACCGA CGACACGGTG ACGTTCCACG CGAACCTGAA CGCCACCCAC
ACGTCGCGGC CCCCGCACAC CGCCCATCAC GGCGAACCCC ACATCGTGCT GCCCACCACC
CCGTGGCACC GCAGCAGACA CTGGGTGGAC GTCCGCCCGC TGCGCCGCGG TGGCGGCGTG
CGTGCCGGTG CGTTGCCCGC CGACAGCGCG GTCCCGCCGG AGTGGTTCTG CGGGCTGACG
TGGCCCGCCA AACCCCTTGT GGCGCAGGAT GGTCCAGTCG ACCCGGGCGC CAGCTGGCTG
GTGGTCGGCG ACGACGCGCT GGCCGCCGAG ATGGGTAAGC TGCTGGGCAG GCCGGTGGCC
ACCGGTGACG GAGCGGATCT GTCCTCGGCC ACGCACGTCC TCTACGCGCC GACCTCCGGT
GACGAGCTGG GTTACGAGCT CTTCGAGGCG GGCCGTGCGA TTGCGACCAC CGCGAGCCGC
GTATCGCAGC CCCCGAAGCT GTTCCTGCTG ACCCGCAATG CCCAGCCGGT CAGCGAGGGG
GACCGGGCCA ACCCCGGGCA GGCGGTGCTG TGGGGCCTGG GCCGCACACT GGCGCTGGAG
CATCCCGACA TCTGGGGCGC CCTGATCGAC ACCGACGAGT CGGTGCCCGC GGTCGTCGCC
GCGCGCTGGG TGCTCGCGGA GGCGCATGCC GGCGACGGTG AAGACCAGCT GGTGTACCGG
GCCGGCATCC GCCGGGTGGC CCGCCTCGTG CACGCGCTGC CGCCCGCGCC GACGGGTTCG
GGGACACTCG ACGCCGACGG CGCGCACCTG GTCATCGGCG CGACGGGCAA CATCGGCCCC
CGACTCGTCG AGCAGCTCGC CGCGATGGGC GCCAAGACCG TGGTGGCGGT GTCCCGTAAC
CCGGGCTCAC GGCTGGACGA ACTGACCGCG CGGCTCGCGG CCTCCGGGAC GACGGTCGTG
ACCGCCGCAG CGGATGCGTC CGACGCGTCA TCGCTGCGAG CGGTGTTCGA CCGCTTCGGG
GCGGACCTGC CCCCGCTGAA GGGCGTCTAT CTGGCGGCGA TGAGCGGCGG TCCGGTCACG
CTGGACGAGA TGACCCACGA CGACGTGGTG GCGATGTTCC GGTCCAAGAT GGATGCCGCG
GCGCTGCTGC ACCAGCTGTC GGCGGGCCAT CCCGTCGAGC AGTTCGTGCT GTTCTCGTCG
ATCTCCGGCG TGCTGGGTTC GCGCTGGCTG GCGCATTATG CGGCGACGAC GACGTTCCTG
GACACCTTCG CGTTCGCCCG CCGGGCGGCA GGGCTGCCGG CCTGCGCGAT CAACTGGGGC
CTGTGGAAGT CGCTGGCCGA CGCGCAGACC GGGTTCGAAC GTCAGGCCAC CCAGGAGTCG
GGTCTGGAGC CGATGGACGA CGCGGTCGCC ATCACCGCGC TGCGCTCGTT CATCGGACCG
CAGGCGCCCG CCAGGGCCAC CGTGGTGGCG GCGGACTGGC CCCGGCTGGC CGCGGCCTAC
CACACCCGCG CGCAACTGCA CATCCTCGAC GACCTGCTGG CCACAGAGGG CACAGCGGCC
ACCGGCCTTA CCCCGACCGG TGACACGAGA TTTCGCCAGG AGCTGAAGGA GTGCGAGCCC
CAGCGGCGTG TGGAGTTGCT CACCGATCAC GTGTTGTCGC AGATCGCCGC CGCAATGGGG
CTGGCTTCGA CGCACACGCT CGACCCGACG GTGGGGTTCT TCCAGTTCGG AATGGATTCG
CTGATGAGCG TAACGTTGCA GCGTTCGCTG AGCGAAAGCC TGGGGGAGGC CCTGCCGGCG
TCGGTGGTGT TCGACTACCC GACCGTGGAA GCGCTCACCG ATTATCTGGC GTCGGTTCTT
CCTGAGATAA TCGAGACCGC CGGTACCGAC AACGACGTCG ATGGTGGCCC CAGCGTCATT
GAGGACGCCT ACGACGACCT CGCCGAGGAC GAGTTGCTGG CGAGACTTTC GGAAAGATTG
AGTTGA
 
Protein sequence
MTSAFDEAAV RRWLVDYLVT NNGCSPEHIE RGASMHDLGV GSRDAVVLTG VLSEYLGRAV 
SPVDFWQYPT VDALAKFLTG GEVEPVDPGP GVRPAATNEP IAVIGLGLRL PGGADLDSNI
EGPEAFWEFL TEGRSSVREV PEDRWEWCED GTPEGAAALA DTTRWGSFLR DLDAFDAEYF
EIIPREATRM DPQQRLLLEV THEALENAGI AADSLAETQT GVFAGASAAD YAQLGATDLS
GIDAWYSTGG ALSIIANRVS YYFDLRGPSV TVDTACSSSL VAIHLACQSL RSGDSELALA
AGVNLLLSPA PTRSFDRAKA MSPTGQCHAF DAGADGFVRG EGCGVAVLKR LSDAQRDGDR
VLAVIRGSAV NQDGRSNGLM APNPSAQMAV LRAAYAAAGV DPREVDYVEA HGTGTLLGDP
IEARALGTVL GKGRAADAPL LLGAVKSNLG HLEAAAGIAG FAKAVLALQH NMIPANLGYQ
SPNPHIPFEK LRLKVVAEHT DWAPAGRPRR AGISSFGFGG TNAHVVIEQA PVFAPSPVEE
PEAVTTLVVS GKSPERIASQ AAALAEWMAG AGSDASLVEV AHSLNHHRAQ HAKFATVVAR
DRDQAIAGLQ ALAAGQSAPG VVGVAAGNPQ PGTVFVYSGQ GSQWPGMARQ LLVDEPAFAN
ALAEIEPVFV EQVGFSLRDV IAGGETVSGD AQVQPVLMGL QLALTELWRS YDVHPDAVIG
HSMGEVTAAV VAGALSLADG LKVIAARSSI MSRLAGQGAV ALLNLDADAA RSLIADHPSV
EIAGYLSPRQ TVVAGLPEQV DAVIAAVTAQ NTFARRVNMV VASHTALMDP VLPDIRAALA
GIEPNIPTLP FLSTVTGADS APVLDADYWV ANVRQPVKFS QAIVAAAADN GTFIEISPHS
TLGQAIGNTL GDGPDHHMLG TLARDTDDTV TFHANLNATH TSRPPHTAHH GEPHIVLPTT
PWHRSRHWVD VRPLRRGGGV RAGALPADSA VPPEWFCGLT WPAKPLVAQD GPVDPGASWL
VVGDDALAAE MGKLLGRPVA TGDGADLSSA THVLYAPTSG DELGYELFEA GRAIATTASR
VSQPPKLFLL TRNAQPVSEG DRANPGQAVL WGLGRTLALE HPDIWGALID TDESVPAVVA
ARWVLAEAHA GDGEDQLVYR AGIRRVARLV HALPPAPTGS GTLDADGAHL VIGATGNIGP
RLVEQLAAMG AKTVVAVSRN PGSRLDELTA RLAASGTTVV TAAADASDAS SLRAVFDRFG
ADLPPLKGVY LAAMSGGPVT LDEMTHDDVV AMFRSKMDAA ALLHQLSAGH PVEQFVLFSS
ISGVLGSRWL AHYAATTTFL DTFAFARRAA GLPACAINWG LWKSLADAQT GFERQATQES
GLEPMDDAVA ITALRSFIGP QAPARATVVA ADWPRLAAAY HTRAQLHILD DLLATEGTAA
TGLTPTGDTR FRQELKECEP QRRVELLTDH VLSQIAAAMG LASTHTLDPT VGFFQFGMDS
LMSVTLQRSL SESLGEALPA SVVFDYPTVE ALTDYLASVL PEIIETAGTD NDVDGGPSVI
EDAYDDLAED ELLARLSERL S