Gene Mvan_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1000 
Symbol 
ID4645785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1029593 
End bp1037026 
Gene Length7434 bp 
Protein Length2477 aa 
Translation table11 
GC content70% 
IMG OID639804501 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_951844 
Protein GI120402015 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGGC AGTCCGGCTT CGCCGTGGTC GGGTATGCGG TCCGTTTCCC CGGTGCCGCC 
GATGCGCGCG AGTTCTGGGA TGTGCTGGCC GAAGGCCGCG ACGCCGTCTC GGAAGTCCCT
GCCGACCGGT GGGATGTCGA CGAGTTCTTC GACGCCGACC CCGATGCCGC GGGCAAGATG
GTCTCCCGCC GAGCTGGGTT CGTCGACGAC GTCGCGGGTT TCGATGCGCC GTTCTTCGGG
GTGTCGGCCC GGGAAGCGAT GTTCATGGAC CCGCAACACC GCCTGGTGCT GGAGACCGCG
TGGTCGGCGG TGGAGCACGC GGGCATCGCG CCGGAGGCGC TGGCGGGCAC GCGGACCGGC
GTCTTCCTCG GGCTGTCCAC CCACGAGTTC CTCGGCATGC TGATCGCGCA CACCGGCTAC
GAGGACGTCG ACATCTATTC CGGCACCGGC ACTTCGCCCG CCGCCGCGGC CGGGCGCGTC
AGTTTCCGGA TGGGACTGCA GGGGCCGGCG GTGGCCGTGG ACACGGCGTG CAGTTCGTCG
TTGGTGGCGG TGCACCAGGC GTGCCAGGCG CTGAGAGACG GCGACTGCGA CACGGCGCTC
GTCGGTGGGG TGAACGTCAT CCTGACCCCG GTCCCGATGA TCAACCTGAC CCGGGCGCGA
ATGCTGGCCC CGGACGGACG GTGTAAGACC TTCGACGCCG CCGCCGACGG TTACGTGCGC
GGCGAAGGGT GCGGGGTTGT GGTGCTCAAA CGCACCGGGG ACGCGCTGCG CGACGGCGAC
CGGATCCGGG CGGTGATCCG CGGCAGCGCA GTGAACCAGG ACGGCGCCTC CGGCGGCCTG
ACCGTGCCGA ACGGCGTTGC GCAGCAGCAC GTCATCGCCG ACGCGCTGCG CCGGGCCGGC
CTCGGCGGCG GCGAGGTCGA CTATCTGGAG GCGCACGGGA CGGGGACATC CCTGGGCGAT
CCCATCGAGG TCCAGGCCGC GGCGGCCGCG TTCGGTGACG GCCGCGACCC CGACCGTCCG
CTGCTGATCG GCTCGGTGAA AACCAACATC GGCCATCTGG AAGCCGCTTC CGGCATCGCC
GGGCTGATCA AGGTGGTGCT GTCGCTCGAA CACGAGACGT TGCCCGTGCA TCTGCATCTG
CGGCGGCCGT CTCCGCACAT CCCGTGGGAG CGGCTTCCGG TCCGTGTCGT CGCGGAGGCG
ACGCCGTGGC AGCGCAACGA CAGACCGCGC ATTGCCGGGG TGAGTTCGTT CGGATTCTCG
GGCACGAACG CACATGTGCT CCTCGAGGAG GCGCCCGTCG CCTCACCGAC CGAGGAACCC
GACGAGGCGC CCCACCGCAG GTACCACCTG CTGCCGCTGT CGGCGCGCAC GCCGGAAGCG
CTGGTGCGGC TGGCATCTCG CCATCTCGCA TACCTGGAGG ACGATCCGGA CGCGACGATC
GTCGACATCT GCGCGGCGGC CGGGGCGGGG AGGTCGCACT TCGAGCACCG CGCAGCGCTG
GTGGTGGATT CGCGACCGCG GGCCCGACGG TTGTTGCGGG CGATCCACGA AGAGCGTCCG
GCTCCCGGAC TGGTGCGGGG CGTCTGCAGT GACCGACCGA AGACGGCCTG GCTGTTCGGC
GGCCAGGGCA ACCAGTTCCC CGGCATGGCA AGGGAAGTGT TCGACCACGA ACCGGTTTTC
GCGCAGACTC TGCGACGGTG CGGCGAGGTG CTGGACGGTC TGCTGCCGCG GCCGCTGCTG
GAGGTGATCT TCGATACCGG GCCGGAGGCC GAGCACAATC TGCGGAACAC CGCCTTCGCC
CAACCGGCCC TGTTCGCCGT GGAGATGGGG ATGGCACGGT TGTGGCGGTC CTGGGGCGTC
GAGCCGGACG TGGTGCTGGG CCACAGCGTC GGCCAGTACG CGGCGGCCTG TGTCGCCGGG
GTCTTCGGCC TCGACGAGGG CGCCCGGCTG ATCGCCGAAC GCGGCCGGCT CTTCGCAAAT
CTGCCCTCGG GTGGGCGGAT GCTCGCCGTG TTCGCCGATC CAGACCGGGT CGAACGCTGT
ACCGCCGAGC ACTCCCGCCT GTCGGTCGCC GCCTACAACG GCGCCAACAC CGTGCTGTCA
GGCCCCGCAC CCGATCTCGA ACAGGTCCAC GCCGAGCTGA CCGCCGCGGG GTCGCGCTGT
GACTGGCTGG ACACCAGCCA CGCATTCCAC TCCGCTCTCG TGGACCCGGC ACTCGACGAA
TTCGAGTCCT TCGCAGCGCA ATTCGAATAC CGCGCACCGA AGCTGACGCT CGTCTGCAAC
CGCACCGGCA AGGTGCTGAC CCGGCAGACC CGGCTCGATG CGCCGTATTG GCGGCGGCAC
GCCCGCGAGC CGGTCCGATT CGCCGACAGT GCGGCAACTC TGGCCGATCT GGGCTGCGCG
GTGCTCATGG AGCTCGGCCC GCAACCGGTC CTCATCGCGG CGGCGCTGCG CAACTGGCCC
GAGACCGCGC CCGTACCCAC CGCCATCGCC TCGCTGCGCC GCGATGCCGA CGCCCACCGC
TGCTTCACCG ATGCGCTGGG CGTCGCGTAC ACCACCGGTC ACCGGCTCGA CTTCGCCGGT
CACCTCGGAC GACGCCGGAC ACTCGACCTG CCCACCTACC CGTTCGAACA CCGCGCCTAC
TGGTTCCCGA CCACGAAGGT GCACACGCTG CCGGCCGGGG GCGGCGTCGA ATCCCCGGCG
GCCGAGTCGA ACACCGGCGA CTGGCTCACC GGGATGTCCG ACGAGCAGCG GATCGACCGC
ATCATCGAGC TGACCCGCAC CGAGCTCGGC AACGCGTTGC GGGTAGCCCC GGCGGAGATC
GACCCCACCG CCGAATTCAT GACCATCGGT ATGGACTCGA TGATCGCGAT GGAGCTGCGC
GGACGGCTGC AGGCCGCATT GGGCACCCCG GTGCCCGCCT CTTTGTTCTT CGAGAATCCC
ACCGTGTCGA CGCTGGCCGA GGCGCTGCAC GCGCTGTGGC TGGACGCCTC GTCGGATCCG
TCGCGGAGGG AGTCGCCCAT TCCCCGGGTG CCGCACGGCT GCACGCTGCC CTGCGACGTG
CCGCTGTCGC ACGCACAGGA ACAGCTGTGG TTCCTCAATC AGCTGCTTCC GTCGTCGAGT
GCATACAACG TCGCCATCCG AGTGGACATC CGCGGCGCAC TGGATCATGA GGTGCTGCAG
CGCAGCCTCG AGGCCGTCGT CGACCGACAT GAGGCGCTGC GCACGGTGTT CCAGAGCCGC
CACGGCGCCC CGCAGGCGAT ACTCACACCT TCTCAACCGA TCAAGCTGGC CTTCGAGGAG
ATCGGTGACG AGGCCGACAT CGCCGCCGCC GCAGTGCGGG AAGCCAGCGT TCCGTTCGAC
ATCGGCGCCG GCCCGCTGCT CCGGGCGCGA CTGTTCGGGG TCGGGGACCA GCGGCACGTG
CTGGTCGTGA CGATGCACCA CATCGTCACC GACGGTTGGT CATTCCGTGT GCTGCTCGGC
GACCTGGGCC GGACCTACCA GGCGCTGGAG CGCGGGGCAC CGGCACCGCT GGACGACCTG
CCGATCCAGT ACGCGGATTA CGCCCGGTGG CAACGCGAGC AGTTGACCGG TCCCGAGTTC
GACGCCCACA TCGAGTTCTG GAAGGCCGAT CTGGCGGGCG CACCGCCGCT GGAGCTCGAC
ACCGACCGGC CGCGGCCGAA GTCCCCCACC TTCCGCGGCG CGCGGATCCG CTTCGACCTG
GGACGGGAAC GTGCCGACGC ATTGCGCGAT CTGTGCCGGG CCGGCAACGT CACGCTGTCG
GTTCCGCTGC TGGCGGCCTT CGCCACCGTG CTGCAACGCT ATTCGGGGCA ACACGATCTG
GTGATCGGCA CGCTGACCGC CAATCGCGGC AGGCTCGAAA CCGAGAATCT CATCGGGCTC
TTCGTCAACG CACTGCCGAT CAGGATCCGG CTCGACGGTG ACCCCGACAT GACGGAGTTG
ATCGACAGGA TCCGCGGGCG CATGTCGGAG GTACTGGCAC ACCAGGACGT GCCGTTCGAC
CTCATCGTCA CCGCGACCGC CCCGGACCGG GACGCCAGCC GGAACCCGCT GTTCGGAGTG
CAGCTCGTGG TGCAGCCGGC TGCGGGGGCC GCGGAGCTGA GCAGTCTCGG CCTCGACGTT
GCAGAGATCG ACACCCACAC GGCCAAAAGG GATCTCACGC TGACGTTCTT CGACGACGAA
CTGCTCGCCG GACACGTGGA GTACGCCACC GAGCTGTTCG ACGAGGCCCG GATCGAGCGG
CTGATCGCCC ACTTCCGCGA GGTGGTCGAC GCCCTGGTGA GCGATCGCCG CCTGCGGCTG
TCCGAGGTGA CGATGCTGAC CGAATCCGAG CGAGCGCACT ACGCGGTCAC GCGATCGCCC
CTGGCCCCGA CGGCGCGGTC GGTGCCGGAG CTGTTCGAGA TGACGGCGGA CCGTACCCCG
GATGCGGTGG CGGTCAGGGC CCCGGACCGC TCGCTGACCT ACCGCGAACT CGACGCCGCC
GCCAACAGGC TCGCCCGCCG GCTGCGTGCG CTCGGCGTCG GGGCCGGAGC GGCCGTCGGG
CTGCGGGTCG GGCGGAGCGC CGCGATGGCG GTCGGCATGC TCGGCATCCT GAAGGCAGGC
GGCGTGTACG TGCCGGTCGA CCCGACGTAT CCGCAGGACC GGATCGAGCA CATGCTGGGT
GAGGCCGGGG TGGCGCTGCT CCTCGACGAG CGGGACGTCG ACGGCGCCGA GGCCGGGCTG
TGCTCCGCCG AGCGACTGGA AAATCTCGCG GCCGCAGACG ATCTGGCCTA CATCATGTAC
ACCTCCGGTT CCACTGGACG GCCGAAAGGG GTGGCGGTCA CCCACGGCAG CGTCGTCGAG
TACGCCGAAA CCCTGGGCCG CGAGCTCGGT ATCACCGGTG AGGACGTCTA CCTCGAGACC
GCCTCGATCT CGTTCTCGTC GTCGATCCGG CAGATGCTGG TGCCCTTCGC GGTCGGAGCC
GAGGTGGTGA TCGCCACGAC CGAGGAACGC CGCGATCCTG CTGCGCTGCT GCGCCGGATC
GGCGAGTCGG CGGTGACGGT CGCCGACCTC GTCCCGACCG TGGTACGCCG CGTCATCGAC
GTGGTGGCGG CAGCCGATGC CGGGCAGAGG ACCGCTTCGC GCCGGAACCG GTTGCGGCTG
CTGCTCACCG CGAGCGAGCC GCTGCGGGCC GGCGTCGTGC GGGCCTGGCG CGAACAGCTG
GGCGGCGGCG CCTCGTGGAT CAACATGTAC GGGCAGACGG AGACCACCGG CATCGTCAGC
CTGCACCCGG TCGGCGAGCC CGACGGGGAC GCCCAGAGCA TCGTGCCGAT CGGACGGCCG
CGCGCCAATG TCGGGATGTA CGTGCTGGAC CGGCTGATGC GGCCGGTGCC GCCGGGCGTC
GGCGGTGCCC TGTTCATCGC GGGCCCGGCG TTGGCCCGCG AATACGTCGG CGATCCGACG
CTGACCGCGC AGAAATATGT ACCCGCCCCG TGGAATCCCG CCGAGCGGCT GTACGTCAGC
GGCGACATGG TCCGGCTCGG CTGGGACGGG ACGATCGAGT ACCGGGGCCG CGCCGACCGG
CAGGTGAAGA TCCGTGGTCT GCGGGTCGAA CCCGCCGAGA TCGACCGCGT GCTGCTCGAA
CACCCCGGCG TGCGCGAGGC CGTGACCGTC GTGCGCGAGG CGAACGCCGA CGGCGCCGCA
CTGGTCGCCT ATTTCACCAC CGGCGACACC CCCGTCCCGG TCGGGGAGTT GCGCGCTCAC
GCGCGCAGGC AGCTGCCCGA CCACATGGTC CCGTCGGCGT TCACGGCACT CGAGCAATTG
CCGCTGACAC CGAACGGCAA GCTCGACCGG ACAGCGCTGC CGGAGGTCAC CATCACCCGC
GACCAGGAGA TCGAATACGT CCCGCCCCGT TCCGGCGTCG AACAGTCGCT GGCAGAGATC
TGGAGCGACA CACTGCAACT CGAGCAGATC GGGGCAGGGG ACAACTTCTT CGAGCTCGGC
GGTCATTCAC TTCTCGCGGC GCAGGTGCGC TCCCGGATAC ACCAGCTTCT CGGGGTCGAG
TTGCCCCTCG AGGCCCTCTT CGAAGATCAG ACACTGTCCG ACCTGGCCCA CCGGATCGAG
GGCGACACCG GGGTCGACAC GTCGGAGGCC CCGCTGCTGC GGCCGGTCGC GCGCACCGGG
CCGCCGCCCC TCTCCTACGC CCAGGAGCTG ATGTGGCGCA ACGAATGTGA CGACCCCGGC
TCCCCTGCCC ACTGGATCGA CGTGTCGATC CGCATCACCG GGCCCCTCGA CCCGGACCTG
GTGGTGCGTG GCATCCAGGC GGCGACGCAT CGACACGAGT TGCTGCGCAC CGTCTTCCGT
CCGTCCGGCG CGTCGGCCGC GCAGGTGATC CTCGATTCGT ACACCCCCGA GGTGCCGATC
TTCGACGGTG CCGCCGCGCT GCAGCATGAT GTCTGGCCGG ATCAGCCGGA TCTCGCCACG
TGTCCACCGC TGCGGGCCGA GCTGTGCCGG ATCGACGACG GCAACCACAT CCTGCGGTTG
CGGGTGCACC GCATCCTCGC CGACGGATAC ACGATGCGGT TGTTGCTCAG CGAGATCGGC
GGGCTGGTCG CCAGTTCGGC GGGCCTGCCC GACTTCCCAC TGCTCGACGG TGACCTGCAC
TACGCCGACT ATGCGATCTG GGAACGATCC TGGCTGACCG GTGCCGCGCT GCAGCGACGC
ATCGACCACT TCCGGCGCGA GTTCGCTGTC GGCGAGCTCC CGCCGGCCCT GCCGACCGAC
CACCCGCGCA CCGGTCGCTC CAGACGCGCC GCCGGCCAGT TGGACTTCGA GTTTCCGGCC
GAGGTCGCCG CGGCCGCACG GGCGCTGGCC GTTCGCGAGC ATGCGTCGCT CTACACGGTG
CTGCTCGCCG GATTCGCCGC CGCCCTGGGC GGTTACGCCG GGCGGCGGAC CGTGGTACTC
GGCTCGCCGG TGACGCGGCG CAACGATCCC GTCACCCAAC TCATGCTCGG CCCGTTCATG
AACACCGTGC CGCTGCGCAT CGACCTCCCG GAAGGCGGCG GCCTGGCGGC GATGGTGCAG
AACATGAAAA CGACTGTGCT GGGCGCATTG TCACACCAGG ATGCGCCCTC GCAGCACGTG
ATCGCAGCGC TCGCGGCTGA GCACGGGCCG TCGGCGGCGG CCATCGGCGA AACCGTGTTC
CTGATGGACG ACCCGGTGCC GGGCGAGTTC GCCGCAGGCG GATTCCGGCT GACCCGGGTG
CCACCTGAGC GGGTCATCGC CCGGCGCGAG CTGACCGCCG CGATGAGCAC GCGCAACCGG
CAGATCACCG GGACGCTCAC CTACGACAGC ACGCTGTTCG ACCACCCGTC GATCGAGTGC
ATCGTCGCCG GCTTCATCGG CGCGGTCTCC GCGCTGCATG TCGACTGCGC CTGA
 
Protein sequence
MTGQSGFAVV GYAVRFPGAA DAREFWDVLA EGRDAVSEVP ADRWDVDEFF DADPDAAGKM 
VSRRAGFVDD VAGFDAPFFG VSAREAMFMD PQHRLVLETA WSAVEHAGIA PEALAGTRTG
VFLGLSTHEF LGMLIAHTGY EDVDIYSGTG TSPAAAAGRV SFRMGLQGPA VAVDTACSSS
LVAVHQACQA LRDGDCDTAL VGGVNVILTP VPMINLTRAR MLAPDGRCKT FDAAADGYVR
GEGCGVVVLK RTGDALRDGD RIRAVIRGSA VNQDGASGGL TVPNGVAQQH VIADALRRAG
LGGGEVDYLE AHGTGTSLGD PIEVQAAAAA FGDGRDPDRP LLIGSVKTNI GHLEAASGIA
GLIKVVLSLE HETLPVHLHL RRPSPHIPWE RLPVRVVAEA TPWQRNDRPR IAGVSSFGFS
GTNAHVLLEE APVASPTEEP DEAPHRRYHL LPLSARTPEA LVRLASRHLA YLEDDPDATI
VDICAAAGAG RSHFEHRAAL VVDSRPRARR LLRAIHEERP APGLVRGVCS DRPKTAWLFG
GQGNQFPGMA REVFDHEPVF AQTLRRCGEV LDGLLPRPLL EVIFDTGPEA EHNLRNTAFA
QPALFAVEMG MARLWRSWGV EPDVVLGHSV GQYAAACVAG VFGLDEGARL IAERGRLFAN
LPSGGRMLAV FADPDRVERC TAEHSRLSVA AYNGANTVLS GPAPDLEQVH AELTAAGSRC
DWLDTSHAFH SALVDPALDE FESFAAQFEY RAPKLTLVCN RTGKVLTRQT RLDAPYWRRH
AREPVRFADS AATLADLGCA VLMELGPQPV LIAAALRNWP ETAPVPTAIA SLRRDADAHR
CFTDALGVAY TTGHRLDFAG HLGRRRTLDL PTYPFEHRAY WFPTTKVHTL PAGGGVESPA
AESNTGDWLT GMSDEQRIDR IIELTRTELG NALRVAPAEI DPTAEFMTIG MDSMIAMELR
GRLQAALGTP VPASLFFENP TVSTLAEALH ALWLDASSDP SRRESPIPRV PHGCTLPCDV
PLSHAQEQLW FLNQLLPSSS AYNVAIRVDI RGALDHEVLQ RSLEAVVDRH EALRTVFQSR
HGAPQAILTP SQPIKLAFEE IGDEADIAAA AVREASVPFD IGAGPLLRAR LFGVGDQRHV
LVVTMHHIVT DGWSFRVLLG DLGRTYQALE RGAPAPLDDL PIQYADYARW QREQLTGPEF
DAHIEFWKAD LAGAPPLELD TDRPRPKSPT FRGARIRFDL GRERADALRD LCRAGNVTLS
VPLLAAFATV LQRYSGQHDL VIGTLTANRG RLETENLIGL FVNALPIRIR LDGDPDMTEL
IDRIRGRMSE VLAHQDVPFD LIVTATAPDR DASRNPLFGV QLVVQPAAGA AELSSLGLDV
AEIDTHTAKR DLTLTFFDDE LLAGHVEYAT ELFDEARIER LIAHFREVVD ALVSDRRLRL
SEVTMLTESE RAHYAVTRSP LAPTARSVPE LFEMTADRTP DAVAVRAPDR SLTYRELDAA
ANRLARRLRA LGVGAGAAVG LRVGRSAAMA VGMLGILKAG GVYVPVDPTY PQDRIEHMLG
EAGVALLLDE RDVDGAEAGL CSAERLENLA AADDLAYIMY TSGSTGRPKG VAVTHGSVVE
YAETLGRELG ITGEDVYLET ASISFSSSIR QMLVPFAVGA EVVIATTEER RDPAALLRRI
GESAVTVADL VPTVVRRVID VVAAADAGQR TASRRNRLRL LLTASEPLRA GVVRAWREQL
GGGASWINMY GQTETTGIVS LHPVGEPDGD AQSIVPIGRP RANVGMYVLD RLMRPVPPGV
GGALFIAGPA LAREYVGDPT LTAQKYVPAP WNPAERLYVS GDMVRLGWDG TIEYRGRADR
QVKIRGLRVE PAEIDRVLLE HPGVREAVTV VREANADGAA LVAYFTTGDT PVPVGELRAH
ARRQLPDHMV PSAFTALEQL PLTPNGKLDR TALPEVTITR DQEIEYVPPR SGVEQSLAEI
WSDTLQLEQI GAGDNFFELG GHSLLAAQVR SRIHQLLGVE LPLEALFEDQ TLSDLAHRIE
GDTGVDTSEA PLLRPVARTG PPPLSYAQEL MWRNECDDPG SPAHWIDVSI RITGPLDPDL
VVRGIQAATH RHELLRTVFR PSGASAAQVI LDSYTPEVPI FDGAAALQHD VWPDQPDLAT
CPPLRAELCR IDDGNHILRL RVHRILADGY TMRLLLSEIG GLVASSAGLP DFPLLDGDLH
YADYAIWERS WLTGAALQRR IDHFRREFAV GELPPALPTD HPRTGRSRRA AGQLDFEFPA
EVAAAARALA VREHASLYTV LLAGFAAALG GYAGRRTVVL GSPVTRRNDP VTQLMLGPFM
NTVPLRIDLP EGGGLAAMVQ NMKTTVLGAL SHQDAPSQHV IAALAAEHGP SAAAIGETVF
LMDDPVPGEF AAGGFRLTRV PPERVIARRE LTAAMSTRNR QITGTLTYDS TLFDHPSIEC
IVAGFIGAVS ALHVDCA