Gene Mvan_3845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3845 
Symbol 
ID4649264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4099381 
End bp4104546 
Gene Length5166 bp 
Protein Length1721 aa 
Translation table11 
GC content69% 
IMG OID639807311 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_954632 
Protein GI120404803 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.549835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGACA TCGACACCGG GGCCCTGCTC GACGAGCGTC GGCTGGAACT GCTGCGACGC 
CGGATGCAGG AGCGTGGGCT GAGCGCAGGC ACCGACACGG CCGAACCCGC CGGCGACACC
GACGTGTTGA CCGAAGGCCA GCTGCGGATG TGGTTCGTGC ACAACGCCGA TCCCAGCGGC
GCGTTGCTCA ACGTGTGTCT TTCCTACCGC CTGTCCGGCG AGATCGATGT CGCGCGGCTG
CACGACGCGG TCGGCGCGGT GGCCCGGCGT CACCGGGTGT TGCGCACGAC GTACCGCACC
GAGAGCACCG CTGCCGGCGA CAGCGGAATG CCGGTCCCCA CCGTGCACAC CGGCCTGATG
CCCGGCTGGG CCGAGCACGA TCTGTCCGAA CTCTCCGAGC GGGCCCGCGG GCTGCGGCTG
GAGGTGCTCG CGCAGCGCGA GTTCGGCAGG CCGTTCGACC TGGGCGCCGA GTCGCCGCTG
CGGATCACGG TGATCCGCAC CGCCCCAGCC GAATACGTGA TGCTTCTGGT CGCCCATCAC
ATCGCCTGGG ACGACGCTTC GTGGGAGGTG TTCTTCGACG ACCTGACCGG CGCCTACGTC
GGTGAGAAAC TGCGGCCGGC GCGACGTCCG GTCGTCGCGG CCGGCGGTTC CGACGAGGAC
GACGTCGCGT ACTGGCGTGC GGTGATGGCC GATCCGCCGG AGCCGCTGGA GCTGCCAGGC
CCGACCGGGT CGGCCGTCCC GACGAGCTGG CGCTCACAGC GCACCACGCT GCGCCTGGAC
GGCGAGACTG CGCGACGCGT CACGGCCCTG GCCGACGAGG TCGGCGCGAC ACCCTACGCG
GTGCTGCTAG CGGTGTTCGG CGTGCTGATC CACCGTTACA GCCACGTCGA CGACTTCCTG
GTCGCCACCC CGGTGCTGAA TCGCAACGGT GACACCGACG ACGTCATCGG GTATTTCGGC
AACACCGTCG CGATGCGGTT GCGGCCGCAC CCGGCAATGA CGTTCCGTGA CCTGCTGACC
CAGACCCGTG ATACCGCCCT CGGTGCTTTC GCGCACCAGC GGGTCGGGCT CGACCGGATG
GTGCGCGAGC TGAACCCGGA CCGGCGCCAC GGCGCCGAGC GGATGACGCG GGTCAGCTTC
GGGTTCCGCA GCCGCGACCG GTTCGGGTTC ACCCCGCCCG GCGTCACGTG CGAGAGAGCC
GATCTGCGTT CGCATCTCAC GCATCTGCCA CTGGGGATCA TGGTCGAGTT CGACCCCGAC
GAGGTCGTGG TCGAGCTCGA GCATCTGGTG GAGATCATCG AACCCGGACT GGCCCGGCAA
CTGCTCGACC ACTACGCCGT GCTGATCCGC AGTGCCCTCG ACGACCCCGA CACCACACTG
AGCGGGCTGC AGCTGATGGG CGACGACGAC CTCGAATGGC TCCGTGCGGT CTCTGTGGGC
CCGACCTTCG ACACCCCGCC CGCCACCATC ACCGACCTCA TCGAGGCGCA GGTCCGGCGC
AGCCCCGACG GCACCGCCGT CGTCTACGAG GGCCGCCATT ACACCTACCG CGAGATCAAC
GAAGCGGCGA ATCGCGTCGC GCACTGGCTG ATCGGCGAGG ACGTGGGCGC CGAGGACCGG
GTCGCGGTGA TGCTCGACAA GTCTCCCGAA CTGGTGGTGA CCGCGCTGGG TGTGCTCAAG
GCAGGGGCGG TCTATGTCCC GATCGACCCG GCCTACCCAC AGGATCGTCT CGAGTTCATT
CTCGGCGACT GCGATGCGAA AGTGGTTGTC CGAGAGCCGG TCACCGGCCT GGACGGCTAC
CGTGCCGACG ACCCGGGCGA CAACGACAGG CGGCGTCCGG TGGGTCCATA CAACACCGCC
TACCTGATCT ACACGTCGGG CTCGACCGGT CTGCCCAAGG GTGTCCCGGT GCCGCATCGT
CCGGTCGCGG AGTATTTCGT CTGGTTCAAG GGCGACTATC GGGTCGACGC CGGGGACCGG
ATGCTGCAGG TCGCCTCGCC CAGCTTCGAC ATATCGATCG CCGAGGTGTT CGGCACGCTG
GCCTGCGGTG CGCGGCTGGT GATCCACCGC CCCGGTGGCC TCAACGACAT CGGCTACCTG
ACCGCGCTGC TGCGCGACGA GGGCATCACC GCGATGCACT TCGTGCCGTC GCTGCTCGGA
CTGTTCCTGT CGCTTCCCGG TGTGAACCAG TGGCGCACGC TGCAGCGGGT GCCGATCGGC
GGCGAGGCGC TACCCGGCGA GGTGGCCGAC AAGTTCCGCG CCACCTTCGA TGCGCTGCTG
CACAACTTCT ACGGCCCGAC CGAGACCGTG ATCAACGCCA CCCGGTTCAA GGTCGAGGGC
AGGCAGGGCA CCCGGATTGT GCCGATCGGC AAGCCCAAGA TCAACACCGC GATCCACATC
CTCGACGATG CGCTGCAACC CGTGCCGGTC GGCTCCATCG GCGAGATCTA CATCGGCGGA
ACCCATGTCG CACGTGGCTA CCACCACCGG CCGGGGCTGA CCGCCGAACG CTTCGTCGCC
GACCCGTTCA CCCCCGGCGC ACGCATGTAC CGTTCCGGTG ATCTCGCCCG CCGCAACGCC
GACGGCGATA TCGAGTTCGT GGGCCGCGCC GACGAACAGG TCAAGATCCG CGGTTTCCGC
ATCGAACTCG GCGACGTCGC CGCCGCGATC ACCGTCGATC CCAGCGTCGG GCAGGCCGTG
GTCGTCGTCA GCGACCTGCC GAACCTCGGC AAGAGCCTGG TCGCTTACCT GACCCCCGCC
GACGGCGCGG GCGTGGACGT GGACAGGATC AGGACCAGGG TGGCCGCCGC GCTGCCCGAG
TACATGACAC CGGCCGCCTA CGTGGTCGTC GACGAGATCC CGATCACCGC GCACGGCAAG
ATCGACCGCG CCGCGCTGCC AGAGCCGGAG ATCGCGCCCA CCACCGAGTT CCGCGAGCCC
GCGGCGGGCA CCGAAGCCCA TCTTGCGCAG TTGTTCGCCG AACTGCTCGG CCACGAGAAG
GTCGGTGCCG ACGACTCGTT CTTCGATCTC GGCGGGCATT CGCTGCTGGC CACCAAACTC
GTGGCCGAGC TGCGGGCCGG GTTCGGTGTA GACGTCGAGG TGCGCGACAT CTTCGAGAAC
GCGACCATCG CCCGACTGGC TGCCCACCTC GACACCATGG GGCGCGCCAC GATCGGCACC
CGCAGGCCCC GGCTGGTCGC CGCACCCCTC GACGGCCCCG CGCCGCTGTC GTCGTCGCAG
CTGCGTTCGT GGTTCGGCTA CCGCATCGAG GGCCGCAGCC CGGTCAACAA CATCCCGTTC
GCCGCCCGGC TGACCGGCCC CTGCGATGTG GACGCGTTCG TCGCGGCCAT CCGCGACGTC
GTCGAACGGC ATGCCATCCT GCGCACCACC TATCGCGAGA TCGACGGTGT GCCTTATCAG
ATCGTGAACC CGATCTCCGA CGTGCCGGTT CGACGCGCCC GCGGCGAGGG CGAGCAGTGG
CTGCAGGCCG AACTCGACCG CGAACGCAAG CACACCTTCG ATCTCGAGGA GGACTGGCCG
ATCCGGGCCG CGGTGTTCAC CGTCGGAGCC GTCAGTCCGG CTTCAGACGG CGGAGCCGTC
AGTCCGGCTT CAGACGGCGG AGCCGTCGAT CCGGCTTCAG ACCATGTCCT GTCGGTGGTG
ATCCATCACA TCGCGGGCGA CCACTGGTCC GGCGGCGTGC TGTTCACCGA CCTGGTGGCG
GCCTACCGCG CCCGCAAGGC CGGTGAGCGG CCGACCTGGG CGCCGCTGCC GGTGCAGTAC
ACCGATTACG GCGCCTGGCA GGCGGAGCTG CTCAGCGACG ACACCGGAAT CGCCGGGCCG
CAGCGCGAGT ACTGGATCCG TCAGCTCGCC GACGCGCCCG TGGAATCCGG CCTGCCGCTG
GAGTTCTCCC GTCCGCGGCT GCCCAGCGGC AAGGGTGATG CGGTCGAGTT CACCATCGAC
GGTCAGGTCA GGCGTCAGAT CACCGAGCTG TGCCGGGAGC TCGGCATCAC CGAGTTCATG
CTGCTGCAGG CCGCGGTGGC GGTGACGCTG GCCAAGGTCG GGGGTGGCCT GGACATCCCG
CTGGGCACAC CGGTGGCGGG CCGCTCCGAA GCCGAGCTGG AACAGCTCGT CGGCTTCTTC
GTGAACTTCG TAGTGCTGCG CAACGACCTG CGGGGTAACC CGACCCTGCG CGAGGTTCTG
ATCCGGGCCC GCGAGATGGC GCTGTCGGCG TACTCCAATC AGGACGTGCC GTTCGAACAG
GTCGTCGAGG CGGTGAATCC ACCACGCACC CTGGCCCGAA ACCCGCTGTT TCAGGTGGTG
GTGCACGTCC GCGAGCAACT TCCGCAGCGC CAGATGATCG ACACCGATAC CGAGTTCACC
GCGCTGGAAC CGACATTCGA CATCGCCCAG GCGGACCTGT CGCTGAACTT CCTGGCCGAC
GGGACCGGTT ACCGCGGATA CCTGATCTAC CGGCCGGAGC TGTACGCCCG GCGCACCATC
GAGCGGCTCG GCCGGTGGCT GCCCCGGGTG GTCGCGGCGT TCGTCGACAC CCCCGACCGT
GCGCTCGGCG ACGTCGACAT CGTCTCTGCC GAGGAGAGAC AACGTGTTTC ACAGGAGTGG
AGCACCGGAG CCCGGGTCCA GGTGCTCGAC GCCGACCTGG CGCCCGCCCC GGTCGGGGTG
TTCGGCGACC TGTACCTGAG CGGCGGTCCG TTCGCGCAGC GCCACCGAAC TGGGGACCGC
GCCAGGTGGG ACGACGACGG CCGGCTGGAG ATCGCCGCCC GCGCCCCCGG CGCGACCGTC
GTCGAGGTGG CGCCGCCGGC CGTCGAGTTC GAGGAACCGC GGACCGACTC CGAACGTGCG
CTGGCCGCAC TGCTTTCCGA GCTGCTGGCC ACCGAGGAGG TCGGTCGTCA CGACGACTTC
TTCGGACTCG GCGGCGACAG CGTGCTGGCC GTGCAGTTGG CCGCGCGCGC ACGCGATACC
GGGTTGGACC TGACCGCACG GATGGTGTTC GAGCACCCGA GGCTGGCCGA GCTGGCCGCC
GCCCTGGACA GCGGCGCCGG CGCGGCCGCT GAGCAGCCGG ACACCCACCA TGCGCCGATG
TCGGTGTCGG GGCTTTCCGA GGACGAACTC GCCGCGCTGA CCGCATCGTG GGGCACCGGG
GAATGA
 
Protein sequence
MTDIDTGALL DERRLELLRR RMQERGLSAG TDTAEPAGDT DVLTEGQLRM WFVHNADPSG 
ALLNVCLSYR LSGEIDVARL HDAVGAVARR HRVLRTTYRT ESTAAGDSGM PVPTVHTGLM
PGWAEHDLSE LSERARGLRL EVLAQREFGR PFDLGAESPL RITVIRTAPA EYVMLLVAHH
IAWDDASWEV FFDDLTGAYV GEKLRPARRP VVAAGGSDED DVAYWRAVMA DPPEPLELPG
PTGSAVPTSW RSQRTTLRLD GETARRVTAL ADEVGATPYA VLLAVFGVLI HRYSHVDDFL
VATPVLNRNG DTDDVIGYFG NTVAMRLRPH PAMTFRDLLT QTRDTALGAF AHQRVGLDRM
VRELNPDRRH GAERMTRVSF GFRSRDRFGF TPPGVTCERA DLRSHLTHLP LGIMVEFDPD
EVVVELEHLV EIIEPGLARQ LLDHYAVLIR SALDDPDTTL SGLQLMGDDD LEWLRAVSVG
PTFDTPPATI TDLIEAQVRR SPDGTAVVYE GRHYTYREIN EAANRVAHWL IGEDVGAEDR
VAVMLDKSPE LVVTALGVLK AGAVYVPIDP AYPQDRLEFI LGDCDAKVVV REPVTGLDGY
RADDPGDNDR RRPVGPYNTA YLIYTSGSTG LPKGVPVPHR PVAEYFVWFK GDYRVDAGDR
MLQVASPSFD ISIAEVFGTL ACGARLVIHR PGGLNDIGYL TALLRDEGIT AMHFVPSLLG
LFLSLPGVNQ WRTLQRVPIG GEALPGEVAD KFRATFDALL HNFYGPTETV INATRFKVEG
RQGTRIVPIG KPKINTAIHI LDDALQPVPV GSIGEIYIGG THVARGYHHR PGLTAERFVA
DPFTPGARMY RSGDLARRNA DGDIEFVGRA DEQVKIRGFR IELGDVAAAI TVDPSVGQAV
VVVSDLPNLG KSLVAYLTPA DGAGVDVDRI RTRVAAALPE YMTPAAYVVV DEIPITAHGK
IDRAALPEPE IAPTTEFREP AAGTEAHLAQ LFAELLGHEK VGADDSFFDL GGHSLLATKL
VAELRAGFGV DVEVRDIFEN ATIARLAAHL DTMGRATIGT RRPRLVAAPL DGPAPLSSSQ
LRSWFGYRIE GRSPVNNIPF AARLTGPCDV DAFVAAIRDV VERHAILRTT YREIDGVPYQ
IVNPISDVPV RRARGEGEQW LQAELDRERK HTFDLEEDWP IRAAVFTVGA VSPASDGGAV
SPASDGGAVD PASDHVLSVV IHHIAGDHWS GGVLFTDLVA AYRARKAGER PTWAPLPVQY
TDYGAWQAEL LSDDTGIAGP QREYWIRQLA DAPVESGLPL EFSRPRLPSG KGDAVEFTID
GQVRRQITEL CRELGITEFM LLQAAVAVTL AKVGGGLDIP LGTPVAGRSE AELEQLVGFF
VNFVVLRNDL RGNPTLREVL IRAREMALSA YSNQDVPFEQ VVEAVNPPRT LARNPLFQVV
VHVREQLPQR QMIDTDTEFT ALEPTFDIAQ ADLSLNFLAD GTGYRGYLIY RPELYARRTI
ERLGRWLPRV VAAFVDTPDR ALGDVDIVSA EERQRVSQEW STGARVQVLD ADLAPAPVGV
FGDLYLSGGP FAQRHRTGDR ARWDDDGRLE IAARAPGATV VEVAPPAVEF EEPRTDSERA
LAALLSELLA TEEVGRHDDF FGLGGDSVLA VQLAARARDT GLDLTARMVF EHPRLAELAA
ALDSGAGAAA EQPDTHHAPM SVSGLSEDEL AALTASWGTG E