Gene Mvan_5386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5386 
Symbol 
ID4648065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5762762 
End bp5766718 
Gene Length3957 bp 
Protein Length1318 aa 
Translation table11 
GC content72% 
IMG OID639808861 
Productnon-ribosomal peptide synthetase 
Protein accessionYP_956163 
Protein GI120406334 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR02353] non-ribosomal peptide synthetase terminal domain of unknown function 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0123058 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGCCG CGGATCGAAC CCCAGAGATC CCCAGTCAAT ACCTGCTCTC GGCGTTCGCA 
CCCGAATCGC GCACGCTGAT CGACATCCTC TACGACACCG CGCGCCGTTA TCCCGACGCC
CCGGCGATCG ACGACGGCAC CGTCCAGCTC ACCTACGCCG AGCTGATCTC CGACGTCGAG
GACAGCGTCG CGTGGCTCGG CGCCCGCGGC ATCGGCCGCG GCGACCGGAT CGGCATCCGG
ATGCCGTCGG GAAGCTACGC GCTCTACGTC GCGATCCTGG CGACACTGGC CACGGGCGCG
GCCTACGTGC CCGTCGACGC CGACGACCCG CAGGAGCGCG CCGACCTGGT GTTCACCGAG
GCGGCCGTCG TCGCGGTCAT CACCGAGCAG GGCCTGGTCC GGGGACCGGG TTCGTCGCGG
GGCTGGCGCG CGGCGGCGCC GCTGAGCCGC GACGACGCGT GGATCATCTT CACCTCCGGT
TCGACCGGCA CCCCCAAGGG CGTTGCCGTC ACCCACCGCA ACGCCGCGGC GTTCGTCGAC
GCCGAGGCCC AGATCTTCCT GCAGGACAAC CCGCTCGGCC CGGGTGACCG GGTGCTGGCC
GGCCTGTCGG TGGCGTTCGA CGCCTCATGC GAGGAGATGT GGCTGGCGTG GCGGAACGGC
GCGTGCCTGG TGCCTGCGCC GCGGTCGCTG GTGCGCAGCG GGATGGACCT GGGGCCGTGG
CTGGTGTCCC GTGACATCAC CGTGGTCTCG ACGGTGCCCA CGCTGGCCTC GCTGTGGCCC
GCCGAAGCGC TGGAAGCGGT GCGGCTGCTG ATCTTCGGCG GTGAAGCGTG CCCGCCGGAA
CTTGCCGAAC GCCTGGCGGC CGGTCCCGAC TCGGCCGGCC GCGAGGTGTG GAACACCTAC
GGTCCCACCG AGGCCACCGT CGTCGCCTGT GCGGCCCGCC TCGACGGCAG GAGCGCTGTG
AGCATCGGGC TGCCGCTGCC CGGGTGGGAC CTGGCCGTCG TCGACAAGGA GGGCAGGCCG
GTGGCCCCGG GCGAGGTCGG CGAACTCGTC ATCGGCGGAG TGGGCCTGGC GCGCTACCTC
GACCCCGAGA AGGACGCCGA GAAGTACGCC CCTCTCCCGT ATCCGAGCGA CATCGCGCAT
TGGACTCGCG CCTACCGCAG CGGTGACCTG GTGCGGCTGG AACTCGACGG GCTCTACTTC
GTGGGCCGCG CCGACGACCA GGTCAAGGTC GGCGGCCGCC GCATCGAGCT CGGTGAGGTG
GACACCGCGC TGGTGCACCT GCCCGGCGTC AGCGGCGGCG CGGCGGCGGT GCGCCGCACC
GCGGGCGGAA CCCCGCTGCT GGTGGGCTAC ATCGCGGTGG CGCCGGGGCT GGAGGGATCC
TTCGACGTGC ACGAGGCGCG GGCACGGCTC TCGGAGTCGC TGCCCGCGGC ACTGGTTCCC
CGCCTGGTGG TGGTCGACGA GCTGCCCACC CGCACCTCCG GGAAGGTGGA CCGCGACGCG
CTGCCGTGGC CGGTGGGCGG CGACGACAGC GACGAGGGCG CCGATCTGGG AGGCGGCACG
CTCGGCTGGC TGGCCGGCCT GTGGCGGGAG GTGCTGGCCG CGCCGGTCGA CGGTCCCGAA
GCCGACTTCT ACGCCCTCGG CGGCGGTTCG TTGTCGGCGG CCCAGCTGGT CTCGGCGCTG
CGGCAGCGCT ATCCGGAGGT CACCGTCGCC GACCTCTACG ACCATCCGCG CCTCGGTTCG
CTCGCCGGGT ATCTCGACGA GCTCGCTCCG CCGCCCGCGG TCGAGACCCG GGTGGTGAAG
CCGGTGTCGC GGCTGACACA GGCCGTGCAG GTGGCGCTGA CGGTGCCGCT GGCCATGCTG
ACCGGCATGC AGTGGGTGGT GTGGCTGGCC GTCGCCAACA ACGTCGCCGC CGAGCTGTCG
CTGGTCGACT GGGTGAAGCC GATCGACTGG TGGTGGATAC TCGGCGGCTT CCTTTTGTTC
GTCACCCCGC CCGGCCGCAT GAGCATCGCG GTGTTCGGCG CCCGGGTGCT GGTGGGCAGC
CTGCAGCCCG GCACGTACCG GCGGGGCGGG TCGGTACACC TGCGGGTGTG GCTGGCCGAA
CGGCTGGCCG AGGCCAGCGG GGCGGAGAAC ATGGCAGGCG CGCCGTGGCT CGTCTACTAC
GCCCGGGCGC TCGGCAACAA CGTCGGCAAG GGCGTCGACC TGCATTCGGC ACCCCCGGTC
ACCGGGATGC TGACGCTCGG ACACCGCTGT TCCATCGAGC CCGAGGTCGA CCTGAGCGGG
CACTGGATCG ACGGCGACCT CTTCCACATC GGCGCGATCA CCGTCGGCAA CGACGCCACC
GTCGGTGCGC GGACGACGCT GCTGCCCGGA GCCGTCGTCG GCAAGAACGC CGACGTGGCG
CCCGGTTCGG CGGTGATCGG CAAGGTCAAG AACGGCCAGT ACTGGAAGGG CTCACCGGCG
GTGAAGTCCG GCAAGGCCAA GCATCCGTGG CCGGATCACC GACCGCCGCG AGCACCGGTC
TGGGTCTTCG TTTACGGCGT CACCTCGGTG TTGTTGGGTG CGCTTCCGCT GGCCGCGCTG
GCCGCCGGGT TGGCGGTGAT CGGCTGGGGT GTGCGCGGCA CGCCATCGGT GACCGGGGCG
GTCGTGCCTG CACTGCTGTG GCTGGCCCCC GCCACGGCCG CGGCGCTGGT GGTGTACGCG
CTGTTCACCG TGGTCGGGGT GCGGCTGCTC GCGATCGGCC TCGACGAGGG CTACCACCCC
GTGCGCAGCC GCTCGGGCTG GCAGCTGTGG GCCACCGAAC GGCTGATGGA CGCGGCGCGC
AACTACCTGT TCCCGATCTA CGCGGGCCTG CTCACACCCT GGTGGCTGCG GCTGCTGGGC
GCCAAGGTCG GCAAGGGCAC CGAGATCTCG ACCGCGCTGC TGATCCCCAA GTTCACCGTC
ATCGAGGACG GCGCGTTCCT GGCCGACGAC ACCATGGTGG CGTCCTACGA GCTCGGCGGC
GGGTGGATCC ACGTGGCACG GGCCACCATC GGCAAGCGGG CGTTCCTGGG CAACTCGGGC
ATCACCCAGC CCGGCCGACG GGTGCCCGAC GACGGTCTGG TCGCCGTGCT GTCGGCCACT
CCGTACAAGG CCAAGGCCGG ATCGTCCTGG CTGGGCAGCC CGCCGGTGCG GCTGCGTCGC
AAGCCGACGG CCGCCGACGC GCTGCGGACG TTCCACCCCT CCCGTCGCCT GAAAGTGCTG
CGGGGAACCG TGGAGACGTT CCGGTTCGTG CCCGTCGTGG TGACCTTCGC GATCGGGGTG
GCCGTGCTGT GGTCGGTGCA GTATCTGGCC GTGACCTTCG GCTGGATCTG GGCGGGGCTG
GCCGCCGGGC CGATCCTGCT AACCGCGGGC GCGGTCGCCG GCGGTGTCGC GGCCATCGCG
AAATGGCTTG TGGTGGGCCG GATCACCGCG ATCGAGCACC CGTTGTGGTC GGCGTTCGTG
TGGCGCAACG AGGTGTCGGA CACCTTCGTC GAGACGGTCG CCGCGCCGTG GTTCGCCCGC
GCCGCAACGG GTACGCCGGT GATGAACCTG TGGCTGCGGG CCCTGGGCGC GAAGATCGGA
CGTGGCGTGT GGTGTGAGAC GTACTGGCTT CCCGAGGCCG ACCTGGTGAC GCTGGCCGAC
GGCGCCACCG TGAACCGGGG CTGCGTCGTG CAGACCCATC TGTTCCATGA TCGGATCATG
CGGATGGACA CCGTGGTGTT GGAGGAGGGC GCGACCCTGG GCCCGCACTG CGTGGCACTG
CCCGCGGCAC GCATCGGGGC GGGCGCCACC GTCGGCCCGG CCTCGCTGGT GATGCGCGGC
GACGAGGTGC CCCCGTCGAC CCGGTGGCAG GGCAACCCCA TCGCGCCGTG GCATCCGTCA
CGGAAGAAGC GTTCGGACTC AGCCGACCCC AAGCCCAAGA AGTCCACCGC CGCGTGA
 
Protein sequence
MTAADRTPEI PSQYLLSAFA PESRTLIDIL YDTARRYPDA PAIDDGTVQL TYAELISDVE 
DSVAWLGARG IGRGDRIGIR MPSGSYALYV AILATLATGA AYVPVDADDP QERADLVFTE
AAVVAVITEQ GLVRGPGSSR GWRAAAPLSR DDAWIIFTSG STGTPKGVAV THRNAAAFVD
AEAQIFLQDN PLGPGDRVLA GLSVAFDASC EEMWLAWRNG ACLVPAPRSL VRSGMDLGPW
LVSRDITVVS TVPTLASLWP AEALEAVRLL IFGGEACPPE LAERLAAGPD SAGREVWNTY
GPTEATVVAC AARLDGRSAV SIGLPLPGWD LAVVDKEGRP VAPGEVGELV IGGVGLARYL
DPEKDAEKYA PLPYPSDIAH WTRAYRSGDL VRLELDGLYF VGRADDQVKV GGRRIELGEV
DTALVHLPGV SGGAAAVRRT AGGTPLLVGY IAVAPGLEGS FDVHEARARL SESLPAALVP
RLVVVDELPT RTSGKVDRDA LPWPVGGDDS DEGADLGGGT LGWLAGLWRE VLAAPVDGPE
ADFYALGGGS LSAAQLVSAL RQRYPEVTVA DLYDHPRLGS LAGYLDELAP PPAVETRVVK
PVSRLTQAVQ VALTVPLAML TGMQWVVWLA VANNVAAELS LVDWVKPIDW WWILGGFLLF
VTPPGRMSIA VFGARVLVGS LQPGTYRRGG SVHLRVWLAE RLAEASGAEN MAGAPWLVYY
ARALGNNVGK GVDLHSAPPV TGMLTLGHRC SIEPEVDLSG HWIDGDLFHI GAITVGNDAT
VGARTTLLPG AVVGKNADVA PGSAVIGKVK NGQYWKGSPA VKSGKAKHPW PDHRPPRAPV
WVFVYGVTSV LLGALPLAAL AAGLAVIGWG VRGTPSVTGA VVPALLWLAP ATAAALVVYA
LFTVVGVRLL AIGLDEGYHP VRSRSGWQLW ATERLMDAAR NYLFPIYAGL LTPWWLRLLG
AKVGKGTEIS TALLIPKFTV IEDGAFLADD TMVASYELGG GWIHVARATI GKRAFLGNSG
ITQPGRRVPD DGLVAVLSAT PYKAKAGSSW LGSPPVRLRR KPTAADALRT FHPSRRLKVL
RGTVETFRFV PVVVTFAIGV AVLWSVQYLA VTFGWIWAGL AAGPILLTAG AVAGGVAAIA
KWLVVGRITA IEHPLWSAFV WRNEVSDTFV ETVAAPWFAR AATGTPVMNL WLRALGAKIG
RGVWCETYWL PEADLVTLAD GATVNRGCVV QTHLFHDRIM RMDTVVLEEG ATLGPHCVAL
PAARIGAGAT VGPASLVMRG DEVPPSTRWQ GNPIAPWHPS RKKRSDSADP KPKKSTAA