Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3845 |
Symbol | |
ID | 4649264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 4099381 |
End bp | 4104546 |
Gene Length | 5166 bp |
Protein Length | 1721 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639807311 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_954632 |
Protein GI | 120404803 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.549835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTGACA TCGACACCGG GGCCCTGCTC GACGAGCGTC GGCTGGAACT GCTGCGACGC CGGATGCAGG AGCGTGGGCT GAGCGCAGGC ACCGACACGG CCGAACCCGC CGGCGACACC GACGTGTTGA CCGAAGGCCA GCTGCGGATG TGGTTCGTGC ACAACGCCGA TCCCAGCGGC GCGTTGCTCA ACGTGTGTCT TTCCTACCGC CTGTCCGGCG AGATCGATGT CGCGCGGCTG CACGACGCGG TCGGCGCGGT GGCCCGGCGT CACCGGGTGT TGCGCACGAC GTACCGCACC GAGAGCACCG CTGCCGGCGA CAGCGGAATG CCGGTCCCCA CCGTGCACAC CGGCCTGATG CCCGGCTGGG CCGAGCACGA TCTGTCCGAA CTCTCCGAGC GGGCCCGCGG GCTGCGGCTG GAGGTGCTCG CGCAGCGCGA GTTCGGCAGG CCGTTCGACC TGGGCGCCGA GTCGCCGCTG CGGATCACGG TGATCCGCAC CGCCCCAGCC GAATACGTGA TGCTTCTGGT CGCCCATCAC ATCGCCTGGG ACGACGCTTC GTGGGAGGTG TTCTTCGACG ACCTGACCGG CGCCTACGTC GGTGAGAAAC TGCGGCCGGC GCGACGTCCG GTCGTCGCGG CCGGCGGTTC CGACGAGGAC GACGTCGCGT ACTGGCGTGC GGTGATGGCC GATCCGCCGG AGCCGCTGGA GCTGCCAGGC CCGACCGGGT CGGCCGTCCC GACGAGCTGG CGCTCACAGC GCACCACGCT GCGCCTGGAC GGCGAGACTG CGCGACGCGT CACGGCCCTG GCCGACGAGG TCGGCGCGAC ACCCTACGCG GTGCTGCTAG CGGTGTTCGG CGTGCTGATC CACCGTTACA GCCACGTCGA CGACTTCCTG GTCGCCACCC CGGTGCTGAA TCGCAACGGT GACACCGACG ACGTCATCGG GTATTTCGGC AACACCGTCG CGATGCGGTT GCGGCCGCAC CCGGCAATGA CGTTCCGTGA CCTGCTGACC CAGACCCGTG ATACCGCCCT CGGTGCTTTC GCGCACCAGC GGGTCGGGCT CGACCGGATG GTGCGCGAGC TGAACCCGGA CCGGCGCCAC GGCGCCGAGC GGATGACGCG GGTCAGCTTC GGGTTCCGCA GCCGCGACCG GTTCGGGTTC ACCCCGCCCG GCGTCACGTG CGAGAGAGCC GATCTGCGTT CGCATCTCAC GCATCTGCCA CTGGGGATCA TGGTCGAGTT CGACCCCGAC GAGGTCGTGG TCGAGCTCGA GCATCTGGTG GAGATCATCG AACCCGGACT GGCCCGGCAA CTGCTCGACC ACTACGCCGT GCTGATCCGC AGTGCCCTCG ACGACCCCGA CACCACACTG AGCGGGCTGC AGCTGATGGG CGACGACGAC CTCGAATGGC TCCGTGCGGT CTCTGTGGGC CCGACCTTCG ACACCCCGCC CGCCACCATC ACCGACCTCA TCGAGGCGCA GGTCCGGCGC AGCCCCGACG GCACCGCCGT CGTCTACGAG GGCCGCCATT ACACCTACCG CGAGATCAAC GAAGCGGCGA ATCGCGTCGC GCACTGGCTG ATCGGCGAGG ACGTGGGCGC CGAGGACCGG GTCGCGGTGA TGCTCGACAA GTCTCCCGAA CTGGTGGTGA CCGCGCTGGG TGTGCTCAAG GCAGGGGCGG TCTATGTCCC GATCGACCCG GCCTACCCAC AGGATCGTCT CGAGTTCATT CTCGGCGACT GCGATGCGAA AGTGGTTGTC CGAGAGCCGG TCACCGGCCT GGACGGCTAC CGTGCCGACG ACCCGGGCGA CAACGACAGG CGGCGTCCGG TGGGTCCATA CAACACCGCC TACCTGATCT ACACGTCGGG CTCGACCGGT CTGCCCAAGG GTGTCCCGGT GCCGCATCGT CCGGTCGCGG AGTATTTCGT CTGGTTCAAG GGCGACTATC GGGTCGACGC CGGGGACCGG ATGCTGCAGG TCGCCTCGCC CAGCTTCGAC ATATCGATCG CCGAGGTGTT CGGCACGCTG GCCTGCGGTG CGCGGCTGGT GATCCACCGC CCCGGTGGCC TCAACGACAT CGGCTACCTG ACCGCGCTGC TGCGCGACGA GGGCATCACC GCGATGCACT TCGTGCCGTC GCTGCTCGGA CTGTTCCTGT CGCTTCCCGG TGTGAACCAG TGGCGCACGC TGCAGCGGGT GCCGATCGGC GGCGAGGCGC TACCCGGCGA GGTGGCCGAC AAGTTCCGCG CCACCTTCGA TGCGCTGCTG CACAACTTCT ACGGCCCGAC CGAGACCGTG ATCAACGCCA CCCGGTTCAA GGTCGAGGGC AGGCAGGGCA CCCGGATTGT GCCGATCGGC AAGCCCAAGA TCAACACCGC GATCCACATC CTCGACGATG CGCTGCAACC CGTGCCGGTC GGCTCCATCG GCGAGATCTA CATCGGCGGA ACCCATGTCG CACGTGGCTA CCACCACCGG CCGGGGCTGA CCGCCGAACG CTTCGTCGCC GACCCGTTCA CCCCCGGCGC ACGCATGTAC CGTTCCGGTG ATCTCGCCCG CCGCAACGCC GACGGCGATA TCGAGTTCGT GGGCCGCGCC GACGAACAGG TCAAGATCCG CGGTTTCCGC ATCGAACTCG GCGACGTCGC CGCCGCGATC ACCGTCGATC CCAGCGTCGG GCAGGCCGTG GTCGTCGTCA GCGACCTGCC GAACCTCGGC AAGAGCCTGG TCGCTTACCT GACCCCCGCC GACGGCGCGG GCGTGGACGT GGACAGGATC AGGACCAGGG TGGCCGCCGC GCTGCCCGAG TACATGACAC CGGCCGCCTA CGTGGTCGTC GACGAGATCC CGATCACCGC GCACGGCAAG ATCGACCGCG CCGCGCTGCC AGAGCCGGAG ATCGCGCCCA CCACCGAGTT CCGCGAGCCC GCGGCGGGCA CCGAAGCCCA TCTTGCGCAG TTGTTCGCCG AACTGCTCGG CCACGAGAAG GTCGGTGCCG ACGACTCGTT CTTCGATCTC GGCGGGCATT CGCTGCTGGC CACCAAACTC GTGGCCGAGC TGCGGGCCGG GTTCGGTGTA GACGTCGAGG TGCGCGACAT CTTCGAGAAC GCGACCATCG CCCGACTGGC TGCCCACCTC GACACCATGG GGCGCGCCAC GATCGGCACC CGCAGGCCCC GGCTGGTCGC CGCACCCCTC GACGGCCCCG CGCCGCTGTC GTCGTCGCAG CTGCGTTCGT GGTTCGGCTA CCGCATCGAG GGCCGCAGCC CGGTCAACAA CATCCCGTTC GCCGCCCGGC TGACCGGCCC CTGCGATGTG GACGCGTTCG TCGCGGCCAT CCGCGACGTC GTCGAACGGC ATGCCATCCT GCGCACCACC TATCGCGAGA TCGACGGTGT GCCTTATCAG ATCGTGAACC CGATCTCCGA CGTGCCGGTT CGACGCGCCC GCGGCGAGGG CGAGCAGTGG CTGCAGGCCG AACTCGACCG CGAACGCAAG CACACCTTCG ATCTCGAGGA GGACTGGCCG ATCCGGGCCG CGGTGTTCAC CGTCGGAGCC GTCAGTCCGG CTTCAGACGG CGGAGCCGTC AGTCCGGCTT CAGACGGCGG AGCCGTCGAT CCGGCTTCAG ACCATGTCCT GTCGGTGGTG ATCCATCACA TCGCGGGCGA CCACTGGTCC GGCGGCGTGC TGTTCACCGA CCTGGTGGCG GCCTACCGCG CCCGCAAGGC CGGTGAGCGG CCGACCTGGG CGCCGCTGCC GGTGCAGTAC ACCGATTACG GCGCCTGGCA GGCGGAGCTG CTCAGCGACG ACACCGGAAT CGCCGGGCCG CAGCGCGAGT ACTGGATCCG TCAGCTCGCC GACGCGCCCG TGGAATCCGG CCTGCCGCTG GAGTTCTCCC GTCCGCGGCT GCCCAGCGGC AAGGGTGATG CGGTCGAGTT CACCATCGAC GGTCAGGTCA GGCGTCAGAT CACCGAGCTG TGCCGGGAGC TCGGCATCAC CGAGTTCATG CTGCTGCAGG CCGCGGTGGC GGTGACGCTG GCCAAGGTCG GGGGTGGCCT GGACATCCCG CTGGGCACAC CGGTGGCGGG CCGCTCCGAA GCCGAGCTGG AACAGCTCGT CGGCTTCTTC GTGAACTTCG TAGTGCTGCG CAACGACCTG CGGGGTAACC CGACCCTGCG CGAGGTTCTG ATCCGGGCCC GCGAGATGGC GCTGTCGGCG TACTCCAATC AGGACGTGCC GTTCGAACAG GTCGTCGAGG CGGTGAATCC ACCACGCACC CTGGCCCGAA ACCCGCTGTT TCAGGTGGTG GTGCACGTCC GCGAGCAACT TCCGCAGCGC CAGATGATCG ACACCGATAC CGAGTTCACC GCGCTGGAAC CGACATTCGA CATCGCCCAG GCGGACCTGT CGCTGAACTT CCTGGCCGAC GGGACCGGTT ACCGCGGATA CCTGATCTAC CGGCCGGAGC TGTACGCCCG GCGCACCATC GAGCGGCTCG GCCGGTGGCT GCCCCGGGTG GTCGCGGCGT TCGTCGACAC CCCCGACCGT GCGCTCGGCG ACGTCGACAT CGTCTCTGCC GAGGAGAGAC AACGTGTTTC ACAGGAGTGG AGCACCGGAG CCCGGGTCCA GGTGCTCGAC GCCGACCTGG CGCCCGCCCC GGTCGGGGTG TTCGGCGACC TGTACCTGAG CGGCGGTCCG TTCGCGCAGC GCCACCGAAC TGGGGACCGC GCCAGGTGGG ACGACGACGG CCGGCTGGAG ATCGCCGCCC GCGCCCCCGG CGCGACCGTC GTCGAGGTGG CGCCGCCGGC CGTCGAGTTC GAGGAACCGC GGACCGACTC CGAACGTGCG CTGGCCGCAC TGCTTTCCGA GCTGCTGGCC ACCGAGGAGG TCGGTCGTCA CGACGACTTC TTCGGACTCG GCGGCGACAG CGTGCTGGCC GTGCAGTTGG CCGCGCGCGC ACGCGATACC GGGTTGGACC TGACCGCACG GATGGTGTTC GAGCACCCGA GGCTGGCCGA GCTGGCCGCC GCCCTGGACA GCGGCGCCGG CGCGGCCGCT GAGCAGCCGG ACACCCACCA TGCGCCGATG TCGGTGTCGG GGCTTTCCGA GGACGAACTC GCCGCGCTGA CCGCATCGTG GGGCACCGGG GAATGA
|
Protein sequence | MTDIDTGALL DERRLELLRR RMQERGLSAG TDTAEPAGDT DVLTEGQLRM WFVHNADPSG ALLNVCLSYR LSGEIDVARL HDAVGAVARR HRVLRTTYRT ESTAAGDSGM PVPTVHTGLM PGWAEHDLSE LSERARGLRL EVLAQREFGR PFDLGAESPL RITVIRTAPA EYVMLLVAHH IAWDDASWEV FFDDLTGAYV GEKLRPARRP VVAAGGSDED DVAYWRAVMA DPPEPLELPG PTGSAVPTSW RSQRTTLRLD GETARRVTAL ADEVGATPYA VLLAVFGVLI HRYSHVDDFL VATPVLNRNG DTDDVIGYFG NTVAMRLRPH PAMTFRDLLT QTRDTALGAF AHQRVGLDRM VRELNPDRRH GAERMTRVSF GFRSRDRFGF TPPGVTCERA DLRSHLTHLP LGIMVEFDPD EVVVELEHLV EIIEPGLARQ LLDHYAVLIR SALDDPDTTL SGLQLMGDDD LEWLRAVSVG PTFDTPPATI TDLIEAQVRR SPDGTAVVYE GRHYTYREIN EAANRVAHWL IGEDVGAEDR VAVMLDKSPE LVVTALGVLK AGAVYVPIDP AYPQDRLEFI LGDCDAKVVV REPVTGLDGY RADDPGDNDR RRPVGPYNTA YLIYTSGSTG LPKGVPVPHR PVAEYFVWFK GDYRVDAGDR MLQVASPSFD ISIAEVFGTL ACGARLVIHR PGGLNDIGYL TALLRDEGIT AMHFVPSLLG LFLSLPGVNQ WRTLQRVPIG GEALPGEVAD KFRATFDALL HNFYGPTETV INATRFKVEG RQGTRIVPIG KPKINTAIHI LDDALQPVPV GSIGEIYIGG THVARGYHHR PGLTAERFVA DPFTPGARMY RSGDLARRNA DGDIEFVGRA DEQVKIRGFR IELGDVAAAI TVDPSVGQAV VVVSDLPNLG KSLVAYLTPA DGAGVDVDRI RTRVAAALPE YMTPAAYVVV DEIPITAHGK IDRAALPEPE IAPTTEFREP AAGTEAHLAQ LFAELLGHEK VGADDSFFDL GGHSLLATKL VAELRAGFGV DVEVRDIFEN ATIARLAAHL DTMGRATIGT RRPRLVAAPL DGPAPLSSSQ LRSWFGYRIE GRSPVNNIPF AARLTGPCDV DAFVAAIRDV VERHAILRTT YREIDGVPYQ IVNPISDVPV RRARGEGEQW LQAELDRERK HTFDLEEDWP IRAAVFTVGA VSPASDGGAV SPASDGGAVD PASDHVLSVV IHHIAGDHWS GGVLFTDLVA AYRARKAGER PTWAPLPVQY TDYGAWQAEL LSDDTGIAGP QREYWIRQLA DAPVESGLPL EFSRPRLPSG KGDAVEFTID GQVRRQITEL CRELGITEFM LLQAAVAVTL AKVGGGLDIP LGTPVAGRSE AELEQLVGFF VNFVVLRNDL RGNPTLREVL IRAREMALSA YSNQDVPFEQ VVEAVNPPRT LARNPLFQVV VHVREQLPQR QMIDTDTEFT ALEPTFDIAQ ADLSLNFLAD GTGYRGYLIY RPELYARRTI ERLGRWLPRV VAAFVDTPDR ALGDVDIVSA EERQRVSQEW STGARVQVLD ADLAPAPVGV FGDLYLSGGP FAQRHRTGDR ARWDDDGRLE IAARAPGATV VEVAPPAVEF EEPRTDSERA LAALLSELLA TEEVGRHDDF FGLGGDSVLA VQLAARARDT GLDLTARMVF EHPRLAELAA ALDSGAGAAA EQPDTHHAPM SVSGLSEDEL AALTASWGTG E
|
| |