Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3874 |
Symbol | |
ID | 8449493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4263647 |
End bp | 4268644 |
Gene Length | 4998 bp |
Protein Length | 1665 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 645042922 |
Product | amino acid adenylation domain protein |
Protein accession | YP_003203158 |
Protein GI | 258654002 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0218124 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGA TGCAGCTGCT GCCCGAACTG TTCGCGGCGC AGGCGGCGCG GACGCCCGAC GCCGTCGCCG TCGAGGATGA TCACCGACGG CTGTCCTACG CCGATCTGGA CTATCGCGCG AATCGGCTGG CGTACATCCT GCGCGCGCAG GGCGTGCAGC CGGAGTCGGT GGTGGGGGTC TGCCTGACCC GGGGAGTCGA CCTGGTGGTC GCGCTGCTGG CCACCTGGAA GGCGGGGGGC GCCTACCTGC CGATCGATCC GACGCACCCG GCCGCGCGCA ACGCCGGGAT GATCACCGAC AGCGCCACCG CGCTGGTCCT GGTCGACCGG CTGACCGAGC AGCTGCTGAC CGGCACCGGG GTGCGGCAGC TGACGCTCGA GCCGGACGCC GGCACCGACC CGCGCACCGA CGGGCGGGCG CCGGCGTCAC TGGTGGGCGG GGCGAACGCG GCCTACGTCG TCTACACCTC AGGATCCACC GGCCGGCCGA AGGGCGTGGT GATCACCCAC GAGGGCATCG CCAACCGGGT CCGGTGGACA GTCGGCCGGC ACGGCCTGGC GGCCACCGAC CGGGTGCTGC AGAAGAGTTC GATCGCCTTC GACGCGGCCG CATGGGAGAT CTTCGCGCCG CTGATCAGCG GCGGCACCGT GGTCCTGCCG GCGCCGGGCG TGGAGCGGTC GGTCGAGGCG ATGGCCACCG TGGTGGCCGA GCGGGCCATC ACCATCCTGC AGGGGGTGCC GTCGGTGCTG CGGCCGCTGG CCGCCGAACC GGCGTGGGCG CGGGCCCACC GGCTGCGCCT GGTGTTCTCG GCCGGTGAGC CGCTGGACTA CGAGCTGGCC CGGCGGCTGG CCACCGCCCC GCGGCGGCCC GAGGTGTGGA ACACCTACGG TCCCACCGAA TGTGCGATCG ACGTGACCGC GTTCCCGGTC CAGGTCGGCG GCACCGGCCC GGTGCCGATC GGCCGGGCGA TCGACGGCAT CGAGCTGCTG GTCCTGGACG AGAATCTGAC CCCGGTCCCG GACGGGGTGA CCGGCGATCT GTATGCCGCC GGTGCCGGGG TGGCTCGCGG CTACAAGGGT CGCCCGGACC TGACCGCGGC CGGTTTCCGG CCGAACCCGT TCGCCGGCGA CGGATCCCGC ATGTACGCCA CCGGTGACCG GGCCCGCCGG CGGCCCGACG GCGTGCTGGA GTTCCTGGGC CGGTCCGACG ACCAGGTCAA GATCAACGGC GTCCGGATCG AACCGGCCGA GGTGGAGGCG GCGCTGGCCG CCCATCCCGC AATCAACCTG GCCGCGGTGG TGGCCCGGCC GATCGGCGAC GCCGGGCTGC GGCTGGTCGC CTTCGTCCAG GCGGACCGGG ACGTATCCAC CGACGAGCTG CGGGCCCACC TGCGCGGCCG GCTCCCGGAG GCGATGGTGC CGGCCGTGGT GCGCCGGCTG GCCGAGCTGC CCCGGACCGG CAGCGGGAAG CTGGACCGCG CCGCGCTGCC CGAGATCGGG CCGGACGAGG CGGCGGAACC GGCGTTCGTC GCCCCGCGCA CTCCGGCCGA GCGCATCGTC GCCCAGGTCT GGTGCGACCT GTTGCAGCTG GACCGGGTCG GCGTGCACGA GGACTTCCTG GCCCTGGGCG GTGAATCGCT GATGCTGACC CGGCTGGCCA GCCGCCTGGC GAAGGCGTGT GGCGGGGCCA TCGACCTGCG CGGGTTGTTC GACGCCGCGA CCGTCGAGGC GCAGGCCCGC CTGCTGCCCC CTGAGCTGCT CGTTGAGCTG TCCGACGAAT CCGCCTCGGA ACCGGGGCCG ACCGAGCGCC CGACACAGGC GCCGGCACCG ACGCCAGAGG GGACGCCGGC GGGGATGCCG GTGCTCTCGT CCGGTCAACG CCGCCTGTGG TTCTCCGACC GGGTGCGGCC GGGCGGGCTA GAGTGGGTCG CCCCGATCTT CCTGCGGGTG CCCGCCGAAC TCTCGGCCGA GCAGCTGGCC GGGGCCCTGA CCGAGCTGGA ACGGCGGCAC GAGGTGCTGC GGACCCGGTT CGCCGACCGG GCCGGGGAAC CGGTCGCGGT CACCGGCCCG GCCGGACCGG TCGAACTCCG GGTCGTCGAC GCGGCCACCG AGGACGCCCT GCCCGGGCTG TTCGGCGAGC AGTTCGGCCG CGGCTTCGAT CTGGAGAACG GCCCGATCTG GCGGGCGATG CTGGTCCGGA TCCCGCACCG GCCCGCCGTC GTGCTGCTCA CCGTGCACCA CATCGCCACC GACGGCTGGT CGGCTGTGGT CCTGGAACGG GACCTGACCC GGCTGTGCCG GGCCGCCCTG ACCGGCGAGC CGGCCGATCT GCCGGCGTTG CCGCTGCGGT TCGCCGACTA CGCCGCCTGG CAGCACGAAC GGCTCGGCGC CCCGGCCCTG CGGGACGGGT TGGCCTACTG GCGGGCCCAG CTGCAGGGCA TCGAACCGCT GGAGCTGCCG GCCGACCGGC CGCGGGCGGC CGAGCGCGAC GTCAGCGGAT CCGGGGTGCC GGTTCAGCTG ACCGGGGAGC GGGCGGCGGC CGTGGAGGCC GTGTCCCGGC AGCTCGGCGT GACCCCCTTC GTCACCCTGC TCGCGGCCTT CGCGATGACG CTGGCCCGGC ACACCGGACG GCTCGACTTC GCCGTCGGCA GCCCGGTGGC CGGTCGCACC CGGCCCGAGT TCGAAAACCT GGTCGGCCCG TTCCTCAACC CGATCGCCCT GCGCTGCAAC CTGTCCGCGG ACCCGACCTT CGCCGAGGCG GTCGCCCGGG TGCGGGCCAC CTGGCTGGAC GCGCAGGCCA ACGCGCAGGT CCCGTTCGAG CGGGTGGTGG ACGAGGTGCT GCCCCGCCGG GACCTGTCCC GCACGCCCAT CTACCAGGTC GGGTTCGACC TGCAGGCCGG CGGCCTGGCC ACCACCGGCG CCGCCGATCC GGCCGCCGAC CTGGCCTTCC AGCAGGCCTG GCGGGTGGCC AAGACCGACC TGACGTTCTT CGTCTGGCAC CGCGCCGGCG GCGAGATGAC CGGGGCGCTG GAGTACGCCA CTTCTCTGTT CGAGAGGTCC ACCGTGGCGC AGTTCGCCAC CCGATTGGAA CAGGTGATCA CCGCGGTGAC CGTCAACCCC GACCTGCGGC TGTCCGCGCT GCCCGGTGCC GACGACCGGC TGGGGGCCGG ACCGGCGCCC GGATCCACCG ACGGCACCGG CCGCGCCGCC GGCGTGCACG AGCTGATCGG CGCCCGCGCG GCCCGCTGCC CGCGCGCCGT GGCCGTGCAG GCCGCCGACG GCCGGCTGAC CTACGCGCAG CTCGAGCAGC GCAGCGACGA CTGGGCCCGC GCCCTCGACC GGCTCGGGGT CGGCCCGGGC GACGTGGTGC CGGTGCTGCT CGGCCGGTCC ACCGACCTGC TCGCCGCGCT GCTGGGCGTG TGGAAGGTCG GGGCCGCGTA CCTGCCGTTG GATCCGGCGA TTCCGGCCGG CCGGCTGGCC ACCGTACTGG CGACCGTGCT GGCCACCGCC GCCGCGCCGG TGCTGGTGAC CGACGACCCG GCGCGATCCC AGGCCGGACC GAGCATCCTT GGCCCGCAGC AGATCACGGC CGGCGACGGC CGCTTCCCGG CCCGGCCGGC GGTCGCCGAG CAGCCGGCCT ACGTCATCTT CACCTCCGGC TCGACCGGCG CCCCCAAGGG CGTCCGGATC ACCCATGGCA ACCTGGCCCA CTACCTGACC AGCTGGGCCG TGGACCGGCT CGCCGCGGCC GGCACCGGGG GCGCCCCGGT GTTCTCCTCG ATCGCCTTCG ACATGTCGGT CACCGCGTTG TGGGCGCCGC TGCTCTGCGG CCAGCGGGTG CTGCTGCTGC CCGAGGACCT GGAGCTCTCG GAGCTGGGCC GGCAGCTGGT CGCGGCCGGG CCGTTCTCCT TCGTCAAGCT CACCCCGGGG CAGCTCGAGG TGCTGGGCGA TCAGCTCGAC GAGGCCGACG TCGACGCGCT GGCCGCCGTC TACGTGGTCG GGGGCGAGGC GTTCCCGGCC GAACTGGCCC GCCACTGGCT GGCCGTGCTG GGCCCGGACC GGCTGGTCAA CGAGTACGGC CCGACCGAGA TCACGGTGGC CGATGCCGCG CACTGGGTCG CCGAGGTCGG CGCCGGCGCC CGGGTGCCGA TCGGCACCGC CCTGCCCGGC ACCACCGCGG TCCTGCTGGA CGAGCAGCTG CGGCCGGTCG CTGACGGCGC GGTCGGCGAG TTGTTCGTCG GCGGCGCCGG CGTCGCCGAC GGCTACGTCG GCGATCCGGC CCTGACCGCG CAGCGGTTCC TGCCCGCCCC CGACGGGCCG CCCGGAGCCC GGCTCTACCG CACCGGTGAC CTGGTCCGGC GGCTGCCGGA CGGCGGACTG GATGTGCTGG GCCGGGCCGA CCAGCAGGTC AAGATCCGCG GGTACCGGGT GGAGCCGGAC GAGGTGCGGG CCGTGCTGGT GGCCGCGCCC TCCGTCCGCG ACGCCGTCGT GGTCGCCGAC CGGCAGCGGC TGATCGGCTA CGTCGTGCCC GCCACACCCG ACAACCCGCC GGCGGTCGAC GAGTTGCTCG CGGCCTGCCG GGATCGACTG CCCGACTACC TGGTCCCGGC CGTCCTGCTC GAGATCCCCT CGGTGCCATT GACGGCCAAC GGCAAGCTGG ATCGCGACCG GCTGCCGGAC CCGACCGCGG CGTCCGCCGG GCCGCGTCGT CCCCGTACGC CGGTGCAGGA GCGGGTCGCC GCGATCTGGA CCGACCTGCT CGGTGTCGAG GTCGGCATCG ACGACCGGTT CTTCCAGGTG GGCGGGCATT CCATCCTGGT GCTCCGGCTG GTCGCCCGGA TCCAGAGCGA GTTCGACGTC GCCATTCCGG TGGCCGCGGT CTTCACCAAC CCGACCATCG CCGGGCTGGC CGCCGTCATC GAGGACGCGG TGCTCGCCGA CATCGAGGCG CTGTCCGACG ACGAGGTCCG CAGCCGGCTG GCCGAGGAGG TCGCCTGA
|
Protein sequence | MTEMQLLPEL FAAQAARTPD AVAVEDDHRR LSYADLDYRA NRLAYILRAQ GVQPESVVGV CLTRGVDLVV ALLATWKAGG AYLPIDPTHP AARNAGMITD SATALVLVDR LTEQLLTGTG VRQLTLEPDA GTDPRTDGRA PASLVGGANA AYVVYTSGST GRPKGVVITH EGIANRVRWT VGRHGLAATD RVLQKSSIAF DAAAWEIFAP LISGGTVVLP APGVERSVEA MATVVAERAI TILQGVPSVL RPLAAEPAWA RAHRLRLVFS AGEPLDYELA RRLATAPRRP EVWNTYGPTE CAIDVTAFPV QVGGTGPVPI GRAIDGIELL VLDENLTPVP DGVTGDLYAA GAGVARGYKG RPDLTAAGFR PNPFAGDGSR MYATGDRARR RPDGVLEFLG RSDDQVKING VRIEPAEVEA ALAAHPAINL AAVVARPIGD AGLRLVAFVQ ADRDVSTDEL RAHLRGRLPE AMVPAVVRRL AELPRTGSGK LDRAALPEIG PDEAAEPAFV APRTPAERIV AQVWCDLLQL DRVGVHEDFL ALGGESLMLT RLASRLAKAC GGAIDLRGLF DAATVEAQAR LLPPELLVEL SDESASEPGP TERPTQAPAP TPEGTPAGMP VLSSGQRRLW FSDRVRPGGL EWVAPIFLRV PAELSAEQLA GALTELERRH EVLRTRFADR AGEPVAVTGP AGPVELRVVD AATEDALPGL FGEQFGRGFD LENGPIWRAM LVRIPHRPAV VLLTVHHIAT DGWSAVVLER DLTRLCRAAL TGEPADLPAL PLRFADYAAW QHERLGAPAL RDGLAYWRAQ LQGIEPLELP ADRPRAAERD VSGSGVPVQL TGERAAAVEA VSRQLGVTPF VTLLAAFAMT LARHTGRLDF AVGSPVAGRT RPEFENLVGP FLNPIALRCN LSADPTFAEA VARVRATWLD AQANAQVPFE RVVDEVLPRR DLSRTPIYQV GFDLQAGGLA TTGAADPAAD LAFQQAWRVA KTDLTFFVWH RAGGEMTGAL EYATSLFERS TVAQFATRLE QVITAVTVNP DLRLSALPGA DDRLGAGPAP GSTDGTGRAA GVHELIGARA ARCPRAVAVQ AADGRLTYAQ LEQRSDDWAR ALDRLGVGPG DVVPVLLGRS TDLLAALLGV WKVGAAYLPL DPAIPAGRLA TVLATVLATA AAPVLVTDDP ARSQAGPSIL GPQQITAGDG RFPARPAVAE QPAYVIFTSG STGAPKGVRI THGNLAHYLT SWAVDRLAAA GTGGAPVFSS IAFDMSVTAL WAPLLCGQRV LLLPEDLELS ELGRQLVAAG PFSFVKLTPG QLEVLGDQLD EADVDALAAV YVVGGEAFPA ELARHWLAVL GPDRLVNEYG PTEITVADAA HWVAEVGAGA RVPIGTALPG TTAVLLDEQL RPVADGAVGE LFVGGAGVAD GYVGDPALTA QRFLPAPDGP PGARLYRTGD LVRRLPDGGL DVLGRADQQV KIRGYRVEPD EVRAVLVAAP SVRDAVVVAD RQRLIGYVVP ATPDNPPAVD ELLAACRDRL PDYLVPAVLL EIPSVPLTAN GKLDRDRLPD PTAASAGPRR PRTPVQERVA AIWTDLLGVE VGIDDRFFQV GGHSILVLRL VARIQSEFDV AIPVAAVFTN PTIAGLAAVI EDAVLADIEA LSDDEVRSRL AEEVA
|
| |