Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_4417 |
Symbol | |
ID | 5060903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 5003226 |
End bp | 5006540 |
Gene Length | 3315 bp |
Protein Length | 1104 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640476680 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001161223 |
Protein GI | 145596926 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.543215 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACCG CTGACTCCAC CACACCCGGC AGCGGCGGCA AGCGTGCCCT GCTGGCCCGC CTTCTCACCG AACGGGCCCA GTCGCCCCGC AGCTATCCGG TCTCCTTCGG CCAACAGCGG CTGTGGTTCC TGGACCGTTT CTCCGGGGGC ATCCCGGTCT ACAACATCCC GGTCGCCTTC CGGGTGCACG GGCCACTGGA CGTCGCAGCG CTGCGTACCG CACTGACCAC GCTCGTCAAC CGGCATCCGG CGCTGCGCAC CACGTTCAGC GAATCCGGCG GCGAACCAGT GCAGGTGGTA CGCCCGACCG GCGAGGTGGA TCTCACCGAG ACCGACCTGA CCGGGTTGCC GGCGGAGCGG CGCGAGGCCG AGGCCACCAC ACTCGTCCGA GCGCACTCCG GGCAACCGTT CGACCTGGCC GGAGGCCCGT TGCTGCGGGT AACAGTGATC CGGACCGACA CGGATCGGTA CCACGTGGCG TTCTGCGTGC ACCACATCGT CTCCGATGCC TGGTCGATCG GGGTGCTCTT CAAGGAGTTG GCGACCGCGT ACCCGGCTGC GCTGGCGGGT GCCCCGGCCC ACCTGCCCGA CCTGCCCACG CAGTACGCCG ACTTCTCCAT CTGGCAGCGG GACCGGCTCG CCGGTGACGC GCTGCGTCGA CAGCTCGACC ACTGGGTTTC GCACCTGCGG GGGGCACCGG CCCTGCTCAG CCTTCCCACC GATCGGCCCC GCCCGGCCAA CCGCTCGTAT CGGGGCGCCT GCCACTACGT CACCGTCCCG GCGGCGGCCG TTCGACGTGT CGAGGAGTTC AACCAGGACG CCGGGGTCAC CATGTTCATG ACCCTGCTCG CCGTGTACCA GGCGGTGTTG TCCCGACACA GCGGGCAGGA CGACATCGTG ATCGGCGCCC CTGTTGCCGG CCGGGAGCAT CCCGACCTGG AGCGGCTGGT CGGCTTCTTC GTCAACACAC TTGCCCTGCG GGTGTCGTCC GCTGGCTCCC CATCGTTGCG GCAACTCGTC AACCGGGTAC GGGAGGTCAC CCTCGTCGGT CTCGGCAACG CCGAGGTGCC GTTCGAGAAG GTGGTTGAGG AACTGCAGCC GGAGCGCAGT CTGGCCCACG CACCGATCTT CCAGGCCCAG CTGATCCTGC AGAACGCACC GCAGAACGCC CTTCGTCTCG CCGGCTGCAC GGCCACCTCC CTGCAGGTCG ACAGCGGCAC CGCCAAGTTC GACCTCACCC TCGCCGGTGA GCTGACCGCC GAGGGTGCCC TGCGGTTGGC CTTCGAGTAC GACACCGAAC TGTTCGACGC CGGCACGGTC GACCGGTTGG CCCGGCATCT GTGCACCCTG CTGGACGCGG CGGTCGCCGA GCCGGATCGC CCGCTGACGC GGCTTCCGCT GCTCAGTGGA GTGGAGCGGT GGCGTGCCGT GGTCGAGTGG AATCAGACCG ACCGGGGCAC GTTGCCGGTC GGCACCATCC TGGACCTGCT ACCCACTGAA CCCTCGGAGT CCGGCGCCCC GCCTGCCGTC ACCGGTCCGG ACGGGCACCT GGACCGAGCC GGTCTGCACC GGCGGGCCGG ACAGATCGCC CGGCGGCTAG TCGCCGCCGG TGTCGCCCCG GACACCCCGG TCGGCATCTG CCTGGATCGC GGGGTCGACA TGGTCGCCGC GGTGCTCGGC GTATGGCGGG CCGGGGCCGG TTACCTACCG CTCGACCCCA CCCTGCCCCC CGAGCGGCTG CGCCACCTGC TCGTCGACTC CGGCACCCGG GTCGTGCTGA CCCATCAGGC GGTTGCCGCG CGGCTCGGGC CGGTGCTGGC GGGCTCGGTG ACGGTGCTGC TCGACGATGC CACCGACGCC GCCGGCCCAG ATGAGCCACT TCCGGCGGTC CCGGCGCATC CGGACGGACT GGCATACCTG ATCTACACCT CGGGTTCGAC CGGTCAACCG AAAGGGGTGG CGGTCCCACA CCGCAGTGTG ACCAACCTCG TTGCCTCCTT CCACGACGAC CTGGACCTGA CGTCCGAGGA CCGGTTCGCC GCGGTCACCA CCCTGTCGTT CGACATCTCG GTGTTGGAAC TGCTGGTGCC GCTGCTGCTG GACATCCCGC TGCTGGTCGT GGGTGCCGAC GAGGTCGGCG ACGGGCCGGC CCTGCGTCGT CGGCTCACCG AAGCGGGGAT CACCGCCATG CAGGCCACGC CGGCGACCTG GCGACTGCTG CTGGCATCCG GCGGCGTACC GCCGACGCTG CGGCTGCGCC TCTGCGGCGG TGAGGCGCTA CCCCGGGACC TTGCCGACGC CCTGCAGGCC GACGGCGTAA CCCTGTGGAA CTGTTATGGG CCCACCGAGA CCACCGTCTG GTCCGCGGCG GCCCCCGTGG CGCCTGCCCC GGCCGCGGTG GACCTCGGTT CGCCGATCGC CAACACCCGG ATCTACCTGC TCGACGAGGC ATACCAGCCA GTGCCGGTGG GCGTGGTGGG AGAAATCCAC ATCGGCGGCT CGGGTGTGGT CCGTGGATAC CACGGCCGAC CCGGCCTGAC CGCCGGTCGG TTCGTCCCCG ACCCGTTCGC CGACGAGCCC GGCGCCCGGC TCTACGCCAC TGGTGACCTG GCCCGGCAGC GCGCTGACGG CCGGCTGGAG TTCCTCGGCC GCACCGACCA TCAGGTCAAG GTGCGCGGGT TCCGGATCGA GTTGGGCGAG ATCGAAACCC TGCTACGGGG CCACGATCTG GTCGCGGACG CGGTGGTCGG CACCTGGGTC GGCGGGGACG GCGACACCCG CCTGGTGGCG TACGCCGTGC CGGCGTCCGG CGTTGACCCG GACGCCCTCG CCGGTCAGGT CCGTCCCCAC CTGTCCGGCC GACTGCCGGA GTACATGCTT CCCGCGGCCC TGGTGCCGAT GACCGCGTTG CCTCTCAACG GCAACGGCAA GGTCGACCGG AACGCCCTGC CCACCCCGAG GTGGACCGAC CCGCGGGCGG AGCTGGTCGC CCCCCGCGAC CCCCTCGAGC AGCTACTCGC CGGGATCTGG CAGGAGGTTC TGCACGTCGA GAGGATCGGT GTGCTCGACG ACTTCTTCCG CCTCGGTGGG CATTCACTAC TCGGCGCGCA GGCGTTGAGC CGGATCGGCG CCGTGCTGGA GACGGAGGTA CCGATCCGAA TCCTCTTCGA GGCACCGACG ATCGACGCGA TGGCCCGCGC GTTGCGCTCC ATGGAGGAGG TGGCCGGCCA GACCGACGCC GTCGCCGCCC TTCGGATGGA GGTGGCCGAC CTCTCCGACG ACGAACTACG GGCCATGCTG GGCGGTCAGG AGTGA
|
Protein sequence | MTTADSTTPG SGGKRALLAR LLTERAQSPR SYPVSFGQQR LWFLDRFSGG IPVYNIPVAF RVHGPLDVAA LRTALTTLVN RHPALRTTFS ESGGEPVQVV RPTGEVDLTE TDLTGLPAER REAEATTLVR AHSGQPFDLA GGPLLRVTVI RTDTDRYHVA FCVHHIVSDA WSIGVLFKEL ATAYPAALAG APAHLPDLPT QYADFSIWQR DRLAGDALRR QLDHWVSHLR GAPALLSLPT DRPRPANRSY RGACHYVTVP AAAVRRVEEF NQDAGVTMFM TLLAVYQAVL SRHSGQDDIV IGAPVAGREH PDLERLVGFF VNTLALRVSS AGSPSLRQLV NRVREVTLVG LGNAEVPFEK VVEELQPERS LAHAPIFQAQ LILQNAPQNA LRLAGCTATS LQVDSGTAKF DLTLAGELTA EGALRLAFEY DTELFDAGTV DRLARHLCTL LDAAVAEPDR PLTRLPLLSG VERWRAVVEW NQTDRGTLPV GTILDLLPTE PSESGAPPAV TGPDGHLDRA GLHRRAGQIA RRLVAAGVAP DTPVGICLDR GVDMVAAVLG VWRAGAGYLP LDPTLPPERL RHLLVDSGTR VVLTHQAVAA RLGPVLAGSV TVLLDDATDA AGPDEPLPAV PAHPDGLAYL IYTSGSTGQP KGVAVPHRSV TNLVASFHDD LDLTSEDRFA AVTTLSFDIS VLELLVPLLL DIPLLVVGAD EVGDGPALRR RLTEAGITAM QATPATWRLL LASGGVPPTL RLRLCGGEAL PRDLADALQA DGVTLWNCYG PTETTVWSAA APVAPAPAAV DLGSPIANTR IYLLDEAYQP VPVGVVGEIH IGGSGVVRGY HGRPGLTAGR FVPDPFADEP GARLYATGDL ARQRADGRLE FLGRTDHQVK VRGFRIELGE IETLLRGHDL VADAVVGTWV GGDGDTRLVA YAVPASGVDP DALAGQVRPH LSGRLPEYML PAALVPMTAL PLNGNGKVDR NALPTPRWTD PRAELVAPRD PLEQLLAGIW QEVLHVERIG VLDDFFRLGG HSLLGAQALS RIGAVLETEV PIRILFEAPT IDAMARALRS MEEVAGQTDA VAALRMEVAD LSDDELRAML GGQE
|
| |