Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4891 |
Symbol | |
ID | 5707543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5550435 |
End bp | 5553749 |
Gene Length | 3315 bp |
Protein Length | 1104 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641274286 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001539631 |
Protein GI | 159040378 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0706407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0725779 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACCG CCGACTCCAC CGAACCCGCC AGCGGCGGCA AACGCGCCCT GCTGGCCAGG CTTCTCACCG AACGGGCCCA GGCGCCCCGC AGCTACCCGG TGTCCTTCGG CCAACAGCGG CTGTGGTTCC TGGACCGATT CACCGGGGGC ATCCCGGTCT ACAACATCCC GGTCGCCTTC CAGGTGCACG GGCCACTGGA CGTCGCGGCG CTGCGTACCG CACTGACCAC GCTCGTCAAC CGGCACGCGG CGCTGCGCAC CACGTTCAGC GAATCCGGCG GCGAACCAGT GCAGGTGGTC CGACCGACCG GCGAGGTGGA CCTCACCGAG ATCGACCTGA CCGGGCTGCC GGTGGATCGG CGTGAGGTCG AGGCCACCCA GCTGGTCCGA GAGCACGCCG GGCAACCGTT CGACCTGGCC GAAGGCCCGC TGCTGCGGGT GGCGGTGATC CGGACCGACG CGGACCGGTT CCACGTGGCG TTCTGCGTAC ACCACATCGT CTTCGACGCC TGGTCGGTCG GGGTGCTCTT CCGGGAGTTG GAGACCGCGT ACGCGGCCGC GTTGGCGGGT ACCCCCGCCA ACCTGCCCGA CCTGCCCACG CAGTACGCCG ATTTCACCGT CTGGCAGCGG GACCGGCTCG CCGGCGACGC GCTGCGCCGA CAGCTCGATC ACTGGGTGGC GCACCTGCGG GGGGCGCCGG CCCTGCTCAG CCTTCCCACC GACCGGCCCC GACCGGCCAA CCGCTCGTAT CGGGGCGCCT GCCACTACGT CACGGTCCCG GCGGCGGCCG TCCGACGGGT CGAGGAGTTC AACCAGGACG CCGGGGTCAC CATGTTCATG ACCCTGCTCG CCGTCTTCCA GGCTGTGCTG TCCCGACACA GTGGGCAGGA CGACATCGTG ATCGGCGCAC CGGTCGCCGG CCGGGAGCAT CCCGACCTGG AGCGGCTGGT TGGCTTCTTC GTCAACACGC TCGCCCTGCG GGTGTCGTCC GCCGGCTCCC CGTCGCTACG GCAACTCGTC GACCGGGTGC GGGAGGTCAC CCTCGCCGGT CTCGGCAATG CCGAGGTGCC GTTCGAGAAG GTGGTCGAGG AACTGCAGCC GGCACGTAGC CTGGCCCACG CACCGATCTT CCAGGCCCAG CTGATCCTGC AGAACGCACC GCACAACGCC TTTCGTCTCA GCGGCTGCAC GACCACCTCG CTGCGGGTCG ACAGCGGCAC GGCCAAGTTC GACCTCACTC TCGCCGGTGA GATGACCGCC GAGGGCGCCC TGCGGTTGGC CTTCGAGTAC GACACCGAAC TGTTCGATGC CGCCACGGTC GACCGGCTGG CCCGGCACCT GTGCACCCTG CTGGACGCGG CGGTCACCGA GCCGGATCGT CCCCTGACGC GGTTCCCGCT GCTCAGCGGA GTGGATCGGT GGCGTGCCGT GGTCGAGTGG AATCAGACCG ACCGGGGTAC GTTGCCGGTC GACACCATCC TGGACCTGCT TCCCACCGAC CCCGCCGAGC CCGGCGCCCC GCCTGCCGTC ACCGGTCCGG ACGGGCACCT GGACCGGGCC GGGCTGCACC GGCGGGCCGG GCAGATCGCC CGGCAACTGC TCGCTGCCGG CGTCGCCCCG GACACCCCGG TGGGGATCTG CCTGGACCGC GGGGTCGACA TGGTCGCCGC GGTGCTCGGC GTGTGGCGGG CCGGGGCCGG CTACCTACCG CTCGACCCCA CCCTGCCCCC CGAGCGGCTG CGCCACCTGC TCGCCGACTC CGGCACCCGG GTCGTGCTGA CCCACCAGGC GGTCGTCGCG CGGCTCGGCC CGGCGCTGGA GGGCTCGGTG ACGATGCTGC TCGACGATGC CACCGATGTC CCCGGCCCGG ACGAGCCACT CCCGGCGGTC CCGGCGCATC CGGACGGGCT GGCGTACCTG ATCTACACCT CGGGTTCGAC CGGCCAACCG AAAGGGGTCG CGGTCCCACA CCGCAGCGTG ACCAACCTCG TTGCCTCCTT CCACGACGAC CTGGACCTGA CGCCCGAGGA CCGGTTCGCC GCGGTCACCA CCCTGTCGTT CGACATCTCG GTGCTGGAAC TGCTGGTGCC GCTGCTGCTG GACGTCCCGC TGCTGGTCGT GGGCGCCGAC GAGGTCGGCG ACGGTCCGGC CCTGCGTCGC CGGCTCACCG AAGCGGGGAT CACCGCCATG CAGGCCACCC CGGCGACGTG GCGGCTGCTG CTGGCGTCCG GTGGTGTACC GCCGACGCTG CGGCTACGCC TCTGCGGCGG CGAGGCCCTA CCCCGGGACC TCGCCGACGC GCTACAGGCC GACGGTGCGG CCCTGTGGAA CTGCTACGGG CCCACCGAAA CCACCGTCTG GTCCGCGGCG ACCCCCGTGG CGCCTGCCCC GGCCGCGGTG GACCTCGGTA AACCGATCGC CAACACCCGG ATCTACCTCC TCGACGAGGT CTACCAGCCG GTGCCGGTGG GCGTGGTGGG AGAAATCCAC ATCGGCGGGT CCGGTGTGGT GCGCGGATAC CACAGTCGAC CTGGCCTGAC CGCCGGTCGG TTCGTCCCCG ACCCGTTCGC CGACCAGCCC GGCGCCCGGC TCTACGCCAC CGGTGACCTG GCCCGGCAGC GCGCTGACGG TCGGCTGGAG TTTCTTGGCC GCACCGACCA TCAGGTCAAG GTGCGTGGGT TCCGGATCGA GTTGGGTGAG ATCGAAGCCC TGCTCCGTGG CCACGATCTG GTCGCGGATG CGGTGGTCGG CACCTGGGCC GGCGGGGACG GCGACACCCG CCTGGTGGCG TACGCCGTGC CGGCGCACGG TGTCGACCCG GACTCCCTCG CCGACCAGGT CCGTGCCGAC CTGGCCGGCC GGCTGCCCGA GTACATGCTT CCCGCTGCCC TGGTGCCGTT GACCGCGCTG CCCCTCAACG ACAACGGCAA GGTCGACCGG AACGCCCTGC CCACCCCCCA GTGGACCGAC CCGCGGGCGG AGCGGGTCGC CCCCCGCGAC CCCCTCGAGC AGCTACTCGC CGGGATCTGG CAGGAGGTAC TGCACGTCGA GGGGATCGGT GTGCACGACG ACTTCTTCCG CCTCGGTGGG CATTCACTCC TCGGTGCGCA GGCGTTGAGC CGGATCGGTG CCGCGCTGGA GACGGAGGTG CCGATCCGGA TCCTCTTCGA GGCTCCGACG ATCGAGGCGA TGGCCCGCGC GCTGCGCTCC ACGGAGGAGG TAGCCGGCCA GACCGACGCC ATCGCCGCCC TCCGGGTGGA GGTGGCCGAC CTCTCCGACG AGGAACTGCG GGCCCTGCTG GGCGGCCAGG AGTGA
|
Protein sequence | MTTADSTEPA SGGKRALLAR LLTERAQAPR SYPVSFGQQR LWFLDRFTGG IPVYNIPVAF QVHGPLDVAA LRTALTTLVN RHAALRTTFS ESGGEPVQVV RPTGEVDLTE IDLTGLPVDR REVEATQLVR EHAGQPFDLA EGPLLRVAVI RTDADRFHVA FCVHHIVFDA WSVGVLFREL ETAYAAALAG TPANLPDLPT QYADFTVWQR DRLAGDALRR QLDHWVAHLR GAPALLSLPT DRPRPANRSY RGACHYVTVP AAAVRRVEEF NQDAGVTMFM TLLAVFQAVL SRHSGQDDIV IGAPVAGREH PDLERLVGFF VNTLALRVSS AGSPSLRQLV DRVREVTLAG LGNAEVPFEK VVEELQPARS LAHAPIFQAQ LILQNAPHNA FRLSGCTTTS LRVDSGTAKF DLTLAGEMTA EGALRLAFEY DTELFDAATV DRLARHLCTL LDAAVTEPDR PLTRFPLLSG VDRWRAVVEW NQTDRGTLPV DTILDLLPTD PAEPGAPPAV TGPDGHLDRA GLHRRAGQIA RQLLAAGVAP DTPVGICLDR GVDMVAAVLG VWRAGAGYLP LDPTLPPERL RHLLADSGTR VVLTHQAVVA RLGPALEGSV TMLLDDATDV PGPDEPLPAV PAHPDGLAYL IYTSGSTGQP KGVAVPHRSV TNLVASFHDD LDLTPEDRFA AVTTLSFDIS VLELLVPLLL DVPLLVVGAD EVGDGPALRR RLTEAGITAM QATPATWRLL LASGGVPPTL RLRLCGGEAL PRDLADALQA DGAALWNCYG PTETTVWSAA TPVAPAPAAV DLGKPIANTR IYLLDEVYQP VPVGVVGEIH IGGSGVVRGY HSRPGLTAGR FVPDPFADQP GARLYATGDL ARQRADGRLE FLGRTDHQVK VRGFRIELGE IEALLRGHDL VADAVVGTWA GGDGDTRLVA YAVPAHGVDP DSLADQVRAD LAGRLPEYML PAALVPLTAL PLNDNGKVDR NALPTPQWTD PRAERVAPRD PLEQLLAGIW QEVLHVEGIG VHDDFFRLGG HSLLGAQALS RIGAALETEV PIRILFEAPT IEAMARALRS TEEVAGQTDA IAALRVEVAD LSDEELRALL GGQE
|
| |