Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2097 |
Symbol | |
ID | 5704676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2415279 |
End bp | 2416616 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641271582 |
Product | condensation domain-containing protein |
Protein accession | YP_001536953 |
Protein GI | 159037700 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00926159 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCGATA CCGCGTTGCG GGTACCGCTG TCGCTCCAAC AGGACTTCCT CCGCAGGGTG GACCACGGCG ACGACGCCGG GCCGTTCGGA TCCCGCTACA CGATCGTCGG CGGTTGGCGC ATCCGGGGCC CGATCGATGT CGACACGCTT CGGGACGCGC TCGCCGACGT GGTGGCGCGA CACGAGGCGC TGCGTACGTT GCTCATGGTC GACGGTGACG AGGCATGCCA ACAGATCCAA CCGCCCAGTA GCCCGGACCT GATGCTGCGC GACCTGCCGG ACCGTGGGCC CGCCGATCGG GAGCGAATCG CGGAGGACTT CCTCAACGAC GTCGAGTCCG GCCGGTTCGG GATGGACGAG ACGCCGCTGC TGCGGGCCGT ACTTGGCCGC TTCGACAACG ATGACGCGGT GCTCGCGCTG GTCGCGCACC ACACCGCCGC CGACGGTTGG TCGATGCAGG TCATCATGCG GGACCTGGCC AGCTACTACG CCGCGCGCCG GCAGGGTCGC CCCGCCGACC TGCCTCCCGC CCGCCAGTAC CGGGAGTACG TGGCGTGGCA GCAGGCGAAC GCGGACAGTG AGACGGCCGT CGCGGCCCGG CGATACTGGC AGGAAAGGCT GCGCGACGCC CAGGTGTGGC CCGTCCGAAC CGACCTGACG CGGGCGGATG GGCCGTTTGT CACCTCCTGG TACCGCTTCC TGCTGGAGGA CGAACTACGG GCGGCGACGG TGGCACTCGC CGCGGAGACC CGCAGTACCC CGTTCATGGT CCTGATGGCG GCGTACCTGA CCCATCTGCG GGAGCGGACC GGAGAGACCG ATCTGGTGGT ACCGACGTTC ATGCCTGGGC GCAATCCCTC CTGGACCCTG CAGATAGTCG GCTCGTTCTA CAACTTCATC CCACTGCGCA CCGACACGTC GAACTGCACC GACTTTCGTG ACCTCATCGG CCGGGTGCGG ACCACCTGCC TGGACGCATA CCGCCACGAA CTCCCGTTCG CCGACATCAT CGCGCAGGCA CCGGACGTGA TGAACGCGGC GATCGGGCCG GATGCGGCGG CGTGCGTCCT GCAGGTCACC CAGTCGCCGT ACGTCCTACG TGAGGAGCAG GTCGGTGACC TGCGATACAC GGCACTGCGC CGGCGGCTGG TCTCGGCGCC GGTCGGTTCG CAGATCCCTG ATGGAGCACT GCTTGGCCTG GAACTCGATC CCGACGGCGG CATCGTCGGC AGCATCGGGT TCACCACGAA CCTGTTCGTC GAGAGCACCA TCGTCGGCAT GGCCGCTGAC TTCCAGCAGA CACTACGCGA CGTACTCCAC CCTTCGTCTC GGCGCTGA
|
Protein sequence | MIDTALRVPL SLQQDFLRRV DHGDDAGPFG SRYTIVGGWR IRGPIDVDTL RDALADVVAR HEALRTLLMV DGDEACQQIQ PPSSPDLMLR DLPDRGPADR ERIAEDFLND VESGRFGMDE TPLLRAVLGR FDNDDAVLAL VAHHTAADGW SMQVIMRDLA SYYAARRQGR PADLPPARQY REYVAWQQAN ADSETAVAAR RYWQERLRDA QVWPVRTDLT RADGPFVTSW YRFLLEDELR AATVALAAET RSTPFMVLMA AYLTHLRERT GETDLVVPTF MPGRNPSWTL QIVGSFYNFI PLRTDTSNCT DFRDLIGRVR TTCLDAYRHE LPFADIIAQA PDVMNAAIGP DAAACVLQVT QSPYVLREEQ VGDLRYTALR RRLVSAPVGS QIPDGALLGL ELDPDGGIVG SIGFTTNLFV ESTIVGMAAD FQQTLRDVLH PSSRR
|
| |