Gene Sare_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2097 
Symbol 
ID5704676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2415279 
End bp2416616 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content68% 
IMG OID641271582 
Productcondensation domain-containing protein 
Protein accessionYP_001536953 
Protein GI159037700 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00926159 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGATA CCGCGTTGCG GGTACCGCTG TCGCTCCAAC AGGACTTCCT CCGCAGGGTG 
GACCACGGCG ACGACGCCGG GCCGTTCGGA TCCCGCTACA CGATCGTCGG CGGTTGGCGC
ATCCGGGGCC CGATCGATGT CGACACGCTT CGGGACGCGC TCGCCGACGT GGTGGCGCGA
CACGAGGCGC TGCGTACGTT GCTCATGGTC GACGGTGACG AGGCATGCCA ACAGATCCAA
CCGCCCAGTA GCCCGGACCT GATGCTGCGC GACCTGCCGG ACCGTGGGCC CGCCGATCGG
GAGCGAATCG CGGAGGACTT CCTCAACGAC GTCGAGTCCG GCCGGTTCGG GATGGACGAG
ACGCCGCTGC TGCGGGCCGT ACTTGGCCGC TTCGACAACG ATGACGCGGT GCTCGCGCTG
GTCGCGCACC ACACCGCCGC CGACGGTTGG TCGATGCAGG TCATCATGCG GGACCTGGCC
AGCTACTACG CCGCGCGCCG GCAGGGTCGC CCCGCCGACC TGCCTCCCGC CCGCCAGTAC
CGGGAGTACG TGGCGTGGCA GCAGGCGAAC GCGGACAGTG AGACGGCCGT CGCGGCCCGG
CGATACTGGC AGGAAAGGCT GCGCGACGCC CAGGTGTGGC CCGTCCGAAC CGACCTGACG
CGGGCGGATG GGCCGTTTGT CACCTCCTGG TACCGCTTCC TGCTGGAGGA CGAACTACGG
GCGGCGACGG TGGCACTCGC CGCGGAGACC CGCAGTACCC CGTTCATGGT CCTGATGGCG
GCGTACCTGA CCCATCTGCG GGAGCGGACC GGAGAGACCG ATCTGGTGGT ACCGACGTTC
ATGCCTGGGC GCAATCCCTC CTGGACCCTG CAGATAGTCG GCTCGTTCTA CAACTTCATC
CCACTGCGCA CCGACACGTC GAACTGCACC GACTTTCGTG ACCTCATCGG CCGGGTGCGG
ACCACCTGCC TGGACGCATA CCGCCACGAA CTCCCGTTCG CCGACATCAT CGCGCAGGCA
CCGGACGTGA TGAACGCGGC GATCGGGCCG GATGCGGCGG CGTGCGTCCT GCAGGTCACC
CAGTCGCCGT ACGTCCTACG TGAGGAGCAG GTCGGTGACC TGCGATACAC GGCACTGCGC
CGGCGGCTGG TCTCGGCGCC GGTCGGTTCG CAGATCCCTG ATGGAGCACT GCTTGGCCTG
GAACTCGATC CCGACGGCGG CATCGTCGGC AGCATCGGGT TCACCACGAA CCTGTTCGTC
GAGAGCACCA TCGTCGGCAT GGCCGCTGAC TTCCAGCAGA CACTACGCGA CGTACTCCAC
CCTTCGTCTC GGCGCTGA
 
Protein sequence
MIDTALRVPL SLQQDFLRRV DHGDDAGPFG SRYTIVGGWR IRGPIDVDTL RDALADVVAR 
HEALRTLLMV DGDEACQQIQ PPSSPDLMLR DLPDRGPADR ERIAEDFLND VESGRFGMDE
TPLLRAVLGR FDNDDAVLAL VAHHTAADGW SMQVIMRDLA SYYAARRQGR PADLPPARQY
REYVAWQQAN ADSETAVAAR RYWQERLRDA QVWPVRTDLT RADGPFVTSW YRFLLEDELR
AATVALAAET RSTPFMVLMA AYLTHLRERT GETDLVVPTF MPGRNPSWTL QIVGSFYNFI
PLRTDTSNCT DFRDLIGRVR TTCLDAYRHE LPFADIIAQA PDVMNAAIGP DAAACVLQVT
QSPYVLREEQ VGDLRYTALR RRLVSAPVGS QIPDGALLGL ELDPDGGIVG SIGFTTNLFV
ESTIVGMAAD FQQTLRDVLH PSSRR