Gene Sare_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2079 
Symbol 
ID5706799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2390083 
End bp2392095 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content70% 
IMG OID641271565 
Productcondensation domain-containing protein 
Protein accessionYP_001536936 
Protein GI159037683 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.262269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGACC CCGCCGATAC CACCAGTCCG ACCGCACTCG TCGATGCCGT GCGCGGTGTC 
TGGGCCGATG TCCTGGACGT CGACGTGACC GCCGTTCCGG GCGATGCCAG CTTCCTCAGC
CTCGGTGGCG ATTCCGTCCT GACCGTCCGG ATGGCGGCGC TGGTCCGACA ACGGCTCGGG
GTGGCCCTGG CCCTGGCCGA CGTTCGCGTC GAACACAGTC CGGCCCAGTT GGCCGCCCTC
ATCCAGGAGC GCGGTACCGC CACGGGTGGC GTGCGCGCGC TACCGCTGGA CCTCAAGCGA
CGGGACGACC CGGAAGCCCC GTTCCCGCTT CTTCCGCTGC AACAGGGCTA CTTCGTCGGC
CAGCAGGATG GATGGGAGCT CTCCTACCGG TCGGCCCACC ACTACGTGGA CATCGGCCTG
GAGGACATCG AAACAGACGA GATCGCTGAG GCGTTGCAGG ACGCGCTGGA GCGGCTCGCC
GAGCACCAGG CGGTGCTGCG CGCGCGGATC CTGCCCGACG GACGGCAGCG GATCCTGCCC
CTGGACGATC CGGAGGCGAT CCCGGTGCTG CGGGTGACCG ACCTCAGCAC GGCTAGCGCG
GACGAGATCG CCGAAAGGCT GGCCGCCATC CGTCGCGAGA TGAGCACCGA CGGCCCCGAC
CCGACCCGAG GATGCGGTCT GGACATGCGG CTGACCCTGC TTCCGGGGGC CAAGGCGCGG
CTGCACTCCT CGACCAGCCT GATGATCGTC GACGGCTGGT CGTCGGGAGT CTTCTACCGC
GACCTGTTTG CCCTGGTCAC CGATCAAAAC GCGATGCTCA CACCGCTGGA CGTCGACTTC
GGTGACTACG CCGTGACCCT GGACGGGCTG TCCGAGACCG AGTCGTGGCG GGCCGACCGT
GACTGGTGGT GGAATCGCCT CGACAGCCTT CCGCTTCCCC CGGCCCTGCC GCTGATCGCC
GACCCGGCCG ACGTCCGACC GGCCCTGATG GGTGCGCGGC AGGCGGTCCT GGACGCGGAC
CGTTGGGCAG CCCTACGGGA ACACTGCAGT CGACATGGCG TCACCCCGTC CGCGGCCATG
TTCGCCGTCT TCTCCACCGC GCTGGCCCGT GCCTGCGGAC ACCGCAGATT CCTGCTCAAC
ACCCTGCAGC TGAACCGCCT CCCGCTGCAC CCGGATGTGC CCCGGCTGGT CGGTGCCTTC
TCCTCCACCA TGCTCGTACC GGTGGAGCTG CCCCAGACGG CGACCTTCTC CGACCTCGCG
GTCCGCGCGC AGCGTGACAT CGGGGAGGCC ATGGCGCACA ACCTGGTGAC CGGGGTGGAG
GTCTCCCGCG AACTCGGACG GCGCCGTGAC ACCCGACGGC CGGTGGCTCC GGTGGTCTTC
CAGAGCACGT TGGGGGTCGA CGCCGCGCTC GGTAGCGAGG TGCCCCGAGC CGCCGGACCG
TTGGGCTCGA TCGACCTGTC CAGCCACTAC CAGCAGCTAC GGACGCCGCA GGTGGCGCTC
GAGGTTCGGC TGTTCGAGCT ACGCAACGAG CTGGTGGTCG TGTTCTCCCT GGTAGAGGAA
ATCTTCGACG CCTCGTACGT GGACCGGATG TTCGGGGAAG TCATGGCCAT GATCGAGTCA
CTGGTCGAGG CGGATCGCTG GTCCGCGCCG GTCGACCTGC CGGGCTTTCT GGACTCGCCC
GAGCAGGGCC CCTCGCTTGC CCGCCTGTCC GTGCCGACCG GGTCGACCGG GACGGTCGAC
GAGCCTGGGC CGCCGCGCGA CGATCTCGAA CGGGCGATCG CGGATCACTG GGCGGTGCTG
CTCGGCTGCG GAGTGCCGGA CCGGGCCGCG AACTTCTTTG CGCTCGGCGG GGACTCGCTG
CTCGCGGTAC GGATGCTCGG AGCCCTCGCC CGGGAGAAGG TCGGGTGGGT GACGCCGCGT
CGGTTCCTTG ACCATCCCAC TGTGGCGGGA CTCGCCAGCG CTGTCCGTGA GCCGGCCGAA
GCCGCCGGCC AGAACGTCGG CATCGGATCG TGA
 
Protein sequence
MTDPADTTSP TALVDAVRGV WADVLDVDVT AVPGDASFLS LGGDSVLTVR MAALVRQRLG 
VALALADVRV EHSPAQLAAL IQERGTATGG VRALPLDLKR RDDPEAPFPL LPLQQGYFVG
QQDGWELSYR SAHHYVDIGL EDIETDEIAE ALQDALERLA EHQAVLRARI LPDGRQRILP
LDDPEAIPVL RVTDLSTASA DEIAERLAAI RREMSTDGPD PTRGCGLDMR LTLLPGAKAR
LHSSTSLMIV DGWSSGVFYR DLFALVTDQN AMLTPLDVDF GDYAVTLDGL SETESWRADR
DWWWNRLDSL PLPPALPLIA DPADVRPALM GARQAVLDAD RWAALREHCS RHGVTPSAAM
FAVFSTALAR ACGHRRFLLN TLQLNRLPLH PDVPRLVGAF SSTMLVPVEL PQTATFSDLA
VRAQRDIGEA MAHNLVTGVE VSRELGRRRD TRRPVAPVVF QSTLGVDAAL GSEVPRAAGP
LGSIDLSSHY QQLRTPQVAL EVRLFELRNE LVVVFSLVEE IFDASYVDRM FGEVMAMIES
LVEADRWSAP VDLPGFLDSP EQGPSLARLS VPTGSTGTVD EPGPPRDDLE RAIADHWAVL
LGCGVPDRAA NFFALGGDSL LAVRMLGALA REKVGWVTPR RFLDHPTVAG LASAVREPAE
AAGQNVGIGS