Gene Sare_0718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0718 
Symbol 
ID5704523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp797970 
End bp799640 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content68% 
IMG OID641270236 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001535628 
Protein GI159036375 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000990893 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGCGCA CCTCGACCAC CACGACGGCA GAGACCACCA CGTCGACGCG AGCCACCGGC 
CGTACGAACC TGCCCGCTTC GCCTTACCCC ATCCACCCGC CGGAGCCACT GGCAACATTG
TCGCCCCGTG ACCGCGCGTT GTTCGTACGC TTCGGTTACG GGCCCACCGA ATCCGTCGCC
CACACTCACA TCCACCACGC CTTCGAGTAC TGGGCCACCG TCACTCCGGA GGCCGTGGCC
GTCGAGGATG GCGACGAGAC CATCACCTAT CGCGAGCTCG ACCAGCGAGC CGATCAGTTG
GCGGCCCGGC TCGCCGCATC GGGTGTCCGT CCCGGCGACC GGGTCGCGTT GTTCGTCCGA
CGGTCCATCC CGATGGTGGT CGGCCTGCTC GCGGCCCTGA AGGCCGGTGC GGCCTACGTT
CCGCAACACG TCGACACGGT GCCCCCGGCC CAGCTGCAGC ACGTCATACA CACCGCCGAC
ACCCGTGTGA TCATGACGCT CGCCGCCACG GCGGACCGTA TCCCGGTGCC GGACGGCCAC
GTCGTGATCA CGCTTGACGA CCTGGGAAAG ACGGAGCCAA CCGATCTCAT CGCCGGCCGT
TTCACCCCGG CGACGCCGCT CCCGCCCGAC AGCCCCTGCT ACGTGCTGTT CACATCCGGC
AGCACCGGCC GACCCAACGG GGTCGTCGTG ACACACCGCA ACATCTGCAA CATCCTGCTG
ACCTCGCCGG GCAACCTCGG CATCCAGCCG GGCTGGAAGG TCGGCCAGAT TCTCAACATC
GCCTTCGACA TGGCCAGCTG GGAGATCCTG GGTGCGCTCA GCCACGGCGC CACCCTGGTC
ATCCGTGGCT CGGACATCGT CGAGACGATG TCCCGCGTCG ACGTCATCAT CGCGACCCCC
ACCGTACTGA GCCGCACTGA CCCAGACCGG TGTCAGCGAG TCAAGGTCGT GGCGGTCGCC
GGTGAACCCT GCCCCCGGGC CCTCGCGGAC GCCTGGTCCG CGGTCTGCGC CTTCTACAAC
GCCTGCGGTC CGACCGAAAC CACGATCGTC AACACGATGA GTCGGCACCG CCCGACCGCG
GAGCGACTCA CCATCGGCCG ACCCACACCC AACAACACCG TGTATGTACT CGACGCTGAC
CTTCGCCCGT GCCCAATCGG CACCGTCGGT GAGATGTGGG CCGGCGGTGA CTGCGTCTCG
GCGGGCTACC TCAGCAATGC CCGGCTCACC GCCGAGCGGT ACGCCCTCGA CCCGTTCCTG
GGGCACGGTC GCCTCATGTT CCGCACCCGT GACCTGGGGC GTTGGACTCC CGACGGCGAG
TTGGAACACT TCGGACGGAC CGACGACCAG GTCAAGGTCC GTGGTTTCCG GGTGGAGCTG
GACTCGGTGT CCGCGATCCT GGAGGCCGTA CCGGGCTGCA CCCGGGCGGC GACCATCAAG
CTCGACGACC GCAGCCTCGT CTCGTTCGTG GCCCCCGCGG AGGTGGACCC GGACCTCGCC
CGGATGGCGG TGTCCGAGGC GCTGCCGTAC TACTGCGTCC CCGAGACCGT GCACACCCTG
CCGGAGCTTC CCACGACCAG CCGAGGAAAG ATCGACAAGC TGGCACTCCG CCGCCTCGTC
ACCCGCCAGG ACCAGCTCGT CACCAGCCAG GAGCAGGGGG CGACGCGATG A
 
Protein sequence
MQRTSTTTTA ETTTSTRATG RTNLPASPYP IHPPEPLATL SPRDRALFVR FGYGPTESVA 
HTHIHHAFEY WATVTPEAVA VEDGDETITY RELDQRADQL AARLAASGVR PGDRVALFVR
RSIPMVVGLL AALKAGAAYV PQHVDTVPPA QLQHVIHTAD TRVIMTLAAT ADRIPVPDGH
VVITLDDLGK TEPTDLIAGR FTPATPLPPD SPCYVLFTSG STGRPNGVVV THRNICNILL
TSPGNLGIQP GWKVGQILNI AFDMASWEIL GALSHGATLV IRGSDIVETM SRVDVIIATP
TVLSRTDPDR CQRVKVVAVA GEPCPRALAD AWSAVCAFYN ACGPTETTIV NTMSRHRPTA
ERLTIGRPTP NNTVYVLDAD LRPCPIGTVG EMWAGGDCVS AGYLSNARLT AERYALDPFL
GHGRLMFRTR DLGRWTPDGE LEHFGRTDDQ VKVRGFRVEL DSVSAILEAV PGCTRAATIK
LDDRSLVSFV APAEVDPDLA RMAVSEALPY YCVPETVHTL PELPTTSRGK IDKLALRRLV
TRQDQLVTSQ EQGATR