Gene Sare_3087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3087 
Symbol 
ID5706822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3501497 
End bp3506215 
Gene Length4719 bp 
Protein Length1572 aa 
Translation table11 
GC content68% 
IMG OID641272523 
Producthypothetical protein 
Protein accessionYP_001537891 
Protein GI159038638 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0122082 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGT CGGACGCGAT CCTGGTCGGT GAGGGCTGGA TCTCGGAGCA CTACTTCACC 
ACCGACGCGA CAAAGGAGTC GTTCCGCGCC CGGGTGCTGG AGCGTCGCAA GGAATGGGAC
GCAGAAGCCG AAGAGCGCCG GCCCACCCCG CGCAGCCGGT TCATCGAGGC CCGGCAGGAG
TTGGAAGCCG ACCTTGCCAA GCTCGCCGAA TTGACCGACG TGGAGGCCGA CCTCGGCGCC
ACCGATCCGC AGACCATCGC CGACGCGGTC GCGTCGATCC AGCAGCGCCT CGTCGGCATC
CTGGAACTAC GCGAGCACGG CCTCGTCGTC GGCGGCGACG GGCCGATCCT GCGCGTCTCC
GCCCCCGGCA TCACCGAACG GGCACCACTG GTCGTCGTGT TCGCCCAGCC GGTAGCCACG
GTCGAGGATG TGCTGGCCAA GGACGGCCGG ACGCTGGCCG AGCCGGTGCG CCTGCACGAC
GACAGCGACG AACTCACCTC GGCCGCGCGG CTTGCCTCCG CCCTGTTCGG CGAGGACGAC
GCCCCCGATC TGATCCTCAT CCTGGCGGGC CGCTGGGCGG TGCTGGCCGA ACGGGAACGC
TGGGCCGAAG GCCGCTATCT CGCCATCGAC ATGCAGCTGA TCTGCGAACG CAACGACACC
AGACGCGGCG GTGAGATAGA CCGCGCCCTG ACCTGCCTGG CTGCCGCCTC GATCGCTCCG
GACGCGGACG GCAACCTGTG GTGGAGCGGC GTCCTCGACG AGTCGATCAA ACACACCGTC
GGCGTCTCCA AGGACCTCCG CGAGGGCGTA CGCCTCTCCA TCGAGATCAT CGCCAACGAG
GTGGTGGCCC GCCGCCGCGA CCAGGGCCTC GACCCGCTGC CCGCGGCGGA GGCGCAGCCG
CTGGCCAAGC AGGCGCTGCG CTTCCTCTAC CGCATCCTCT TCCTGCTCTA CGCGGAGGCG
TCGCCCGAGC TGGGCGTACT GCCGGTCGGC GCCCCCGAGT ACGACCGGGG TTACAGCCTG
GACCGGCTGC GGGAACTCGT CCAGGTAGAA CTGGCCACCC CGCGCGCCCG AACCGGCACT
CACCTCTACG AGTCGCTGGG CGTGCTGTTC CGGCTCGTCG ACAAGGGCCA CCAGAGCGTC
GAGCCGGGCG ACGAGGAGGC GCTGTCCGCG CCAGGGCTCA CGTTCAACCC GCTGCGCGCC
GACCTGTTCC GGCCGAGCGC CACCGCGCAC ATCGACGCGG TCGGTCTCGG CAACGCCGCA
CTTCAGGAGG TCCTCACCCA CCTGCTGCTG AGCAAGGAGC GGCGCGGCCG GGACCGAGGC
TTTATCTCGT ATGCGGAACT GGGCATCAAC CAGCTCGGCG CGGTCTACGA GGGGCTGATG
TCGTACACCG GCTTCTTCGC CGACACCGAC CTGTTCGAGG TCGCCAAGGG TGGCGATGGC
AGCAAGGGCT CCTGGGTGGT GCCGGTGGAC CGGGCGGGTG GCATCGCGTC CGGAGACTTT
GTGAAGCTGG CCGACCGGGT GACCGGCGAG CTGCGGCCGG TGGTGCACCG GCAGGGCTCG
TTCGTGTTCC GGCTGGCCGG TCGGGAACGC CAGCAGTCCG CGTCCTACTA CACCCCCGAG
GTGCTGACGA AGTTCACCGT CGGGCAGGCC CTGGAGGAAC TGCTCGACCA GAACGGCGTA
CGCACACCGG CCGCTGACAT CCTCGGCATG ACGATCTGTG AACCGGCGCT CGGCTCGGGA
GCGTTCGCCA TCGAGGCGGT GCGGCAGCTC GCCGAGCAGT ATCTGAAGCG TCGCAAGGAG
GAGCTGGCCG ACGAGGGTAA AACCATCGAC CCCGACGACT ACCCGAAACG CCTGCAAGAG
GTGAAGGCAT ACCTCGCCCT GCACAACGTC TACGGCGTCG ACCTGAACGC CACAGCCGTC
GAGCTCGCCG AGATCACCCT ATGGCTAGAC ACGATGGTCC CGGGCCTGCC GGCGCCCTGG
TTCGGCCTGC ACCTCAAGCG CGGAAACTCT TTGATCGGTG CCCGCCGGGC CGTCTACCGC
CGCAGCCAAA TCGCCGACAA GTCGTGGCTG GGCGCGGTAC CGACGGAGAT GCCGCTGACC
TCGCTTGTCG ACGACATGGC CGCCGGCCGG GTCGGCACCG ACGGCATCCA CCACTTCCTG
CTGCCCGCCG ACGGCTGGGG TTCGGCCGCC GACGCCAAGG AAGCCGCCGC GCTCGCCCCC
GACGCCGCGA AGAAGCTCAA GACCTGGCGG GGAAGCACCA AGACCAAACC CACCAAACAG
ACCATCGACG CGTACGCCGA ACTGGCGCAC CGGGTCGAGT CACTGTGGCA GATCGTCTAC
CGCCGCCTGG ACCTCGCCGA GCAGCAGATC CGCCGACGCA TCCCGGTGTG GGAGGCAGGC
GAGCTGCCCG CCGGTGGAGC GGTGCAGCGC GAGGAGATCG AGAAGGCGCT CAACGACGAG
GACGGCGCCT ACCGGCGGCT ACGCCGGGCG ATGGACGCCT GGACGGGACT GTGGTTCTGG
CCGCTGACCG ACGAGCCCGC CATAGTGGAT GGCCAGGTGA TCGCCCCGCC GAGCGTCGCG
CAGTGGGTGG CCGGCCTGCA GGCGCTGCTG GGCCGCAACC CGGAGCTGCG CAAGCGGAAG
GCGTCCGGGT CGACCCTGAC CGCCGGGATG AGCTGGAGCG AACTCAACGA GGCCGAACAG
GTCGAGATCG GCTTCGCCGG CGCGAAACCC GTCGAGACGG TGCTGCGGGA GCATCCGTGG
CTGGTGGTCT GCGAGCGCAT CGCCGGGCGG CAGGGCTTCT TCCACTGGCA ACTGGACTTC
GCCACGGTGT TCGCCCGGGG CGGCTTCGAC CTCCAGGTCG GCAACCCGCC CTGGGTGCGT
CCCCGCTCGG ACGTGGACGC CCTGCTGGCC GAGGGTGACC CGTGGTGGCA GTTGGCGCTG
AAGCCGACCC AGGCGCAGGT CGCTACCCGT CGGGAGGCGA CCCTGGCACT GCCGGGCATG
ACCGATCTGG TGATCGACGG CACGGCGGAG GTGCTGTCGA TCGCGGCGTT CGTCGGCTCG
GTCGGGCAGT ACCCGCACCT GCAGGGCCTT CAGCCGGACC TGTACCGCTG CTTTATGGAG
GTGATGTGGC GGCACGGGTC AACCCGCGGC ACCATCGGCA TGATCCACCT GAACTCACAC
TTCACCGACG AGAAGGCGGG CCTTCTCCGC ACCGAGCTCT ATCTGCGCCT CCGTCGCCAT
TGGCACTTCG TCAACGAGCT CAAGCTGTTT GAGATTCAGG ATCAAAAGCA CTTCGGCGTC
ACGATCCACG GGTCACGCCA ACATCGAGCG GACTTCACCC AGGCGAATTG GCTCTACCAT
CCCGACACGG CGGTCCGGTC AATGGTGCAC GACGGCTCTG GGCCGGAACC CGGACTGAAG
GACGACGAGG GACGTTGGGA CGTGCGCCCG CACGCCTCGC GGATCACCAC GGTCACTGAC
GAGACGCTGC GCGCGTGGCA CGCCACGATG GAGACCGACG AGGTGCCGAT CCGGCAGACC
CGGATGGTGT ACGCGGTGAA CCAGTCCAGC GCGGCGGTCC TGGAAAAACT GTCCCGCGCC
GGACGAATCG GCGACCTCGG CCTGCGCTTC TCGGCCGGTT GGCACGAGAA GAATGACCGG
ACCAAGGGCT ATTTCGAGTC GGAGTGGGGT ATACCGGACT CCTGGGACGA CGTGATCCTG
CAGGGGCCGC ATCTGTTTGT GTCGACACCC CTCTACAAGG CTCCGAACCC CTCTTTGTTG
CATCACCAAG ACTGGACGTC GACGGACTTG GAGGCGTTGG CGGCCCACGC GATCCCGGCC
ACCGCCTACC AGCCCCGAGG CGACCGCTAC GACTACGACT GCGCCTACAC CGAATGGGGC
GACGAAGACC ACCCCGACCC CGCACGCGAC CACTACCGCA TCGCCTGGCG TCGCATGGCG
GCTAACCAGG GGGAACGCAC CCTGATCCCG GCAATCATTC CGCCTGGTGC CGCTCACGTC
GATGGTGTCA TCTCTGCCGC TGATCCTGGT CGGATGTCGC CGACCCCCGT CCTACAGGCT
GTCCTCGGCT CGCTTGTGTC TGACTTCATG ACGAGGGTCG CGCCAAAGGG CGACATCCGA
GCACCTGCGA TCACTCGGCT GCCGTGGGTC GCCGACGAAG GCATGACCCA TGACGCGCTC
GCCGTTCGCG CGCTCAGGCT TAACTGCATC ACCGAGTCCT ACGCCGCGCT ATGGGCGGAA
GCCTGCACCT CGGCATTCAA CGACGATGAC TGGACCGGTG GATTCGAGCA CGCGCGGCGT
ACTCCCCTCG GGCGGATCGG GCCGGAGTGG ACGCCGGAGA CGCCGCTGCG GATCGCCGCC
GACCGTCGCC AGGCGCTCGT GGAGATCGAC GCGCTTGTTG CGCTGGCGCT CGGGCTGACC
GCCGACGAGT TGTGTTCGAT CTACCGGACC CAGTTCGCGG TGCTGCGCGG ATACGACCGC
AACGTCTACC TCTACGACGC CAACGGTCGG CTGGTGCCCA ACTCGGTGCT CACCGTCTGG
CGCAGAAAGG GCGATCGGAT CACCGCCGAG GAGCGCACCG CCACCAACCA GGCCGGCAAC
ACCTACACCT ATGAGCTGCC GTTCGTCACC CTGGACCGGG AGGCGGACAT GCGCCAGGCG
TACGCGGTCT TCGCGCAGCG CCTGCGGGAG CGTTCGTGA
 
Protein sequence
MSVSDAILVG EGWISEHYFT TDATKESFRA RVLERRKEWD AEAEERRPTP RSRFIEARQE 
LEADLAKLAE LTDVEADLGA TDPQTIADAV ASIQQRLVGI LELREHGLVV GGDGPILRVS
APGITERAPL VVVFAQPVAT VEDVLAKDGR TLAEPVRLHD DSDELTSAAR LASALFGEDD
APDLILILAG RWAVLAERER WAEGRYLAID MQLICERNDT RRGGEIDRAL TCLAAASIAP
DADGNLWWSG VLDESIKHTV GVSKDLREGV RLSIEIIANE VVARRRDQGL DPLPAAEAQP
LAKQALRFLY RILFLLYAEA SPELGVLPVG APEYDRGYSL DRLRELVQVE LATPRARTGT
HLYESLGVLF RLVDKGHQSV EPGDEEALSA PGLTFNPLRA DLFRPSATAH IDAVGLGNAA
LQEVLTHLLL SKERRGRDRG FISYAELGIN QLGAVYEGLM SYTGFFADTD LFEVAKGGDG
SKGSWVVPVD RAGGIASGDF VKLADRVTGE LRPVVHRQGS FVFRLAGRER QQSASYYTPE
VLTKFTVGQA LEELLDQNGV RTPAADILGM TICEPALGSG AFAIEAVRQL AEQYLKRRKE
ELADEGKTID PDDYPKRLQE VKAYLALHNV YGVDLNATAV ELAEITLWLD TMVPGLPAPW
FGLHLKRGNS LIGARRAVYR RSQIADKSWL GAVPTEMPLT SLVDDMAAGR VGTDGIHHFL
LPADGWGSAA DAKEAAALAP DAAKKLKTWR GSTKTKPTKQ TIDAYAELAH RVESLWQIVY
RRLDLAEQQI RRRIPVWEAG ELPAGGAVQR EEIEKALNDE DGAYRRLRRA MDAWTGLWFW
PLTDEPAIVD GQVIAPPSVA QWVAGLQALL GRNPELRKRK ASGSTLTAGM SWSELNEAEQ
VEIGFAGAKP VETVLREHPW LVVCERIAGR QGFFHWQLDF ATVFARGGFD LQVGNPPWVR
PRSDVDALLA EGDPWWQLAL KPTQAQVATR REATLALPGM TDLVIDGTAE VLSIAAFVGS
VGQYPHLQGL QPDLYRCFME VMWRHGSTRG TIGMIHLNSH FTDEKAGLLR TELYLRLRRH
WHFVNELKLF EIQDQKHFGV TIHGSRQHRA DFTQANWLYH PDTAVRSMVH DGSGPEPGLK
DDEGRWDVRP HASRITTVTD ETLRAWHATM ETDEVPIRQT RMVYAVNQSS AAVLEKLSRA
GRIGDLGLRF SAGWHEKNDR TKGYFESEWG IPDSWDDVIL QGPHLFVSTP LYKAPNPSLL
HHQDWTSTDL EALAAHAIPA TAYQPRGDRY DYDCAYTEWG DEDHPDPARD HYRIAWRRMA
ANQGERTLIP AIIPPGAAHV DGVISAADPG RMSPTPVLQA VLGSLVSDFM TRVAPKGDIR
APAITRLPWV ADEGMTHDAL AVRALRLNCI TESYAALWAE ACTSAFNDDD WTGGFEHARR
TPLGRIGPEW TPETPLRIAA DRRQALVEID ALVALALGLT ADELCSIYRT QFAVLRGYDR
NVYLYDANGR LVPNSVLTVW RRKGDRITAE ERTATNQAGN TYTYELPFVT LDREADMRQA
YAVFAQRLRE RS