Gene Sare_2330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2330 
Symbol 
ID5704254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2680051 
End bp2683260 
Gene Length3210 bp 
Protein Length1069 aa 
Translation table11 
GC content67% 
IMG OID641271808 
ProductVioB - polyketide synthase 
Protein accessionYP_001537179 
Protein GI159037926 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0866221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0021675 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGTCT TCGACCTACC ACGGCTGCAC TTCCGTGGAT CGGCGACGAC GCACCTGCCC 
ACCGGGCCAC GCAACGGCCT GGTGGACCTG GCGACGAACA CGGCGCTCAC CGAGGACGGC
AGGGCCTTTC CCGTAGCGAG TCCGGCTCAC GCCTACCACG ACTACCTCGA CCAGGTGGGA
CCACGCTTCG ACCTCGCCGG GCGTCCCTGC GCCGACGGAC CCTTCAGCGT GGCCAAGGGC
AGGGACTTCG CCGGCAACGG ACACTTTTCG GTGGACGCGC GGATCGTCAG CGTCGAGGTC
CGCACGAGCG AGATCAACAC CGTGGACCCG GTGGTCGGAC GCACCGTTGA CATGTGGGGG
CACTACAACG AGTACCTCGG CACGACGGTG AACCGGGCCC GGGTCTTCGA TGTCGACCCG
GCCTCCAACC GGACCACGAC GCTGATGGTC GGGCAGTTTG GCTTCGGGCG CGACGGGCGG
TCCCACGACG TCGGCTACCT CGCCACCGGC CGCGTCCACG GGTTCGTGCC GCCACGGTGG
CACAACGCGG ACCACGCCGT CGACATCGAC GACCACTGGC AGGCCAACGA CCTGCGCCGG
TCCGTCGTGC ACCAGTTCGT GGTGACGGCG GACGAACTGA CCTGGCTCGA CGAGCCGGCA
GCCTCCCCGG CCGTACGTCT GCTCAGGGAC ACCGACGCAA GCGGCCTGGT GGTGCAGTTC
TCGCTCAGCC GCATGTCCGT CCCGTCCGCG CCCGACCAGC CGAGCCGGTG GCAGCTCAAC
GGGACGATCG CCCCCTGGTA CCCGCACGAA CCGCGCACCT ACCCGGCGGG TCGCCTACTC
GTCCCGGACA GCCGAGGTCC GAGCCGCACC GACGGCCGGC TGCACAACCT CTCGGTCGAG
CTGACCGACA CCCACGCCAT CCTCAATATG ATCACGGCGG TGCCCACCGT AGGGACCGGG
CCGGTCGATG TCGGAGATCT CGAACTGCGT ACCGCCGGCG ACGGTCGACT GGTGGCCCGC
CTGCCCCGGG AGGCGTATCT CGGGCAGGAG TACCTCCTGG CCTGCGGCCT GGTGACCGTT
CCCATCGAGA TGTCCGCCGA GGCCGCCTCC GAGGAGCCGT TGAGCCTGGT TTCCCGTAGA
CCCGGCGGAC CGGTGCGACA GCTACGGGAA CGAGAGGTCA ACGTACAGGT CGACGAATCT
GCGCTCATCC TCGAGCACCC ACGAGACGCC GAGGACGCGC ACCACGACGT TGAGGTTCCC
GTGCGGGCCT TCGTCCGGGG CCGCCCGGGT GCCGTCCACG AGATCGCGGT ACGGCAGTTC
TTCAATCCCC GGGCTCTGCC AGGTGAGGCT GCGGCCCGTT CCCCCAAGGC CCGCTGCTCC
GACATCGACG TGCTCAGGCT GCGTCCGGGA CGGCTGGACG AGGCCGGAGG CTGGTCGAGC
GCCTGCGTTC TGGGTACCGA CCAGGCGGGG TGTGGATGGT TCACCATGCG GGGTGCGACG
GCCGGTACGG CCCTGGTCCT GCTGTCCACC GACGCCGACG ACCTGCCCTG CGATCCGGAG
GCACCCGGAT CCGCCACGCT GGCCTACGAC CACGACGACG TGCTCGGATA CTGGCCTGGC
GCAGGCTACC TGTCGATCCG TGTCCTACCC GACGACTGGC GGCTGGCCGG GCTCGAGCAG
AAGGACGTGA CGTTCGAGCT GGTCTACCGG GAGGTTTTTG CCTTCTACGA GCATCTGTTC
TCCTTCATGA AGGCAGAGGT CTTCAGCCTG GCGGACCGGT GCAAGGTGGA GACGTACGCA
AAGTTGATCT GGCAGATGTG CGATCCCCGC AACAAGGCCA AGACCTACTA CATGCCGCCG
ACGCGAGACC TGTCCGAGCC GAAGGCCCGG CTCCTCCTAA AGTACCTACG CGCGCAGCAG
GTACCGGATG CGGTCCTCCT GACGGCGCCG GCCACGAATC GCCGTAGCCG CAGGATAGCC
ACACGTGATC AACTCGTCCG CGTGCTGCGT GAGGCGGCAA AGGTCGAGTT GGCCGTCATG
CTCCAGTACC TGTACGCAGC CTATTCCGTG CCCGCCTACG GTGTCGGGCT GGAGTACGTG
CGGCAGGGTG GGTGGACGCG CGAGCAGTTG CGGCTGGCCT GTGGTGACGG TGGTGAAACC
CTCGACGAGG GTGTTCGCAG CATGCTGCTG AACATCGCCC GCGAGGAAAT GATCCACTTT
CTTCTGGTCA ACAACATCCT CGTCGCGATG GGTGAGCCCT TCCACGTGCC GTGGATTGAC
TTCGCCACGA TCAACCACGA GCTCCCAGTA CCGCTGGACT TCTGCCTCGA AGGCATGGGA
ATCGGTAGCG TGGAGCGTTT CACCATGATC GAACGGCCCG AAAGCCACGT ACCCGACGTG
CTGGGAACTG GTGGCGCGAC GGCCCAGGAC GACGGACACA CCTACGCCTC GCTGAGCGAC
CTGTACTCCG CAATCCGGGA GGGCCTGCAG CACATCCCCG GCCTCTTTCT GGTCGACAAG
GGTCGGGGAG GCGGTGAACA CCACCTCTTC CTGCGGGAGT CAATCAACAG TCGCCACCCC
GACTACCAGC TTGAGGTCGA CGATCTGTCG AGCGCCCTGT TCGCGATCGA CGTCATCACC
GAGCAAGGGG AGGGCGGAGT ACTCACCGCC GGACCTGACG AGGTGTCGCA CTACACCTCG
TTCCTGCGGA TCGGCGAACT CCTCCGTGGC GCAGTGGCCC CGGGCGGGGA TCCGTGGCAC
CCGGCCTATC CCGTGCTGCG CAACCCGACG CTTCGGCATG GCGAACGGGC CATGGAGACA
GTCACCGATC CAGATGCCCG AACGGTCATG GCGCTGTTCA ACCGCGCGTA CTTCATGGCA
CTTCAGCTCA TGGCGCAACA CTTCGGTGAA CGCCCCGACG GGAGCCTGCG GCGGTCCGAC
CTGATGAACG GGGCGATCGA CATGATGACG GGTCTGATGC GCCCGCTTGC TGAACTGCTG
GTGACCATGC CGTCGGGACG GCGTGGCAGG ACCGCTGGAC CGTCGTTCGA ACTGGTGGAG
CAGCCCACCC CGGTGTCCCG CCCCGAGGTG GCCCGGCGGG GCATCGCCCT GCGTCTCGAC
GACCTCGCGG CAGAGTGCGG CAAGTCCGCC CTGGTGCCGA CCCGGGTGGG CGAGATGAGC
GCGTTCTGGG CCGACCACTT CCGGCCGTGA
 
Protein sequence
MSVFDLPRLH FRGSATTHLP TGPRNGLVDL ATNTALTEDG RAFPVASPAH AYHDYLDQVG 
PRFDLAGRPC ADGPFSVAKG RDFAGNGHFS VDARIVSVEV RTSEINTVDP VVGRTVDMWG
HYNEYLGTTV NRARVFDVDP ASNRTTTLMV GQFGFGRDGR SHDVGYLATG RVHGFVPPRW
HNADHAVDID DHWQANDLRR SVVHQFVVTA DELTWLDEPA ASPAVRLLRD TDASGLVVQF
SLSRMSVPSA PDQPSRWQLN GTIAPWYPHE PRTYPAGRLL VPDSRGPSRT DGRLHNLSVE
LTDTHAILNM ITAVPTVGTG PVDVGDLELR TAGDGRLVAR LPREAYLGQE YLLACGLVTV
PIEMSAEAAS EEPLSLVSRR PGGPVRQLRE REVNVQVDES ALILEHPRDA EDAHHDVEVP
VRAFVRGRPG AVHEIAVRQF FNPRALPGEA AARSPKARCS DIDVLRLRPG RLDEAGGWSS
ACVLGTDQAG CGWFTMRGAT AGTALVLLST DADDLPCDPE APGSATLAYD HDDVLGYWPG
AGYLSIRVLP DDWRLAGLEQ KDVTFELVYR EVFAFYEHLF SFMKAEVFSL ADRCKVETYA
KLIWQMCDPR NKAKTYYMPP TRDLSEPKAR LLLKYLRAQQ VPDAVLLTAP ATNRRSRRIA
TRDQLVRVLR EAAKVELAVM LQYLYAAYSV PAYGVGLEYV RQGGWTREQL RLACGDGGET
LDEGVRSMLL NIAREEMIHF LLVNNILVAM GEPFHVPWID FATINHELPV PLDFCLEGMG
IGSVERFTMI ERPESHVPDV LGTGGATAQD DGHTYASLSD LYSAIREGLQ HIPGLFLVDK
GRGGGEHHLF LRESINSRHP DYQLEVDDLS SALFAIDVIT EQGEGGVLTA GPDEVSHYTS
FLRIGELLRG AVAPGGDPWH PAYPVLRNPT LRHGERAMET VTDPDARTVM ALFNRAYFMA
LQLMAQHFGE RPDGSLRRSD LMNGAIDMMT GLMRPLAELL VTMPSGRRGR TAGPSFELVE
QPTPVSRPEV ARRGIALRLD DLAAECGKSA LVPTRVGEMS AFWADHFRP