Gene Sare_2962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2962 
Symbol 
ID5707792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3366380 
End bp3370450 
Gene Length4071 bp 
Protein Length1356 aa 
Translation table11 
GC content72% 
IMG OID641272411 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001537779 
Protein GI159038526 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.639198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00173398 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGACG ACAGTTTCGA AGGCCGGGTC GCCCAGCTCT CCGAACGCCG GCGGTTGCTG 
CTCGAACGGT TGCGGCAACA GCGGCAGGGA CCGGGCCGAC CGGCCGCGAT CCGGCCCCGG
CCGAGCGGCG TCGACCGGGT GCCCCTTTCG CAGGCGCAGG AACAGCTGTG GTTCCTCGCC
CAACTCGCCC CAGACGAACC CACGTACAAC CTGGTGCAGG CCTCGTACCT CGTCGGGCCG
CTCGACCTCG TCGCGCTCCG GCGGGCGCTC GACGAGGTGG TGCGCCGGCA TGAGGCACTG
CGCACCGTCA TCGAGTCCAC CGATGACACG GCGTACCAGG TGGTGTGCTC GCCGGGACCT
GCCGCGCTGG AGATCGACGA CGTCAGCGTG CTGCCGGCGA GCGAACGGCG GCACGCGGCA
CTGCTGCTGT TGCAGGATCA CGAGGTGCAC CGTCCGTTCG ACCTGGCCAG CGGGCCGCTG
TTCCGGGCTC GGCTGGTGCG GCTCGGTCCG ACCGAGCACG CGCTGGCCCT CGCCGTGCAC
CACAGCACCG CCGACGGTTG GTCCATGGGT CTGATCATCC GCGAGCTGAG CACGCTGTAC
GCGGCGTACC GATCCGGGAC CGCGGCGGAC CTACCGGCGC CGGCGCTGCA GTTTGCGGAC
TTCGCCGCAT GGCAGCGGCG GCAGCTGCGG GCGGACGCCA TGGAGCGGCA CCTTCGGTAC
TGGCAGGACC GGTTGGTCGA TCTGCCCACC CTGGACCTGC CGACCGACCG GTCCCGACCG
GCTGCCACGT CGTTCCGCGG CGCCCTGCTG GAGCAGCCAA TCGATCGCAC GCTGCACACG
GCCGTGCAGG CGATGGCCAA GGAGACCGGC GCCAGTTCCT TCATGGTCCT CGTCGCGGCG
TTCGGAGCGA CCCTCGCCCG CTACACCGGC CAGGAGGACC TGGCCATCGG CACCGCCTTC
AGTGGCCGTG GCCGGCCGGA ACTGGAGAAG GTGGTGGGTT TCTTCGCCAA CATGGGCGTG
CTGCGGATCG ACGCCTCAGG CAACCCGACC TTCGCCACCC TGGTCGACCG GGCACGGGAC
ACCTGCCTCG GGGCGTGGGA GCACCAGGAT GCGCCGTTCG AGCGGGTCGT GCAGCGGGTC
GCGCCGCTGC GCGACCCCAG CCGCAATCCG TTGTTCCAGG TGGCGGTGCA GATGCTGACC
TCGTCCACCT GGGGTGGTCC GGGCCTGCCG GGCACCGACA GTTTCCCGGT GGACCTGCGG
CTGGAACGAT CCCGGTTCGA CCTGACGGTC AGTCTCGTCG ACCACGGTGA CCGGTACTCG
ATACTCGCCG AGTACTCCAC CGACCTGTTC GGGCGGCAGC GCATCCAGCG ACTGTTGGTC
CACTTCGAAC GGGTGTTGGC GGCTGGGCTC GCCGATCCGA CCCGTCGATT GTCCGAGTTC
CCCCTGCTCA CCGAGGAGGA ACGACGGGAG GTCCTGGCGT TCGGCGTCGG TGAGCGGCGG
CCGGTGCCGC GGACCACCGC GCTGCGGATG TTCGCCGAGC AGGCGCGGCG CCGCCGCGAC
ACCGTCGCCG TCCGGCACGA CGGCCTGGAC CTGAGCTACC GGGCGCTGGA CGACCGGGCG
CGGCGGCTGG CCGGCCGCCT GCGCGCCGCC GGGGTACGCC CGAAGGACCC GGTGCCGGTG
CTGCTGGACC GGGGGTTCGA CGAGGTGGTC GCGCCGCTGG CCATCTGGTA CGCCGGCGCG
GTCCACGTCC CCCTGGACAC GGCCGCCCCA CCGAACCGAC TGCGCCGGAT CATCACGAAC
ACCGGCGCAC GCCTCGCCGT CACCCGGACC GAGTACGCGG CGCGGATGCC CACGGACGGG
CCTTGGCGGG TGCTCCACCT GGACGACCGC GATCCGGAGG TCGACGCGCC GCGCACCGTT
GACGACCTGG CGGCGTCGTC GACGGGGCTC GACGACGTCG CCTACATACT GCACACCTCG
GGCTCGACCG GCGACCCGAA GGGGGTGCAG ATCGACCACG CCGGGTTGGT CAACTACCTG
GACTGGATGG TCGGCGAGTG GCGGTGTGGA CCCGGTGACC GGATCCTGCA CGCGGGGGCG
CCGATCTTCG ACCTGGCGGC CGGGGAGACG CTTGCCGCCC TGACCTCCGG CGCGACCCTG
GTGGTGATCG GCAAGGAGCA GCTGCTCTCG CCGGACGGGC TGGTCGAGGT GCTGTCCCGG
GAACAGATCA CCCACCTGCT CCTCACCCCG ACCGGGCTCA GCCTGGCCGA CGCCGACCCC
GACCGCCTAC CCGACCTGCG TGAGGTCTTC GTTGCTGGTG AGGTGTGCTC CGCCGAACTG
GCGGTTCGGT GGTCCCGGCC CGGCCGGTGC CGCCTGGCCA ACCTGTACGG TCCGACCGAG
ATCACCATCG CCAACACGGC CTACGACTGC ACCGGCTGGT CGTCGGCGGA GCCACCGCCC
ATCGGCAGGT CGCTTCCCAA CCGCCACCTC TACCTGCTCG ACCGGTGGGG CCAGCCGGTG
CCGGCGGGGG TGCCCGGCGA GATCGTCGTC GGGGGCGTCG GGGTCAGCCG CGGCTACCTC
AACGAGCCGG AGCTGACCGC CCGCACGTTC ACCGACGATC CCTTCGCCCC GGGCGCTCGG
GTGTACCGGA CCGGCGACCG AGGGGTGTGG ACGGACGACG GGCTGCTTCG CTTCGTCGAC
CGGCTCGACG GGCAGGTGAA GCTGCGCGGG CTGCGGATCG AGCTGGCGGA AGTCGAGACC
ACACTGGCCC GGCACGAGGA CGTCGACCAG GTGGCCGCGA CGGTCGTCCG AGACGGTTCG
GGGACGCAAC GGCTCGTCGC CTACGTGGTG CCCGTCGCCG ACCAGATCGA CGCCGCAGCG
CTCCGCGCGT ACGCGGCCGA GGAGTTGCCC GCGCACATGG TGCCGGGTCA GGTCCTGCAC
CTGTCGGCAC TGCCGCTGAC CGGCTCCGGC AAGATCGACC GCCGGGCGTT GCCCCCGCCC
GCCCCGGACG GGGCGGAGGT CGAGGATCGG GCGATGCCGG CCGACCCGGC CGAACGGCAG
GTGGCGGCGG TGTTCGCCGA GGTTCTCGGC GTGCCCTCGG TCGCGGTGGA CCGATCCTTC
TTCGACCTGG GTGGCCACTC GTTGCAGGCG GCGTACGTCC TGGCGCGGAT CGCCCGGCAG
ACCGGCGTCA CGATCGGCCT CAAGCAGTTC TATGCGGATC CGACCGTCCG GACCCTGGCC
GGGCTGGTCG GTCGGGGTTC TGCCGCCGAC GCCGGCCGGT CGCCGCTGGT CACCCTCAAG
GCGGAGGGCT CCCGGCCCCG GCTGTACTGT CTGCACGCCG TGTCCGGCTC GCCCTACTGG
TACCTGCCGC TGAGCCGGGC CCTGCACCCC GAGCAGCCGT TGGACGGCTT CGAGGCACCC
GGTCTGGAGG GCGACGCCGA GCCGGTGGAG GACCTGACTG CCCTCGCGGC CCGGTACGTC
GATGCCCTCC GCGAGCGGCA GCCCGCCGGC CCGTACCTGT TGGCCGGCTG GTCAATGGGC
GGTTTCCTGT CGTTCGAGAT GGCCCGTCAG CTCGCGGCCG TGGGTGAGTC CCCGGCGCTG
GTGGCGATGA TCGACTCCAA CGAGCCCGGT CCGCTGCCGC TGCCCAGCGA GCAGGAGGTG
ATGGAGACCT TCGTCAGCGA CCTCGGCGGC CTCGCTGGGA CGGCCCCGCC GGTGCTACCG
GCCGAGATCG CCCGGGCGGC ATCCACCGAC CCCGCGGTAC TCACCGCCTT CCTGGTAGAG
CACGGGATGG TTCCCGCCGA CGTACGCGCC GACTTCGTGT CCCACCGGTA CCGAGTGTTC
CGCGCCAACA TGCGGGCGGT CTACGGCTAC CGGCCCGGGC CCTACGCCGG CCGGGTCGTG
ATGGTCCAGG CAGCGGAGGA ACCGAGCCGG GCGGCATGGG CCCGTCACGC AGGTGCGATG
GAGACGGTCA CCCTTCCGGG CAACCACTAC TCGCTCTGGT CCGCCGCGCA CCTGCCGGGA
CTCGCCGCGA TGATCGACGC CCGGGTCGGG GAGGCGATGG CCGGCGACTG A
 
Protein sequence
MPDDSFEGRV AQLSERRRLL LERLRQQRQG PGRPAAIRPR PSGVDRVPLS QAQEQLWFLA 
QLAPDEPTYN LVQASYLVGP LDLVALRRAL DEVVRRHEAL RTVIESTDDT AYQVVCSPGP
AALEIDDVSV LPASERRHAA LLLLQDHEVH RPFDLASGPL FRARLVRLGP TEHALALAVH
HSTADGWSMG LIIRELSTLY AAYRSGTAAD LPAPALQFAD FAAWQRRQLR ADAMERHLRY
WQDRLVDLPT LDLPTDRSRP AATSFRGALL EQPIDRTLHT AVQAMAKETG ASSFMVLVAA
FGATLARYTG QEDLAIGTAF SGRGRPELEK VVGFFANMGV LRIDASGNPT FATLVDRARD
TCLGAWEHQD APFERVVQRV APLRDPSRNP LFQVAVQMLT SSTWGGPGLP GTDSFPVDLR
LERSRFDLTV SLVDHGDRYS ILAEYSTDLF GRQRIQRLLV HFERVLAAGL ADPTRRLSEF
PLLTEEERRE VLAFGVGERR PVPRTTALRM FAEQARRRRD TVAVRHDGLD LSYRALDDRA
RRLAGRLRAA GVRPKDPVPV LLDRGFDEVV APLAIWYAGA VHVPLDTAAP PNRLRRIITN
TGARLAVTRT EYAARMPTDG PWRVLHLDDR DPEVDAPRTV DDLAASSTGL DDVAYILHTS
GSTGDPKGVQ IDHAGLVNYL DWMVGEWRCG PGDRILHAGA PIFDLAAGET LAALTSGATL
VVIGKEQLLS PDGLVEVLSR EQITHLLLTP TGLSLADADP DRLPDLREVF VAGEVCSAEL
AVRWSRPGRC RLANLYGPTE ITIANTAYDC TGWSSAEPPP IGRSLPNRHL YLLDRWGQPV
PAGVPGEIVV GGVGVSRGYL NEPELTARTF TDDPFAPGAR VYRTGDRGVW TDDGLLRFVD
RLDGQVKLRG LRIELAEVET TLARHEDVDQ VAATVVRDGS GTQRLVAYVV PVADQIDAAA
LRAYAAEELP AHMVPGQVLH LSALPLTGSG KIDRRALPPP APDGAEVEDR AMPADPAERQ
VAAVFAEVLG VPSVAVDRSF FDLGGHSLQA AYVLARIARQ TGVTIGLKQF YADPTVRTLA
GLVGRGSAAD AGRSPLVTLK AEGSRPRLYC LHAVSGSPYW YLPLSRALHP EQPLDGFEAP
GLEGDAEPVE DLTALAARYV DALRERQPAG PYLLAGWSMG GFLSFEMARQ LAAVGESPAL
VAMIDSNEPG PLPLPSEQEV METFVSDLGG LAGTAPPVLP AEIARAASTD PAVLTAFLVE
HGMVPADVRA DFVSHRYRVF RANMRAVYGY RPGPYAGRVV MVQAAEEPSR AAWARHAGAM
ETVTLPGNHY SLWSAAHLPG LAAMIDARVG EAMAGD