Gene Sare_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2071 
Symbol 
ID5703282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2371675 
End bp2375655 
Gene Length3981 bp 
Protein Length1326 aa 
Translation table11 
GC content71% 
IMG OID641271557 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001536928 
Protein GI159037675 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.564639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.531105 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGC CCCTGATGCC CGAAGCGGTC CTCGACGACG TACTCGACTG CATCGCCGAG 
GTCCTCGGCA CGGATCCGGG GCTGATCGAT GTGGAAGCAC CACTGACAGC CCTGGGTCTG
GAGTCGTTCA CCGCCGTACG GCTCCGCCGG CGGATCCGGG AGCGCACCGG ACACGACCTT
CCGCTGACCG CGTTCCTCGA CAACGCCACC GCGAGGCACG TCACCCGGCA GCTGTCGGGA
GCCCCTGACG AACCGGCCGC TGAATCCGCT GGGCAGCGCA CCGCTGGGCA GCCGGGCGGC
CGGGCGGCCT TCCCCTTGAC TCCGGTCCAG GAGTCCTACC TTGTCGGGCG GGAGTCAGGC
CTGGTCCTCG GCGGGGTGGC CACCTTCTAC TACCACGAGT ACGACCGGAT CAGCGACGAC
GCCGCCACCG ACCTCCAGCG ACTCGACCTG GCCTGGAACC AGTTGGTCGA CCACCACCCG
ATGCTACGGA TGGTGGTCGA CCAGCGTGGC CGTGGGAAGA TCCTGCCCAC AGCCGGTCCA
TACCGCATCG GCGTCACCGA CCTGCGAGGC GCCTCACCCG CACAGGTGGA CGAGTCGTTG
GCCACGCTCC GGCACGAACG CTCCCACCAG GTACGCCCAA CCGGGCAGTG GCCGTTGTTC
GACCTGCACG CGGCGTTCCT ACCCGATGGT CGTACCCGCC TCTACATCGG CTTCGACGTG
CTGATCACCG ACATGGCCGG ATGGATGCTG CTGATGCGGC AGTGGGGACA ACTCGTCGCC
GACCCGACGA CATCCCTGCC CGAGCCTCCC GCCGAGTTCG CCGACCTGCT GCACGCCCGG
GAGACGGATC TGGAGTGGAC CCAGCGGCGG GAACGGGACC GCGCCTACTG GGCAGCCCGC
GTCGCCGAGC TTCCTCCAGC GCCACGCCTA CCGGTGACCC GTGCCGCCGA GTCCACCGTG
CCGCCCCGCT TCGCTCGGCA CGCCGGCCAG CTCGACGCGC AGGCCTGGCG GACACTCCGT
ACCCGGTGTA CCGAGCACGG GGTGACAGCG ACCGCCGCGT TGCTAGCCGC GTTCGCGGTG
ATCCTCGAGC GGTGGGGCGC CGGCGAGCAG GTCTGCCTGA ACACCACCCT CTTCGAGCGT
CCGGAGAAGC CCGAGGGTAT CGACCTGGTG GTGGGCGACT TCACCACCAC CGCGTTGGTC
GGTACCCCGA GAATCGACCC GGCCTCGTGG AACGGATTCG CTGGTTACGC ATCGGAGCTC
AACCGTCGCT TCTGGGAGGA CCTGGACCAC CGGTCGGTCT CCGGAGTTGA TGTGCTGCGG
GGGCTCAGCG ACAGCTCCGG CGCCCCGCCC TACCCGGTCG TCTTCACCAG TGGGGTAGGA
CTCGCCGGTG ACGGCACCGC GGCTCCGGCG AGCTGGCTTG GCGCGGAGGT CTTCGGCGTC
TCGCAGACCC CCCAAGTGCT GCTCGACCAC ATCGTGTGGG ACGAGGATGG CGTGCTCCGA
ATCGCCTGGG ACGGTGTCGT TGACGCCTTT CCCGACGGCT ACCTGCGCAG CATGCTCGAC
GCGTACGTCC GGCTGCTGCA CCGCCTCACC GAGGCCACCG CGTGGAAGGA CCCGAGGCTC
GCCTGGGACC CCTTCGCCCT CCCCGTGGAA CCGTTGGACG TCGACCCGTT CCCTGATGCC
GGCCCGCTGC TGCACGACCC GGCGACCAGC ATCGCCCGCC GTATGCCGGA GAAGCCGGCC
CTGTACACGG GGGGCGCCGT TACCTCGCAC GGCCGGTTGG CCGAGGGCGT CGCCGCGACC
ACTGCCGCTC TTGCCGCCGC CGGTGTCGGT ACCGGCGACC TGGTGGCGGT CGCCTGCGAG
AAGGGGCTGG CCCAGGTCGT CGCGGTCCTG GCGGTCAACG CCGCCGGCGC GGGCTACCTG
CCGGTCGAGC CGTCCTGGCC GGATGCCCGG GTGGCCACGA TCTGTGTCCG TGCCGGAGTG
CGGCACGCCC TGGTGGGCCG GGGCGTTCGG ACCACGTGGC CCGAGGGTGT GTTGACGCAC
CGCCTGACCG CGGCCGGCCG GCCCGGGGGC CGATCAGGAA AGACTGTCTC CGAACCAACA
CCACCGCCAT CGCGACCCGA CCCGGATGAC ACCGCTTACG TCATCTTCAC CTCCGGCTCC
ACCGGGCAGC CGAAGGGCGT CGAGATCCAG CACCGCGCCG CCCGCACCAC CATCGACGAC
ATCGTCGACC GCTTCGGCGT CCACGCCGAC GACCGGGTGC TGGCGCTGTC CGCGCTCAGT
TTCGATCTCT CCGTCTTCGA CATCTACGGC GTTCTCGGGG CGGGCGGGGC GCTGGTCCTG
CCCGACCCGG CCCGACAGCG CGATCCGCAA CACTGGCTGG AACTGGCCGA GCGGCATGGC
GTCACGGTGT GGAACACCGC TCCCGCGCTG TTGGAGATGC TCGTCGAGTA CGCCGAAATC
GAACCGGGGG CCGCGACCCG GGCGCTGCGC GCCCTACGGC TGGTGATGCT CTCCGGCGAC
TGGATCCCGT TGACCCTGCC CGAACGCCTG CGTCGGCTCG CCCCACAGGC CCAGCTGATG
AGTCTGGGCG GTGCGACCGA GGCGTCCATC TGGTCGATCA CCTACCCGGT GGTGGACGTC
GCCCCGGGGT GGCGGAGTAT CCCCTACGGT CGGGCGTTGC GGGCCCAGTC CTTCCACATT
CTCGACCCGG ATGGCCGGCC ATGCCCGGTG GGTGAGCCGG GGGAACTGTT CATCGGCGGC
GATGGGCTCG CCCGGGGATA CATTGGCGAT CCGGAGCAGA CGGCGCATCG CTTCGCCCGG
CACCCGCTGT TGGGTGAGCG GCTGTACCGC ACCGGAGACC TGGGACGCTG GCGGACCGAC
GGGAACATCG AGTTCCTGGG ACGCGTCGAC CGGCAGGTCA AGATCCGCGG ACATCGGATC
GAACTCGGTG AGATCGAGGC CGCGCTCGGT CGGCACCCGG CCCTGCGGCA GTGCATCGTG
GCCGCGGTGC GTGGTGCCGA CGAGCGCCCC CGCTTGGCCG CCTACGTGGT GCCGAAGGCG
GGAGATGCCG TCCCGACCGC CGACGAATTG GCCGGGACGC TGCGCGAACG GCTGCCCGAC
TACATGGTTC CGAGCAAGTT CCTTGTGCTC GACTCGCTGC CGGTCACCCC AAACGGCAAG
ATCGACTACG CGGCACTACC CAATCCGTAC CAGGCGGGGG ACGCCGACGT GCAGCCACGT
CACGCGACGC TCTCGCCGCC CGCGTCGCCG GTAGTTCCCC CGGTCACGTC CGCCCCACGC
TTCGTCGACT GGGCCGGTAC GGCGGTGGCC GAGGCGGAGG CACTCGGCCT CGAGGTCGCA
CTCGTCGTCC GCCCGGGGCG AATGTCACCG GCGCAGGCCC TCGTCGCGGC GACGCGCTGG
CTCGACCGGG TCCACTCGGC AGGCGCCACC GCACTGGTCG AGCGGATCCC CGCTGACGGC
CTCATCGAAC TCGCTCAGCA GACCACCGAT CTACCGCAGG TCGACGTCCT TCCTCCAGCT
CCGCTGAGCA TGCCGGCCCC ACAGAGTGTG TCGGTCGCCG AACCGGCAGC AGCCGCGCTG
AACCGGACCG GGCCGGAGCC TGCAGGTCAC CCGGTCGCCG ATGGTCGTAC CGCTGATCCG
GCCGCCACCG ACGCGGTCAT CGCGGTGCTC GCAGACCTGA CGGGGGAGCC GGTTCGGGCG
GACAGCACCT TCGCCGCACT GGGCGTGACC TCGCTCACGC TGGTGCTGAC GCACCGCAGG
CTGCGGGAGA GCATCGCGCC CCAACTCGCG CTCGCGGACA TGTTCGCCCA TGCCACCGTG
GCCTCCCTCG CCGCGCACCT CACCGCCCTG GCCGCTCCGA GGAACACCCG AGCACCCGGG
CCGACGGCCG CCCCGGTGCC GTCCGCCCGA CGCTCGTCCC GGCTGGCCGC CCGTGTCCGG
GCCCAGGAGG TGGACCGGTG A
 
Protein sequence
MNLPLMPEAV LDDVLDCIAE VLGTDPGLID VEAPLTALGL ESFTAVRLRR RIRERTGHDL 
PLTAFLDNAT ARHVTRQLSG APDEPAAESA GQRTAGQPGG RAAFPLTPVQ ESYLVGRESG
LVLGGVATFY YHEYDRISDD AATDLQRLDL AWNQLVDHHP MLRMVVDQRG RGKILPTAGP
YRIGVTDLRG ASPAQVDESL ATLRHERSHQ VRPTGQWPLF DLHAAFLPDG RTRLYIGFDV
LITDMAGWML LMRQWGQLVA DPTTSLPEPP AEFADLLHAR ETDLEWTQRR ERDRAYWAAR
VAELPPAPRL PVTRAAESTV PPRFARHAGQ LDAQAWRTLR TRCTEHGVTA TAALLAAFAV
ILERWGAGEQ VCLNTTLFER PEKPEGIDLV VGDFTTTALV GTPRIDPASW NGFAGYASEL
NRRFWEDLDH RSVSGVDVLR GLSDSSGAPP YPVVFTSGVG LAGDGTAAPA SWLGAEVFGV
SQTPQVLLDH IVWDEDGVLR IAWDGVVDAF PDGYLRSMLD AYVRLLHRLT EATAWKDPRL
AWDPFALPVE PLDVDPFPDA GPLLHDPATS IARRMPEKPA LYTGGAVTSH GRLAEGVAAT
TAALAAAGVG TGDLVAVACE KGLAQVVAVL AVNAAGAGYL PVEPSWPDAR VATICVRAGV
RHALVGRGVR TTWPEGVLTH RLTAAGRPGG RSGKTVSEPT PPPSRPDPDD TAYVIFTSGS
TGQPKGVEIQ HRAARTTIDD IVDRFGVHAD DRVLALSALS FDLSVFDIYG VLGAGGALVL
PDPARQRDPQ HWLELAERHG VTVWNTAPAL LEMLVEYAEI EPGAATRALR ALRLVMLSGD
WIPLTLPERL RRLAPQAQLM SLGGATEASI WSITYPVVDV APGWRSIPYG RALRAQSFHI
LDPDGRPCPV GEPGELFIGG DGLARGYIGD PEQTAHRFAR HPLLGERLYR TGDLGRWRTD
GNIEFLGRVD RQVKIRGHRI ELGEIEAALG RHPALRQCIV AAVRGADERP RLAAYVVPKA
GDAVPTADEL AGTLRERLPD YMVPSKFLVL DSLPVTPNGK IDYAALPNPY QAGDADVQPR
HATLSPPASP VVPPVTSAPR FVDWAGTAVA EAEALGLEVA LVVRPGRMSP AQALVAATRW
LDRVHSAGAT ALVERIPADG LIELAQQTTD LPQVDVLPPA PLSMPAPQSV SVAEPAAAAL
NRTGPEPAGH PVADGRTADP AATDAVIAVL ADLTGEPVRA DSTFAALGVT SLTLVLTHRR
LRESIAPQLA LADMFAHATV ASLAAHLTAL AAPRNTRAPG PTAAPVPSAR RSSRLAARVR
AQEVDR