Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2962 |
Symbol | |
ID | 5707792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3366380 |
End bp | 3370450 |
Gene Length | 4071 bp |
Protein Length | 1356 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641272411 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001537779 |
Protein GI | 159038526 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins [COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.639198 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00173398 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCGACG ACAGTTTCGA AGGCCGGGTC GCCCAGCTCT CCGAACGCCG GCGGTTGCTG CTCGAACGGT TGCGGCAACA GCGGCAGGGA CCGGGCCGAC CGGCCGCGAT CCGGCCCCGG CCGAGCGGCG TCGACCGGGT GCCCCTTTCG CAGGCGCAGG AACAGCTGTG GTTCCTCGCC CAACTCGCCC CAGACGAACC CACGTACAAC CTGGTGCAGG CCTCGTACCT CGTCGGGCCG CTCGACCTCG TCGCGCTCCG GCGGGCGCTC GACGAGGTGG TGCGCCGGCA TGAGGCACTG CGCACCGTCA TCGAGTCCAC CGATGACACG GCGTACCAGG TGGTGTGCTC GCCGGGACCT GCCGCGCTGG AGATCGACGA CGTCAGCGTG CTGCCGGCGA GCGAACGGCG GCACGCGGCA CTGCTGCTGT TGCAGGATCA CGAGGTGCAC CGTCCGTTCG ACCTGGCCAG CGGGCCGCTG TTCCGGGCTC GGCTGGTGCG GCTCGGTCCG ACCGAGCACG CGCTGGCCCT CGCCGTGCAC CACAGCACCG CCGACGGTTG GTCCATGGGT CTGATCATCC GCGAGCTGAG CACGCTGTAC GCGGCGTACC GATCCGGGAC CGCGGCGGAC CTACCGGCGC CGGCGCTGCA GTTTGCGGAC TTCGCCGCAT GGCAGCGGCG GCAGCTGCGG GCGGACGCCA TGGAGCGGCA CCTTCGGTAC TGGCAGGACC GGTTGGTCGA TCTGCCCACC CTGGACCTGC CGACCGACCG GTCCCGACCG GCTGCCACGT CGTTCCGCGG CGCCCTGCTG GAGCAGCCAA TCGATCGCAC GCTGCACACG GCCGTGCAGG CGATGGCCAA GGAGACCGGC GCCAGTTCCT TCATGGTCCT CGTCGCGGCG TTCGGAGCGA CCCTCGCCCG CTACACCGGC CAGGAGGACC TGGCCATCGG CACCGCCTTC AGTGGCCGTG GCCGGCCGGA ACTGGAGAAG GTGGTGGGTT TCTTCGCCAA CATGGGCGTG CTGCGGATCG ACGCCTCAGG CAACCCGACC TTCGCCACCC TGGTCGACCG GGCACGGGAC ACCTGCCTCG GGGCGTGGGA GCACCAGGAT GCGCCGTTCG AGCGGGTCGT GCAGCGGGTC GCGCCGCTGC GCGACCCCAG CCGCAATCCG TTGTTCCAGG TGGCGGTGCA GATGCTGACC TCGTCCACCT GGGGTGGTCC GGGCCTGCCG GGCACCGACA GTTTCCCGGT GGACCTGCGG CTGGAACGAT CCCGGTTCGA CCTGACGGTC AGTCTCGTCG ACCACGGTGA CCGGTACTCG ATACTCGCCG AGTACTCCAC CGACCTGTTC GGGCGGCAGC GCATCCAGCG ACTGTTGGTC CACTTCGAAC GGGTGTTGGC GGCTGGGCTC GCCGATCCGA CCCGTCGATT GTCCGAGTTC CCCCTGCTCA CCGAGGAGGA ACGACGGGAG GTCCTGGCGT TCGGCGTCGG TGAGCGGCGG CCGGTGCCGC GGACCACCGC GCTGCGGATG TTCGCCGAGC AGGCGCGGCG CCGCCGCGAC ACCGTCGCCG TCCGGCACGA CGGCCTGGAC CTGAGCTACC GGGCGCTGGA CGACCGGGCG CGGCGGCTGG CCGGCCGCCT GCGCGCCGCC GGGGTACGCC CGAAGGACCC GGTGCCGGTG CTGCTGGACC GGGGGTTCGA CGAGGTGGTC GCGCCGCTGG CCATCTGGTA CGCCGGCGCG GTCCACGTCC CCCTGGACAC GGCCGCCCCA CCGAACCGAC TGCGCCGGAT CATCACGAAC ACCGGCGCAC GCCTCGCCGT CACCCGGACC GAGTACGCGG CGCGGATGCC CACGGACGGG CCTTGGCGGG TGCTCCACCT GGACGACCGC GATCCGGAGG TCGACGCGCC GCGCACCGTT GACGACCTGG CGGCGTCGTC GACGGGGCTC GACGACGTCG CCTACATACT GCACACCTCG GGCTCGACCG GCGACCCGAA GGGGGTGCAG ATCGACCACG CCGGGTTGGT CAACTACCTG GACTGGATGG TCGGCGAGTG GCGGTGTGGA CCCGGTGACC GGATCCTGCA CGCGGGGGCG CCGATCTTCG ACCTGGCGGC CGGGGAGACG CTTGCCGCCC TGACCTCCGG CGCGACCCTG GTGGTGATCG GCAAGGAGCA GCTGCTCTCG CCGGACGGGC TGGTCGAGGT GCTGTCCCGG GAACAGATCA CCCACCTGCT CCTCACCCCG ACCGGGCTCA GCCTGGCCGA CGCCGACCCC GACCGCCTAC CCGACCTGCG TGAGGTCTTC GTTGCTGGTG AGGTGTGCTC CGCCGAACTG GCGGTTCGGT GGTCCCGGCC CGGCCGGTGC CGCCTGGCCA ACCTGTACGG TCCGACCGAG ATCACCATCG CCAACACGGC CTACGACTGC ACCGGCTGGT CGTCGGCGGA GCCACCGCCC ATCGGCAGGT CGCTTCCCAA CCGCCACCTC TACCTGCTCG ACCGGTGGGG CCAGCCGGTG CCGGCGGGGG TGCCCGGCGA GATCGTCGTC GGGGGCGTCG GGGTCAGCCG CGGCTACCTC AACGAGCCGG AGCTGACCGC CCGCACGTTC ACCGACGATC CCTTCGCCCC GGGCGCTCGG GTGTACCGGA CCGGCGACCG AGGGGTGTGG ACGGACGACG GGCTGCTTCG CTTCGTCGAC CGGCTCGACG GGCAGGTGAA GCTGCGCGGG CTGCGGATCG AGCTGGCGGA AGTCGAGACC ACACTGGCCC GGCACGAGGA CGTCGACCAG GTGGCCGCGA CGGTCGTCCG AGACGGTTCG GGGACGCAAC GGCTCGTCGC CTACGTGGTG CCCGTCGCCG ACCAGATCGA CGCCGCAGCG CTCCGCGCGT ACGCGGCCGA GGAGTTGCCC GCGCACATGG TGCCGGGTCA GGTCCTGCAC CTGTCGGCAC TGCCGCTGAC CGGCTCCGGC AAGATCGACC GCCGGGCGTT GCCCCCGCCC GCCCCGGACG GGGCGGAGGT CGAGGATCGG GCGATGCCGG CCGACCCGGC CGAACGGCAG GTGGCGGCGG TGTTCGCCGA GGTTCTCGGC GTGCCCTCGG TCGCGGTGGA CCGATCCTTC TTCGACCTGG GTGGCCACTC GTTGCAGGCG GCGTACGTCC TGGCGCGGAT CGCCCGGCAG ACCGGCGTCA CGATCGGCCT CAAGCAGTTC TATGCGGATC CGACCGTCCG GACCCTGGCC GGGCTGGTCG GTCGGGGTTC TGCCGCCGAC GCCGGCCGGT CGCCGCTGGT CACCCTCAAG GCGGAGGGCT CCCGGCCCCG GCTGTACTGT CTGCACGCCG TGTCCGGCTC GCCCTACTGG TACCTGCCGC TGAGCCGGGC CCTGCACCCC GAGCAGCCGT TGGACGGCTT CGAGGCACCC GGTCTGGAGG GCGACGCCGA GCCGGTGGAG GACCTGACTG CCCTCGCGGC CCGGTACGTC GATGCCCTCC GCGAGCGGCA GCCCGCCGGC CCGTACCTGT TGGCCGGCTG GTCAATGGGC GGTTTCCTGT CGTTCGAGAT GGCCCGTCAG CTCGCGGCCG TGGGTGAGTC CCCGGCGCTG GTGGCGATGA TCGACTCCAA CGAGCCCGGT CCGCTGCCGC TGCCCAGCGA GCAGGAGGTG ATGGAGACCT TCGTCAGCGA CCTCGGCGGC CTCGCTGGGA CGGCCCCGCC GGTGCTACCG GCCGAGATCG CCCGGGCGGC ATCCACCGAC CCCGCGGTAC TCACCGCCTT CCTGGTAGAG CACGGGATGG TTCCCGCCGA CGTACGCGCC GACTTCGTGT CCCACCGGTA CCGAGTGTTC CGCGCCAACA TGCGGGCGGT CTACGGCTAC CGGCCCGGGC CCTACGCCGG CCGGGTCGTG ATGGTCCAGG CAGCGGAGGA ACCGAGCCGG GCGGCATGGG CCCGTCACGC AGGTGCGATG GAGACGGTCA CCCTTCCGGG CAACCACTAC TCGCTCTGGT CCGCCGCGCA CCTGCCGGGA CTCGCCGCGA TGATCGACGC CCGGGTCGGG GAGGCGATGG CCGGCGACTG A
|
Protein sequence | MPDDSFEGRV AQLSERRRLL LERLRQQRQG PGRPAAIRPR PSGVDRVPLS QAQEQLWFLA QLAPDEPTYN LVQASYLVGP LDLVALRRAL DEVVRRHEAL RTVIESTDDT AYQVVCSPGP AALEIDDVSV LPASERRHAA LLLLQDHEVH RPFDLASGPL FRARLVRLGP TEHALALAVH HSTADGWSMG LIIRELSTLY AAYRSGTAAD LPAPALQFAD FAAWQRRQLR ADAMERHLRY WQDRLVDLPT LDLPTDRSRP AATSFRGALL EQPIDRTLHT AVQAMAKETG ASSFMVLVAA FGATLARYTG QEDLAIGTAF SGRGRPELEK VVGFFANMGV LRIDASGNPT FATLVDRARD TCLGAWEHQD APFERVVQRV APLRDPSRNP LFQVAVQMLT SSTWGGPGLP GTDSFPVDLR LERSRFDLTV SLVDHGDRYS ILAEYSTDLF GRQRIQRLLV HFERVLAAGL ADPTRRLSEF PLLTEEERRE VLAFGVGERR PVPRTTALRM FAEQARRRRD TVAVRHDGLD LSYRALDDRA RRLAGRLRAA GVRPKDPVPV LLDRGFDEVV APLAIWYAGA VHVPLDTAAP PNRLRRIITN TGARLAVTRT EYAARMPTDG PWRVLHLDDR DPEVDAPRTV DDLAASSTGL DDVAYILHTS GSTGDPKGVQ IDHAGLVNYL DWMVGEWRCG PGDRILHAGA PIFDLAAGET LAALTSGATL VVIGKEQLLS PDGLVEVLSR EQITHLLLTP TGLSLADADP DRLPDLREVF VAGEVCSAEL AVRWSRPGRC RLANLYGPTE ITIANTAYDC TGWSSAEPPP IGRSLPNRHL YLLDRWGQPV PAGVPGEIVV GGVGVSRGYL NEPELTARTF TDDPFAPGAR VYRTGDRGVW TDDGLLRFVD RLDGQVKLRG LRIELAEVET TLARHEDVDQ VAATVVRDGS GTQRLVAYVV PVADQIDAAA LRAYAAEELP AHMVPGQVLH LSALPLTGSG KIDRRALPPP APDGAEVEDR AMPADPAERQ VAAVFAEVLG VPSVAVDRSF FDLGGHSLQA AYVLARIARQ TGVTIGLKQF YADPTVRTLA GLVGRGSAAD AGRSPLVTLK AEGSRPRLYC LHAVSGSPYW YLPLSRALHP EQPLDGFEAP GLEGDAEPVE DLTALAARYV DALRERQPAG PYLLAGWSMG GFLSFEMARQ LAAVGESPAL VAMIDSNEPG PLPLPSEQEV METFVSDLGG LAGTAPPVLP AEIARAASTD PAVLTAFLVE HGMVPADVRA DFVSHRYRVF RANMRAVYGY RPGPYAGRVV MVQAAEEPSR AAWARHAGAM ETVTLPGNHY SLWSAAHLPG LAAMIDARVG EAMAGD
|
| |