Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2071 |
Symbol | |
ID | 5703282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2371675 |
End bp | 2375655 |
Gene Length | 3981 bp |
Protein Length | 1326 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641271557 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001536928 |
Protein GI | 159037675 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.564639 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.531105 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTGC CCCTGATGCC CGAAGCGGTC CTCGACGACG TACTCGACTG CATCGCCGAG GTCCTCGGCA CGGATCCGGG GCTGATCGAT GTGGAAGCAC CACTGACAGC CCTGGGTCTG GAGTCGTTCA CCGCCGTACG GCTCCGCCGG CGGATCCGGG AGCGCACCGG ACACGACCTT CCGCTGACCG CGTTCCTCGA CAACGCCACC GCGAGGCACG TCACCCGGCA GCTGTCGGGA GCCCCTGACG AACCGGCCGC TGAATCCGCT GGGCAGCGCA CCGCTGGGCA GCCGGGCGGC CGGGCGGCCT TCCCCTTGAC TCCGGTCCAG GAGTCCTACC TTGTCGGGCG GGAGTCAGGC CTGGTCCTCG GCGGGGTGGC CACCTTCTAC TACCACGAGT ACGACCGGAT CAGCGACGAC GCCGCCACCG ACCTCCAGCG ACTCGACCTG GCCTGGAACC AGTTGGTCGA CCACCACCCG ATGCTACGGA TGGTGGTCGA CCAGCGTGGC CGTGGGAAGA TCCTGCCCAC AGCCGGTCCA TACCGCATCG GCGTCACCGA CCTGCGAGGC GCCTCACCCG CACAGGTGGA CGAGTCGTTG GCCACGCTCC GGCACGAACG CTCCCACCAG GTACGCCCAA CCGGGCAGTG GCCGTTGTTC GACCTGCACG CGGCGTTCCT ACCCGATGGT CGTACCCGCC TCTACATCGG CTTCGACGTG CTGATCACCG ACATGGCCGG ATGGATGCTG CTGATGCGGC AGTGGGGACA ACTCGTCGCC GACCCGACGA CATCCCTGCC CGAGCCTCCC GCCGAGTTCG CCGACCTGCT GCACGCCCGG GAGACGGATC TGGAGTGGAC CCAGCGGCGG GAACGGGACC GCGCCTACTG GGCAGCCCGC GTCGCCGAGC TTCCTCCAGC GCCACGCCTA CCGGTGACCC GTGCCGCCGA GTCCACCGTG CCGCCCCGCT TCGCTCGGCA CGCCGGCCAG CTCGACGCGC AGGCCTGGCG GACACTCCGT ACCCGGTGTA CCGAGCACGG GGTGACAGCG ACCGCCGCGT TGCTAGCCGC GTTCGCGGTG ATCCTCGAGC GGTGGGGCGC CGGCGAGCAG GTCTGCCTGA ACACCACCCT CTTCGAGCGT CCGGAGAAGC CCGAGGGTAT CGACCTGGTG GTGGGCGACT TCACCACCAC CGCGTTGGTC GGTACCCCGA GAATCGACCC GGCCTCGTGG AACGGATTCG CTGGTTACGC ATCGGAGCTC AACCGTCGCT TCTGGGAGGA CCTGGACCAC CGGTCGGTCT CCGGAGTTGA TGTGCTGCGG GGGCTCAGCG ACAGCTCCGG CGCCCCGCCC TACCCGGTCG TCTTCACCAG TGGGGTAGGA CTCGCCGGTG ACGGCACCGC GGCTCCGGCG AGCTGGCTTG GCGCGGAGGT CTTCGGCGTC TCGCAGACCC CCCAAGTGCT GCTCGACCAC ATCGTGTGGG ACGAGGATGG CGTGCTCCGA ATCGCCTGGG ACGGTGTCGT TGACGCCTTT CCCGACGGCT ACCTGCGCAG CATGCTCGAC GCGTACGTCC GGCTGCTGCA CCGCCTCACC GAGGCCACCG CGTGGAAGGA CCCGAGGCTC GCCTGGGACC CCTTCGCCCT CCCCGTGGAA CCGTTGGACG TCGACCCGTT CCCTGATGCC GGCCCGCTGC TGCACGACCC GGCGACCAGC ATCGCCCGCC GTATGCCGGA GAAGCCGGCC CTGTACACGG GGGGCGCCGT TACCTCGCAC GGCCGGTTGG CCGAGGGCGT CGCCGCGACC ACTGCCGCTC TTGCCGCCGC CGGTGTCGGT ACCGGCGACC TGGTGGCGGT CGCCTGCGAG AAGGGGCTGG CCCAGGTCGT CGCGGTCCTG GCGGTCAACG CCGCCGGCGC GGGCTACCTG CCGGTCGAGC CGTCCTGGCC GGATGCCCGG GTGGCCACGA TCTGTGTCCG TGCCGGAGTG CGGCACGCCC TGGTGGGCCG GGGCGTTCGG ACCACGTGGC CCGAGGGTGT GTTGACGCAC CGCCTGACCG CGGCCGGCCG GCCCGGGGGC CGATCAGGAA AGACTGTCTC CGAACCAACA CCACCGCCAT CGCGACCCGA CCCGGATGAC ACCGCTTACG TCATCTTCAC CTCCGGCTCC ACCGGGCAGC CGAAGGGCGT CGAGATCCAG CACCGCGCCG CCCGCACCAC CATCGACGAC ATCGTCGACC GCTTCGGCGT CCACGCCGAC GACCGGGTGC TGGCGCTGTC CGCGCTCAGT TTCGATCTCT CCGTCTTCGA CATCTACGGC GTTCTCGGGG CGGGCGGGGC GCTGGTCCTG CCCGACCCGG CCCGACAGCG CGATCCGCAA CACTGGCTGG AACTGGCCGA GCGGCATGGC GTCACGGTGT GGAACACCGC TCCCGCGCTG TTGGAGATGC TCGTCGAGTA CGCCGAAATC GAACCGGGGG CCGCGACCCG GGCGCTGCGC GCCCTACGGC TGGTGATGCT CTCCGGCGAC TGGATCCCGT TGACCCTGCC CGAACGCCTG CGTCGGCTCG CCCCACAGGC CCAGCTGATG AGTCTGGGCG GTGCGACCGA GGCGTCCATC TGGTCGATCA CCTACCCGGT GGTGGACGTC GCCCCGGGGT GGCGGAGTAT CCCCTACGGT CGGGCGTTGC GGGCCCAGTC CTTCCACATT CTCGACCCGG ATGGCCGGCC ATGCCCGGTG GGTGAGCCGG GGGAACTGTT CATCGGCGGC GATGGGCTCG CCCGGGGATA CATTGGCGAT CCGGAGCAGA CGGCGCATCG CTTCGCCCGG CACCCGCTGT TGGGTGAGCG GCTGTACCGC ACCGGAGACC TGGGACGCTG GCGGACCGAC GGGAACATCG AGTTCCTGGG ACGCGTCGAC CGGCAGGTCA AGATCCGCGG ACATCGGATC GAACTCGGTG AGATCGAGGC CGCGCTCGGT CGGCACCCGG CCCTGCGGCA GTGCATCGTG GCCGCGGTGC GTGGTGCCGA CGAGCGCCCC CGCTTGGCCG CCTACGTGGT GCCGAAGGCG GGAGATGCCG TCCCGACCGC CGACGAATTG GCCGGGACGC TGCGCGAACG GCTGCCCGAC TACATGGTTC CGAGCAAGTT CCTTGTGCTC GACTCGCTGC CGGTCACCCC AAACGGCAAG ATCGACTACG CGGCACTACC CAATCCGTAC CAGGCGGGGG ACGCCGACGT GCAGCCACGT CACGCGACGC TCTCGCCGCC CGCGTCGCCG GTAGTTCCCC CGGTCACGTC CGCCCCACGC TTCGTCGACT GGGCCGGTAC GGCGGTGGCC GAGGCGGAGG CACTCGGCCT CGAGGTCGCA CTCGTCGTCC GCCCGGGGCG AATGTCACCG GCGCAGGCCC TCGTCGCGGC GACGCGCTGG CTCGACCGGG TCCACTCGGC AGGCGCCACC GCACTGGTCG AGCGGATCCC CGCTGACGGC CTCATCGAAC TCGCTCAGCA GACCACCGAT CTACCGCAGG TCGACGTCCT TCCTCCAGCT CCGCTGAGCA TGCCGGCCCC ACAGAGTGTG TCGGTCGCCG AACCGGCAGC AGCCGCGCTG AACCGGACCG GGCCGGAGCC TGCAGGTCAC CCGGTCGCCG ATGGTCGTAC CGCTGATCCG GCCGCCACCG ACGCGGTCAT CGCGGTGCTC GCAGACCTGA CGGGGGAGCC GGTTCGGGCG GACAGCACCT TCGCCGCACT GGGCGTGACC TCGCTCACGC TGGTGCTGAC GCACCGCAGG CTGCGGGAGA GCATCGCGCC CCAACTCGCG CTCGCGGACA TGTTCGCCCA TGCCACCGTG GCCTCCCTCG CCGCGCACCT CACCGCCCTG GCCGCTCCGA GGAACACCCG AGCACCCGGG CCGACGGCCG CCCCGGTGCC GTCCGCCCGA CGCTCGTCCC GGCTGGCCGC CCGTGTCCGG GCCCAGGAGG TGGACCGGTG A
|
Protein sequence | MNLPLMPEAV LDDVLDCIAE VLGTDPGLID VEAPLTALGL ESFTAVRLRR RIRERTGHDL PLTAFLDNAT ARHVTRQLSG APDEPAAESA GQRTAGQPGG RAAFPLTPVQ ESYLVGRESG LVLGGVATFY YHEYDRISDD AATDLQRLDL AWNQLVDHHP MLRMVVDQRG RGKILPTAGP YRIGVTDLRG ASPAQVDESL ATLRHERSHQ VRPTGQWPLF DLHAAFLPDG RTRLYIGFDV LITDMAGWML LMRQWGQLVA DPTTSLPEPP AEFADLLHAR ETDLEWTQRR ERDRAYWAAR VAELPPAPRL PVTRAAESTV PPRFARHAGQ LDAQAWRTLR TRCTEHGVTA TAALLAAFAV ILERWGAGEQ VCLNTTLFER PEKPEGIDLV VGDFTTTALV GTPRIDPASW NGFAGYASEL NRRFWEDLDH RSVSGVDVLR GLSDSSGAPP YPVVFTSGVG LAGDGTAAPA SWLGAEVFGV SQTPQVLLDH IVWDEDGVLR IAWDGVVDAF PDGYLRSMLD AYVRLLHRLT EATAWKDPRL AWDPFALPVE PLDVDPFPDA GPLLHDPATS IARRMPEKPA LYTGGAVTSH GRLAEGVAAT TAALAAAGVG TGDLVAVACE KGLAQVVAVL AVNAAGAGYL PVEPSWPDAR VATICVRAGV RHALVGRGVR TTWPEGVLTH RLTAAGRPGG RSGKTVSEPT PPPSRPDPDD TAYVIFTSGS TGQPKGVEIQ HRAARTTIDD IVDRFGVHAD DRVLALSALS FDLSVFDIYG VLGAGGALVL PDPARQRDPQ HWLELAERHG VTVWNTAPAL LEMLVEYAEI EPGAATRALR ALRLVMLSGD WIPLTLPERL RRLAPQAQLM SLGGATEASI WSITYPVVDV APGWRSIPYG RALRAQSFHI LDPDGRPCPV GEPGELFIGG DGLARGYIGD PEQTAHRFAR HPLLGERLYR TGDLGRWRTD GNIEFLGRVD RQVKIRGHRI ELGEIEAALG RHPALRQCIV AAVRGADERP RLAAYVVPKA GDAVPTADEL AGTLRERLPD YMVPSKFLVL DSLPVTPNGK IDYAALPNPY QAGDADVQPR HATLSPPASP VVPPVTSAPR FVDWAGTAVA EAEALGLEVA LVVRPGRMSP AQALVAATRW LDRVHSAGAT ALVERIPADG LIELAQQTTD LPQVDVLPPA PLSMPAPQSV SVAEPAAAAL NRTGPEPAGH PVADGRTADP AATDAVIAVL ADLTGEPVRA DSTFAALGVT SLTLVLTHRR LRESIAPQLA LADMFAHATV ASLAAHLTAL AAPRNTRAPG PTAAPVPSAR RSSRLAARVR AQEVDR
|
| |