Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4895 |
Symbol | |
ID | 5707547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5559095 |
End bp | 5562202 |
Gene Length | 3108 bp |
Protein Length | 1035 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641274290 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001539635 |
Protein GI | 159040382 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00652278 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCGGGCGA CGCCCCTCGA CACCGCTATC GACCCGGACC CGACGCCCAC CACGGCGGTC GGCGCCACCG TGCCGGACCT CGTCGCCGCC GTGGCGCGAC GGCAGCCGGA CGCGGTCGCG GTTGCCGGCA GTGACGGGGC CACGCTCACC TACCGGGAGC TGACCGAGCA GGCCGAGGCC CTGGCCCACC GGCTCGTCAC CTGGGGCGTC CACCCGGACG AGCCGGTGGC GGTGGCGCTG CCCCGCTCGG TGGAGCTGGT CGTCACCCTT CTCGCCGTAC TCAAGGCCGG CGGTGGGTAT CTGCCTCTCG ACCCGGCTGA TCCGCCAGCG CGGACGCGGC AGCTGCTCGC CGTGGCCGGG GATCCGCCGG TGCTGTCCAC AGGCGAGGTG CCTGGCGCCA CGCGGCTGTT CCGTCTGGAC CAGCCCGGCC CCACCGCGCC CACCGGTGCC GTGCCGCGGC GACTCCACCC CGCCGGTCTC GCATACGTCA ACTTCACCTC CGGCTCCACC GGTACGCCGA AGGGGGTTGC CGTTGCCCAC TCTGCCGTGG TCCGCCTGAT TCACCAGCCT GGCTACCTGC GACTGGGGCC GACCGAGACG GTTCTCCAAC TCGCTCCGGC CGCCTTCGAC GCGGCGACCC TGGAGATCTG GGGTGCCCTG GCCACCGGGG CCCGGCTCGT GCTGGCCCCA CCCGGCGCGC TGGACCTCGC CGACCTGGCC CGGCTGCTCC GCCGGGAACG GATCACAGTC CTCTGGCTGA CCGCCGGGCT CTTTCACCAG CTGGTGGAGT TCGACCCGGA CTGCCTCGCC GGGGTCGGGC AACTCCTGGC CGGCGGGGAC GTGCTCGGCC CGGACGCGGT CCGCCGGGCA CTGCGGGCGC GTGACGGAGC GGTTCTGATC AACGGGTACG GCCCGACCGA GAACACCACC TTCACCTGCG TGCACCCGAT GACCGACCCG GCGGCCGTAC CGGACCCGGT GCCGATCGGT CGGCCGGTGC CAGGGAGCAC CGTGTACGTC CTCGACCCGG CGGGACGACA CGTGCCCGTC GGGGTTCCCG GAGAGCTGTA CACCGGCGGT GCCGGGGTCG CCCGGGGATA CCTCGGGCGT CCGGGGGCCA CGGCGGCGGT ATTCCTCCCC GACCCGTTCG ACCCCCGCCC GGGATCCCGG ATGTACCGCA CCGGGGACCG AGTCCGCTGG CGGCCGGACG GCACCCTCGA CTTCCTCGGC CGGATCGACG AACAGGTGAA GATCCGCGGG TTCCGGGTCG AGCCGGGCGA GGTCGCGGCG GTGCTGCGGG CCCACCCCGC CGTCGGGGAC ACTGCCGTCC TGGTCGACGG GGAGGGAGAA CGGCGTCGGC TGCTGGCATA CCTGACGCCC CGGCCGGGCG CGTCGGCACC GACCCCGCAG GAACTGGCCG GCTACGCAGC CGACCGGCTC CCGGCCCACC TCCGACCGGC GGCCTTCCTG ATCCTGTCCA CTCTGCCCCT GACCCGTAGC GGCAAGATCG ACCGGCGGGC ACTGCCCCGA CCGGAACCGC CCGCAGCTCG GCCCGTCACC GCCGTGACCG ATCCGATCCA GGTGCGACTG GCGGCCCTCT GGGCCGAGCT GCTCGGTAGC GCACCCTCCA CCCCGGACGA CGACTTCTTC GCGCTGGGCG GGAACTCGCT GCTCGCCACC CGGCTGACCT TCGTCGTCGC CGACCGGTTC GGGGTCGACC TGCCGGTACG TGTCGTCTAC GAGCACCCAA CGCTCGCCCG GCTCGCCGGC GTCCTCGGTG AGCACCCCGA AGCGCGGCCG GCGAGCACCG GGGTGACCCG TCGAGACCGG GCTGGCTACC GGTCGCCTGC GACCACGGAC GCCATGGCCC GCCAACCGGC CCGTTCGGGC GACGAACCCG GCTCACGGTG GCACTGGGCC CCCACCCGGG CCCCGGCCGC AGACCCGGGT CGCTCCGAAC CCGACGGCAC CCTCTCCGCG GACCTCGCCG AGAGCCTCCG GCCGGCGCTC AGCCTGCTGT TGGACAGCGT CAGCTGGTTC ACCTCCGCCG CGTTGGCGCT GGTTCGGCGG ACCGCCGTGG AGCGCTACCG CGACCTGGCG GCCCGCACCG GATCGCCGAC GGTCGCCTTC GTCGACTTCT GGCACGGGAA CAGCGACCTG TTCGTGGATC CGCCGGTGAA GCTGCTCACC CCACTCGTGC GGGGGTTGCA GGACCGCTGG GCGCGGATCC TGCCCGACGC GGTCGGCCAC CGGGTCACCG CCACCTCGGC CGAGCTGGGG GACCGGGTGG CCGAGGCGTT TGCCGCACCC CGACCCGGCT GGATCGGCGC CCACCAGCAC AGCCTGGACG TATTTCTGGC CGCCGACGGG CCGGAGGCGG TCGCCCGGGG AGACATCCAG GGGGTGATCG GCGGGGTGCA CCCGGGGTTG ACCACCCTGC GCTCGGTGCT CCTCGTCACC GGTCGGGATG CCCGGCTGGT CTTTGGTCGC GACAACTGTG GCCCCGAGCC GGCCACGACG CTGCCGGTCG GCGACTGTGT TCTCGGCCCG GTCGACGGCG TGTTGACCGT CAGCAGCCGG GGCGGACGCG ACCGGCTACC GCTGACCGAG GTCCTCAAAG AGCCACTGCT GCGGCATCTG GCGCGGCGTT TCGACATCCG CCGCCCAGCC GAGCATCAGC CCCGGATCAC CATCGACCGA GTGGTGCTGG CCCGCGAGAG CTGGCGTTTC ACCGTCACCG GGCTGGGCTT CGCCACGCTG ACCGGCCAGG GCGACCGGTT CCGTCGCGTC CAGCGGTGGC AGCACGCACA CGGGCTGCCC CGGCATCTGT TCGGGTGGAC CCCGATGGAG GAGAGGCCGT TCTCCCTCGA CCTCACCAGC CCCGCCTCGG TTGACGTCCT GGCCGGTGCC CTGCGGCGCA CCGCCGACCA CGACCCGGCC GCCACGCTGC GTTTCAGTGA GCCTCTGCCA GGACCGGAGC ACGCCTGGCT CACCGATGGG CAGGGCCGGC AGCGCACGGT GCCGCTGCGC CTCGTCGCCG TCGACACTCG CACCCCGGAC GCCGACCGGC GGACGGACCG ACGACACCGC AGTCAGGCGG AGGTATGA
|
Protein sequence | MRATPLDTAI DPDPTPTTAV GATVPDLVAA VARRQPDAVA VAGSDGATLT YRELTEQAEA LAHRLVTWGV HPDEPVAVAL PRSVELVVTL LAVLKAGGGY LPLDPADPPA RTRQLLAVAG DPPVLSTGEV PGATRLFRLD QPGPTAPTGA VPRRLHPAGL AYVNFTSGST GTPKGVAVAH SAVVRLIHQP GYLRLGPTET VLQLAPAAFD AATLEIWGAL ATGARLVLAP PGALDLADLA RLLRRERITV LWLTAGLFHQ LVEFDPDCLA GVGQLLAGGD VLGPDAVRRA LRARDGAVLI NGYGPTENTT FTCVHPMTDP AAVPDPVPIG RPVPGSTVYV LDPAGRHVPV GVPGELYTGG AGVARGYLGR PGATAAVFLP DPFDPRPGSR MYRTGDRVRW RPDGTLDFLG RIDEQVKIRG FRVEPGEVAA VLRAHPAVGD TAVLVDGEGE RRRLLAYLTP RPGASAPTPQ ELAGYAADRL PAHLRPAAFL ILSTLPLTRS GKIDRRALPR PEPPAARPVT AVTDPIQVRL AALWAELLGS APSTPDDDFF ALGGNSLLAT RLTFVVADRF GVDLPVRVVY EHPTLARLAG VLGEHPEARP ASTGVTRRDR AGYRSPATTD AMARQPARSG DEPGSRWHWA PTRAPAADPG RSEPDGTLSA DLAESLRPAL SLLLDSVSWF TSAALALVRR TAVERYRDLA ARTGSPTVAF VDFWHGNSDL FVDPPVKLLT PLVRGLQDRW ARILPDAVGH RVTATSAELG DRVAEAFAAP RPGWIGAHQH SLDVFLAADG PEAVARGDIQ GVIGGVHPGL TTLRSVLLVT GRDARLVFGR DNCGPEPATT LPVGDCVLGP VDGVLTVSSR GGRDRLPLTE VLKEPLLRHL ARRFDIRRPA EHQPRITIDR VVLARESWRF TVTGLGFATL TGQGDRFRRV QRWQHAHGLP RHLFGWTPME ERPFSLDLTS PASVDVLAGA LRRTADHDPA ATLRFSEPLP GPEHAWLTDG QGRQRTVPLR LVAVDTRTPD ADRRTDRRHR SQAEV
|
| |