Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4933 |
Symbol | |
ID | 5707080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5599143 |
End bp | 5600810 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641274329 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001539671 |
Protein GI | 159040418 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1021] Peptide arylation enzymes |
TIGRFAM ID | [TIGR02275] 2,3-dihydroxybenzoate-AMP ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.63836 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCGA CCGCGGTTCA CCCGACCCGT ACGGGAACCG TTCCCTGGCC CGCCGAGGTA GCCACGCGCT ACCTCGCCGA GGGTTACTGG GCGGGCCGAC CCCTGGGGGC GTACCTCACC GCTGCCGCTC GCGCGAACCC GGCCGCGATC GCACTGGTCG ACGGCGATCT GCGGCTGAGC TACCGGGAGC TGATGTCCCG CGCCGACGGC GCTGCGGCCC GGCTCGTCGA GCGTGGCGTC AGCGGCGACG ACCGGGTCGT GGTGCAGTTG CCGAACTGCT GGGAGCACAT CGTGCTCACC GTGGCCTGCC TGCGACTCGG CGCCGTCCCG GTGTGGGCTC TGCCGGAGCA CCGACTGCGC GAAATCACCG GCGTCGCCGC CCGCGCCGAG GCCCGGGTAC TGGTAGTACC CGCCCGGCAT CGCGAGTTCG ACCACAGGGC GATGGCGCAC GAGGTGGCCG CCACCGTACC CAGCATCGAG CACGTCCTGG TGACCGGATC CGCAGATCCA GGTGAGGATC TGGGCAGGCT GTGTGAGCCG GCAGCAGACC CCGCCGCGCT GTCCGCACGC TTCGACGCGG CTGCGCCGGA TGCTACCGCG GTGGCGACGT TCCTGCTCTC CGGCGGCACC ACCGGCACGC CCAAACTCGT GCCCCGAACG CACAACGACC TCGCCTACAT GGTCGGCGAG GCAGCCCGAC TGTGCGAGTT CGGTCCGGAC ACGGCGTATC TCGCGGCTCT TCCGCTCGGC CACGGATTCC CGTACACCGG TCCGGGTGTT CTGGGCGCGC TGATGTCCGG CGGCCGGGTG GTCATCGCCG CCTCCCCCGC CCCCGGGCCG GCGTTGGCGA CGATCGAACG CGAGCGGGTC ACCGCGACGT CGATCGTCCC GGCGATCGCG CTTCGTTGGT TGGCGCACCA CGCGGCCCAC CCCGGCCGGG ACCTGGGTTC CCTGCGTCTG GTGCAGATAG GGGCGGCACG TCTGGAGCCC GACGCCGCGG CCCGGATCGA GCCCGAGCTG GGGGGACGGC TGCAGCAGGT GTTCGGGATG GGGGAGGGTC TGCTCTGCCT GACCCGCCTG GACGATCCGC CGGCAGTCGT GCACCACACC CAGGGCCGGC CGATCAGCCC CGCCGACGAG GTTCTCATCG TCGACGACGA GGACCAGCCG GTGCGGCCGG GGGAGGCGGG GGCGCTACTC ACCCGCGGCC CGTACACCCT TCGCGGCTAC TACCGCTCGC CCGAGATAGA CGCGGCGTCC TTCCTGGCTG ATGGTTGGTA CCGAACCGGC GACATCGTCC GCCAGACGCC GGACGGGAAC CTGGTGGTCA CCGGCCGCGA GAAAGATCTG ATCAACCGTG GTGGTGAGAA GGTCAGCGCC GTCGAGGTCG AGGGTTTCGC GCTCGCTCTC GACGGGGTCA CCCAGGCAGC CGCCATGGCG ATGTCGGACG CCGAACTCGG TGAACGCGTA TGCCTGTTCG TCGTCCCCGC GGGTGGGGCG CGGGTGGACC TGGCAGACGT GCGTGCCTCG ATGCTCGACC GCGGCGTCGC GGCGTTCAAG CTGCCGGACC GACTGGTCAG CGTGGACGCG CTGCCGATGA CACCACTCGG CAAAATCGAC AAAAAGGCAT TGCGGGACCA GATCCCGACG CACCTGGACA CTGCCTGA
|
Protein sequence | MPPTAVHPTR TGTVPWPAEV ATRYLAEGYW AGRPLGAYLT AAARANPAAI ALVDGDLRLS YRELMSRADG AAARLVERGV SGDDRVVVQL PNCWEHIVLT VACLRLGAVP VWALPEHRLR EITGVAARAE ARVLVVPARH REFDHRAMAH EVAATVPSIE HVLVTGSADP GEDLGRLCEP AADPAALSAR FDAAAPDATA VATFLLSGGT TGTPKLVPRT HNDLAYMVGE AARLCEFGPD TAYLAALPLG HGFPYTGPGV LGALMSGGRV VIAASPAPGP ALATIERERV TATSIVPAIA LRWLAHHAAH PGRDLGSLRL VQIGAARLEP DAAARIEPEL GGRLQQVFGM GEGLLCLTRL DDPPAVVHHT QGRPISPADE VLIVDDEDQP VRPGEAGALL TRGPYTLRGY YRSPEIDAAS FLADGWYRTG DIVRQTPDGN LVVTGREKDL INRGGEKVSA VEVEGFALAL DGVTQAAAMA MSDAELGERV CLFVVPAGGA RVDLADVRAS MLDRGVAAFK LPDRLVSVDA LPMTPLGKID KKALRDQIPT HLDTA
|
| |