Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2665 |
Symbol | |
ID | 5706976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3039557 |
End bp | 3041122 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641272123 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001537493 |
Protein GI | 159038240 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.535194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000509999 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACCGACC AGCCGCAGCC GGGTTATGTG CACCGTGCGT TGGCGCTCTT CGCGGAGTTC GGCGACCGGG CGGCGATCAT CGGGAATGGA CGCACACTCA CCTACACCGA CGTGTGCGAT GACGTGCGCG GGTTCGCTAC CACGCTGCTC CGTCGCGGAA TCCGACCTGG CACCGCCGTA CTGGTGTCGC TGGGCAACCC GGTGGAGGCA CCATTGCTAC AGCTCGCACT CCACCTGATC GGCTGCCGTA CGATGTGGAT CGCGCCGGTG ACCTCCCGTC GGGAGATCGA CGAGTTCGTC CAGCTCGCCC GGCCCGACGC CCTACTCTAC GACGCCCGCG ACCCGGCCAA CGTCGGTGCG GAGTTGGCCC AGGGTCTGCG CGACCGGCCG GTGCTCCGCC TCGGCGTGGA CCTGACGCCG GCGCGGGACG TCAGCGACCT ACCCGCACGG GTACCAGCAG CAGAATCGTT TCTCCAGACC TCCGGCACCA CCGGCACTCC CAAGCTGGTG CACCACCGGG ATAGCTTCTA CACCCAGGTC CTCGCCCTGG CCGCCGACTT CCGCGGGGCC GGATTCCCGT TGCTGCGGCA CCTGTCGTAC TCGCCGATGT GGCTGGCCAG CGGCCAGATC ACCACACTGT TCAACCTGTT CACCGGAGGG GTCTTGTTCC CCCGGGAGGG GTGGGAGGCG GCGGAGTTCA TCGACACCGT GCCGGCCGAA CGGATCACCT CCACCTTCCT GACCCCGCCG ATGCTCTACG AGGTGCTCGA TCATCCCGCC CTGCCGGGTG CCGACTTCTC GTCGATGTTC ATGTTCAACG TGGGCGCCGG GCCTGCCGCA CCCGCCCGGC TGCGCCAGGC GATCACGCGG TTCGGTCCGG TGCTGCGCAT CGTGTACGGG CTCAGCGAGG TGGTGGTGCT CACCGCACAG CCGGGCCTGA CCGAGGACCC GGAGCACCCG GAGCGGCTGC GCTCCTGCGG AAAGCCGTAC GGCGACGTGC GGATCGAGAT CCGTGGCGCG GACGGTGCGG TGCTACCGAC GGGTTCGGAC GGCGAGGTGT GGGTCCACAC CGCACTGCGC TTCGCCGGCT ACCACGGCCG CCCCGACCTG ACCGCGGACA CGCTGGTGGA CGGTTGGGTG CGTACCCGCG ACATCGGCCA CCTCGATGCG GACGGCTACC TGTACCTGGT CGACCGGTTC CAAGACCGGA TCCTCACCCG CAGGCGTAGC TGGCCGATCT ACTCCCGGCC GATCGAGGAC GCCCTGGCCG GGCACCCCGA CGTTCGGGCG GCGGCGGTTG TCGGCGTGCC CGACGAGGTG GCCGGTGAGT TGCCGTACGC CTACGTGGTG CCTGCTCCCG GCGCCACGGT GAGCAGCGCC GAGCTGATCG ACCTGGTGAC CACAACGCTC AGCGACACCT GGGCACCGGG CGCAGTGGAG TTCGTCGACG CGCTGCCGCT GAATCGTGCT AACAAAGTGG ACAAACGTGC GCTTCGCGCT CGGTATGCGG CCGAGCACCC GTCAACCGTC GAGCACCCTG AGGCATCGAT CGGCCGTCGC ACGTGA
|
Protein sequence | MTDQPQPGYV HRALALFAEF GDRAAIIGNG RTLTYTDVCD DVRGFATTLL RRGIRPGTAV LVSLGNPVEA PLLQLALHLI GCRTMWIAPV TSRREIDEFV QLARPDALLY DARDPANVGA ELAQGLRDRP VLRLGVDLTP ARDVSDLPAR VPAAESFLQT SGTTGTPKLV HHRDSFYTQV LALAADFRGA GFPLLRHLSY SPMWLASGQI TTLFNLFTGG VLFPREGWEA AEFIDTVPAE RITSTFLTPP MLYEVLDHPA LPGADFSSMF MFNVGAGPAA PARLRQAITR FGPVLRIVYG LSEVVVLTAQ PGLTEDPEHP ERLRSCGKPY GDVRIEIRGA DGAVLPTGSD GEVWVHTALR FAGYHGRPDL TADTLVDGWV RTRDIGHLDA DGYLYLVDRF QDRILTRRRS WPIYSRPIED ALAGHPDVRA AAVVGVPDEV AGELPYAYVV PAPGATVSSA ELIDLVTTTL SDTWAPGAVE FVDALPLNRA NKVDKRALRA RYAAEHPSTV EHPEASIGRR T
|
| |