Gene Sare_4358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4358 
Symbol 
ID5706439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4924087 
End bp4925736 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content70% 
IMG OID641273780 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_001539130 
Protein GI159039877 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.124731 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAGCA CGATGATGGA CGCCCCCCTC CAGGTATCCC GGATCCTTGG CCACGGCTCC 
ACCGTGCACA GCACGGCCGA GGTGGTCACC TGGACCGGTG CCGAGCCCCG CCGGATGACC
TACGCCGACG TGGGGCGCTT GTCCGCCCAG TTGGCGCATG CGCTGCGCGA CGAGTGCGGC
GTTACCGGTG ACGAGCGGGT CGCCACCTTC CTGTGGAACA ACACCGAGCA TCTGGTGGCG
TACTTCGCGG TGCCGAGCAT GGGCGCGGTG CTGCACACAC TCAACATCCG GCTCCTCCCG
GACCAGGTGG CATACATCGC CAACCACGCC GAGGACCGGG TGATGCTGGT CGACACGACA
CTGATCCCCC TGCTGGCGAA GGCCATCGGC GACATGACCA CCGTCCGGCA CGTGGTGGTC
GTCGGCAACG GTGACCCCGC CCCGCTGGTC GCGGCGGCCG GTGACCGGAT CTCCGTGCAT
CACTGGGACA CCCTGCTGGC CGGTAGACCG GACACCTACG ACTGGCCGGA CGTGGACGAA
CGGTCCGCCG CCGCGCTCTG CTACACGTCC GGCACCACCG GTAACCCCAA GGGGGTGGCC
TACTCGCACC GCTCGATCTA CCTGCACTCG CTTCAGGTCT GTATGCCGGA GTCGTTCAGT
CTCGGGCCGC GGGACCGGGT GTTGGCGATC GTGCCGATGT TCCATGCCAT GTCCTGGGGC
CTGCCCTACG CGGCATTCCT CTCCGGCGGA TCGCTGGTCC TGCCGGACCG GTTCCTCCAG
GCCGCCCCGA TCGCCGAGAT GATCGCCGCC GAGCGACCCA CCGTCGCCGG TGCCGTCCCC
ACCATCTGGA CCGATCTGCT CGCGCACCTG GACAGCCACG ACGTCGACAC CGCCTCCCTG
GGGGAGGTGA TCGTCGGCGG GTCGGCCTGT CCGCCGGCAC TGATGCACGC GTTCGAGGAG
CGGCACAACA TCCGGATCAT CCACGCGTGG GGCATGACCG AGACCTCTCC GCTCGGTTCG
GTGGCCCGCC CGCCGGTCGG CGTCGACCGC GAGCAGGCGT GGCGGTACCG CTACACGCAG
GGGCGCGTCC CCGCCGGGGT GGAGGCTCGG ATCGTCGGCC CGGAGGGCGT GCCGCTGGCC
GCCGACGGGA CGTCCGTGGG TGAGCTGGAG GTCCGTGGGC CCTGGGTGAC CGGGCGGTAC
GTCGGCGACG AGGCCCCGGA CGAGGACACG TTCCGGGACG GCTGGCTACG TACGGGTGAT
GTCGGCACCC TCTCCCCGGA CGGCTACCTG ACGCTGACCG ACCGCGCCAA GGATGTGATC
AAGTCCGGCG GGGAGTGGAT CTCGTCGGTG GAGTTGGAGA ATGCCCTGAT GGCACACCCG
GACGTGGTCG AAGCCTGCGT GGTCGGCGTA CCGGACCAGC GTTGGGGCGA GCGGCCACTG
GCCACTGTGG TGCTCCGGGA GGGCGCGACG GTGGGAGCCG AGCAACTGCG GGAATTCCTC
GCCGGTTCGG TGGCCCGCTG GCAGCTGCCC GAGCGCTGGG CGGTCATCGA CGCCGTGCCG
AGGACCAGCG TGGGCAAGTT CGACAAGAAG GCGGTCCGGT CCCGGTACGC GGAGGGGGAA
CTTGCCGTTC GAGAGCTGAC CGCCCCTTAG
 
Protein sequence
MRSTMMDAPL QVSRILGHGS TVHSTAEVVT WTGAEPRRMT YADVGRLSAQ LAHALRDECG 
VTGDERVATF LWNNTEHLVA YFAVPSMGAV LHTLNIRLLP DQVAYIANHA EDRVMLVDTT
LIPLLAKAIG DMTTVRHVVV VGNGDPAPLV AAAGDRISVH HWDTLLAGRP DTYDWPDVDE
RSAAALCYTS GTTGNPKGVA YSHRSIYLHS LQVCMPESFS LGPRDRVLAI VPMFHAMSWG
LPYAAFLSGG SLVLPDRFLQ AAPIAEMIAA ERPTVAGAVP TIWTDLLAHL DSHDVDTASL
GEVIVGGSAC PPALMHAFEE RHNIRIIHAW GMTETSPLGS VARPPVGVDR EQAWRYRYTQ
GRVPAGVEAR IVGPEGVPLA ADGTSVGELE VRGPWVTGRY VGDEAPDEDT FRDGWLRTGD
VGTLSPDGYL TLTDRAKDVI KSGGEWISSV ELENALMAHP DVVEACVVGV PDQRWGERPL
ATVVLREGAT VGAEQLREFL AGSVARWQLP ERWAVIDAVP RTSVGKFDKK AVRSRYAEGE
LAVRELTAP