Gene Sare_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1040 
Symbol 
ID5706539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1164548 
End bp1166341 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content67% 
IMG OID641270556 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001535940 
Protein GI159036687 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.270253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.341952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGAGT CGACATCGCC ACCGTTGTAC GAGGTGCCGG AGAAGGGCGG GCTCGCTGAC 
CTCATCGCGA GTAACGCCGC CGAGGATCCG GATGCGCCGG CATTCCACCG TAAAGTCGAC
GGCCGGTGGG AGCCGGTCAG CGCCGCGGAG TTCTACACCG AGGTCACCGA CCTGGCCCGA
GGGTTGATCG CCAAGGGCGT CGAGCGCGGG GACCGCGTGG GTCTGCTCTC CGGCAACCGG
TACGAGTGGT CGCTCGTGGA CTTCGCCCTG TGGGTGATCG GCGCGGCGCC GGTGCCGATC
TATCTGACCT CCTCGACCGA GCAGATCGAG TGGATTCTCG GCGACTCTGG TGCGGTAGCG
GTTGTGGTGG AGACCGAGGC GCACGAGAAC GTGATCCAGG GAATGCGGGG CAAACTACCC
GCGCTGCGAC ACGTCTGGCA GATCGACGGT GGTGCGGTGG CGCACCTGCG AGCCGCGGGG
GCCGACGTCG ACCCCGCCGC CGTCGAAGAG CGTCGGACGG CCGTGCTCCC GGAGGACACG
GCCACGATCA TCTACACCTC CGGCACCACC GGGATGCCCA AGGGTTGCGT GCTCAGCCAT
GCCAATCTGT TCGCCGAGGC GGGTAACGCC GTAGCTCTGC TGCGCGCCAT GTTCGGTCCG
CTCGGCGACA TCCCGGCATC GACACTGCTG TTCCTGCCTC TTGCCCACGT CTTCGGCAGG
ATGGTGGAGG TCGGTGCCAT GGTGGCGCGG ACGCCCATCG CGCACTGCTC CGACGTCAAG
CAGGTGCCCG CGGAACTGAT CAGCTACAAG CCGACATTCC TGCTCTCGGT TCCCTACGTG
CTGGAGAAGG CGTACAACAC CGCCCGGCGC AAGGCTTACG AAGCTGGCAA GGGCAAAGTG
TTCGACACCG CGGCGGCGAC GGCAATCGCC TACTCGGAGG CGCAGCGGCC CGGACTGGGC
CTGCGCATGC GACACGCCCT GTTCGAGCGA CTCGTGTACC GCAGGGTCCG GGCCGCCTTC
GGCGGGAACC TGCGTTTCGC GATCTCCGGA GGTGCGGCCC TGGGCGAGCG CCTCACGCAC
TTCTACCGCG GCTGTGGCAT CACCGTGTTC GAGGGTTACG GACTCACCGA AACCAGTGCC
GCGGTCACGG TCAACTCGCT GGATTCCTTC CGGCCGGGAA CCGTGGGCAG GGTCCTGCCG
AGCGTACGGA TGAGGATCGA CGACGACGGT GAGGTGCAGT GCACCGGTGG CCCGGTGTTC
GCGGGCTACT GGAACAACGA TGAGGCCAAT GCCGAGTCGT TCACCGAGGA CGGCTGGTTC
CGGACCGGTG ACATCGGGGA GTTCGACGAG TTCGGGCACC TGCGGATCAC CGGCCGTAAG
AAGGAGATCC TGGTGACCAG CGGCGGTAAG AACGTGTCGC CCGCCGTGAT CGAGGACCGC
ATCGCCGCCG CTCCGCTGGT CGCCCAGGCA CTCGTGGTGG GTGATGGGCA GAAGTACATC
GCCGCACTCA TCACCGTCGA CTCGGAGTAC CTGGAGCACT GGAAGACCGG CGCGGGCAAA
CCCGCCGACG CGGCGGTCTC CGACCTCATC GACGATCCCG ACCTTCTGCA CGAGCTGCAG
CGCGCCGTGG ACGAGGGCAA CGCGGCGGTG TCGACGGCCG AGGCGGTACG TCGGTTCCGT
GTTCTTCCCA AGGAGTTCAC TGTGGAATCG GGCCATCTGA CGCCATCGCT GAAGTTGCGT
CGCAGTGTCA TCATGGCGGA CTTCGCCGAC GAGGTAGCTG AGCTGTACAG CTAG
 
Protein sequence
MRESTSPPLY EVPEKGGLAD LIASNAAEDP DAPAFHRKVD GRWEPVSAAE FYTEVTDLAR 
GLIAKGVERG DRVGLLSGNR YEWSLVDFAL WVIGAAPVPI YLTSSTEQIE WILGDSGAVA
VVVETEAHEN VIQGMRGKLP ALRHVWQIDG GAVAHLRAAG ADVDPAAVEE RRTAVLPEDT
ATIIYTSGTT GMPKGCVLSH ANLFAEAGNA VALLRAMFGP LGDIPASTLL FLPLAHVFGR
MVEVGAMVAR TPIAHCSDVK QVPAELISYK PTFLLSVPYV LEKAYNTARR KAYEAGKGKV
FDTAAATAIA YSEAQRPGLG LRMRHALFER LVYRRVRAAF GGNLRFAISG GAALGERLTH
FYRGCGITVF EGYGLTETSA AVTVNSLDSF RPGTVGRVLP SVRMRIDDDG EVQCTGGPVF
AGYWNNDEAN AESFTEDGWF RTGDIGEFDE FGHLRITGRK KEILVTSGGK NVSPAVIEDR
IAAAPLVAQA LVVGDGQKYI AALITVDSEY LEHWKTGAGK PADAAVSDLI DDPDLLHELQ
RAVDEGNAAV STAEAVRRFR VLPKEFTVES GHLTPSLKLR RSVIMADFAD EVAELYS