Gene Sare_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2221 
Symbol 
ID5703902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2557709 
End bp2559397 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content72% 
IMG OID641271701 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_001537072 
Protein GI159037819 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.202713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTGA GCGCCGAGCA ACCCTGGCTG CGCAGCTACG CACCGGGCGT GCCGGCGACC 
GTCACCCCGA CCGACGAGTC GCTGGTCGAC CTGCTGCGTG CGGCAGTCCG CAGGTTCGGC
AGCCGGACCG CGCTGGACTT CTTCGGCGCC ACCACCACCT ACGTTGAGCT GGCGGCGCAG
GTGGACCGGG CGGCGGAGGC GCTGCGCCGC CTCGGCGTCG GCCGGGGCGA CCGGGTGGCA
CTGGTCCTGC CGAACTGCCC GCAACACGTG GTGGCCTTCT ACGCGGTGCT CCGCCTCGGC
GCGGTCGTGG TCGAGCACAA CCCGCTCTAC ACCGAGCAGG AACTCGCCCA CCAGCTCGCC
GACCACGGCG CCCGGGTCGC CGTGGTCTGG GACAGGGTAG CCCCACTGGT GCACCGCACC
GCTGGGACCA CCAAGGTCGA AACGGTCGTC GCGGTCGATC TCAGCGCGGC CCTGCCCCGG
CTGAAACGCT GGGCACTCCG GCTACCGCTG CCCCGAGCAC GCACCGCCCG TGCGGCGATG
ACCGCGCCCG CACCGGACGC GCTGGCCTGG GAACACCTCG TGGCCGGCAG TGAGCCCCTG
GCCGCCGACC ACCCTGCCCC GGAGCCGGAG GACACGGCGC TGTTGCAGTA CACGGGAGGG
ACCACCGGCA CCCCGAAGGG AGCGATCCTC ACCCACCGCA ACCTTCGCGT CAACGCGGCG
CAGGGCCGCG CCTGGATGCC GGGCCTCCGC GACGGCGCCG AGACGGTGTA CGCCGTACTG
CCGCTGTTCC ACGCGTACGG GCTGACGCTG TGCCTGACCT TCGCGGTGAG CATCGGCGCG
GCCCTGGTGC TGCTGCCCCG CTTCGACGTG GACGAGACGC TCACAGCGGT ACGCCGCCGC
CCGCCCACGT TCCTGCCGGC GGTGCCGCCG ATCTACGAAC GACTCGCCGT CGCCGCTCGC
GCACGAAGGG TCGACCTGAC CTCGATCCGA TACGCCATCT CCGGCGCGAT GACGCTGCCA
CCGGCCACCG TGCGACTGTG GGAGTCGGTG ACCGGCGGGC TGCTGGTCGA GGGGTACGGG
ATGACCGAGA CCTCCCCGGT GGCGTTGGGG AACCCGGTGT CGGCAGCCCG GCAACCCGGC
ACGGTCGGGG TGCCGTTCCC CGCCACCAAC GTGCGCATCG TCGACCCGGA CGACCCGACC
CGGGACCGCG CCCCTGGCGA GGCCGGCGAG TTGTTGATCA GCGGCCCGCA GGTGTTCGCC
GGATACTGGC ACCGACCGGA GGAGACGGCG GCGGTGCTGC TGCCGGGCGG GTGGCTGCGG
ACCGGGGACA TCGTCGAGAT GAACTCGGAC GGGTTCGTAC GGATCGTTGA CCGGATCAAA
GAGTTGATCA TTACCGGCGG GTTCAACGTC TACCCGTCGG AGGTGGAGGA GGCGCTACGA
CAGGTTCCCG GGGTTCGCGA CGCCGCGGCG GTCGGCCTAC CCGGCGCCGG CGGGGCCGAG
GAGGTTGTCG CCGCGGTGGT GCTGCACCCT GACTGCGCCA CCGACGCGGC AGGCATCCGA
GCCGCCTGCC GGCAGCACCT GACCGCGTAC AAGGTGCCAC GCCGAATAGT CGTGGTCGAC
GACCTGCCCC GCTCGCAGCT CGGCAAGGTG CTGCGCCGAG AGGTTCGCGA CCGGCTGCTC
GCCGCCTGA
 
Protein sequence
MDLSAEQPWL RSYAPGVPAT VTPTDESLVD LLRAAVRRFG SRTALDFFGA TTTYVELAAQ 
VDRAAEALRR LGVGRGDRVA LVLPNCPQHV VAFYAVLRLG AVVVEHNPLY TEQELAHQLA
DHGARVAVVW DRVAPLVHRT AGTTKVETVV AVDLSAALPR LKRWALRLPL PRARTARAAM
TAPAPDALAW EHLVAGSEPL AADHPAPEPE DTALLQYTGG TTGTPKGAIL THRNLRVNAA
QGRAWMPGLR DGAETVYAVL PLFHAYGLTL CLTFAVSIGA ALVLLPRFDV DETLTAVRRR
PPTFLPAVPP IYERLAVAAR ARRVDLTSIR YAISGAMTLP PATVRLWESV TGGLLVEGYG
MTETSPVALG NPVSAARQPG TVGVPFPATN VRIVDPDDPT RDRAPGEAGE LLISGPQVFA
GYWHRPEETA AVLLPGGWLR TGDIVEMNSD GFVRIVDRIK ELIITGGFNV YPSEVEEALR
QVPGVRDAAA VGLPGAGGAE EVVAAVVLHP DCATDAAGIR AACRQHLTAY KVPRRIVVVD
DLPRSQLGKV LRREVRDRLL AA