Gene Sare_3509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3509 
Symbol 
ID5703318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4047294 
End bp4049093 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content69% 
IMG OID641272936 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001538302 
Protein GI159039049 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.889293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0246515 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGAGT TCTCCGTCCC ACCGATCGTC ACCGTCGGCG ACTCGGCCAA TCTAACCGAT 
CCGGTCTGGG ACAACGCCGA GGCCGCCCCG GACGTGGTGC AGTTCATCCG CGAGGGTGCC
GACGGCGCCC GGGTCGAGGT GACCTGCCAT CAGTTCCGTG ACGAGGTGAC GGCGGTGGCA
CGCGGCCTGG TCGCAGCCGG TGTGCAGCCC GGCGACCGGG TCGGCCTGAT GAGCCGAACG
CGGTACGAGT GGACGCTGTT CGACTACGCC ATCTGGGCCG CCGGCGCGAT CACGGTGCCG
ATCTACGAGA CCTCCAGCGC CGAACAGGCC GCCTGGATCC TGTCCGACTC GGGGGCGGTC
GCCATCCTGG TCGAGACGAG CGCCCACGCC ACGCTGGTTG CCGACGTGCG CGACCGGGTA
CCCGATCTGG CGCACGTGTG GCAGATCGAC CTCGGCGCGA TGGACGAGCT GATCGCCACC
GGCGAGTCGG TGGATCCGAC CGAGATCGAG CGGCGACGCG CGGCCGTTCG GGCCGACGAC
ATCGCCACCA TCGTCTACAC CAGCGGCACC ACTGGCCGCC CCAAGGGCTG CATGCTGACC
CACCGCAGCA TGTACGCCGA TGTCGCCAAC GCCGTGCCGG TGCTGCCGAA CCTGTTCGGC
CCAGGTGCCT CCACGCTGCT GTTCCTCCCC CTCGCGCACG TCTTCGCCCG GCTGATCCAG
GTCGGCGTGG TCCAGGCCCG CGCCACGATG GCCCACTGCG CGGACACCAA GGACCTGATC
GCCAGGCTGC AGGCCGTCCG CCCCACGTTC GTGCTCTCCG TCCCCCGGGT GTTCGAGAAG
GTCTACAACT CCGCGAAGCA GAAGGCCGAA GCCGACGGCA AGGGCCGGAT CTTCGCCCGC
GCCGAGGCGG TCGCCATCGC GTACAGCGAG GCCCTGGAGA CCCGGACCGG GCCGGGCCTG
GCGCTCCGTG TGCAGCACGC CCTCTTTGAT CGCCTGGTCT ACCGCAAGCT GCGGGCCGCA
CTTGGCGGGC GGTGCCGCGA CGCGATCTCG GGCGGCGCGC CGCTCGGCGC GCGGCTCGGG
CACTTCTTCC GCGGCGTGGG GGTGACCATC TATGAAGGGT ACGGGCTGAC CGAGACCTCT
CCCGCTGCCT GCGCCAACCG GCCCGGTGCG ATCCGAATCG GCAGCGTCGG ACGCCCACTG
CCCGGCGTGA ACATCCGGAT CGACGACGAT GACGAGATCC TCATCGCCGG TGAACTGGTC
TTCACGGGCT ACTGGCGCAA CGAGGCCGCC AGTGCGGAGG TACTCACCCC TGACGGCTGG
TTCCGCACCG GCGATTTGGG CCAGCTCGAC AGTGACGGCT ACCTGAACAT CACCGGCCGA
AAGAAGGAGA TCATCGTGAC CGCGGGCGGC AAGAACGTCG CCCCGGCGGT CCTCGAGGAC
CAGGTCCGGG CGCATCCTCT GGTCAGCCAG TGCGTGGTGG TCGGTGACCG ACAGCCCTTT
GTCGCCGCGC TGGTCACCGT GGACGAGGAG GCGCTGCCGG CGTGGCTGGA GAACGCCGGC
CTACCCGCGG CCACCCCAAT CGAGGAGCTC TACCAGCATG AAGGGCTGCG CTCCGAGATC
CAGACCGCGA TCGACACCGC CAACCGCGCC GTGTCCAGGG CCGAGGCCAT CAAGGTCTTC
CGGATCCTCC CCCGGGACTT CACGGAGGCG ACCGGTGAGC TGACTCCTTC ACTCAAGGTC
AAACGACAAA TCGTGCACAA ATCGTACGCC ACGGAGATCG CCGATATCTA CCGGAGCTGA
 
Protein sequence
MREFSVPPIV TVGDSANLTD PVWDNAEAAP DVVQFIREGA DGARVEVTCH QFRDEVTAVA 
RGLVAAGVQP GDRVGLMSRT RYEWTLFDYA IWAAGAITVP IYETSSAEQA AWILSDSGAV
AILVETSAHA TLVADVRDRV PDLAHVWQID LGAMDELIAT GESVDPTEIE RRRAAVRADD
IATIVYTSGT TGRPKGCMLT HRSMYADVAN AVPVLPNLFG PGASTLLFLP LAHVFARLIQ
VGVVQARATM AHCADTKDLI ARLQAVRPTF VLSVPRVFEK VYNSAKQKAE ADGKGRIFAR
AEAVAIAYSE ALETRTGPGL ALRVQHALFD RLVYRKLRAA LGGRCRDAIS GGAPLGARLG
HFFRGVGVTI YEGYGLTETS PAACANRPGA IRIGSVGRPL PGVNIRIDDD DEILIAGELV
FTGYWRNEAA SAEVLTPDGW FRTGDLGQLD SDGYLNITGR KKEIIVTAGG KNVAPAVLED
QVRAHPLVSQ CVVVGDRQPF VAALVTVDEE ALPAWLENAG LPAATPIEEL YQHEGLRSEI
QTAIDTANRA VSRAEAIKVF RILPRDFTEA TGELTPSLKV KRQIVHKSYA TEIADIYRS