Gene Sare_4933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4933 
Symbol 
ID5707080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5599143 
End bp5600810 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content71% 
IMG OID641274329 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001539671 
Protein GI159040418 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.63836 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCGA CCGCGGTTCA CCCGACCCGT ACGGGAACCG TTCCCTGGCC CGCCGAGGTA 
GCCACGCGCT ACCTCGCCGA GGGTTACTGG GCGGGCCGAC CCCTGGGGGC GTACCTCACC
GCTGCCGCTC GCGCGAACCC GGCCGCGATC GCACTGGTCG ACGGCGATCT GCGGCTGAGC
TACCGGGAGC TGATGTCCCG CGCCGACGGC GCTGCGGCCC GGCTCGTCGA GCGTGGCGTC
AGCGGCGACG ACCGGGTCGT GGTGCAGTTG CCGAACTGCT GGGAGCACAT CGTGCTCACC
GTGGCCTGCC TGCGACTCGG CGCCGTCCCG GTGTGGGCTC TGCCGGAGCA CCGACTGCGC
GAAATCACCG GCGTCGCCGC CCGCGCCGAG GCCCGGGTAC TGGTAGTACC CGCCCGGCAT
CGCGAGTTCG ACCACAGGGC GATGGCGCAC GAGGTGGCCG CCACCGTACC CAGCATCGAG
CACGTCCTGG TGACCGGATC CGCAGATCCA GGTGAGGATC TGGGCAGGCT GTGTGAGCCG
GCAGCAGACC CCGCCGCGCT GTCCGCACGC TTCGACGCGG CTGCGCCGGA TGCTACCGCG
GTGGCGACGT TCCTGCTCTC CGGCGGCACC ACCGGCACGC CCAAACTCGT GCCCCGAACG
CACAACGACC TCGCCTACAT GGTCGGCGAG GCAGCCCGAC TGTGCGAGTT CGGTCCGGAC
ACGGCGTATC TCGCGGCTCT TCCGCTCGGC CACGGATTCC CGTACACCGG TCCGGGTGTT
CTGGGCGCGC TGATGTCCGG CGGCCGGGTG GTCATCGCCG CCTCCCCCGC CCCCGGGCCG
GCGTTGGCGA CGATCGAACG CGAGCGGGTC ACCGCGACGT CGATCGTCCC GGCGATCGCG
CTTCGTTGGT TGGCGCACCA CGCGGCCCAC CCCGGCCGGG ACCTGGGTTC CCTGCGTCTG
GTGCAGATAG GGGCGGCACG TCTGGAGCCC GACGCCGCGG CCCGGATCGA GCCCGAGCTG
GGGGGACGGC TGCAGCAGGT GTTCGGGATG GGGGAGGGTC TGCTCTGCCT GACCCGCCTG
GACGATCCGC CGGCAGTCGT GCACCACACC CAGGGCCGGC CGATCAGCCC CGCCGACGAG
GTTCTCATCG TCGACGACGA GGACCAGCCG GTGCGGCCGG GGGAGGCGGG GGCGCTACTC
ACCCGCGGCC CGTACACCCT TCGCGGCTAC TACCGCTCGC CCGAGATAGA CGCGGCGTCC
TTCCTGGCTG ATGGTTGGTA CCGAACCGGC GACATCGTCC GCCAGACGCC GGACGGGAAC
CTGGTGGTCA CCGGCCGCGA GAAAGATCTG ATCAACCGTG GTGGTGAGAA GGTCAGCGCC
GTCGAGGTCG AGGGTTTCGC GCTCGCTCTC GACGGGGTCA CCCAGGCAGC CGCCATGGCG
ATGTCGGACG CCGAACTCGG TGAACGCGTA TGCCTGTTCG TCGTCCCCGC GGGTGGGGCG
CGGGTGGACC TGGCAGACGT GCGTGCCTCG ATGCTCGACC GCGGCGTCGC GGCGTTCAAG
CTGCCGGACC GACTGGTCAG CGTGGACGCG CTGCCGATGA CACCACTCGG CAAAATCGAC
AAAAAGGCAT TGCGGGACCA GATCCCGACG CACCTGGACA CTGCCTGA
 
Protein sequence
MPPTAVHPTR TGTVPWPAEV ATRYLAEGYW AGRPLGAYLT AAARANPAAI ALVDGDLRLS 
YRELMSRADG AAARLVERGV SGDDRVVVQL PNCWEHIVLT VACLRLGAVP VWALPEHRLR
EITGVAARAE ARVLVVPARH REFDHRAMAH EVAATVPSIE HVLVTGSADP GEDLGRLCEP
AADPAALSAR FDAAAPDATA VATFLLSGGT TGTPKLVPRT HNDLAYMVGE AARLCEFGPD
TAYLAALPLG HGFPYTGPGV LGALMSGGRV VIAASPAPGP ALATIERERV TATSIVPAIA
LRWLAHHAAH PGRDLGSLRL VQIGAARLEP DAAARIEPEL GGRLQQVFGM GEGLLCLTRL
DDPPAVVHHT QGRPISPADE VLIVDDEDQP VRPGEAGALL TRGPYTLRGY YRSPEIDAAS
FLADGWYRTG DIVRQTPDGN LVVTGREKDL INRGGEKVSA VEVEGFALAL DGVTQAAAMA
MSDAELGERV CLFVVPAGGA RVDLADVRAS MLDRGVAAFK LPDRLVSVDA LPMTPLGKID
KKALRDQIPT HLDTA