Gene Sare_4895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4895 
Symbol 
ID5707547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5559095 
End bp5562202 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content74% 
IMG OID641274290 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001539635 
Protein GI159040382 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00652278 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGGGCGA CGCCCCTCGA CACCGCTATC GACCCGGACC CGACGCCCAC CACGGCGGTC 
GGCGCCACCG TGCCGGACCT CGTCGCCGCC GTGGCGCGAC GGCAGCCGGA CGCGGTCGCG
GTTGCCGGCA GTGACGGGGC CACGCTCACC TACCGGGAGC TGACCGAGCA GGCCGAGGCC
CTGGCCCACC GGCTCGTCAC CTGGGGCGTC CACCCGGACG AGCCGGTGGC GGTGGCGCTG
CCCCGCTCGG TGGAGCTGGT CGTCACCCTT CTCGCCGTAC TCAAGGCCGG CGGTGGGTAT
CTGCCTCTCG ACCCGGCTGA TCCGCCAGCG CGGACGCGGC AGCTGCTCGC CGTGGCCGGG
GATCCGCCGG TGCTGTCCAC AGGCGAGGTG CCTGGCGCCA CGCGGCTGTT CCGTCTGGAC
CAGCCCGGCC CCACCGCGCC CACCGGTGCC GTGCCGCGGC GACTCCACCC CGCCGGTCTC
GCATACGTCA ACTTCACCTC CGGCTCCACC GGTACGCCGA AGGGGGTTGC CGTTGCCCAC
TCTGCCGTGG TCCGCCTGAT TCACCAGCCT GGCTACCTGC GACTGGGGCC GACCGAGACG
GTTCTCCAAC TCGCTCCGGC CGCCTTCGAC GCGGCGACCC TGGAGATCTG GGGTGCCCTG
GCCACCGGGG CCCGGCTCGT GCTGGCCCCA CCCGGCGCGC TGGACCTCGC CGACCTGGCC
CGGCTGCTCC GCCGGGAACG GATCACAGTC CTCTGGCTGA CCGCCGGGCT CTTTCACCAG
CTGGTGGAGT TCGACCCGGA CTGCCTCGCC GGGGTCGGGC AACTCCTGGC CGGCGGGGAC
GTGCTCGGCC CGGACGCGGT CCGCCGGGCA CTGCGGGCGC GTGACGGAGC GGTTCTGATC
AACGGGTACG GCCCGACCGA GAACACCACC TTCACCTGCG TGCACCCGAT GACCGACCCG
GCGGCCGTAC CGGACCCGGT GCCGATCGGT CGGCCGGTGC CAGGGAGCAC CGTGTACGTC
CTCGACCCGG CGGGACGACA CGTGCCCGTC GGGGTTCCCG GAGAGCTGTA CACCGGCGGT
GCCGGGGTCG CCCGGGGATA CCTCGGGCGT CCGGGGGCCA CGGCGGCGGT ATTCCTCCCC
GACCCGTTCG ACCCCCGCCC GGGATCCCGG ATGTACCGCA CCGGGGACCG AGTCCGCTGG
CGGCCGGACG GCACCCTCGA CTTCCTCGGC CGGATCGACG AACAGGTGAA GATCCGCGGG
TTCCGGGTCG AGCCGGGCGA GGTCGCGGCG GTGCTGCGGG CCCACCCCGC CGTCGGGGAC
ACTGCCGTCC TGGTCGACGG GGAGGGAGAA CGGCGTCGGC TGCTGGCATA CCTGACGCCC
CGGCCGGGCG CGTCGGCACC GACCCCGCAG GAACTGGCCG GCTACGCAGC CGACCGGCTC
CCGGCCCACC TCCGACCGGC GGCCTTCCTG ATCCTGTCCA CTCTGCCCCT GACCCGTAGC
GGCAAGATCG ACCGGCGGGC ACTGCCCCGA CCGGAACCGC CCGCAGCTCG GCCCGTCACC
GCCGTGACCG ATCCGATCCA GGTGCGACTG GCGGCCCTCT GGGCCGAGCT GCTCGGTAGC
GCACCCTCCA CCCCGGACGA CGACTTCTTC GCGCTGGGCG GGAACTCGCT GCTCGCCACC
CGGCTGACCT TCGTCGTCGC CGACCGGTTC GGGGTCGACC TGCCGGTACG TGTCGTCTAC
GAGCACCCAA CGCTCGCCCG GCTCGCCGGC GTCCTCGGTG AGCACCCCGA AGCGCGGCCG
GCGAGCACCG GGGTGACCCG TCGAGACCGG GCTGGCTACC GGTCGCCTGC GACCACGGAC
GCCATGGCCC GCCAACCGGC CCGTTCGGGC GACGAACCCG GCTCACGGTG GCACTGGGCC
CCCACCCGGG CCCCGGCCGC AGACCCGGGT CGCTCCGAAC CCGACGGCAC CCTCTCCGCG
GACCTCGCCG AGAGCCTCCG GCCGGCGCTC AGCCTGCTGT TGGACAGCGT CAGCTGGTTC
ACCTCCGCCG CGTTGGCGCT GGTTCGGCGG ACCGCCGTGG AGCGCTACCG CGACCTGGCG
GCCCGCACCG GATCGCCGAC GGTCGCCTTC GTCGACTTCT GGCACGGGAA CAGCGACCTG
TTCGTGGATC CGCCGGTGAA GCTGCTCACC CCACTCGTGC GGGGGTTGCA GGACCGCTGG
GCGCGGATCC TGCCCGACGC GGTCGGCCAC CGGGTCACCG CCACCTCGGC CGAGCTGGGG
GACCGGGTGG CCGAGGCGTT TGCCGCACCC CGACCCGGCT GGATCGGCGC CCACCAGCAC
AGCCTGGACG TATTTCTGGC CGCCGACGGG CCGGAGGCGG TCGCCCGGGG AGACATCCAG
GGGGTGATCG GCGGGGTGCA CCCGGGGTTG ACCACCCTGC GCTCGGTGCT CCTCGTCACC
GGTCGGGATG CCCGGCTGGT CTTTGGTCGC GACAACTGTG GCCCCGAGCC GGCCACGACG
CTGCCGGTCG GCGACTGTGT TCTCGGCCCG GTCGACGGCG TGTTGACCGT CAGCAGCCGG
GGCGGACGCG ACCGGCTACC GCTGACCGAG GTCCTCAAAG AGCCACTGCT GCGGCATCTG
GCGCGGCGTT TCGACATCCG CCGCCCAGCC GAGCATCAGC CCCGGATCAC CATCGACCGA
GTGGTGCTGG CCCGCGAGAG CTGGCGTTTC ACCGTCACCG GGCTGGGCTT CGCCACGCTG
ACCGGCCAGG GCGACCGGTT CCGTCGCGTC CAGCGGTGGC AGCACGCACA CGGGCTGCCC
CGGCATCTGT TCGGGTGGAC CCCGATGGAG GAGAGGCCGT TCTCCCTCGA CCTCACCAGC
CCCGCCTCGG TTGACGTCCT GGCCGGTGCC CTGCGGCGCA CCGCCGACCA CGACCCGGCC
GCCACGCTGC GTTTCAGTGA GCCTCTGCCA GGACCGGAGC ACGCCTGGCT CACCGATGGG
CAGGGCCGGC AGCGCACGGT GCCGCTGCGC CTCGTCGCCG TCGACACTCG CACCCCGGAC
GCCGACCGGC GGACGGACCG ACGACACCGC AGTCAGGCGG AGGTATGA
 
Protein sequence
MRATPLDTAI DPDPTPTTAV GATVPDLVAA VARRQPDAVA VAGSDGATLT YRELTEQAEA 
LAHRLVTWGV HPDEPVAVAL PRSVELVVTL LAVLKAGGGY LPLDPADPPA RTRQLLAVAG
DPPVLSTGEV PGATRLFRLD QPGPTAPTGA VPRRLHPAGL AYVNFTSGST GTPKGVAVAH
SAVVRLIHQP GYLRLGPTET VLQLAPAAFD AATLEIWGAL ATGARLVLAP PGALDLADLA
RLLRRERITV LWLTAGLFHQ LVEFDPDCLA GVGQLLAGGD VLGPDAVRRA LRARDGAVLI
NGYGPTENTT FTCVHPMTDP AAVPDPVPIG RPVPGSTVYV LDPAGRHVPV GVPGELYTGG
AGVARGYLGR PGATAAVFLP DPFDPRPGSR MYRTGDRVRW RPDGTLDFLG RIDEQVKIRG
FRVEPGEVAA VLRAHPAVGD TAVLVDGEGE RRRLLAYLTP RPGASAPTPQ ELAGYAADRL
PAHLRPAAFL ILSTLPLTRS GKIDRRALPR PEPPAARPVT AVTDPIQVRL AALWAELLGS
APSTPDDDFF ALGGNSLLAT RLTFVVADRF GVDLPVRVVY EHPTLARLAG VLGEHPEARP
ASTGVTRRDR AGYRSPATTD AMARQPARSG DEPGSRWHWA PTRAPAADPG RSEPDGTLSA
DLAESLRPAL SLLLDSVSWF TSAALALVRR TAVERYRDLA ARTGSPTVAF VDFWHGNSDL
FVDPPVKLLT PLVRGLQDRW ARILPDAVGH RVTATSAELG DRVAEAFAAP RPGWIGAHQH
SLDVFLAADG PEAVARGDIQ GVIGGVHPGL TTLRSVLLVT GRDARLVFGR DNCGPEPATT
LPVGDCVLGP VDGVLTVSSR GGRDRLPLTE VLKEPLLRHL ARRFDIRRPA EHQPRITIDR
VVLARESWRF TVTGLGFATL TGQGDRFRRV QRWQHAHGLP RHLFGWTPME ERPFSLDLTS
PASVDVLAGA LRRTADHDPA ATLRFSEPLP GPEHAWLTDG QGRQRTVPLR LVAVDTRTPD
ADRRTDRRHR SQAEV