Gene Strop_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1023 
Symbol 
ID5057469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1153075 
End bp1154973 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content67% 
IMG OID640473292 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001157875 
Protein GI145593578 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCG CATCGACACT GCACGGCGGC TTTGTCGCCC ACGCAGCCGC GAATCCGGAC 
ACGCTGGCCG TAGCGTCCGA CGCCGGCGTG ATGACCTACG GTCGGCTTGA CGAGACCTCG
GCGGCGCTGG CCGAGCGGCT GTCTGCCTTG GGTGCTGGCC CGGGTGTTCC GATCGGGGTC
TGTATCGAAC GCACGCCGGA CCTGCTCGTC GCTATCCTCG GCGTGCTGCG CGCGGGCGCC
TGCTATCTGC CGCTCGATCC TCAATATTCA GCGCGCCACC TCGGCTTCAT GGTGGCCGAC
AGCGGGACCC GCCTGGTCGT TACCACACGA TCCTCTCGGG ACGCGTGCCC GGACGGCTGC
ACCGCGCTCG TCCTGGAGGA ATCCGAGGCG ATAGCCGACC CGCCGCCAGT GGCCGCGGTT
CCGGACGATT CTGCCTACGT CATATATACC TCCGGCTCGA CCGGCACGCC CAAGGGGGTG
CCGATCCGGC ACAGCAGCTG CGCGGCGATG CTTGCCGAGG CGGACCGAAT TTTCGAGGGC
TGTGACATGA GCGGTATCGC CGCCGTCACC TCGGTCTGCT TCGACCTGTC AGTGCTGGAG
ATCTTCTCCG CCCTCAGCCG TGGCCGGACG CTCGTCCTGG TGAATAGTGC CAGCCACCTT
CCGGAGAGCT CCCATGTCGA ACGGGTGACG CACGTCAGCA CGGTCCCGTC CGCAATGACC
AGCCTGCTTG ACGCGCAAGC CGTTCCGGCC GGCCTGCGGA ACGTGGTGCT CGGCGGCGAA
CCCGTACGTC GGAGCCTGGT CGACCGGATC TACCGCGAGA CCAACGTCGA CTTCGTCTTC
AACGGATACG GCCCGACGGA AGGCACGGTC TTCTGTACCT TCAAGCCCGT ATCCCGCGAC
GAGGCCGGCG AGCCGTCGAT CGGTACGCCA TCCCTGACCG CTCGCGTCTA TGTGCTCGAC
GAGAAGCTGC GGCCGTCGGC CGTCGGCGAG TCGGGTGAGC TGTACCTCGG CGGTGCCGGA
CTTACCTGGG GCTACCTCAA CCGGCCCGGG CTGACTGCGG AACGGTTCGT ACCTGATCCG
CAGGTGGCGG GTGAACGCAT GTATCGCACC GGCGACATCG CTCGGCTCAA CGAAGCAGGT
GAAATCGAGT TCGTGGGACG CTCCGACCTT CAGGTGAAGG TCCGCGGGTA CCGCATCGAG
CTAGAAGAGG TCGAGGCACG ACTGACCGAA TGCCCCGAGG TGCGGACGGC TGCGGCCGTC
GTCCGTGAGC AGACGCCGGG TACGAGAGCC CTGACCGCGT ACGCGGTTCC GGCGAGTGGA
GCACCCGACG GCGACGGGCC CTGGCTCGAC GCCGACCTGC AGGCAACGAT CAAGCAACAG
CTCGGTGCGC TGTTGCCCGG TTACATGGTT CCCGAAACGA TCGTCTTCCT GCCCGCGCTC
CCGCTGTCGC CAGTTGGGAA GCTGGACCGC ACGGCGCTAC CGGCACCACC CGTTGTCGAT
GTGCTGCCCT CGGGGGACTC CGCCACCACC GACACCGAAC AAGCGCTTGC CGAGATCTGG
GGTGCACTGC TGGACCGGAC TCCGCAGTCC ATCGGCATCC GCGACACATT CTACGACCTC
GGCGGCAACT CTTTGTTGTT GGTGCGGCTC GCGAAGCGAA TGGGTCAGCG CTTTCACCGC
AAGGTCGGCG TGGCGGACCT GTTCCGGTTC CGCGACATCG GCTCGCTCGC CAAGTGGCTG
GACGACGAGA GCGGAAAGAG TCCTGAGGAC ATCGAGCAGG CACGACGCCG TGCCAGCACC
AGACGCTCGG TGGTGCGCGG CCACAGCAGA TCACCGAGCA CTCGAACCGA CCCGACCGTC
AAGAACACGC CCGCATCAAA TGGAGGCCCA CATCCATGA
 
Protein sequence
MTAASTLHGG FVAHAAANPD TLAVASDAGV MTYGRLDETS AALAERLSAL GAGPGVPIGV 
CIERTPDLLV AILGVLRAGA CYLPLDPQYS ARHLGFMVAD SGTRLVVTTR SSRDACPDGC
TALVLEESEA IADPPPVAAV PDDSAYVIYT SGSTGTPKGV PIRHSSCAAM LAEADRIFEG
CDMSGIAAVT SVCFDLSVLE IFSALSRGRT LVLVNSASHL PESSHVERVT HVSTVPSAMT
SLLDAQAVPA GLRNVVLGGE PVRRSLVDRI YRETNVDFVF NGYGPTEGTV FCTFKPVSRD
EAGEPSIGTP SLTARVYVLD EKLRPSAVGE SGELYLGGAG LTWGYLNRPG LTAERFVPDP
QVAGERMYRT GDIARLNEAG EIEFVGRSDL QVKVRGYRIE LEEVEARLTE CPEVRTAAAV
VREQTPGTRA LTAYAVPASG APDGDGPWLD ADLQATIKQQ LGALLPGYMV PETIVFLPAL
PLSPVGKLDR TALPAPPVVD VLPSGDSATT DTEQALAEIW GALLDRTPQS IGIRDTFYDL
GGNSLLLVRL AKRMGQRFHR KVGVADLFRF RDIGSLAKWL DDESGKSPED IEQARRRAST
RRSVVRGHSR SPSTRTDPTV KNTPASNGGP HP