Gene Strop_4417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4417 
Symbol 
ID5060903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp5003226 
End bp5006540 
Gene Length3315 bp 
Protein Length1104 aa 
Translation table11 
GC content71% 
IMG OID640476680 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001161223 
Protein GI145596926 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.543215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG CTGACTCCAC CACACCCGGC AGCGGCGGCA AGCGTGCCCT GCTGGCCCGC 
CTTCTCACCG AACGGGCCCA GTCGCCCCGC AGCTATCCGG TCTCCTTCGG CCAACAGCGG
CTGTGGTTCC TGGACCGTTT CTCCGGGGGC ATCCCGGTCT ACAACATCCC GGTCGCCTTC
CGGGTGCACG GGCCACTGGA CGTCGCAGCG CTGCGTACCG CACTGACCAC GCTCGTCAAC
CGGCATCCGG CGCTGCGCAC CACGTTCAGC GAATCCGGCG GCGAACCAGT GCAGGTGGTA
CGCCCGACCG GCGAGGTGGA TCTCACCGAG ACCGACCTGA CCGGGTTGCC GGCGGAGCGG
CGCGAGGCCG AGGCCACCAC ACTCGTCCGA GCGCACTCCG GGCAACCGTT CGACCTGGCC
GGAGGCCCGT TGCTGCGGGT AACAGTGATC CGGACCGACA CGGATCGGTA CCACGTGGCG
TTCTGCGTGC ACCACATCGT CTCCGATGCC TGGTCGATCG GGGTGCTCTT CAAGGAGTTG
GCGACCGCGT ACCCGGCTGC GCTGGCGGGT GCCCCGGCCC ACCTGCCCGA CCTGCCCACG
CAGTACGCCG ACTTCTCCAT CTGGCAGCGG GACCGGCTCG CCGGTGACGC GCTGCGTCGA
CAGCTCGACC ACTGGGTTTC GCACCTGCGG GGGGCACCGG CCCTGCTCAG CCTTCCCACC
GATCGGCCCC GCCCGGCCAA CCGCTCGTAT CGGGGCGCCT GCCACTACGT CACCGTCCCG
GCGGCGGCCG TTCGACGTGT CGAGGAGTTC AACCAGGACG CCGGGGTCAC CATGTTCATG
ACCCTGCTCG CCGTGTACCA GGCGGTGTTG TCCCGACACA GCGGGCAGGA CGACATCGTG
ATCGGCGCCC CTGTTGCCGG CCGGGAGCAT CCCGACCTGG AGCGGCTGGT CGGCTTCTTC
GTCAACACAC TTGCCCTGCG GGTGTCGTCC GCTGGCTCCC CATCGTTGCG GCAACTCGTC
AACCGGGTAC GGGAGGTCAC CCTCGTCGGT CTCGGCAACG CCGAGGTGCC GTTCGAGAAG
GTGGTTGAGG AACTGCAGCC GGAGCGCAGT CTGGCCCACG CACCGATCTT CCAGGCCCAG
CTGATCCTGC AGAACGCACC GCAGAACGCC CTTCGTCTCG CCGGCTGCAC GGCCACCTCC
CTGCAGGTCG ACAGCGGCAC CGCCAAGTTC GACCTCACCC TCGCCGGTGA GCTGACCGCC
GAGGGTGCCC TGCGGTTGGC CTTCGAGTAC GACACCGAAC TGTTCGACGC CGGCACGGTC
GACCGGTTGG CCCGGCATCT GTGCACCCTG CTGGACGCGG CGGTCGCCGA GCCGGATCGC
CCGCTGACGC GGCTTCCGCT GCTCAGTGGA GTGGAGCGGT GGCGTGCCGT GGTCGAGTGG
AATCAGACCG ACCGGGGCAC GTTGCCGGTC GGCACCATCC TGGACCTGCT ACCCACTGAA
CCCTCGGAGT CCGGCGCCCC GCCTGCCGTC ACCGGTCCGG ACGGGCACCT GGACCGAGCC
GGTCTGCACC GGCGGGCCGG ACAGATCGCC CGGCGGCTAG TCGCCGCCGG TGTCGCCCCG
GACACCCCGG TCGGCATCTG CCTGGATCGC GGGGTCGACA TGGTCGCCGC GGTGCTCGGC
GTATGGCGGG CCGGGGCCGG TTACCTACCG CTCGACCCCA CCCTGCCCCC CGAGCGGCTG
CGCCACCTGC TCGTCGACTC CGGCACCCGG GTCGTGCTGA CCCATCAGGC GGTTGCCGCG
CGGCTCGGGC CGGTGCTGGC GGGCTCGGTG ACGGTGCTGC TCGACGATGC CACCGACGCC
GCCGGCCCAG ATGAGCCACT TCCGGCGGTC CCGGCGCATC CGGACGGACT GGCATACCTG
ATCTACACCT CGGGTTCGAC CGGTCAACCG AAAGGGGTGG CGGTCCCACA CCGCAGTGTG
ACCAACCTCG TTGCCTCCTT CCACGACGAC CTGGACCTGA CGTCCGAGGA CCGGTTCGCC
GCGGTCACCA CCCTGTCGTT CGACATCTCG GTGTTGGAAC TGCTGGTGCC GCTGCTGCTG
GACATCCCGC TGCTGGTCGT GGGTGCCGAC GAGGTCGGCG ACGGGCCGGC CCTGCGTCGT
CGGCTCACCG AAGCGGGGAT CACCGCCATG CAGGCCACGC CGGCGACCTG GCGACTGCTG
CTGGCATCCG GCGGCGTACC GCCGACGCTG CGGCTGCGCC TCTGCGGCGG TGAGGCGCTA
CCCCGGGACC TTGCCGACGC CCTGCAGGCC GACGGCGTAA CCCTGTGGAA CTGTTATGGG
CCCACCGAGA CCACCGTCTG GTCCGCGGCG GCCCCCGTGG CGCCTGCCCC GGCCGCGGTG
GACCTCGGTT CGCCGATCGC CAACACCCGG ATCTACCTGC TCGACGAGGC ATACCAGCCA
GTGCCGGTGG GCGTGGTGGG AGAAATCCAC ATCGGCGGCT CGGGTGTGGT CCGTGGATAC
CACGGCCGAC CCGGCCTGAC CGCCGGTCGG TTCGTCCCCG ACCCGTTCGC CGACGAGCCC
GGCGCCCGGC TCTACGCCAC TGGTGACCTG GCCCGGCAGC GCGCTGACGG CCGGCTGGAG
TTCCTCGGCC GCACCGACCA TCAGGTCAAG GTGCGCGGGT TCCGGATCGA GTTGGGCGAG
ATCGAAACCC TGCTACGGGG CCACGATCTG GTCGCGGACG CGGTGGTCGG CACCTGGGTC
GGCGGGGACG GCGACACCCG CCTGGTGGCG TACGCCGTGC CGGCGTCCGG CGTTGACCCG
GACGCCCTCG CCGGTCAGGT CCGTCCCCAC CTGTCCGGCC GACTGCCGGA GTACATGCTT
CCCGCGGCCC TGGTGCCGAT GACCGCGTTG CCTCTCAACG GCAACGGCAA GGTCGACCGG
AACGCCCTGC CCACCCCGAG GTGGACCGAC CCGCGGGCGG AGCTGGTCGC CCCCCGCGAC
CCCCTCGAGC AGCTACTCGC CGGGATCTGG CAGGAGGTTC TGCACGTCGA GAGGATCGGT
GTGCTCGACG ACTTCTTCCG CCTCGGTGGG CATTCACTAC TCGGCGCGCA GGCGTTGAGC
CGGATCGGCG CCGTGCTGGA GACGGAGGTA CCGATCCGAA TCCTCTTCGA GGCACCGACG
ATCGACGCGA TGGCCCGCGC GTTGCGCTCC ATGGAGGAGG TGGCCGGCCA GACCGACGCC
GTCGCCGCCC TTCGGATGGA GGTGGCCGAC CTCTCCGACG ACGAACTACG GGCCATGCTG
GGCGGTCAGG AGTGA
 
Protein sequence
MTTADSTTPG SGGKRALLAR LLTERAQSPR SYPVSFGQQR LWFLDRFSGG IPVYNIPVAF 
RVHGPLDVAA LRTALTTLVN RHPALRTTFS ESGGEPVQVV RPTGEVDLTE TDLTGLPAER
REAEATTLVR AHSGQPFDLA GGPLLRVTVI RTDTDRYHVA FCVHHIVSDA WSIGVLFKEL
ATAYPAALAG APAHLPDLPT QYADFSIWQR DRLAGDALRR QLDHWVSHLR GAPALLSLPT
DRPRPANRSY RGACHYVTVP AAAVRRVEEF NQDAGVTMFM TLLAVYQAVL SRHSGQDDIV
IGAPVAGREH PDLERLVGFF VNTLALRVSS AGSPSLRQLV NRVREVTLVG LGNAEVPFEK
VVEELQPERS LAHAPIFQAQ LILQNAPQNA LRLAGCTATS LQVDSGTAKF DLTLAGELTA
EGALRLAFEY DTELFDAGTV DRLARHLCTL LDAAVAEPDR PLTRLPLLSG VERWRAVVEW
NQTDRGTLPV GTILDLLPTE PSESGAPPAV TGPDGHLDRA GLHRRAGQIA RRLVAAGVAP
DTPVGICLDR GVDMVAAVLG VWRAGAGYLP LDPTLPPERL RHLLVDSGTR VVLTHQAVAA
RLGPVLAGSV TVLLDDATDA AGPDEPLPAV PAHPDGLAYL IYTSGSTGQP KGVAVPHRSV
TNLVASFHDD LDLTSEDRFA AVTTLSFDIS VLELLVPLLL DIPLLVVGAD EVGDGPALRR
RLTEAGITAM QATPATWRLL LASGGVPPTL RLRLCGGEAL PRDLADALQA DGVTLWNCYG
PTETTVWSAA APVAPAPAAV DLGSPIANTR IYLLDEAYQP VPVGVVGEIH IGGSGVVRGY
HGRPGLTAGR FVPDPFADEP GARLYATGDL ARQRADGRLE FLGRTDHQVK VRGFRIELGE
IETLLRGHDL VADAVVGTWV GGDGDTRLVA YAVPASGVDP DALAGQVRPH LSGRLPEYML
PAALVPMTAL PLNGNGKVDR NALPTPRWTD PRAELVAPRD PLEQLLAGIW QEVLHVERIG
VLDDFFRLGG HSLLGAQALS RIGAVLETEV PIRILFEAPT IDAMARALRS MEEVAGQTDA
VAALRMEVAD LSDDELRAML GGQE