Gene Svir_20860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSvir_20860 
Symbol 
ID8387410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharomonospora viridis DSM 43017 
KingdomBacteria 
Replicon accessionNC_013159 
Strand
Start bp2228424 
End bp2230079 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content70% 
IMG OID644976144 
Productpeptide arylation enzyme 
Protein accessionYP_003133926 
Protein GI257056094 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.651422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0115329 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGG GGCACGGTCT CGGACCGGAC CACGTCGCCT GGCCGCCGGA GGTGGCCGCC 
CACTACCGCG CGGCCGGATA CTGGCGGGGA CAGACGTTCG GCGCGCTACT GACGTCGTTG
ACGGCCGCCT ACGCCGATCG CACGGCGGTG GTGGGGGAGC GGAACGGAAC GGTGTCGCGG
CTGAGCTACG CCGAGCTCGA CGACGCCGCG CACCGGATCG CCGCGGGACT GGTCGACCTG
GGGATCAAGG CGGGCGACCG GGTGATCGTG CAGTTGCCGA ACATCCCGGA GTTCGTGTCG
GTGATCTTCG GTGTGTGGCG TGCCGGGGCG TGGCCGGTGT TCACGTTGCC CGCACACCGG
CACAGCGAAC TGCGGCACTT CGCCACACAG AGCGAGGCGG CGGCCATCAT CACCGTCGAC
CACCACAACC GTCATGACCA CGCCACGATG GCCAAGGCCG TCGCGGCCGA AGTGGACTCG
GTGCGACACG TGCTCGTGGT CGGCTCGCAG GAGTTCGCCG ACCTCGCCGC GACCCCGCCG
CGGGAGTTGC CCGACCCCGA TCCGGGGTCG GTGGCGTTCC TCCAACTGTC GGGCGGCAGC
ACCGGACTGC CGAAGCTGAT CCCCCGCACG CACGACGACT ACCTGTACAG CGTGCGGGAG
AGCGCTCGGA TCTGTGCGTT GCGCCCGGAG AGCGTGTACC TGGCGGTCCT GCCCGCCGCG
CACAACTTCC CGCTCAGTTC GCCCGGGGTG TTGGGGGCCC TGCACGCGGG GGCGACCACC
GTCATGTCGC CGAGCCCGGA CGCGGGCACG GCGTTCCGGC TGATCGAGTC GGAGAAGGTC
ACGATCACCG GTGTCGTACC GCCGCTGGCC GCGGCATGGG TGCAGGCCGC CGCGCACACC
GACCGTGATC TGTCCAGTTT GCAGGTCATC CTCGTCGGGG GAGCGAAGTG CAGTCGCGAA
CTCGCCGAGC GGATCGGTCC CGCGTTGGGC TGTCAGCTGC AGCAGGTCTT CGGCATGGCC
GAGGGATTGG TCTGCTACAC GCGGCTCGAC GACCCCGAAC AGATCGTGCT CAGCACCCAG
GGACGGCCGA TCTCCCCCGA CGACGAGATC CGGATCGTCG ATCCGGACGA TCCCGACGCG
TCCGCTGAGG ACGAGGTGGC GCCGGGAGAG GTGGGCGCCT TGTTGACGCG CGGCCCGTAC
ACCATCCGCG GTTACTACCG CGCTGCCGAA CACAACGCCA CGGCGTTCAC CCCGGACGGT
TTCTACCGGA CGGGGGACCT CGTACGGCGG CATCCGTCCG GGCACCTGGA GGTGGTGGGG
CGGGTCAAGG AGCAGATCAA CCGAGGCGGC GAGAAGGTGG CGGCCGAGGA GGTCGAGAAC
CACTTGATGG CTCATCCGGG GGTGCTCGAC GCCGCGGTCG TCGCGGTGCC GGACGAGTAC
CTGGGGGAGC GGACCTGCGC CTACGTCATC CCCACCGAGG GGTCCGAGTT GACGGGCGCG
GAACTGCGCC GCTTCGTCCG GGAACGCGGG GTCGCCGCGT TCAAGGTGCC GGACCTGGTC
ATGGTGGTGG ACTCGTTCCC GGTGACCGGG GTCGGCAAGA CGAGCAAGCG TGAACTCCGA
GCGGCGCTCG CGGCGCTCGC CAAGAACGGG GGATAG
 
Protein sequence
MSEGHGLGPD HVAWPPEVAA HYRAAGYWRG QTFGALLTSL TAAYADRTAV VGERNGTVSR 
LSYAELDDAA HRIAAGLVDL GIKAGDRVIV QLPNIPEFVS VIFGVWRAGA WPVFTLPAHR
HSELRHFATQ SEAAAIITVD HHNRHDHATM AKAVAAEVDS VRHVLVVGSQ EFADLAATPP
RELPDPDPGS VAFLQLSGGS TGLPKLIPRT HDDYLYSVRE SARICALRPE SVYLAVLPAA
HNFPLSSPGV LGALHAGATT VMSPSPDAGT AFRLIESEKV TITGVVPPLA AAWVQAAAHT
DRDLSSLQVI LVGGAKCSRE LAERIGPALG CQLQQVFGMA EGLVCYTRLD DPEQIVLSTQ
GRPISPDDEI RIVDPDDPDA SAEDEVAPGE VGALLTRGPY TIRGYYRAAE HNATAFTPDG
FYRTGDLVRR HPSGHLEVVG RVKEQINRGG EKVAAEEVEN HLMAHPGVLD AAVVAVPDEY
LGERTCAYVI PTEGSELTGA ELRRFVRERG VAAFKVPDLV MVVDSFPVTG VGKTSKRELR
AALAALAKNG G