Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3345 |
Symbol | |
ID | 8666633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 3664248 |
End bp | 3667427 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | non-ribosomal peptide synthetase |
Protein accession | YP_003339027 |
Protein GI | 271964831 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0251 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGTGA CGTCCGAGAT CGACTCCCCA CCCGTCCCGG CCTCGCTCGC CCAGCAGGGA ATCTGGTTCA ACGAACGGCT GGGCGGTGCC GGGACCGTCT ACACGATGCC CTTCTCCGTC ACGTTCGACG GCCCGCTCGA CGTGCCCGCC TTGACCGCCG CCTGCCGGGC GCTGATCGAG CGGCACCCGA TCCTGGCGAG CACGGTCCGG GAACGGCAGG GCGTGCCGTA CGTCGTGCCC GCCGCCACCC CGCCCGCGCC GGTGGTCGCC GAGGTCACGG CCGCCCGGCG TGACGACCTG ATGAGGGCGG AGATCCTGCG CCCCTTCGAC CTGGCCGCCG GTCCGCTCGT GCGGATGACC CTCTACGTCG AGGAGGCGGG CCGGGCGACG CTGCTGGTCG TGGCGCACCA CCTCGTCTTC GACGGCGAGT CCACCTCGGT GTTCCTGCGG GACCTGGCCG AGCTCTACCG GGCCGGGGTG ACCGGCACCC CCGCCGACCT GCCCGCGCTG GACCACGACG GGCTCGCCGA GCGGGCCGCG GCCAGGGTCG AGGCCGGCCT CTCTTTCGCC CGGGAGTTCT GGAGCTCGCG GTGGCGCCCG CCCGCCGAGG TGATCCTGCC CGGCCTGGCC GGACCGGTGC CCGCGGTGGA CGAGGGGGCG GCCGTGGAGT TCGCGCTGCC GCCGGAGTCC CGGGAGGCGC TGGCCCGGCT GGCGGAGGAG ATCGGGGCCG GCAGGTTCGA GATCGTGCTC GCCTCGCTGC ACGTGCTGCT CCACCGGTAC GGCAACGCCG AGCCGACGGT CGCGGTCGAT CTGGGCACCC GCTCGCCGGA GACCCGCGAC CACCTGGGCG CCTTCGTCAA CGAGCTGCCC GTCACCGCCG GGCTCCGGCC CGAATGGGGC TTCCGCCGGT TCGTGGCCGA CCAGCGCTTC GGGTACGGGC TCCGCTCCGA CCTGCGCGGC CTCTTCCGGG CCCGGGAGGT ACCGCTGTCG CGCGCGGTCA GCGGGGTCAG GCCCGGTGTC GCGCTCGCCC CGATCTCGCT CGGCTACCGC AGGCGCGAGG CCGCGCCCGC CTTCCACGGG GTCGACGCCC ACGTGGAGTG GGTGCTGTTC AACCACACCG TCCGCGGCGC CATGCGCGTG CACATCGTGG ACGGGCCCGG CCGCTTCGGT GTCGTCCTCC AGTACAACCC GCAGATCATG GCCCGCGAGG ACGCCGAGCG GGTGGCCGCC CACTGGCGCG CGCTGCTCGA CGCGGTGGCC GCCGACCCCG ACATGCCGCT GGCCGAGCTG CCGATGCTCG ACGCGGAGGA GACCGGGCGG CTGCTGTCGC GGTGGAACGA CACCGCCGCC GGCCATCCGC CGCTCACCCT TCCCGAGCTG GTGGCCGCCC AGGCCGGCCG TACCCCGGAC GCGACCGCGG CCGTGTGCGG CGCGCAGACG ATGACCTACG CCGAGCTCGG CGCGGCCGTG GACGACCTGG CCCGGCGGCT GCGCGGCGCC GGCGTGGGAC GGGGGACGCT GGTCGCGGTC TGCGCCGAGC GCTCGCTTGC CACCCTGGTC GGCCTGCTCG CCGTGGCGCG CGCCGGCGGG GCCTACCTCC CGCTCGACCC GGACCATCCC GCCGAGCGCC TGCGCCTGGT CCTGGAGGAC TCCGGGGCCG CCCTGATCCT GGCCGGCTCC GGCCGGCACG ACCGCCTGGC CGGCTCCGGC GTGGCGGTCA TCTCGCTCGA CGCCCCCGGC CCGCGGTCCG GGGAAGGCGA TCGGGGGCCG GGGGAGGGCG GTTCCGGCCT GGAGCGGTCC GGGGAAGGCG GGTCCGGCGC GGGAGGTCGG GGGGAGGGCG GGCTCGCCTG GCCGGAGCTC GGCGACCTCG CCTACGTCAT CTACACCTCC GGCTCCACCG GCCGCCCCAA GGGCGTCGAG ATCCCGCACC GGGCGCTGAC CAACCTGCTG CTGGCCATGC GTGACCGGCT CGGTTCCCAG CCGGGGGACG GCTGGCTCGC CCACACCTCG CTGTCGTTCG ACATCTCGGC GCTGGAGCTC TACCTGCCGC TGGTGACCGG GGGACGGGTC GTCATCGCCC CGGACGCCGC GGCCAGGGAC GGGCACGAGC TCGTACGGCT GGCCGCCGAG GGGGTGAGCC ACGTGCAGGC GACCCCGTCC GGCTGGCGGA TGCTGCTCGA CGCCGGGTTC GACCTGCCCC GCGTGACCGC CCTGGCCGGC GGTGAGGCTC TCCCCGCCCC CCTGGCCCGC GAGATCCTCA GCCGGGCCGG CCGTCTGATC AACGTCTACG GCCCCACCGA GACGACGATC TGGTCGATGA GCGCGGAGAT CGCCGAGCCC GTCACGACCG TGCCGATCGG CGTTCCGCTG GCCAACACCC GGGTGCACGT GCTGGACGAG CGGCTCGGGC TGCTGCCGCT GGGCGTACCC GGCGAGCTGT GCATCGCCGG TGACGGCGTC GCCGACGGCT ACCACAACCG CCCCGAGCTG ACCGCCGAGC GGTTCGCCGG CGACCCGTTC GGGCCCGGCC GGCTGTACCG CACCGGTGAC CGGGTGGTCA GAAGGGCCGA CGGGCAGATC GAGTTCATCG GACGCCTCGA CGACCAGATC AAGCTCCGCG GGCACCGGAT CGAACCGGGG GAGATCGAGT CCAGGCTGCT CGAACACCCC GGCATTCCCC GCGCGGCCGT GGTCGCCCGG GAGGACGACA AGGGGGAGCG GCGGATCGTC GCCTACCTGG AGTGCGGGCA GGTCCCCGGC GACGTGCGCG AGCACTGCGC CGGGACGCTG CCCTCCTACA TGATCCCGGC CGACTTCGTC GGGCTGCCCC GGCTGCCGCT GACCCCCAAC GGCAAGCTCG ACCGGTCCGC GCTGCCCGCG CCCGGCCCCC GGGAGAGCGA GGCAGGCCCT CACGAGAGCG CGGCCGGCGT GGCCGGGCCG CACTCCTACA GCGGGGTGGC GGCCGAGCTC CACGAGATCT GGTGCGACGT GCTGGGGCTT GAGGCCGTCG GGCCGCAGGA GGACCTGTTC GAGCTGGGCG GCCACTCGCT GACCATCACC CAGATCGCCT CCCGGGTCCG CCTGCGCCTG GGCGCGGACC TGCCGCTCCA CATCTACTAC GACGAGCCCA CGATCAGCGC CGTCGCCGCC GCCGTCGAGC GCCTGAACGG GAAGAACTGA
|
Protein sequence | MSVTSEIDSP PVPASLAQQG IWFNERLGGA GTVYTMPFSV TFDGPLDVPA LTAACRALIE RHPILASTVR ERQGVPYVVP AATPPAPVVA EVTAARRDDL MRAEILRPFD LAAGPLVRMT LYVEEAGRAT LLVVAHHLVF DGESTSVFLR DLAELYRAGV TGTPADLPAL DHDGLAERAA ARVEAGLSFA REFWSSRWRP PAEVILPGLA GPVPAVDEGA AVEFALPPES REALARLAEE IGAGRFEIVL ASLHVLLHRY GNAEPTVAVD LGTRSPETRD HLGAFVNELP VTAGLRPEWG FRRFVADQRF GYGLRSDLRG LFRAREVPLS RAVSGVRPGV ALAPISLGYR RREAAPAFHG VDAHVEWVLF NHTVRGAMRV HIVDGPGRFG VVLQYNPQIM AREDAERVAA HWRALLDAVA ADPDMPLAEL PMLDAEETGR LLSRWNDTAA GHPPLTLPEL VAAQAGRTPD ATAAVCGAQT MTYAELGAAV DDLARRLRGA GVGRGTLVAV CAERSLATLV GLLAVARAGG AYLPLDPDHP AERLRLVLED SGAALILAGS GRHDRLAGSG VAVISLDAPG PRSGEGDRGP GEGGSGLERS GEGGSGAGGR GEGGLAWPEL GDLAYVIYTS GSTGRPKGVE IPHRALTNLL LAMRDRLGSQ PGDGWLAHTS LSFDISALEL YLPLVTGGRV VIAPDAAARD GHELVRLAAE GVSHVQATPS GWRMLLDAGF DLPRVTALAG GEALPAPLAR EILSRAGRLI NVYGPTETTI WSMSAEIAEP VTTVPIGVPL ANTRVHVLDE RLGLLPLGVP GELCIAGDGV ADGYHNRPEL TAERFAGDPF GPGRLYRTGD RVVRRADGQI EFIGRLDDQI KLRGHRIEPG EIESRLLEHP GIPRAAVVAR EDDKGERRIV AYLECGQVPG DVREHCAGTL PSYMIPADFV GLPRLPLTPN GKLDRSALPA PGPRESEAGP HESAAGVAGP HSYSGVAAEL HEIWCDVLGL EAVGPQEDLF ELGGHSLTIT QIASRVRLRL GADLPLHIYY DEPTISAVAA AVERLNGKN
|
| |