Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_6834 |
Symbol | |
ID | 8670144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 7521734 |
End bp | 7524895 |
Gene Length | 3162 bp |
Protein Length | 1053 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | non-ribosomal peptide synthetase |
Protein accession | YP_003342283 |
Protein GI | 271968087 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.445692 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.613796 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGCGC CGCTGTCTCC GGAGCAGGAG CGGCTCTGGC TCCTGCAACG CCTCGATCCG GACAACGCCT CCTACACCAT GTATCTCGTG CGGGCCCTGC ACGGTCCGCT CGACCAGGCC GCGCTCACCT GGGCGCTCAC CGACGTCCTC GCCCGCCACG AGAGCCTGCG CACCAGGTTC GCCGAGGAGG ACGGCGTGCC CTGGGCGGTG GTGGAGCCAG GGGCGCCCGG GATCGAGTGG CTGAGCCTCC GCAGCCGGCA GGAGGCCGCC GACCTGGTGT CCCGGCGGGC CAACGCGCCG TTCGACCTGG AGGCCGGGCC GCCCCTGCAG GTCGCGGTGA TCCGGCTCGC CGACGACGAG CACCTGCTCT GCCTCACGAT GCACCACATC ATCGCGGACG GGTGGTCGCT CAACGTCATC CTCGACGACC TCGCCGAGTG CTACACCGCC CACCTGCACG GTGTCGAGCC CCGGCTGCGC CCGCTGCCCG TCCAGGCCGG CGACTACGGG CGGTGGCAGC GCCGCCGGGC CCAGCGGGCC GTGCCGTACT GGACGGAGAG GCTCGCCGAC CCGCCGGCGT CCGAACTGCC CTTCCGCCAC CGGCGGGGGC GGAGCGGGGA GGCCGCGACC CACCGCGCCG GGCTGCCGCC GGCGACCGCC CGGAGGCTGG AACGGCTGGC CGGGGAGAAC CGCACCACCC TGTTCGCCGT CCTGACGGCG GCCTACCAGA CCCTGCTGTT CCGGCACACC GGGCAGGAAG ACGTCCTGGT CGGCAGCGTG GTCGCGGGCC GGGACCGGGT CGAGCTGGAG CCGATGGTCG GCTACGTGGC CCAGACGGTG ATCCTGCGCG GGGATCTCGG CGGGGACCCG TCGTTCACCG ACCTGGTCGC CCGCACCCGG GGCGAGGTCC TGGGCGCGCT GGGCAACTCC GCGGTCCCGT TCGAGAAGCT CGGCCACCCC GCCGACTCGC TGCTGCCGTC CATGTTCATC CTGCACAACC AGGACGCCGG CCCCCGGCGG TCCTTCGGCG GGCTGACCGT CACCGATGTC GATGCCGGGT TCCGGCAGGT GAAGGTGGAC CTGCTCGTCG AGGCGTGGTC GGACGGCAAC GGGCTCGCGC TGTCCTTCCT CTACGACAGC GGCCTGTTCG AGGCCGGGGC GATCGGCCGC CTGGCCGACC GGTTCGCGGT GCTGCTGGAG TCCGTCGCGG CCGAGCCGGG CACCCCGATC TCGGCCCTGC CGATCTGGAC CGAGGCCGAC CTGGCCGACA TCCGCGCCCT CGCCACCGGC CCCGCCCTCG CCACCGGCTC GCCCTCCGTG CCCGAGCCGC CCCTCGCCGC CAGCGTTCCC CTCGACCCCG CGCCGGGGTC CGGCGCCGCG CCGTACCTCG TCCCGGAGAT GATCGTCGAG GCCGTACGGC GGGCGCCCGG CGCCGTCGCG GTGATCTGCG GCGAGGAGAC GATCACCTAC GGCGAGCTGC TCGCCCGGGC GGACGCGCTC GCCGGCGCTC TCCGCGACGG CGGGGTGGGC CCCGGTGACG TCGTGGGCGT CTGCCTGCCC CGCTCGATCG AGGCGATCGC CGCGCTGCTC GCCGTGTGGC GGTCCGGCGC CGCCTACCTG CCGTTCGATC CGGACGTGCC CGACGAGCGG CTGGCCTTCT CGCTGTCGGA CAGCTCGGCC ACCCACGTGA TCACCAGGAG GAGGCTCCCC GACGGCCTGA CCGCCGTCGA CCCCGCGAGC GGCGGCGGTC CCGACGGTCT CGCGGGCCGT ACGGCCGCCG CCCCTGCGGA GCCCGGCCGT GCGGCCGCCT ACGTCATCAC CACCTCCGGC TCCACCGGCG TGCCCAAGGG CGTGCTGGTC GAGCACGGCG CGGTCGCGGC GCGGGTCCGG TGGATGCGCG CGGACTACGG CCTGACCTCG GCCGACCACG TCGTCCAGTT CGCCTCGCTC AGCTTCGACG CCCACGTGGA GGAGGTCTTC CCCACGCTGG CCGCCGGGGC GACGCTGGTG CTGCTCCCCG ACGGCGCGGC GAGCCTGCCC GACCTGCTGG CCTCGCCCCC GGGCGGGCGG GTCACCGTGC TCGACCTGCC GACCGCCTAC TGGCACGCGC TGGTCGAGGA GCTGGAGGAG GTCGTCTGGC CGCCCTCGCT CCGCCTGGTG ATCCTCGGCG GCGAGCAGGT CTCCGCGGCG GCGGTCGAGC GGTGGCGCGG CCGGTTCGGC GACGGCGTCC GGCTCGTCAA CACCTACGGC CCGACCGAGG CGGCGGTGAT CGCCACCGCC GCCGATCTCG GGGCGGAGGC GGCTCTGGAA CATCCCCCGA TCGGCCGCCC GATCGGCGCG ACCACGGTCC ACGTGCTCGA CGGGCGGGGC GAGCCCCTGC CTCCCGGGGC GACCGGGGAG CTCGTCATCG GCGGGGCCGG GGTGGCCCGC GGCTACCTCG GCAGGCCCGC TCTCACCGCG ACCGCCTTCG TCCCCGATCC CGCCGGAGAG CCGGGGGCAC GCCGCTACCG GACGGGGGAC CGGGTGAGGT GGCGCGCCGA CGGGCGGCTG GAGTTCCTGG GACGGCTGGA CGGCCAGCTC AAGATAAGGG GCTTCCGGAT CGAGCCCGGC GAGGTGGAGA GCCACCTGCT CGCCCATCCC GGAGTGGGCC AGGCCTTCGT CACCGGGCGG GGCGGGGAAC TCCTCGCCTA CGTCACCGGC ACCGTCGACC CCGCCGACCT CCGTGCCCAC CTGGAACGGA CGCTTCCGCG CCAGCTCGTC CCGACCGCCT GGGTCCGGCT GGAGGCGCTG CCCCTCACCG GCGGAGGCAA GGTCGACCGG GCGGCGCTCC CCGAGCCGGT GACCGCCCCG GCGGCCGAGC GGGTCCTGCC GCGCACCGAC GCGGAGAGGC TGGTGGCGGG GATCTGGGAC GAGCTGCTCG GCGCCGGCCC GTACGGCGTC CTCGACGACT TCTTCGCGCT GGGCGGCCAC TCCCTGCTGG CGACCCGGGT GGCCGCCCGG ATCCGGCGGG CCACCGGCGT CGAGGTCCCG ATCCGGACGA TCTTCGCCCG GAGCACCGTC GCGGCCCTCG CCGAGGCGGT GGAGGACCTG CTGATCGAGG AACTGGCCGG TCTGACCGAA GAGGAGGCCA TGGACCTGCT CGCGGGCACC GACTCTCCCT GA
|
Protein sequence | MRAPLSPEQE RLWLLQRLDP DNASYTMYLV RALHGPLDQA ALTWALTDVL ARHESLRTRF AEEDGVPWAV VEPGAPGIEW LSLRSRQEAA DLVSRRANAP FDLEAGPPLQ VAVIRLADDE HLLCLTMHHI IADGWSLNVI LDDLAECYTA HLHGVEPRLR PLPVQAGDYG RWQRRRAQRA VPYWTERLAD PPASELPFRH RRGRSGEAAT HRAGLPPATA RRLERLAGEN RTTLFAVLTA AYQTLLFRHT GQEDVLVGSV VAGRDRVELE PMVGYVAQTV ILRGDLGGDP SFTDLVARTR GEVLGALGNS AVPFEKLGHP ADSLLPSMFI LHNQDAGPRR SFGGLTVTDV DAGFRQVKVD LLVEAWSDGN GLALSFLYDS GLFEAGAIGR LADRFAVLLE SVAAEPGTPI SALPIWTEAD LADIRALATG PALATGSPSV PEPPLAASVP LDPAPGSGAA PYLVPEMIVE AVRRAPGAVA VICGEETITY GELLARADAL AGALRDGGVG PGDVVGVCLP RSIEAIAALL AVWRSGAAYL PFDPDVPDER LAFSLSDSSA THVITRRRLP DGLTAVDPAS GGGPDGLAGR TAAAPAEPGR AAAYVITTSG STGVPKGVLV EHGAVAARVR WMRADYGLTS ADHVVQFASL SFDAHVEEVF PTLAAGATLV LLPDGAASLP DLLASPPGGR VTVLDLPTAY WHALVEELEE VVWPPSLRLV ILGGEQVSAA AVERWRGRFG DGVRLVNTYG PTEAAVIATA ADLGAEAALE HPPIGRPIGA TTVHVLDGRG EPLPPGATGE LVIGGAGVAR GYLGRPALTA TAFVPDPAGE PGARRYRTGD RVRWRADGRL EFLGRLDGQL KIRGFRIEPG EVESHLLAHP GVGQAFVTGR GGELLAYVTG TVDPADLRAH LERTLPRQLV PTAWVRLEAL PLTGGGKVDR AALPEPVTAP AAERVLPRTD AERLVAGIWD ELLGAGPYGV LDDFFALGGH SLLATRVAAR IRRATGVEVP IRTIFARSTV AALAEAVEDL LIEELAGLTE EEAMDLLAGT DSP
|
| |