Gene Sros_6834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6834 
Symbol 
ID8670144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7521734 
End bp7524895 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content76% 
IMG OID 
Productnon-ribosomal peptide synthetase 
Protein accessionYP_003342283 
Protein GI271968087 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.445692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.613796 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCGC CGCTGTCTCC GGAGCAGGAG CGGCTCTGGC TCCTGCAACG CCTCGATCCG 
GACAACGCCT CCTACACCAT GTATCTCGTG CGGGCCCTGC ACGGTCCGCT CGACCAGGCC
GCGCTCACCT GGGCGCTCAC CGACGTCCTC GCCCGCCACG AGAGCCTGCG CACCAGGTTC
GCCGAGGAGG ACGGCGTGCC CTGGGCGGTG GTGGAGCCAG GGGCGCCCGG GATCGAGTGG
CTGAGCCTCC GCAGCCGGCA GGAGGCCGCC GACCTGGTGT CCCGGCGGGC CAACGCGCCG
TTCGACCTGG AGGCCGGGCC GCCCCTGCAG GTCGCGGTGA TCCGGCTCGC CGACGACGAG
CACCTGCTCT GCCTCACGAT GCACCACATC ATCGCGGACG GGTGGTCGCT CAACGTCATC
CTCGACGACC TCGCCGAGTG CTACACCGCC CACCTGCACG GTGTCGAGCC CCGGCTGCGC
CCGCTGCCCG TCCAGGCCGG CGACTACGGG CGGTGGCAGC GCCGCCGGGC CCAGCGGGCC
GTGCCGTACT GGACGGAGAG GCTCGCCGAC CCGCCGGCGT CCGAACTGCC CTTCCGCCAC
CGGCGGGGGC GGAGCGGGGA GGCCGCGACC CACCGCGCCG GGCTGCCGCC GGCGACCGCC
CGGAGGCTGG AACGGCTGGC CGGGGAGAAC CGCACCACCC TGTTCGCCGT CCTGACGGCG
GCCTACCAGA CCCTGCTGTT CCGGCACACC GGGCAGGAAG ACGTCCTGGT CGGCAGCGTG
GTCGCGGGCC GGGACCGGGT CGAGCTGGAG CCGATGGTCG GCTACGTGGC CCAGACGGTG
ATCCTGCGCG GGGATCTCGG CGGGGACCCG TCGTTCACCG ACCTGGTCGC CCGCACCCGG
GGCGAGGTCC TGGGCGCGCT GGGCAACTCC GCGGTCCCGT TCGAGAAGCT CGGCCACCCC
GCCGACTCGC TGCTGCCGTC CATGTTCATC CTGCACAACC AGGACGCCGG CCCCCGGCGG
TCCTTCGGCG GGCTGACCGT CACCGATGTC GATGCCGGGT TCCGGCAGGT GAAGGTGGAC
CTGCTCGTCG AGGCGTGGTC GGACGGCAAC GGGCTCGCGC TGTCCTTCCT CTACGACAGC
GGCCTGTTCG AGGCCGGGGC GATCGGCCGC CTGGCCGACC GGTTCGCGGT GCTGCTGGAG
TCCGTCGCGG CCGAGCCGGG CACCCCGATC TCGGCCCTGC CGATCTGGAC CGAGGCCGAC
CTGGCCGACA TCCGCGCCCT CGCCACCGGC CCCGCCCTCG CCACCGGCTC GCCCTCCGTG
CCCGAGCCGC CCCTCGCCGC CAGCGTTCCC CTCGACCCCG CGCCGGGGTC CGGCGCCGCG
CCGTACCTCG TCCCGGAGAT GATCGTCGAG GCCGTACGGC GGGCGCCCGG CGCCGTCGCG
GTGATCTGCG GCGAGGAGAC GATCACCTAC GGCGAGCTGC TCGCCCGGGC GGACGCGCTC
GCCGGCGCTC TCCGCGACGG CGGGGTGGGC CCCGGTGACG TCGTGGGCGT CTGCCTGCCC
CGCTCGATCG AGGCGATCGC CGCGCTGCTC GCCGTGTGGC GGTCCGGCGC CGCCTACCTG
CCGTTCGATC CGGACGTGCC CGACGAGCGG CTGGCCTTCT CGCTGTCGGA CAGCTCGGCC
ACCCACGTGA TCACCAGGAG GAGGCTCCCC GACGGCCTGA CCGCCGTCGA CCCCGCGAGC
GGCGGCGGTC CCGACGGTCT CGCGGGCCGT ACGGCCGCCG CCCCTGCGGA GCCCGGCCGT
GCGGCCGCCT ACGTCATCAC CACCTCCGGC TCCACCGGCG TGCCCAAGGG CGTGCTGGTC
GAGCACGGCG CGGTCGCGGC GCGGGTCCGG TGGATGCGCG CGGACTACGG CCTGACCTCG
GCCGACCACG TCGTCCAGTT CGCCTCGCTC AGCTTCGACG CCCACGTGGA GGAGGTCTTC
CCCACGCTGG CCGCCGGGGC GACGCTGGTG CTGCTCCCCG ACGGCGCGGC GAGCCTGCCC
GACCTGCTGG CCTCGCCCCC GGGCGGGCGG GTCACCGTGC TCGACCTGCC GACCGCCTAC
TGGCACGCGC TGGTCGAGGA GCTGGAGGAG GTCGTCTGGC CGCCCTCGCT CCGCCTGGTG
ATCCTCGGCG GCGAGCAGGT CTCCGCGGCG GCGGTCGAGC GGTGGCGCGG CCGGTTCGGC
GACGGCGTCC GGCTCGTCAA CACCTACGGC CCGACCGAGG CGGCGGTGAT CGCCACCGCC
GCCGATCTCG GGGCGGAGGC GGCTCTGGAA CATCCCCCGA TCGGCCGCCC GATCGGCGCG
ACCACGGTCC ACGTGCTCGA CGGGCGGGGC GAGCCCCTGC CTCCCGGGGC GACCGGGGAG
CTCGTCATCG GCGGGGCCGG GGTGGCCCGC GGCTACCTCG GCAGGCCCGC TCTCACCGCG
ACCGCCTTCG TCCCCGATCC CGCCGGAGAG CCGGGGGCAC GCCGCTACCG GACGGGGGAC
CGGGTGAGGT GGCGCGCCGA CGGGCGGCTG GAGTTCCTGG GACGGCTGGA CGGCCAGCTC
AAGATAAGGG GCTTCCGGAT CGAGCCCGGC GAGGTGGAGA GCCACCTGCT CGCCCATCCC
GGAGTGGGCC AGGCCTTCGT CACCGGGCGG GGCGGGGAAC TCCTCGCCTA CGTCACCGGC
ACCGTCGACC CCGCCGACCT CCGTGCCCAC CTGGAACGGA CGCTTCCGCG CCAGCTCGTC
CCGACCGCCT GGGTCCGGCT GGAGGCGCTG CCCCTCACCG GCGGAGGCAA GGTCGACCGG
GCGGCGCTCC CCGAGCCGGT GACCGCCCCG GCGGCCGAGC GGGTCCTGCC GCGCACCGAC
GCGGAGAGGC TGGTGGCGGG GATCTGGGAC GAGCTGCTCG GCGCCGGCCC GTACGGCGTC
CTCGACGACT TCTTCGCGCT GGGCGGCCAC TCCCTGCTGG CGACCCGGGT GGCCGCCCGG
ATCCGGCGGG CCACCGGCGT CGAGGTCCCG ATCCGGACGA TCTTCGCCCG GAGCACCGTC
GCGGCCCTCG CCGAGGCGGT GGAGGACCTG CTGATCGAGG AACTGGCCGG TCTGACCGAA
GAGGAGGCCA TGGACCTGCT CGCGGGCACC GACTCTCCCT GA
 
Protein sequence
MRAPLSPEQE RLWLLQRLDP DNASYTMYLV RALHGPLDQA ALTWALTDVL ARHESLRTRF 
AEEDGVPWAV VEPGAPGIEW LSLRSRQEAA DLVSRRANAP FDLEAGPPLQ VAVIRLADDE
HLLCLTMHHI IADGWSLNVI LDDLAECYTA HLHGVEPRLR PLPVQAGDYG RWQRRRAQRA
VPYWTERLAD PPASELPFRH RRGRSGEAAT HRAGLPPATA RRLERLAGEN RTTLFAVLTA
AYQTLLFRHT GQEDVLVGSV VAGRDRVELE PMVGYVAQTV ILRGDLGGDP SFTDLVARTR
GEVLGALGNS AVPFEKLGHP ADSLLPSMFI LHNQDAGPRR SFGGLTVTDV DAGFRQVKVD
LLVEAWSDGN GLALSFLYDS GLFEAGAIGR LADRFAVLLE SVAAEPGTPI SALPIWTEAD
LADIRALATG PALATGSPSV PEPPLAASVP LDPAPGSGAA PYLVPEMIVE AVRRAPGAVA
VICGEETITY GELLARADAL AGALRDGGVG PGDVVGVCLP RSIEAIAALL AVWRSGAAYL
PFDPDVPDER LAFSLSDSSA THVITRRRLP DGLTAVDPAS GGGPDGLAGR TAAAPAEPGR
AAAYVITTSG STGVPKGVLV EHGAVAARVR WMRADYGLTS ADHVVQFASL SFDAHVEEVF
PTLAAGATLV LLPDGAASLP DLLASPPGGR VTVLDLPTAY WHALVEELEE VVWPPSLRLV
ILGGEQVSAA AVERWRGRFG DGVRLVNTYG PTEAAVIATA ADLGAEAALE HPPIGRPIGA
TTVHVLDGRG EPLPPGATGE LVIGGAGVAR GYLGRPALTA TAFVPDPAGE PGARRYRTGD
RVRWRADGRL EFLGRLDGQL KIRGFRIEPG EVESHLLAHP GVGQAFVTGR GGELLAYVTG
TVDPADLRAH LERTLPRQLV PTAWVRLEAL PLTGGGKVDR AALPEPVTAP AAERVLPRTD
AERLVAGIWD ELLGAGPYGV LDDFFALGGH SLLATRVAAR IRRATGVEVP IRTIFARSTV
AALAEAVEDL LIEELAGLTE EEAMDLLAGT DSP