Gene Sros_5582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5582 
Symbol 
ID8668876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6103488 
End bp6108452 
Gene Length4965 bp 
Protein Length1654 aa 
Translation table11 
GC content73% 
IMG OID 
Productputative nonribosomal peptide synthetase (NPRS) 
Protein accessionYP_003341077 
Protein GI271966881 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.344509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCGGC TTTCTTCCCA CAACAACCTG AATGAGGCTT TCGAGGCGCG GGTGGCGGCG 
GCCCCGGACG CCGAGGCGGT GATCTGCGGC GAGGTCCGCC TCTCGGCCAC CGAACTGAAC
GCGCGAGCCA ACGCACTGGC ACGACGGCTG ACCGCCGCCG GAGTCGGCCC CGAATCACCC
GTCGTGGTAC TGATGCAACG CTCCGCCGAC GTGGTCGTCA CCCTGCTCGC CGTGATCAAA
GCCGGCGGAA CCTACATCCC CCTGCACCCC GCACTGCCCC CCGCCCGGAT GCGCTGGATC
GCCGAACAGA CACACGCCCA GGTCCTGATC ACCGACGACA CCTACGCCGG ACACGAACTG
ACCCACCACC TCCCCACCAT GACGACCGGA CCACAGAGCG ACCCCCGCAA CCTGGACCTG
GAGATCACCC CCGACCGGCT GGCCTACACC ATGTTCACCT CCGGATCCAC CGGAGTCCCC
AAGGGCGTCG CGGTGACCCA CCGCAACGTC ATGGCCATGG CGGCCGACCG GCGCTGGCGG
GACCACCACC GCGTGCTGTC GCACTCCCCC TTCGCCTTCG ACGCCTCCAC CTACGAGCTG
TGGGTGCCGC TGCTCAACGG CGGGACCGTG ATCGTGGCCC CGCCCGGACC GGTGGACGCC
TCCGTGCTGC GCCGAGCGCT GGAGGAGGAA CGGGTCGACG CGGTGTTCCT GACGACCGGC
CTGTTCAACC TGCTGGTCCA GGAGGCGCCC GCCGTACTGG CCGCCGTCCC GCACCTGTGG
ACCGGCGGCG AGGCCGCCTC CCCCGCCGCG GTGGAGCGCG TCCTGCGGGA GGGCGGCGAC
GTCCACAACG GCTACGGCCC CACCGAGACC ACGACGTTCG CGCTGTCGCA CCGGGTGGCC
GTCCCCTCCC CCGGCGCCGT CCCCATCGGC GGCCCGCTGG ACGCGATCCG CGCCCATGTC
CTGGACGACA GGTTGCGGCC GGCACGGGAG GGGGAGCTGT ACCTCGCGGG CGACCAATTG
GCCCGCGGCT ATCTCGGCCG GTCGAGCCTG ACGGCCGAGC GCTTCGTCGC CGACCCCCAC
GGCGAGCCCG GATCGCGCAT GTACCGCACG GGCGACCTGG TACGGCGTCG CGACGACGGG
CACCTGGAGT TCGTCGGCCG GGCCGACGGC CAGGTGAAGA TCCGCGGCTT CCGCATCGAG
CTCGGCGAGA TCGAGACCTT CCTCGCCCGG CACCCCGAGG TGGCCCAGGT CGCCGTCATC
GCCAGGGAGG ACCGTCCGGG GGACAGACGC CTGGCCGCCT ACATCGTGCC CCACGACCCG
CCCGCCGCCG GCGCCGGGAC CGGGCTCGTC GAGCGGGTCC GGCGGGCCGC CGGGGAGACG
CTGCCCGCCT ACATGGTCCC CTCCGCCTTC GTCGTCATGG ACGCGCTGCC GCTCAACCAC
AACGGCAAGG TCGACCGGCC CGCCCTGCCC GCGCCCGAGA CGGCGGCCTC CGGCACGGCC
CCGCGCACGC CGGTCGAGGA GATCCTGTGC GAGCTGTTCG CGCAGGTGCT CGCCGTGCCG
GGGATCGGGA TCGACGACGA CTTCTTCCTG ATGGGCGGGC ATTCGCTGCT GGCCATGCGG
GTGGTGACCG GGGTCAGGTC GGCGCTCTCC GTCGAGCTGC AGGTGCGGGC GGTCTTCGAG
GCGCCCACCG TCGCCGCGCT GGCGGTCATG GTCGAGGAGG CGAGGGCCGC CGGCGCCGGC
GGGGTGCGGC CGCCGCTCGT CCGCGCGGAG CGTTCGGCGC CGCCGTCGCT CTCCTTCGCC
CAGCACCGGC TCTGGATCGT CGACCAGCTC GCGGAGCCGG GCGGCCTCTA CACCATCCCG
CTGGTCCTGC GCCTGTCCGG GCGGCTGGAC AGGGCGGCGC TGGAGGCCGC GATCGGGGAC
GTGGCATGGC GGCAGGAGAC GCTGCGGACG GTCTTCCCCG CGCGGGACGG CCGGCCGTCG
CCGCGGGTGC TGGAGGCCGT ACCCGGGCTG ACGGTCGTGG AGACCGGCGA GGCGGAACTC
TGGGCGGCGG TGGAGGAGGA GGCGCTGCGC CCCTTCGACC TCTCGGCCGA GCCGCCGGTG
CGGGCGCTGC TGTTCGCTCT GGGATCCGGC GCTCCCGTCG AGCCGGACGA GCACGTGCTG
GTGCTGCTGT TCCACCACAT CGCCATGGAC GGGTGGTCGC TCGCCCCGCT CAGGCAGGAC
CTGGCCCTGG CCTACACGGC CCGCACGCGC GGTGAGGCCC CCTCGTGGGA GCCGCTGCCC
GTGCGCTACT CGGACTACGC CCGGTGGCAG CGCGACCTGC TCGGCGACGC CGCGGACCCG
GACAGCCCCG CGAGCGTGCA GACCGCCTAC TGGCGGGAGG CCCTGGCGGG GGCGCCCGAC
GAGCTGGCGC TGCCCGCCGA CCGGCCTCGC CCGCCGGTGA CCGGGCATCG GGGCGCGTCG
GTGCCCCTGC GGCTCTCGCC CGACCTGCAC GGGCGGCTGC TCGCCCTGGC CAAGGCCAAC
CGGTCCACGC TGTTCATGGT CGTGCAGGCC GGACTGGCGG CGCTGCTGAC CCGGCTGGGG
GCCGGGACCG ACGTGCCGAT CGGCGCTCCC GTGGCCGGGC GTACCGACGA GGCGCTGGAC
CGGCTGGTCG GCTTCTTCGC CAACACGCTG GTGCTGCGCA CCGACACCTC GGGCGACCCG
GCCTTCCGCG AGCTGCTGGC CAGGGTCAGG GAGACGGACC TGGCGGCGTA CGCGCACCAG
GACGTGCCGT TCGACCAGGT GGTGGAGGCG GTCAACCCTC CGAGGATCCC CGGCTGCCCC
CCGCTCTTCC AGGTGATGCT CGCTCTGAAC AACACCCCTG AGGACCAGGC CGAGCTGCCG
GGCCTGACGA CGACGCCGGA TCCCACCTAC TCGCTGTACG GCTTCGGCGG CGCGAAGTGC
GACCTGGCGT TCGGCCTCAA CGAGAACCTC TCCGCCGGCG CCCCCGCGGG CGTCGACGGG
GTGGTGCAGT ACGCGACCGA CCTGTTCGAC CACGGCACGG TCGAGTCGAT CACCGCTCGG
CTGGTGAGGC TGCTGGAGGC GGTGGCGGCG GACCCGGACG TCCGCGTGGG CGCGGTGGAG
CTGCTGGCGC CGGACGAACG CTACACGCTG CTGGAGAAAT GGAACGACAC GGCGGTCCGG
ACGCCCGGGG GCAGCCTGGC CCGGCTGTTC GAGGCGCGGG TGGCGGCGGC TCCGGACGCC
GAGGCGGTGG TCTGCGGCGA GGTCCGCCTC TCGGCCGCCG AACTCAACGC GCGAGCCAAC
GCACTGGCAC GACGACTGAA CGCCGCCGGA GTCGGCCCCG AATCACCCGT CGTGGTACTG
ATGCAACGCT CCGCCGACGT GGTCGTCACC CTGCTCGCCG TGATCAAAGC CGGCGGAACC
TACATCCCCC TGCACCCCGG ACTGCCACCC GCCCGGATGC GCTGGATCGC CGAACAGACA
CACGCCCAGG TCCTGATCAC CGACGACACC TACGCCGGAC ACGAACTGAC CCACCACCTC
CCCACCATGA CGACCGGACC ACAGAGCGAC CCCCGCAACC TGGACCTGGA GATCACCCCC
GACCGGCTGG CCTACACCAT GTTCACCTCC GGATCCACCG GAGTCCCCAA GGGCGTCGCG
GTCACCCACC GCAACGTCGC CTCCTTCGCC GCCGACCGGC TCTGGCACGG CTCCGGGCAT
CGCAGGGTCC TGTTCCACAC GGCGTCCTCC TTCGACGTCT CCATGTACGA GCTGTGGGTG
CCGCTGCTCA ACGGCGGGAC CGTGGTCGTG GCCCCGCCCG GACCGGTGGA CGCCTCCGTG
GTGCGCTGGG CGCTGGAGGA GGAACGGGTC GACGCGGTGT TCCTGACGAC CGGCCTGTTC
AACGTGCTGG CCGAGGACTC GGCCGCCCTG CTCGCGAAGG TCCCGGAGCT GTGGATCGCC
GGTGAGGCCG CCTCCCCCGC CGCGGTGGAG CGCGTCCTGC GGGAGGGCGG CGACGTCCAC
AACGGCTACG GCCCCACCGA GACCACGATC TACGTCACCG CCCACCACGT GACCTCTCCC
GGAGCCGTCG TCCCGATCGG CCGCCCGCTC GACAACACCC GCGCCTACGT CCTGGACGGC
AGGCTCCGCC CGGTGCCCGC GGGCGTGCCC GGCGAGCTGT ACATCGCCGG CGACCACCTG
GCCCGCGGCT ATCTCGGCCG CCCGGACCTG ACCGCCGAGC GCTTCGTCGC CGACCCCCAC
GGCGAGCCCG GATCGCGCAT GTACCGCACC GGCGACCTCG TCCGCTGGCT CCCCGACGGC
CTCCTCGACT ACCTGGGCCG CGTCGACGAC CAGGTCAAGA TCCGCGGCAT CCGCATCGAG
CTGGGCGAGA TCGAGGCCGT CCTCGCCCGC CACCCCGCCA CGGCCCAGGT CGTGGTGCTG
GTCAGGGAGG ACCAGCCGGG CGACAAACGC CTGGTCGCCT ACCTGGTCCC GTACCGGCGG
CCGGACGGCT CCTCCCCGCC GGCGTCCCCT CCGGCGGGCG CGGAGCTCTC CGGGCAGGTA
CGGCGGTTCG CCGAGGAGGC GCTGCCCGCC TACATGGTCC CTTCGGCCTT CGTCGTGCTG
GACGGGCTGC CGCTCAACCA CAACGGCAAG ATCGACCGGC GCGCGTTGCC GGTGCCCGCG
TGGCAGGAGC CGGCCGCCGC GAAGCAGCCG CCCCGGACGG AGGCGGAGGC GCTGGTCGCG
GAGATCTGGG CCGAGGTGCT CGGACTGGAC GGGATCGGCG TGCATGACGA CTTCTTCGCA
CTCGGCGGCA ACTCGCTGCT GGCCATCCGA GTCGTCTCCA GGATCAGGGC CGCCGTGGAT
CTGGAGATCC CGGTCGACGC GGTCTTCACC AACCCCACGG TCGAGCGGCT CGCCGACGCT
GTCGAGGCGC TGCTGATCGC GGACATCGAA GGACGGAGCC CGTAG
 
Protein sequence
MNRLSSHNNL NEAFEARVAA APDAEAVICG EVRLSATELN ARANALARRL TAAGVGPESP 
VVVLMQRSAD VVVTLLAVIK AGGTYIPLHP ALPPARMRWI AEQTHAQVLI TDDTYAGHEL
THHLPTMTTG PQSDPRNLDL EITPDRLAYT MFTSGSTGVP KGVAVTHRNV MAMAADRRWR
DHHRVLSHSP FAFDASTYEL WVPLLNGGTV IVAPPGPVDA SVLRRALEEE RVDAVFLTTG
LFNLLVQEAP AVLAAVPHLW TGGEAASPAA VERVLREGGD VHNGYGPTET TTFALSHRVA
VPSPGAVPIG GPLDAIRAHV LDDRLRPARE GELYLAGDQL ARGYLGRSSL TAERFVADPH
GEPGSRMYRT GDLVRRRDDG HLEFVGRADG QVKIRGFRIE LGEIETFLAR HPEVAQVAVI
AREDRPGDRR LAAYIVPHDP PAAGAGTGLV ERVRRAAGET LPAYMVPSAF VVMDALPLNH
NGKVDRPALP APETAASGTA PRTPVEEILC ELFAQVLAVP GIGIDDDFFL MGGHSLLAMR
VVTGVRSALS VELQVRAVFE APTVAALAVM VEEARAAGAG GVRPPLVRAE RSAPPSLSFA
QHRLWIVDQL AEPGGLYTIP LVLRLSGRLD RAALEAAIGD VAWRQETLRT VFPARDGRPS
PRVLEAVPGL TVVETGEAEL WAAVEEEALR PFDLSAEPPV RALLFALGSG APVEPDEHVL
VLLFHHIAMD GWSLAPLRQD LALAYTARTR GEAPSWEPLP VRYSDYARWQ RDLLGDAADP
DSPASVQTAY WREALAGAPD ELALPADRPR PPVTGHRGAS VPLRLSPDLH GRLLALAKAN
RSTLFMVVQA GLAALLTRLG AGTDVPIGAP VAGRTDEALD RLVGFFANTL VLRTDTSGDP
AFRELLARVR ETDLAAYAHQ DVPFDQVVEA VNPPRIPGCP PLFQVMLALN NTPEDQAELP
GLTTTPDPTY SLYGFGGAKC DLAFGLNENL SAGAPAGVDG VVQYATDLFD HGTVESITAR
LVRLLEAVAA DPDVRVGAVE LLAPDERYTL LEKWNDTAVR TPGGSLARLF EARVAAAPDA
EAVVCGEVRL SAAELNARAN ALARRLNAAG VGPESPVVVL MQRSADVVVT LLAVIKAGGT
YIPLHPGLPP ARMRWIAEQT HAQVLITDDT YAGHELTHHL PTMTTGPQSD PRNLDLEITP
DRLAYTMFTS GSTGVPKGVA VTHRNVASFA ADRLWHGSGH RRVLFHTASS FDVSMYELWV
PLLNGGTVVV APPGPVDASV VRWALEEERV DAVFLTTGLF NVLAEDSAAL LAKVPELWIA
GEAASPAAVE RVLREGGDVH NGYGPTETTI YVTAHHVTSP GAVVPIGRPL DNTRAYVLDG
RLRPVPAGVP GELYIAGDHL ARGYLGRPDL TAERFVADPH GEPGSRMYRT GDLVRWLPDG
LLDYLGRVDD QVKIRGIRIE LGEIEAVLAR HPATAQVVVL VREDQPGDKR LVAYLVPYRR
PDGSSPPASP PAGAELSGQV RRFAEEALPA YMVPSAFVVL DGLPLNHNGK IDRRALPVPA
WQEPAAAKQP PRTEAEALVA EIWAEVLGLD GIGVHDDFFA LGGNSLLAIR VVSRIRAAVD
LEIPVDAVFT NPTVERLADA VEALLIADIE GRSP