Gene Sros_3345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3345 
Symbol 
ID8666633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3664248 
End bp3667427 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content75% 
IMG OID 
Productnon-ribosomal peptide synthetase 
Protein accessionYP_003339027 
Protein GI271964831 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0251 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTGA CGTCCGAGAT CGACTCCCCA CCCGTCCCGG CCTCGCTCGC CCAGCAGGGA 
ATCTGGTTCA ACGAACGGCT GGGCGGTGCC GGGACCGTCT ACACGATGCC CTTCTCCGTC
ACGTTCGACG GCCCGCTCGA CGTGCCCGCC TTGACCGCCG CCTGCCGGGC GCTGATCGAG
CGGCACCCGA TCCTGGCGAG CACGGTCCGG GAACGGCAGG GCGTGCCGTA CGTCGTGCCC
GCCGCCACCC CGCCCGCGCC GGTGGTCGCC GAGGTCACGG CCGCCCGGCG TGACGACCTG
ATGAGGGCGG AGATCCTGCG CCCCTTCGAC CTGGCCGCCG GTCCGCTCGT GCGGATGACC
CTCTACGTCG AGGAGGCGGG CCGGGCGACG CTGCTGGTCG TGGCGCACCA CCTCGTCTTC
GACGGCGAGT CCACCTCGGT GTTCCTGCGG GACCTGGCCG AGCTCTACCG GGCCGGGGTG
ACCGGCACCC CCGCCGACCT GCCCGCGCTG GACCACGACG GGCTCGCCGA GCGGGCCGCG
GCCAGGGTCG AGGCCGGCCT CTCTTTCGCC CGGGAGTTCT GGAGCTCGCG GTGGCGCCCG
CCCGCCGAGG TGATCCTGCC CGGCCTGGCC GGACCGGTGC CCGCGGTGGA CGAGGGGGCG
GCCGTGGAGT TCGCGCTGCC GCCGGAGTCC CGGGAGGCGC TGGCCCGGCT GGCGGAGGAG
ATCGGGGCCG GCAGGTTCGA GATCGTGCTC GCCTCGCTGC ACGTGCTGCT CCACCGGTAC
GGCAACGCCG AGCCGACGGT CGCGGTCGAT CTGGGCACCC GCTCGCCGGA GACCCGCGAC
CACCTGGGCG CCTTCGTCAA CGAGCTGCCC GTCACCGCCG GGCTCCGGCC CGAATGGGGC
TTCCGCCGGT TCGTGGCCGA CCAGCGCTTC GGGTACGGGC TCCGCTCCGA CCTGCGCGGC
CTCTTCCGGG CCCGGGAGGT ACCGCTGTCG CGCGCGGTCA GCGGGGTCAG GCCCGGTGTC
GCGCTCGCCC CGATCTCGCT CGGCTACCGC AGGCGCGAGG CCGCGCCCGC CTTCCACGGG
GTCGACGCCC ACGTGGAGTG GGTGCTGTTC AACCACACCG TCCGCGGCGC CATGCGCGTG
CACATCGTGG ACGGGCCCGG CCGCTTCGGT GTCGTCCTCC AGTACAACCC GCAGATCATG
GCCCGCGAGG ACGCCGAGCG GGTGGCCGCC CACTGGCGCG CGCTGCTCGA CGCGGTGGCC
GCCGACCCCG ACATGCCGCT GGCCGAGCTG CCGATGCTCG ACGCGGAGGA GACCGGGCGG
CTGCTGTCGC GGTGGAACGA CACCGCCGCC GGCCATCCGC CGCTCACCCT TCCCGAGCTG
GTGGCCGCCC AGGCCGGCCG TACCCCGGAC GCGACCGCGG CCGTGTGCGG CGCGCAGACG
ATGACCTACG CCGAGCTCGG CGCGGCCGTG GACGACCTGG CCCGGCGGCT GCGCGGCGCC
GGCGTGGGAC GGGGGACGCT GGTCGCGGTC TGCGCCGAGC GCTCGCTTGC CACCCTGGTC
GGCCTGCTCG CCGTGGCGCG CGCCGGCGGG GCCTACCTCC CGCTCGACCC GGACCATCCC
GCCGAGCGCC TGCGCCTGGT CCTGGAGGAC TCCGGGGCCG CCCTGATCCT GGCCGGCTCC
GGCCGGCACG ACCGCCTGGC CGGCTCCGGC GTGGCGGTCA TCTCGCTCGA CGCCCCCGGC
CCGCGGTCCG GGGAAGGCGA TCGGGGGCCG GGGGAGGGCG GTTCCGGCCT GGAGCGGTCC
GGGGAAGGCG GGTCCGGCGC GGGAGGTCGG GGGGAGGGCG GGCTCGCCTG GCCGGAGCTC
GGCGACCTCG CCTACGTCAT CTACACCTCC GGCTCCACCG GCCGCCCCAA GGGCGTCGAG
ATCCCGCACC GGGCGCTGAC CAACCTGCTG CTGGCCATGC GTGACCGGCT CGGTTCCCAG
CCGGGGGACG GCTGGCTCGC CCACACCTCG CTGTCGTTCG ACATCTCGGC GCTGGAGCTC
TACCTGCCGC TGGTGACCGG GGGACGGGTC GTCATCGCCC CGGACGCCGC GGCCAGGGAC
GGGCACGAGC TCGTACGGCT GGCCGCCGAG GGGGTGAGCC ACGTGCAGGC GACCCCGTCC
GGCTGGCGGA TGCTGCTCGA CGCCGGGTTC GACCTGCCCC GCGTGACCGC CCTGGCCGGC
GGTGAGGCTC TCCCCGCCCC CCTGGCCCGC GAGATCCTCA GCCGGGCCGG CCGTCTGATC
AACGTCTACG GCCCCACCGA GACGACGATC TGGTCGATGA GCGCGGAGAT CGCCGAGCCC
GTCACGACCG TGCCGATCGG CGTTCCGCTG GCCAACACCC GGGTGCACGT GCTGGACGAG
CGGCTCGGGC TGCTGCCGCT GGGCGTACCC GGCGAGCTGT GCATCGCCGG TGACGGCGTC
GCCGACGGCT ACCACAACCG CCCCGAGCTG ACCGCCGAGC GGTTCGCCGG CGACCCGTTC
GGGCCCGGCC GGCTGTACCG CACCGGTGAC CGGGTGGTCA GAAGGGCCGA CGGGCAGATC
GAGTTCATCG GACGCCTCGA CGACCAGATC AAGCTCCGCG GGCACCGGAT CGAACCGGGG
GAGATCGAGT CCAGGCTGCT CGAACACCCC GGCATTCCCC GCGCGGCCGT GGTCGCCCGG
GAGGACGACA AGGGGGAGCG GCGGATCGTC GCCTACCTGG AGTGCGGGCA GGTCCCCGGC
GACGTGCGCG AGCACTGCGC CGGGACGCTG CCCTCCTACA TGATCCCGGC CGACTTCGTC
GGGCTGCCCC GGCTGCCGCT GACCCCCAAC GGCAAGCTCG ACCGGTCCGC GCTGCCCGCG
CCCGGCCCCC GGGAGAGCGA GGCAGGCCCT CACGAGAGCG CGGCCGGCGT GGCCGGGCCG
CACTCCTACA GCGGGGTGGC GGCCGAGCTC CACGAGATCT GGTGCGACGT GCTGGGGCTT
GAGGCCGTCG GGCCGCAGGA GGACCTGTTC GAGCTGGGCG GCCACTCGCT GACCATCACC
CAGATCGCCT CCCGGGTCCG CCTGCGCCTG GGCGCGGACC TGCCGCTCCA CATCTACTAC
GACGAGCCCA CGATCAGCGC CGTCGCCGCC GCCGTCGAGC GCCTGAACGG GAAGAACTGA
 
Protein sequence
MSVTSEIDSP PVPASLAQQG IWFNERLGGA GTVYTMPFSV TFDGPLDVPA LTAACRALIE 
RHPILASTVR ERQGVPYVVP AATPPAPVVA EVTAARRDDL MRAEILRPFD LAAGPLVRMT
LYVEEAGRAT LLVVAHHLVF DGESTSVFLR DLAELYRAGV TGTPADLPAL DHDGLAERAA
ARVEAGLSFA REFWSSRWRP PAEVILPGLA GPVPAVDEGA AVEFALPPES REALARLAEE
IGAGRFEIVL ASLHVLLHRY GNAEPTVAVD LGTRSPETRD HLGAFVNELP VTAGLRPEWG
FRRFVADQRF GYGLRSDLRG LFRAREVPLS RAVSGVRPGV ALAPISLGYR RREAAPAFHG
VDAHVEWVLF NHTVRGAMRV HIVDGPGRFG VVLQYNPQIM AREDAERVAA HWRALLDAVA
ADPDMPLAEL PMLDAEETGR LLSRWNDTAA GHPPLTLPEL VAAQAGRTPD ATAAVCGAQT
MTYAELGAAV DDLARRLRGA GVGRGTLVAV CAERSLATLV GLLAVARAGG AYLPLDPDHP
AERLRLVLED SGAALILAGS GRHDRLAGSG VAVISLDAPG PRSGEGDRGP GEGGSGLERS
GEGGSGAGGR GEGGLAWPEL GDLAYVIYTS GSTGRPKGVE IPHRALTNLL LAMRDRLGSQ
PGDGWLAHTS LSFDISALEL YLPLVTGGRV VIAPDAAARD GHELVRLAAE GVSHVQATPS
GWRMLLDAGF DLPRVTALAG GEALPAPLAR EILSRAGRLI NVYGPTETTI WSMSAEIAEP
VTTVPIGVPL ANTRVHVLDE RLGLLPLGVP GELCIAGDGV ADGYHNRPEL TAERFAGDPF
GPGRLYRTGD RVVRRADGQI EFIGRLDDQI KLRGHRIEPG EIESRLLEHP GIPRAAVVAR
EDDKGERRIV AYLECGQVPG DVREHCAGTL PSYMIPADFV GLPRLPLTPN GKLDRSALPA
PGPRESEAGP HESAAGVAGP HSYSGVAAEL HEIWCDVLGL EAVGPQEDLF ELGGHSLTIT
QIASRVRLRL GADLPLHIYY DEPTISAVAA AVERLNGKN