Gene Sros_4121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4121 
Symbol 
ID8667415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4587439 
End bp4592190 
Gene Length4752 bp 
Protein Length1583 aa 
Translation table11 
GC content75% 
IMG OID 
Productamino acid adenylation domain protein 
Protein accessionYP_003339770 
Protein GI271965574 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.183322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.106605 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTTG ATGTGACGCG TGGCGATCTG GCGGGCCGCC TTCCCGCAGC GGCGGCCGGG 
AGGTTCGACG CACCGGGTTG CGTCCACGAA GCGATAGAGG CCCAGAGCCG GCGCACCCCC
GGACGCACCG CCGTCGTGGC GGGCACCAGG CGACTCAGCT ACGCCGAATT GGACCGCAGG
GCCAACGACA TCGCACGTGC CCTGGCCGGA CGGGGCGTCG GACGCGGCCA TCTGGTCGGA
GTCTGCCTGG AGCGCGACGA GTGGCTGGTT CCCGCGCTGG TGGGGGTGTG GAAGGCGGGG
GCGGCCTACG TGCCGCTGGA CCCCGCGTAC CCGGCCGAGC GGCTGCGGTT CATGGCCGAG
GACGCCGCCG TGACCGCGGT GCTCACCTCG GCCGCGCTGC GCCGGACCGC CGTCCTGACC
GGGGCCGAGG CCGTCCTCGT CGACGACGTG GTGCCGGGCG AGGGGGCGCC CCCCGTGGCC
GGGGACCCCG GCGACGCGGC GTACGTCATC TACACCTCCG GCTCCACCGG GACGCCCAAG
GGCGTGGTGG TGGAGCACCA CAACACCCTG AACCTGCTGC GCTGGGAGGC GTCGGCGTAC
ACCGCCGAGG AACTGAGCGG CATGCTGGCC GGCTCCTCGG TCTGCTTCGA CGCGTCGATC
AGCCAGCTCT TCTCGCCGTT GATCGCCGGC GGGACGGTGA TCATGGCCGA CAACCTGCTG
GCGCTGCCGA GCCTGCCGGC CCGGGAGGAG GTGACCACGG TCTACGGGGT CCCCTCCGCG
CTGTCGGTGC TGCTGCGTGA GCCGCTGCCG AGCGGGGTGC GCGCGGTGTT CTCCGGCGGC
GAGCCGCTGA CCGGCGCCCT GCTGCGGCGG ATCTACGCCA ACCCCGGGGT GCGCCGGGTG
CTGAACCTGT ACGGCCCGAC CGAGTGCACC ACCGCCTGCG CCGGCGTCGA GGTGGGCCGG
GACTACGAGG GTGAGCCGCC CCTGGGCGGA CCGATCGCCG GCGCGGTGTT CTCGGTGCGG
GACGCGGCCG GAGACCCCGT CCCGGACGGA GAGACGGGCG AGTTGTGGAT CGGCGGGCCC
GGGGTGACCC GCGGCTACCT GGGACGTGAG GCGCCCGCGT TCCGCACCGA TCCGGACGGC
GGCCGGGTCT ACCGCAGCGG GGATCTGGTC CGCTGGGTGG ACGGCGAGCT GCGCTTCGCC
GGCCGGGCCG ACGACCAGGT CAAGATCCGC GGCTACCGGG TGGAGCTGGG CGAGGTGGAG
GCCGCGCTGG CCCGCCACCC CGCGGTACGC CGGGCCACCG TGCTGGCCGC CACCGACGAC
GACGGGATCG CCTACCTGGC CGGTCACGTG GCGGCCTCCG AGGTGAGCGA GCCGGAGCTG
CGGGACTGGC TGTGCGCCCG GCTGCCCGAT CACCTGGTGC CGACCCGGAT CGGGGTGGCC
GAGGAGCTTC CGCTCGCCCC CAACGGCAAG GTGGACCGTG CGGCCCTGCC CAGGCTCGGC
GCCTTCCGGT CCGCCGGTGC GGCACCGGTC GCGCCCCGCA CCGACGACGA ACGCCTGGTG
GCCGAGGTGA TCGCCGGCGT GCTGGGCCTG CCGCAGGTCG GCGTACACGA TCGTTTCAGC
GACCTGGGCG GCCATTCGCT GGCCGCGGCC AGAGTGGTCA CCGAGCTGTC CCGCCGGCTG
GGGCACACCG TGCCCCTGGC GGCCTTCCTG ACCTCCCCCA CCGCGGCCGG CCTGGCCATC
CGGCTGCGCC AGGCCGGACC GGCGCCGGTG CGGCGGAGCG GCCGGACCCG GCATCCGCTC
ACCGACGCGC AGCGCCAGTT CTGGACCCTG AGCCAGCTCC ACCCGCGCAG CCCGGTCACC
ACGCTGGGGA TCCGGCTGCG CGTGCGCGAC CTGCGCGGAG CCGCGCCGCT GAGGTCGGCC
CTGGACGCGG TCGTACGCCG GCACGAGGTG CTGCGCAGCA CGGTCTCCGT CGACGAGGAC
GGCGTGCCGT ACGCCGTGGT GCACCCGCCG GCCGCCGTAC CCCTGGTCGA GCACGCCGTG
GGAGAGGACC CGGAGAAGGT GGCCAGGGCC GCGGCGGCGC ACGTGTTCGA CCTGACGAGC
GAGGTGCCGC TGCTACGGGC CGGCCTGTGC TGGGTGGGCG ACGCGGAGGC CGAGCTGGTG
GTCGTGGTGG ACCACATCGC CTTCGACGGC GGCTCGGTCG GGGTGCTGAT GGACGAGCTG
GCCGCCGAGC TGGCCGGCGA TCCTGTCGTC GGGCCGGCGG TCCAGGTCGG CGATGTGGCC
GTGCAGCAGC GGGAACAGCC GGACCTGCCG CGGCTGCGCG AGTTCTGGCG GGCCGAGCTG
GCCGGCGCGC CGGTGGCGGA CCCGGCGCCC GCGGACCTGA CGGCGGACAG GGTGATCCGG
CCGCTGCCCG AGGAGTTCGT GACGGGTGTG AACGCGCTGG CCCGCGAGTG CGGCGCCACC
CCGTTCGCGG TGTACCTGGC CGCTCTGGCC CTGGCCGGCG GCGAGCCGGA CACGCTGGTC
GGGGTGGTCG CGGCGCGGCG GGCCCGGCCG GAGCTGACCG AGGTCATCGG CCCGCTGGTG
GACACGGTGC CGGTACGGCT GCGGCCGACC GGCGCGCTGA CCTTCCGGAA CCTGGTCCGG
CAGGCCGCCG CCGCGACCAC GCGGGCACTG GTCCACCAGG AGATCCCGGC CGCCGACCTG
CCCCGCGCCC CGGTGCTGCT GGCCATGCAG CACGCCGAGG TTCCGGTGTG CCTGGGCGAC
CTGGAGCTGC TCACCGAGCT GGGCTCGGGC TCCTCGGTCC ACGAGCTCAG CGTGCTCGTC
AACCGGACCG TGTCCGGCAC CGAGCTGCAG CTGGAGTACG GCACCGCCCA CCTCGATCCG
GTGCGGGCCG AGGCGTACCT GGACCGGCTG GTGTGGCTGC TGCGGTGCGC GCTGACCGAC
CCGGACCGGC CGCTGTCGGC GTTCGAGCTG GTCACCCCGG ACGAGCGCGC CGCCCTGCTG
GCCGCCGCGG CCGGGCCGGA GCTGCCGGAG ATCGCGGAGA CGGTGCCGCG GGCGCTGGCG
GTCCGCACCG CCGGGACGGC CGTGATCGGC CCGGACGGCA CCTCGCTCGG CTACGCCGAG
CTGAACGACT GGTCCGGCCG GGTGGCGGCG GCGCTGCTGG AACACGGGGT GGCGCCGGGC
GAGGTGGTCG GGGTCTGCCT GCCCCGAGAC CACCTGATGC CGGCGGCCCT GCTGGCGGTG
TGGCGGGCCG GTGCGGCGTA CCTGCCGCTG GATCCGGAGC ACCCGGCCGA GCGGCTGCGC
CGGCTGGCCG AGGACGCCGG GGTGCGCGTG GTGCTCACCA GGGGCGGTGC GGCCGCCGTT
CCCGGCGTCA CCACGCTGGA CGCCGACGAC CTGGCCGCTG TCCACAGCGA GCCGGCGGAG
CTGCCGGAGG TGCGGGCCGG GGATCTGGCC TACGTGCTCC ACACCTCCGG CTCCACCGGG
ACGCCCAAGG GCGTGGAGGT CACCCACGGC AACCTGGCCG CCTATGTGGC CGGCCTGGTG
TCCGAGCTGC GCCTGGGCCC CCGGGACGTC CAGCCGGTCG TGGCGCCCCT GACCTTCGAC
ACCTCCGCGT CCGAGCTGTG GAGCCTGCTG AGCGTCGGCG GCACCTGCGT GGTGGTGGAC
CGCGCCACGG CCGTGGACGG GCACGCGCTG GCCGAGCGGA TCGCGACCAG CAAGGCCACC
GTGGTCGACC TGGTCCCGAC CACGTACCGG ATGCTGCTGG CCGCGGACTG GGCCGGTGAC
CCGGCACTGC TGGCCATCAG CGGCGGGGAG ACCCTTGATT CGGCCCTGGC CGGGCAGATC
CTGCCCCGGG TCCGCGAGCT GTGGAACACC TACGGCCCCA CCGAGGCCAC TGTCGCCTCG
ATCCAGCACC GGCTCGACGC GCACGGGGCG GGAGCGGTGC CGATCGGGCT GCCGATGCCC
GGGGAGCGGG CCTACGTGGT CGACTCCGAG CTGCGGCTGG CGCCGCCGGG CGCGGTGGGC
GAGCTTCTGC TGGGCGGCGC CGGTGTGGCC AGGGGCTACC GGGGCCGACC CGACCTGACC
GCGACCGCGT TCGTCGACGA CCCGTTCGTC CCCGGCGGCC GGTGTTACCG GACCGGGGAC
CTGGTGCGGT GGCGGCCGGA CGGGACGCTG GAGTTCCACG GCCGCCGGGA CCACCAGGTG
AAGGTGCGCG GCTACCGGAT CGAGCTGGGC GAGATCGAGA CGGTGCTGAA CGAGGTGGCC
TACGGGGCGG TGACCGTGTC CGGCTCGGGC GCGCAGGCAC ACCTGATCGG CTACGTGACG
CCGGAGACGG CCGACCTGGC CGCGGTCGAG CGGCACGTCC GGTCGCGGCT GCCCGGCCAC
ATGGTGCCCC GGCGGTGGGT CGCGCTGCCC ACCCTGCCCA CCCTGTCCAG CGGCAAGGTG
GATCGGGAAG CCCTGCCCGA GCCGGTCGAC GACCCGAAGG CCGGACGGGT GGCGCCCGCC
ACCGATCCCG AACGCCTGGT GGCCACCGTC TGGGCCGCCG TACTGGAACG CTCCGCCGTC
TGGGCCGATG ACGACTTCTT CGCCCTGGGC GGCCATTCCT TCGCCGCGAC CCGCGTGGTC
GGCCGGCTGC GGGAGACGTT GGATCTGGCC GTGCCGGTGC GGTTGCTGTT CGAGCGACCG
GTACTCGCCG ACTTCGCGGC CGGACTGGAA GAGCTGTTGA TCGCCAAACT GATGGATGGG
AACGACGCAT GA
 
Protein sequence
MTVDVTRGDL AGRLPAAAAG RFDAPGCVHE AIEAQSRRTP GRTAVVAGTR RLSYAELDRR 
ANDIARALAG RGVGRGHLVG VCLERDEWLV PALVGVWKAG AAYVPLDPAY PAERLRFMAE
DAAVTAVLTS AALRRTAVLT GAEAVLVDDV VPGEGAPPVA GDPGDAAYVI YTSGSTGTPK
GVVVEHHNTL NLLRWEASAY TAEELSGMLA GSSVCFDASI SQLFSPLIAG GTVIMADNLL
ALPSLPAREE VTTVYGVPSA LSVLLREPLP SGVRAVFSGG EPLTGALLRR IYANPGVRRV
LNLYGPTECT TACAGVEVGR DYEGEPPLGG PIAGAVFSVR DAAGDPVPDG ETGELWIGGP
GVTRGYLGRE APAFRTDPDG GRVYRSGDLV RWVDGELRFA GRADDQVKIR GYRVELGEVE
AALARHPAVR RATVLAATDD DGIAYLAGHV AASEVSEPEL RDWLCARLPD HLVPTRIGVA
EELPLAPNGK VDRAALPRLG AFRSAGAAPV APRTDDERLV AEVIAGVLGL PQVGVHDRFS
DLGGHSLAAA RVVTELSRRL GHTVPLAAFL TSPTAAGLAI RLRQAGPAPV RRSGRTRHPL
TDAQRQFWTL SQLHPRSPVT TLGIRLRVRD LRGAAPLRSA LDAVVRRHEV LRSTVSVDED
GVPYAVVHPP AAVPLVEHAV GEDPEKVARA AAAHVFDLTS EVPLLRAGLC WVGDAEAELV
VVVDHIAFDG GSVGVLMDEL AAELAGDPVV GPAVQVGDVA VQQREQPDLP RLREFWRAEL
AGAPVADPAP ADLTADRVIR PLPEEFVTGV NALARECGAT PFAVYLAALA LAGGEPDTLV
GVVAARRARP ELTEVIGPLV DTVPVRLRPT GALTFRNLVR QAAAATTRAL VHQEIPAADL
PRAPVLLAMQ HAEVPVCLGD LELLTELGSG SSVHELSVLV NRTVSGTELQ LEYGTAHLDP
VRAEAYLDRL VWLLRCALTD PDRPLSAFEL VTPDERAALL AAAAGPELPE IAETVPRALA
VRTAGTAVIG PDGTSLGYAE LNDWSGRVAA ALLEHGVAPG EVVGVCLPRD HLMPAALLAV
WRAGAAYLPL DPEHPAERLR RLAEDAGVRV VLTRGGAAAV PGVTTLDADD LAAVHSEPAE
LPEVRAGDLA YVLHTSGSTG TPKGVEVTHG NLAAYVAGLV SELRLGPRDV QPVVAPLTFD
TSASELWSLL SVGGTCVVVD RATAVDGHAL AERIATSKAT VVDLVPTTYR MLLAADWAGD
PALLAISGGE TLDSALAGQI LPRVRELWNT YGPTEATVAS IQHRLDAHGA GAVPIGLPMP
GERAYVVDSE LRLAPPGAVG ELLLGGAGVA RGYRGRPDLT ATAFVDDPFV PGGRCYRTGD
LVRWRPDGTL EFHGRRDHQV KVRGYRIELG EIETVLNEVA YGAVTVSGSG AQAHLIGYVT
PETADLAAVE RHVRSRLPGH MVPRRWVALP TLPTLSSGKV DREALPEPVD DPKAGRVAPA
TDPERLVATV WAAVLERSAV WADDDFFALG GHSFAATRVV GRLRETLDLA VPVRLLFERP
VLADFAAGLE ELLIAKLMDG NDA