Gene Sros_3791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3791 
Symbol 
ID8667081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4222349 
End bp4225417 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content71% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003339454 
Protein GI271965258 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.17753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.608105 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGTTTC GCGTTCTGGG CCCTCTGGAG GTAGTCACCG GCGACCAACG GCTCGACCTG 
GGAGGCATCA GGCAGCAGAC CGTGCTCGCC GCCCTCCTCC TCGACGCCAA CCGAGCGGTC
ACCACCGGCC GGCTCATGGA GGCGATCTAC GGAGACGATC CCCCCACCAC CTCGCGGGCC
CAGGTGCAGA TCTGCATCTC CGCGCTCCGG CGCCTCTTCG CCGCCCACAG CAGCGCCGAC
GTCATCTCCA CCCAGTCCCA GGGATACGCG ATCCAGGCCG ACAGCAGCCA GATCGACTCG
CACCGCTTCG AGAAACTCGT CCTGCAGGCC CGGCGCGCCC GCGAGGACCG CAACCTCGAC
GAGGCCATAA AGCACTACCG CAAGGCGCTC GCGCTCTGGC GCGGTCCCGC CCTGGACGGC
ATCGAGAGCA GGCTCGTGCA GTCGGCGGCG AGCAGGCTCG CCGAACACCG GATCACGGCG
AACGAGGACT GCGTCCAGCT CGAACTGGAC CTGGGCAGGC ACCACGAACT GGTCGGCGAG
CTCACCGAGC TCGTGGAGGA GTATCCGCTG CGCGAGCGGC TCCGCGGCCA TCTCATGCTG
GCGCTGTACC GCTCGGGACG CCAGGCGGAG GCGCTGCAGG TCTACCGGCT CGCCCGGCAG
ACCATGATCG ACGAACTGGG GATCGAGCCC AACGAGCGGC TCCAGCAGCT CGAATACGCC
ATCCTCACCT CCGACGAGAG CCTCGACCTG CCGGCGCAGC CCGCCAAGGT CGTCGAGGAG
CCGACGGCCC GCGTCCCGAA CTCGCCGAGC ATGCTCCCCA CCGACATCGC CGACTTCACC
GGCCGCACCA AGCAGATCGA CGACATCCGC CAGCGGCTGA CGCTCGCCGT CGACGACCGG
TCCCGCTTCG CCGTACCGAT CATCGCGATC GTCGGGAAGG CGGGGATCGG GAAGACCACC
GTCGCGGTGC ACTCCGCGCA CAGCGTCGCC GAGCACTTCC CCGACGGGCA GCTCTACGCC
GACCTGCACG GAGGCGTCTC TCGCCCGACC AGCCCGATGC AGGTGCTCGA ACGGTTCCTG
CGCGTGCTCG GCGTGCCCGG CACCGCGCTC CCCGACGGCC TGGAGGAGCG GGCGGAGATG
TACCGCTCCC TGCTCGCCGA CCGCAGGATG CTGATCGTGC TGGACGACGC GGGCAACGAG
AGCCAGGTCC TGCCGCTGCT TCCGGGCAAT CCCGCCTCCG CCGTGATCAT CACCAGCCGT
AGCCGGCTCG CCGGGCTGGC CGGCGCGATC CACGTCGACG TGGACGTTTT CGATTCGAGC
CAGTCGATGG ACCTGCTGTC CCGGATAGCG GGTGTGGAGC GGGTGCAGTC CGAGGCGGAG
TCCGCCGCGG CGCTGGCCGA GCTCTGCGGG CAGCTCCCCC TGGCACTCCG CATCGCCGGC
GCGCGGCTCC TGGCGCGCCC CCACTGGAGC ATCGAGCAGC TCGTGGGGCG GCTGGAGGAC
GAGACCCGCC GGCTGGACGA GCTCAAACAC GGCGACATGG GGATCAGGGC CAGCATCTCG
CTGACCTATG ACGGCACCGG CGACGACGCC CGGCGGCTCT TCCGCCGCCT GGCGATCCTG
GACTCCCAGA TCTTCTCCGC CTGGATCAGC GCGGCCCTCC TCGACATGCC CTTCGCCGAC
GCGCAGGACC TGCTGGACGA CCTGGCCGAC GCGCAGCTCG TCGAGACCAC CGGAGTCGGG
CGTGGCGTGC ACACGCAGTA CAGGTTCCAC GACCTCATCC GGGTGTTCGC CCGGGAGCGT
CTCGCCGCGG AGGAGTCCGC CCCCGAGCGG GGCGCGGCGC TGGCCCGCGT GCTCGGCGGC
CTGCTCTTCC TCGCGGAGGC GGCCCGCCGC CGGGAGTACG GCCCCGACAT CCTGATCCAC
AGTGACGCCT CCCTCTGGTC GCTGCCCAGG GATCTGGTCG ACCAGCTCAT CGCGGTGCCG
CTCGCCTGGT TCGAGCGCGA GCGCATGATC CTGGTCTCCG GCATCCGGCA GGCGGCGCAG
GCCGGCCTCG TCGAGCTCTG CTGGAGCCTC ACGATCAACG CGGTGACGTT CTTCGAGGCG
CGGGTCTACC TCGACGACTG GCGGGAGACC CACGACATCG CGCTGGCGGC CACCCGGCAC
GCCCGGGACA AGCGCGGCCA GGCGGCGATA CTCCACTCGA TGGGCTCGCT GGCCATCACC
GAGCAGCGAT TCGACGACGC GCAGCGCGAA TTCGAAGCGG CGGTCAGGCT GTTCCGGGAG
GTCAGCGACG ATCGCGGCGT CGCCATGGCC ATCCGCAACA TCGGGTTCCT CGACCGGATG
AACGGCCGCT TCGACGAGGC GGCGGCGCAC TACGAATGGG CGCTGGAGAT CTTCCGCACG
ATCGGGGACC AGGTCGCCGC CGCCTACGCG CTCCACAACC TGGCCCAGCT CAGGCTGGAG
TTCGACGACC TCGAAGGCGC CAAGCGGCTG CTGTCGGAGG CGCTGCAGCT CAGCGGGAAC
GGCGGCAGCC GAAGGGTGCG GGCCCAGGTC CTGCACCGGA TGGGCCACGT CCACCTCCAG
TCGGACGAGC CCGCCCTCGC CGCACGCGTC TTCGACGAGG CGCTGACCGT CGTCAGGGAC
ATCGGCGACC CCACCGGAGA GGCGTACGCG CTGCACGGGC TGGGCATCGC ACGGCTCCGG
CAGGGCATGC TCGCCGAGGC GGAGGGGGCG CTGCGCCACG CCCTGATGCT GGCCAGCACA
TCCAGCCAGC GGCTCGCGGA GGCGCGGGTG CTGGTCGGGC TGGGTGAGCT GACCATCGCG
TCGGGCAATC CGGCACAGGC CGTGCCCTAT TTCCAGCAGG CCCTCACCCT GTTCCGCCGG
ATACAGGTTC CGGTGCACGA GGCCCGCACC CTCATCATGC TCGGCGACGC GCACCTGGCC
GCCGGAGACA GTTCCGCGGC CCACAACGCG CTGGCCGAGG CCCACGCCCT GGCCGAAAAG
CTGGATCCCC CGGCGGCCGA GCAGGTGCGC GAACAGCTCG CCGAAAGGGC GCGCGGCCGG
GCGGAGTGA
 
Protein sequence
MEFRVLGPLE VVTGDQRLDL GGIRQQTVLA ALLLDANRAV TTGRLMEAIY GDDPPTTSRA 
QVQICISALR RLFAAHSSAD VISTQSQGYA IQADSSQIDS HRFEKLVLQA RRAREDRNLD
EAIKHYRKAL ALWRGPALDG IESRLVQSAA SRLAEHRITA NEDCVQLELD LGRHHELVGE
LTELVEEYPL RERLRGHLML ALYRSGRQAE ALQVYRLARQ TMIDELGIEP NERLQQLEYA
ILTSDESLDL PAQPAKVVEE PTARVPNSPS MLPTDIADFT GRTKQIDDIR QRLTLAVDDR
SRFAVPIIAI VGKAGIGKTT VAVHSAHSVA EHFPDGQLYA DLHGGVSRPT SPMQVLERFL
RVLGVPGTAL PDGLEERAEM YRSLLADRRM LIVLDDAGNE SQVLPLLPGN PASAVIITSR
SRLAGLAGAI HVDVDVFDSS QSMDLLSRIA GVERVQSEAE SAAALAELCG QLPLALRIAG
ARLLARPHWS IEQLVGRLED ETRRLDELKH GDMGIRASIS LTYDGTGDDA RRLFRRLAIL
DSQIFSAWIS AALLDMPFAD AQDLLDDLAD AQLVETTGVG RGVHTQYRFH DLIRVFARER
LAAEESAPER GAALARVLGG LLFLAEAARR REYGPDILIH SDASLWSLPR DLVDQLIAVP
LAWFERERMI LVSGIRQAAQ AGLVELCWSL TINAVTFFEA RVYLDDWRET HDIALAATRH
ARDKRGQAAI LHSMGSLAIT EQRFDDAQRE FEAAVRLFRE VSDDRGVAMA IRNIGFLDRM
NGRFDEAAAH YEWALEIFRT IGDQVAAAYA LHNLAQLRLE FDDLEGAKRL LSEALQLSGN
GGSRRVRAQV LHRMGHVHLQ SDEPALAARV FDEALTVVRD IGDPTGEAYA LHGLGIARLR
QGMLAEAEGA LRHALMLAST SSQRLAEARV LVGLGELTIA SGNPAQAVPY FQQALTLFRR
IQVPVHEART LIMLGDAHLA AGDSSAAHNA LAEAHALAEK LDPPAAEQVR EQLAERARGR
AE