Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3791 |
Symbol | |
ID | 8667081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 4222349 |
End bp | 4225417 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003339454 |
Protein GI | 271965258 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.17753 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.608105 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGTTTC GCGTTCTGGG CCCTCTGGAG GTAGTCACCG GCGACCAACG GCTCGACCTG GGAGGCATCA GGCAGCAGAC CGTGCTCGCC GCCCTCCTCC TCGACGCCAA CCGAGCGGTC ACCACCGGCC GGCTCATGGA GGCGATCTAC GGAGACGATC CCCCCACCAC CTCGCGGGCC CAGGTGCAGA TCTGCATCTC CGCGCTCCGG CGCCTCTTCG CCGCCCACAG CAGCGCCGAC GTCATCTCCA CCCAGTCCCA GGGATACGCG ATCCAGGCCG ACAGCAGCCA GATCGACTCG CACCGCTTCG AGAAACTCGT CCTGCAGGCC CGGCGCGCCC GCGAGGACCG CAACCTCGAC GAGGCCATAA AGCACTACCG CAAGGCGCTC GCGCTCTGGC GCGGTCCCGC CCTGGACGGC ATCGAGAGCA GGCTCGTGCA GTCGGCGGCG AGCAGGCTCG CCGAACACCG GATCACGGCG AACGAGGACT GCGTCCAGCT CGAACTGGAC CTGGGCAGGC ACCACGAACT GGTCGGCGAG CTCACCGAGC TCGTGGAGGA GTATCCGCTG CGCGAGCGGC TCCGCGGCCA TCTCATGCTG GCGCTGTACC GCTCGGGACG CCAGGCGGAG GCGCTGCAGG TCTACCGGCT CGCCCGGCAG ACCATGATCG ACGAACTGGG GATCGAGCCC AACGAGCGGC TCCAGCAGCT CGAATACGCC ATCCTCACCT CCGACGAGAG CCTCGACCTG CCGGCGCAGC CCGCCAAGGT CGTCGAGGAG CCGACGGCCC GCGTCCCGAA CTCGCCGAGC ATGCTCCCCA CCGACATCGC CGACTTCACC GGCCGCACCA AGCAGATCGA CGACATCCGC CAGCGGCTGA CGCTCGCCGT CGACGACCGG TCCCGCTTCG CCGTACCGAT CATCGCGATC GTCGGGAAGG CGGGGATCGG GAAGACCACC GTCGCGGTGC ACTCCGCGCA CAGCGTCGCC GAGCACTTCC CCGACGGGCA GCTCTACGCC GACCTGCACG GAGGCGTCTC TCGCCCGACC AGCCCGATGC AGGTGCTCGA ACGGTTCCTG CGCGTGCTCG GCGTGCCCGG CACCGCGCTC CCCGACGGCC TGGAGGAGCG GGCGGAGATG TACCGCTCCC TGCTCGCCGA CCGCAGGATG CTGATCGTGC TGGACGACGC GGGCAACGAG AGCCAGGTCC TGCCGCTGCT TCCGGGCAAT CCCGCCTCCG CCGTGATCAT CACCAGCCGT AGCCGGCTCG CCGGGCTGGC CGGCGCGATC CACGTCGACG TGGACGTTTT CGATTCGAGC CAGTCGATGG ACCTGCTGTC CCGGATAGCG GGTGTGGAGC GGGTGCAGTC CGAGGCGGAG TCCGCCGCGG CGCTGGCCGA GCTCTGCGGG CAGCTCCCCC TGGCACTCCG CATCGCCGGC GCGCGGCTCC TGGCGCGCCC CCACTGGAGC ATCGAGCAGC TCGTGGGGCG GCTGGAGGAC GAGACCCGCC GGCTGGACGA GCTCAAACAC GGCGACATGG GGATCAGGGC CAGCATCTCG CTGACCTATG ACGGCACCGG CGACGACGCC CGGCGGCTCT TCCGCCGCCT GGCGATCCTG GACTCCCAGA TCTTCTCCGC CTGGATCAGC GCGGCCCTCC TCGACATGCC CTTCGCCGAC GCGCAGGACC TGCTGGACGA CCTGGCCGAC GCGCAGCTCG TCGAGACCAC CGGAGTCGGG CGTGGCGTGC ACACGCAGTA CAGGTTCCAC GACCTCATCC GGGTGTTCGC CCGGGAGCGT CTCGCCGCGG AGGAGTCCGC CCCCGAGCGG GGCGCGGCGC TGGCCCGCGT GCTCGGCGGC CTGCTCTTCC TCGCGGAGGC GGCCCGCCGC CGGGAGTACG GCCCCGACAT CCTGATCCAC AGTGACGCCT CCCTCTGGTC GCTGCCCAGG GATCTGGTCG ACCAGCTCAT CGCGGTGCCG CTCGCCTGGT TCGAGCGCGA GCGCATGATC CTGGTCTCCG GCATCCGGCA GGCGGCGCAG GCCGGCCTCG TCGAGCTCTG CTGGAGCCTC ACGATCAACG CGGTGACGTT CTTCGAGGCG CGGGTCTACC TCGACGACTG GCGGGAGACC CACGACATCG CGCTGGCGGC CACCCGGCAC GCCCGGGACA AGCGCGGCCA GGCGGCGATA CTCCACTCGA TGGGCTCGCT GGCCATCACC GAGCAGCGAT TCGACGACGC GCAGCGCGAA TTCGAAGCGG CGGTCAGGCT GTTCCGGGAG GTCAGCGACG ATCGCGGCGT CGCCATGGCC ATCCGCAACA TCGGGTTCCT CGACCGGATG AACGGCCGCT TCGACGAGGC GGCGGCGCAC TACGAATGGG CGCTGGAGAT CTTCCGCACG ATCGGGGACC AGGTCGCCGC CGCCTACGCG CTCCACAACC TGGCCCAGCT CAGGCTGGAG TTCGACGACC TCGAAGGCGC CAAGCGGCTG CTGTCGGAGG CGCTGCAGCT CAGCGGGAAC GGCGGCAGCC GAAGGGTGCG GGCCCAGGTC CTGCACCGGA TGGGCCACGT CCACCTCCAG TCGGACGAGC CCGCCCTCGC CGCACGCGTC TTCGACGAGG CGCTGACCGT CGTCAGGGAC ATCGGCGACC CCACCGGAGA GGCGTACGCG CTGCACGGGC TGGGCATCGC ACGGCTCCGG CAGGGCATGC TCGCCGAGGC GGAGGGGGCG CTGCGCCACG CCCTGATGCT GGCCAGCACA TCCAGCCAGC GGCTCGCGGA GGCGCGGGTG CTGGTCGGGC TGGGTGAGCT GACCATCGCG TCGGGCAATC CGGCACAGGC CGTGCCCTAT TTCCAGCAGG CCCTCACCCT GTTCCGCCGG ATACAGGTTC CGGTGCACGA GGCCCGCACC CTCATCATGC TCGGCGACGC GCACCTGGCC GCCGGAGACA GTTCCGCGGC CCACAACGCG CTGGCCGAGG CCCACGCCCT GGCCGAAAAG CTGGATCCCC CGGCGGCCGA GCAGGTGCGC GAACAGCTCG CCGAAAGGGC GCGCGGCCGG GCGGAGTGA
|
Protein sequence | MEFRVLGPLE VVTGDQRLDL GGIRQQTVLA ALLLDANRAV TTGRLMEAIY GDDPPTTSRA QVQICISALR RLFAAHSSAD VISTQSQGYA IQADSSQIDS HRFEKLVLQA RRAREDRNLD EAIKHYRKAL ALWRGPALDG IESRLVQSAA SRLAEHRITA NEDCVQLELD LGRHHELVGE LTELVEEYPL RERLRGHLML ALYRSGRQAE ALQVYRLARQ TMIDELGIEP NERLQQLEYA ILTSDESLDL PAQPAKVVEE PTARVPNSPS MLPTDIADFT GRTKQIDDIR QRLTLAVDDR SRFAVPIIAI VGKAGIGKTT VAVHSAHSVA EHFPDGQLYA DLHGGVSRPT SPMQVLERFL RVLGVPGTAL PDGLEERAEM YRSLLADRRM LIVLDDAGNE SQVLPLLPGN PASAVIITSR SRLAGLAGAI HVDVDVFDSS QSMDLLSRIA GVERVQSEAE SAAALAELCG QLPLALRIAG ARLLARPHWS IEQLVGRLED ETRRLDELKH GDMGIRASIS LTYDGTGDDA RRLFRRLAIL DSQIFSAWIS AALLDMPFAD AQDLLDDLAD AQLVETTGVG RGVHTQYRFH DLIRVFARER LAAEESAPER GAALARVLGG LLFLAEAARR REYGPDILIH SDASLWSLPR DLVDQLIAVP LAWFERERMI LVSGIRQAAQ AGLVELCWSL TINAVTFFEA RVYLDDWRET HDIALAATRH ARDKRGQAAI LHSMGSLAIT EQRFDDAQRE FEAAVRLFRE VSDDRGVAMA IRNIGFLDRM NGRFDEAAAH YEWALEIFRT IGDQVAAAYA LHNLAQLRLE FDDLEGAKRL LSEALQLSGN GGSRRVRAQV LHRMGHVHLQ SDEPALAARV FDEALTVVRD IGDPTGEAYA LHGLGIARLR QGMLAEAEGA LRHALMLAST SSQRLAEARV LVGLGELTIA SGNPAQAVPY FQQALTLFRR IQVPVHEART LIMLGDAHLA AGDSSAAHNA LAEAHALAEK LDPPAAEQVR EQLAERARGR AE
|
| |