Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3657 |
Symbol | |
ID | 8666945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 4052565 |
End bp | 4055657 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003339329 |
Protein GI | 271965133 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGATCA CCATGGGGGC CCGCGAGGTG GAGTTCAGAA TCCTGGGCCC GTTCGAGGTC GTGGTGGACG GGCGGGCGGT GCGGCTGAGC GGGCAGCGTG CCCGGGCGGT TCTGGCCATG CTCGTGGTGC ACCACGGGCA GATGTTCACC ATCGAACGCC TGGCCGCGAT CTGGGGCGCC GACCCGCCGG CCACCGTGCG CAACCAGGTG ATGATCGCGA TGGCCACGGT ACGCCGGGCG CTGCGGGAGG CCGGTGTCGA GCCGCAGCTC ATCGAGACCA TGGGTTCCGG CTACCGGCTG CGTGGCGGTC TCACCGACGC CCAGCGAGCC GAACAGGACA TCGAGCGAGC CCGGCAGGCG GCGAGGGAAG GGCGACCGGC CGAGGCCAGC GACCTTTTGA GCCGGGCACT GGCGTGGTGG CGAGGTCCGG TGCTCGCCGA TCTGGAGCTG CCGGAGGCGG AGGCCTCCGT GAGCCGGTGG GAGGAGCTGC GGCTGGTCGC CCTCGAAGAG CGGGCCGAGC TGGAGCTGAC GCTGGGCCGG CACCGCGACC TGACCGGCGA GCTGGCGGCG CTTCTCGCCG AGCAGCCGCT ACGGGAACGC CTGCGCGGGA TGCTCATGCT GGCCCTCTAC CGGTCGGGGC GGCGCTCCGA CGCGCTGGAG ATCTACCGCA CCGGACGGAT CCTGCTGGCC GAGGAGCTCG GCCTCGACCC GGGGCCCGAG CTGCGCCGGC TGGAGGCGGC CATCCTCGCC GACGATCCCG GCCTGGACAT CCCGCCACAG CAGCAGGCGC GGCACACCCC GGCCCCCGCC GAGCTGCCAC CCGACGTCAT GGGCTTCGTG GGGCGCGAAC GCGATCTGGC CGCGTTGCGG CGCTACGCCG TTCCCGACAG TTCGACCATG GCCGTCGTGA CAGCCGTCAC CGGGGCGGCC GGGGTGGGAA AGACCGCCCT CGCCCTGCGT TTCGGCCACC GGACGGCCGA CGAATTCCCG GACGGGCAGC TCTACATCGA CCTGCGGGGG CACTCGCTGC GTCCGCCGAT GGCGCCGCTG GAGGCGCTCA CCCGGATGCT GGGATCACTG GGCGTGCCGG CCGAGCAGAT CCCCGGCGAC GAGGAGAGGG CCGCGGGCCT GTACCGGTCA CACCTGTCGG GCAGGCGGAT ACTCGTACTG CTCGACAACG CCCACACAGC CGACCAGGTA CGACCGCTGT TACCGGGCGC TCCGGGCTGC CTCACCCTGG TCACCAGCCG GGACGCGCTC GCCGGGCTGG CGGCCTCCCA TGGCGCCCGG CGGCTCTCCC TGGGCATGCT CGGCCACGCC GAAAGCCTGA GACTGCTGGA ATCGGTGATC GGCGCCGAGC GCCTGGCGGC CGAGCGGCGA ACGGCGGAGG AGATCGTCCG GCTCTGCGCC CACCTGCCGC TCGCCCTGCG CGTGGCCGCC GCCACCCTGG CCACCCATCC GCACTGGTCG CTGGCCGGCT ACGGCACGGC TCTGGCCGCC GGGCGGCTGG ACATGCTCCA GATCGACGGC GACATGGCGG TGCGCGCCGC GTTCAGCCTG TCCTATGCGC GGTTACCGCC CCCCGCCCGG CGGCTGTTCC GGCTGCTCGG ACTGGTCCCC GGACCCGACG TCACGGCGCC GGCCGCCGCG GCCCTGGCGG GCATCGACGT GACGCAAGCC GAGCGGCTGC TGGACCGGCT CGCCGCCGCT CACCTTCTCA CAGAGCATCA GCCGCACCGC CACACCTTCC ACGACCTGCT GCGCCAGTAC GCCGGAGAAC TCGCCCGGCA GGAAGACGAC TCGGAGAGCC GCCATGAGGC CGCCGAACGC CTCGGCGCCT GGTATCTGGC CAGAGCCGCC TGGGCGGCCG AGATCGCCTA TCCGTCGATC ACCCGGCTGC CCGCCCTCGA AGCGGTCGTT CGGCAGCCCG CCGTACAGGC GCCGGACGAG CACGCCGGCC GCTCGCCGGA CGGGCGCGGC GCGGCCGCGG ACTGGTTGCG GGACGAGCGT GCGAACCTGA TCGCGATCGC GGTCCACGCG GCCGAGCACG GACCGCGGCA CCACGCGTGG CTGATCGCCG ATGCCCTGCG CGGCCACCTG TTCCAGCACA TCGACATCGC CGACTGCGTG GCCGTCGCCG AGGCGGCCCT GCGCGCCGCC ACGGCCGAGG GCGAGCCGTC CGGCCTGGCG ACGGCGCAGC TCTGCGTCGG CGCCGCCGCC CAGCTACGCT CCGACTACGG TCAGGCACGC GCCGCCTACG CCGCCGCCGC GCGCTACAGC GAACAGGCCG GATGGCCTCA AGGCGTCTCC GCCGCCCACA ACAACGCCGC CTCCGCCTGC CACGATCAGG GCGAGCTCCA GCCCGCCGTC GACCACCTCG ATGTCGCCCT GCGGATCAAC CGCGACATCG GCAATGTCTA CGGCGAGACG AACGCCCTGA GCAATCTCGG CACCATGCAC CTCGAACTCG GCGCCCTCGG CGAGGCGGAA CGATACTTGC GCCATGCCGT GGCACTGCAC CGGACGCTGC GTGGCAGCCC GCTGAGCTCC TCGTTGAACG AGCTCGCAAC CGTCCGGCGC CTCCTCGGGC ACCTCGACGA GGCCTTGGCC CTGGCCACCG AGGCGCTCGC ACACGATCGC GCAAGCGGCC TGCCCGTGCC CGAGGTCAAA AGCCTCGCCA CCCTGGCGGA GATCCACCGC GACGCGGGCC GTCTCGGCCC TGCCCTCGAC CACGCGATCG CGGCGCACGA CCTGGCCGAG AGCACCCATC ATGTCTACGC CCTGTGCACC GCGGCGAACG TCCTCGGCAC CGTCCGTACG CTGGGCGAAC GCTACGGCGA GGCGGCGGAC GCGCATCAGC GGGCCCTCGA ACTCGCGACT GAGGCGGGCA TGCGCTACAT GCGAGTCCGG GCCCTGCTCG GGCTCGCCCG CGCGCACCTC GGTACAGGCC GGCGCGACCT GGCGCTCTCC CTCGCCGACG AGGCGCTGGA GCTGGCCCGC CCGACGAACT ACCGCCTCCA GGAGGGCGCC GCGCTGGCCG TGGTGGCGGA GATCGGCCGA GCGTGGGAAC AACGTTCCGC CCGGCCCTCG TGA
|
Protein sequence | MVITMGAREV EFRILGPFEV VVDGRAVRLS GQRARAVLAM LVVHHGQMFT IERLAAIWGA DPPATVRNQV MIAMATVRRA LREAGVEPQL IETMGSGYRL RGGLTDAQRA EQDIERARQA AREGRPAEAS DLLSRALAWW RGPVLADLEL PEAEASVSRW EELRLVALEE RAELELTLGR HRDLTGELAA LLAEQPLRER LRGMLMLALY RSGRRSDALE IYRTGRILLA EELGLDPGPE LRRLEAAILA DDPGLDIPPQ QQARHTPAPA ELPPDVMGFV GRERDLAALR RYAVPDSSTM AVVTAVTGAA GVGKTALALR FGHRTADEFP DGQLYIDLRG HSLRPPMAPL EALTRMLGSL GVPAEQIPGD EERAAGLYRS HLSGRRILVL LDNAHTADQV RPLLPGAPGC LTLVTSRDAL AGLAASHGAR RLSLGMLGHA ESLRLLESVI GAERLAAERR TAEEIVRLCA HLPLALRVAA ATLATHPHWS LAGYGTALAA GRLDMLQIDG DMAVRAAFSL SYARLPPPAR RLFRLLGLVP GPDVTAPAAA ALAGIDVTQA ERLLDRLAAA HLLTEHQPHR HTFHDLLRQY AGELARQEDD SESRHEAAER LGAWYLARAA WAAEIAYPSI TRLPALEAVV RQPAVQAPDE HAGRSPDGRG AAADWLRDER ANLIAIAVHA AEHGPRHHAW LIADALRGHL FQHIDIADCV AVAEAALRAA TAEGEPSGLA TAQLCVGAAA QLRSDYGQAR AAYAAAARYS EQAGWPQGVS AAHNNAASAC HDQGELQPAV DHLDVALRIN RDIGNVYGET NALSNLGTMH LELGALGEAE RYLRHAVALH RTLRGSPLSS SLNELATVRR LLGHLDEALA LATEALAHDR ASGLPVPEVK SLATLAEIHR DAGRLGPALD HAIAAHDLAE STHHVYALCT AANVLGTVRT LGERYGEAAD AHQRALELAT EAGMRYMRVR ALLGLARAHL GTGRRDLALS LADEALELAR PTNYRLQEGA ALAVVAEIGR AWEQRSARPS
|
| |