Gene Sros_3657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3657 
Symbol 
ID8666945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4052565 
End bp4055657 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table11 
GC content73% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003339329 
Protein GI271965133 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGATCA CCATGGGGGC CCGCGAGGTG GAGTTCAGAA TCCTGGGCCC GTTCGAGGTC 
GTGGTGGACG GGCGGGCGGT GCGGCTGAGC GGGCAGCGTG CCCGGGCGGT TCTGGCCATG
CTCGTGGTGC ACCACGGGCA GATGTTCACC ATCGAACGCC TGGCCGCGAT CTGGGGCGCC
GACCCGCCGG CCACCGTGCG CAACCAGGTG ATGATCGCGA TGGCCACGGT ACGCCGGGCG
CTGCGGGAGG CCGGTGTCGA GCCGCAGCTC ATCGAGACCA TGGGTTCCGG CTACCGGCTG
CGTGGCGGTC TCACCGACGC CCAGCGAGCC GAACAGGACA TCGAGCGAGC CCGGCAGGCG
GCGAGGGAAG GGCGACCGGC CGAGGCCAGC GACCTTTTGA GCCGGGCACT GGCGTGGTGG
CGAGGTCCGG TGCTCGCCGA TCTGGAGCTG CCGGAGGCGG AGGCCTCCGT GAGCCGGTGG
GAGGAGCTGC GGCTGGTCGC CCTCGAAGAG CGGGCCGAGC TGGAGCTGAC GCTGGGCCGG
CACCGCGACC TGACCGGCGA GCTGGCGGCG CTTCTCGCCG AGCAGCCGCT ACGGGAACGC
CTGCGCGGGA TGCTCATGCT GGCCCTCTAC CGGTCGGGGC GGCGCTCCGA CGCGCTGGAG
ATCTACCGCA CCGGACGGAT CCTGCTGGCC GAGGAGCTCG GCCTCGACCC GGGGCCCGAG
CTGCGCCGGC TGGAGGCGGC CATCCTCGCC GACGATCCCG GCCTGGACAT CCCGCCACAG
CAGCAGGCGC GGCACACCCC GGCCCCCGCC GAGCTGCCAC CCGACGTCAT GGGCTTCGTG
GGGCGCGAAC GCGATCTGGC CGCGTTGCGG CGCTACGCCG TTCCCGACAG TTCGACCATG
GCCGTCGTGA CAGCCGTCAC CGGGGCGGCC GGGGTGGGAA AGACCGCCCT CGCCCTGCGT
TTCGGCCACC GGACGGCCGA CGAATTCCCG GACGGGCAGC TCTACATCGA CCTGCGGGGG
CACTCGCTGC GTCCGCCGAT GGCGCCGCTG GAGGCGCTCA CCCGGATGCT GGGATCACTG
GGCGTGCCGG CCGAGCAGAT CCCCGGCGAC GAGGAGAGGG CCGCGGGCCT GTACCGGTCA
CACCTGTCGG GCAGGCGGAT ACTCGTACTG CTCGACAACG CCCACACAGC CGACCAGGTA
CGACCGCTGT TACCGGGCGC TCCGGGCTGC CTCACCCTGG TCACCAGCCG GGACGCGCTC
GCCGGGCTGG CGGCCTCCCA TGGCGCCCGG CGGCTCTCCC TGGGCATGCT CGGCCACGCC
GAAAGCCTGA GACTGCTGGA ATCGGTGATC GGCGCCGAGC GCCTGGCGGC CGAGCGGCGA
ACGGCGGAGG AGATCGTCCG GCTCTGCGCC CACCTGCCGC TCGCCCTGCG CGTGGCCGCC
GCCACCCTGG CCACCCATCC GCACTGGTCG CTGGCCGGCT ACGGCACGGC TCTGGCCGCC
GGGCGGCTGG ACATGCTCCA GATCGACGGC GACATGGCGG TGCGCGCCGC GTTCAGCCTG
TCCTATGCGC GGTTACCGCC CCCCGCCCGG CGGCTGTTCC GGCTGCTCGG ACTGGTCCCC
GGACCCGACG TCACGGCGCC GGCCGCCGCG GCCCTGGCGG GCATCGACGT GACGCAAGCC
GAGCGGCTGC TGGACCGGCT CGCCGCCGCT CACCTTCTCA CAGAGCATCA GCCGCACCGC
CACACCTTCC ACGACCTGCT GCGCCAGTAC GCCGGAGAAC TCGCCCGGCA GGAAGACGAC
TCGGAGAGCC GCCATGAGGC CGCCGAACGC CTCGGCGCCT GGTATCTGGC CAGAGCCGCC
TGGGCGGCCG AGATCGCCTA TCCGTCGATC ACCCGGCTGC CCGCCCTCGA AGCGGTCGTT
CGGCAGCCCG CCGTACAGGC GCCGGACGAG CACGCCGGCC GCTCGCCGGA CGGGCGCGGC
GCGGCCGCGG ACTGGTTGCG GGACGAGCGT GCGAACCTGA TCGCGATCGC GGTCCACGCG
GCCGAGCACG GACCGCGGCA CCACGCGTGG CTGATCGCCG ATGCCCTGCG CGGCCACCTG
TTCCAGCACA TCGACATCGC CGACTGCGTG GCCGTCGCCG AGGCGGCCCT GCGCGCCGCC
ACGGCCGAGG GCGAGCCGTC CGGCCTGGCG ACGGCGCAGC TCTGCGTCGG CGCCGCCGCC
CAGCTACGCT CCGACTACGG TCAGGCACGC GCCGCCTACG CCGCCGCCGC GCGCTACAGC
GAACAGGCCG GATGGCCTCA AGGCGTCTCC GCCGCCCACA ACAACGCCGC CTCCGCCTGC
CACGATCAGG GCGAGCTCCA GCCCGCCGTC GACCACCTCG ATGTCGCCCT GCGGATCAAC
CGCGACATCG GCAATGTCTA CGGCGAGACG AACGCCCTGA GCAATCTCGG CACCATGCAC
CTCGAACTCG GCGCCCTCGG CGAGGCGGAA CGATACTTGC GCCATGCCGT GGCACTGCAC
CGGACGCTGC GTGGCAGCCC GCTGAGCTCC TCGTTGAACG AGCTCGCAAC CGTCCGGCGC
CTCCTCGGGC ACCTCGACGA GGCCTTGGCC CTGGCCACCG AGGCGCTCGC ACACGATCGC
GCAAGCGGCC TGCCCGTGCC CGAGGTCAAA AGCCTCGCCA CCCTGGCGGA GATCCACCGC
GACGCGGGCC GTCTCGGCCC TGCCCTCGAC CACGCGATCG CGGCGCACGA CCTGGCCGAG
AGCACCCATC ATGTCTACGC CCTGTGCACC GCGGCGAACG TCCTCGGCAC CGTCCGTACG
CTGGGCGAAC GCTACGGCGA GGCGGCGGAC GCGCATCAGC GGGCCCTCGA ACTCGCGACT
GAGGCGGGCA TGCGCTACAT GCGAGTCCGG GCCCTGCTCG GGCTCGCCCG CGCGCACCTC
GGTACAGGCC GGCGCGACCT GGCGCTCTCC CTCGCCGACG AGGCGCTGGA GCTGGCCCGC
CCGACGAACT ACCGCCTCCA GGAGGGCGCC GCGCTGGCCG TGGTGGCGGA GATCGGCCGA
GCGTGGGAAC AACGTTCCGC CCGGCCCTCG TGA
 
Protein sequence
MVITMGAREV EFRILGPFEV VVDGRAVRLS GQRARAVLAM LVVHHGQMFT IERLAAIWGA 
DPPATVRNQV MIAMATVRRA LREAGVEPQL IETMGSGYRL RGGLTDAQRA EQDIERARQA
AREGRPAEAS DLLSRALAWW RGPVLADLEL PEAEASVSRW EELRLVALEE RAELELTLGR
HRDLTGELAA LLAEQPLRER LRGMLMLALY RSGRRSDALE IYRTGRILLA EELGLDPGPE
LRRLEAAILA DDPGLDIPPQ QQARHTPAPA ELPPDVMGFV GRERDLAALR RYAVPDSSTM
AVVTAVTGAA GVGKTALALR FGHRTADEFP DGQLYIDLRG HSLRPPMAPL EALTRMLGSL
GVPAEQIPGD EERAAGLYRS HLSGRRILVL LDNAHTADQV RPLLPGAPGC LTLVTSRDAL
AGLAASHGAR RLSLGMLGHA ESLRLLESVI GAERLAAERR TAEEIVRLCA HLPLALRVAA
ATLATHPHWS LAGYGTALAA GRLDMLQIDG DMAVRAAFSL SYARLPPPAR RLFRLLGLVP
GPDVTAPAAA ALAGIDVTQA ERLLDRLAAA HLLTEHQPHR HTFHDLLRQY AGELARQEDD
SESRHEAAER LGAWYLARAA WAAEIAYPSI TRLPALEAVV RQPAVQAPDE HAGRSPDGRG
AAADWLRDER ANLIAIAVHA AEHGPRHHAW LIADALRGHL FQHIDIADCV AVAEAALRAA
TAEGEPSGLA TAQLCVGAAA QLRSDYGQAR AAYAAAARYS EQAGWPQGVS AAHNNAASAC
HDQGELQPAV DHLDVALRIN RDIGNVYGET NALSNLGTMH LELGALGEAE RYLRHAVALH
RTLRGSPLSS SLNELATVRR LLGHLDEALA LATEALAHDR ASGLPVPEVK SLATLAEIHR
DAGRLGPALD HAIAAHDLAE STHHVYALCT AANVLGTVRT LGERYGEAAD AHQRALELAT
EAGMRYMRVR ALLGLARAHL GTGRRDLALS LADEALELAR PTNYRLQEGA ALAVVAEIGR
AWEQRSARPS