Gene Sros_7741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7741 
Symbol 
ID8671063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8536415 
End bp8539456 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content76% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003343153 
Protein GI271968957 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0132103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGGTTTG GACTCCTTGG CCCGGTGCTG GTCCAGGCCG GTGACTCACC CTTACGGATC 
ACGGCGCCCA AACAGCGCAC GGTTCTCGCC ATGCTGCTCG CGCGCGCCGG CTACGTCGTG
CCGATCCGGT CATTGGTGAC GGAGGTGTGG GACGAGCATC CGCCCCGCTC CGCGGTGGCC
AACCTGCGCA CCTATCTCAT GCAGCTCCGC AGGATGCTGC CCCCCTGCGA GAATCCGGCC
GTCGAGCCGC TGGTCACCTC GGACGCCGGC TACCTGCTGC GGGTCGAGCC TGCCGAGTTC
GACCTCTTCC AGTTCGAGGC GCTCTCCGCG CTCGGCCGCC AGGCGCTGGC CCGGCGGGAT
CTCGTGACGG CGCAGGACGC CTACACCCGG GCACTCGCAC TGTGGCGAGG GGGAGCGGCC
GAGGACGCGC CGCTGGGGCC GACCCTGCGC GAGGTGGTCG CCCGCCTCAC CGACCAGTAT
CTGAGCGCGG TGGAGGAGCA CACCGAGATC CAGCTCGCCC TCGGCAGCCC CACGACGGCG
GTCAGGCGCC TGCGCGAGCT GATCGGCCGC TACCCCCTGC GGGAACGGCT GTACGGCCAG
CTCATGGTCG CCCTGTACCG GTGCGGTGAC GTCGCCGGCG CGCTGGACGT CTTCGGGGTG
GCCCGCCGGA TCCTGGCCGA GGAGCTCGGG CTCGACCCCG GCCCCGAGCT GCGCCTCCTG
CACCAGGCGG TGCTGCGCCG GGACGCGGAT CTGATGGTGC CGGGCGGGCC GCCGGCCGGC
GGGGACACCG TGACCGTGGG AGCCGCCGTG ACCGGGGGAG CGGCCGTGCC GGTGGGAGCC
GCCGTGACCG CGGCGGCGGC CGTGCCGGTG GAGGACACCG TGACCGTGCA CGCCGGGGAC
GGGCCTCCGC GCCCGCGCCA GCTTCCGCGG GAGCCGCCGC TGTTCGTGGG CCGGCCGGCC
GAGCTGGCCG GGATGCTCAC CGCGCTGTGC GGCGATCCGG CGCAGGGCGC GGGACCGCCG
GTGCTGGCGC TGCACGGCCC CGGCGGCGTC GGCAAGTCGA CGCTGGCGCT GCGGGCGGCG
TACGCCGTGG CCGACCGCTA CGCCGACGGC CAGCTCTACG CCGATCTGCA GGGATCGAGC
CCGGGGCTGC CGCCGTTGCG GCCGGCCGAG GTGCTCGGCC GTTTCCTGCG GGCACTGGGG
GTGCCCCACG GCGAGGTCCC CGCCGCACCG GGGGAGGCGG CCGCCCACTA CCAGTCCCTG
CTGGCCGGCC GGCGGGTCCT GGTCGTCCTC GACAACGCCG TCGACGCGGC CCAGGTGGCA
CCGCTGCTGC CGGCCGGCGG CGGCTGCGCG GCGCTGGTCA CCAGCCGGAC GGCGCTGACC
ACCATGGACG CCGTGCCGAT CGCCCTCGAC GTGTTCGACG AGGCGGACTC GGTGCGGATG
CTCACGCTGC TGGCGGGGCA GGACCGGGTG GCCGCCGAGG CCGGGGCGGC GGCCGACGTC
GCGCGCTGGT GCGGCTACCA CCCGCTGGCG CTGCGCATCG CCGGCGCCCG TCTCGCCGGC
CGTCCCGACT GGTCGCTCGT GCGGTTCGGC GAGCGGCTGC GCGACCAGCG GCGACGGCTG
GACGAGCTGC GGGCGGCCGA CCTGGGCATC CGATCCTGTT TCGAGGTCAG CTACGCGGCG
CTGACGGGCG GCGCGGGCCG GGGCGGGGGC GCCGCGGCGC ACGCCTTCCG GCTGTTCGGC
GTGCTCGACG TGCCGGAGAT CAGCGTCGAG CTCGCCGCCG CGCTCCTCGA CGCCGACCTG
AAGGCGGCGG AGGACGCGCT CGACGAGCTG GCGGAGGTCC GCCTGGTCGA GCCGGCCGGC
GGCGGGCGGT TCCGCATGCA CGACCTGCTG AGGCTGTTCG CCGCGGAGCT GGCCGTCGTC
CACGACCCGC CGGACGAGCG CGTGCGGGCC GTACGGCGGG CGCTGGACTG GTACCTCGAC
CTCTGCCATC AGGTGAACGA CCTGCTCCAG CCGCATCTGC GATCCGGGGA CGGGCACCGG
CCGAGCCGGC GGGACACCGG GGTGGCCCTG CGCGACCACG TCGAGGCGGT GCGACGGTTC
GAGACCGAGA TGCCGTGCCT GATCGCGGCC GCGGCCCAGG CGGCGACGGG GGAGCAGGCG
GTCGCGTGCT TCGTCACCGA CCTGATGCCG CTGGTCAGGG CGCTGGCGAC CAAGTGCGGG
CACTGGCGGG AGTTCGAGAC CGTCGCACGG CTCGCCATCG GGGTGGCGCG GCGGCACGGC
GACCGTGCCG GGGAGGCGAC CGCGCTCACG ATGCTGGGAC TGGTGGAGTG GAGGACCGGC
CGGTCCGAGG CGGCCCGCGA CTGCCTGAGC CGCGCCCTTG AACTCCGGCG CGGCCTGGGC
GACCGGGAGG CCGAGGGGAT GGCGCTGCAC AACCTCGGCT GGCTGAGCAC GCGCAGCGGC
GACCTCGACG ACGCCCTCGG TTCCATCACC GCGGGCCTGC GGCTGCTTGA GGCGCACGGG
TCCAGCCGGG TCGGGATGGT CAGGCACAAC CTGGGCGAGG TCCTGCTGCG GCTCGCCCGG
TTCACCGAGG CGGCGGACTG CCTCCAGCGG TGCCTGGCCA TCCGCAGGTC GAACGGCGAC
CGCTTCGGGG AGGGCATCAC CCTGGCCGCG CTCGGCCGCG CCTACTGCCT GCTCGACCGC
AGGGACGAGG CTCTGGCCAC ACTCGGGGAG GCGCTGCGCC ACTGCCGCGA GACCGGCAAC
CGGGAGGACG AGTGGGAGGT TCTGCTCAGC AGGTCGGAGA TATGGCTGCG CCGCGGGGAT
CCGGCCTCGG CCGCCGCCGA CCTCGCCCGG GTGCTGGAGC TGACCGCCCA GGCCGGCGAG
CTCTACGGCC AGGCCGCCGC CACCCGCCAG CTCGCCAGGG CGCGCGCCGC GCTGGGCGAC
CCCGCCGCGG CGGAGGACGC CCGCCGGGCC GGGGAGCTCT TCGCCTCGCC CGCCATGCGG
CCCGATCCGG TGCTGGAGAG GCTGCTCACC GCCCCGCTGT AG
 
Protein sequence
MRFGLLGPVL VQAGDSPLRI TAPKQRTVLA MLLARAGYVV PIRSLVTEVW DEHPPRSAVA 
NLRTYLMQLR RMLPPCENPA VEPLVTSDAG YLLRVEPAEF DLFQFEALSA LGRQALARRD
LVTAQDAYTR ALALWRGGAA EDAPLGPTLR EVVARLTDQY LSAVEEHTEI QLALGSPTTA
VRRLRELIGR YPLRERLYGQ LMVALYRCGD VAGALDVFGV ARRILAEELG LDPGPELRLL
HQAVLRRDAD LMVPGGPPAG GDTVTVGAAV TGGAAVPVGA AVTAAAAVPV EDTVTVHAGD
GPPRPRQLPR EPPLFVGRPA ELAGMLTALC GDPAQGAGPP VLALHGPGGV GKSTLALRAA
YAVADRYADG QLYADLQGSS PGLPPLRPAE VLGRFLRALG VPHGEVPAAP GEAAAHYQSL
LAGRRVLVVL DNAVDAAQVA PLLPAGGGCA ALVTSRTALT TMDAVPIALD VFDEADSVRM
LTLLAGQDRV AAEAGAAADV ARWCGYHPLA LRIAGARLAG RPDWSLVRFG ERLRDQRRRL
DELRAADLGI RSCFEVSYAA LTGGAGRGGG AAAHAFRLFG VLDVPEISVE LAAALLDADL
KAAEDALDEL AEVRLVEPAG GGRFRMHDLL RLFAAELAVV HDPPDERVRA VRRALDWYLD
LCHQVNDLLQ PHLRSGDGHR PSRRDTGVAL RDHVEAVRRF ETEMPCLIAA AAQAATGEQA
VACFVTDLMP LVRALATKCG HWREFETVAR LAIGVARRHG DRAGEATALT MLGLVEWRTG
RSEAARDCLS RALELRRGLG DREAEGMALH NLGWLSTRSG DLDDALGSIT AGLRLLEAHG
SSRVGMVRHN LGEVLLRLAR FTEAADCLQR CLAIRRSNGD RFGEGITLAA LGRAYCLLDR
RDEALATLGE ALRHCRETGN REDEWEVLLS RSEIWLRRGD PASAAADLAR VLELTAQAGE
LYGQAAATRQ LARARAALGD PAAAEDARRA GELFASPAMR PDPVLERLLT APL