Gene Sros_5783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5783 
Symbol 
ID8669077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6333915 
End bp6336980 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content75% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003341272 
Protein GI271967076 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0303431 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.017876 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGTTG ATCAGCCCAC ATTCAGCGTG CTCGGTGCCC TGGACGTCCG TGCCTCGGGA 
CGGCCCCTCC GCATCGCCGG CACGAAACCA CGAATCCTGC TGGCATCGCT GCTGCTCCAC
GCGAACCACG TGGTCGGCGC GGACCTCCTC GTCGAGGTGC TGTGGCCCCG GCACCGGCCG
CGCTCCGCCC ACGCCAACCT CCGCACCTAC GTCAGCTCGC TGCGCGGCGT GCTCGACGCG
GCCGGCGCCC GGATCCAGGC CCGCCCGCCC GGCTACGTGA TCGAGCTGGC ACCCGGACAG
CTCGACGCGC TGCTCTTCGC CGACCTGATC GCGCGGGCGC GCGCGGCGGG CCGTACGGAG
GAGGCCTTCG ACGGGCTCAG CCGGGCGCTC GGGCTGTGGC GCGGCACCCC CCTCGCCGAC
CTGCCGGGCA GCCCGCTCTG GGACGGGCGG CTGCAGTCGC TGGCGGAGCT GCGGCTCGGC
GCGGCCGAGG AGCTGATCGC CCTGAGAATG GCCCGGGGCC GGTACGCCGA CGCGATCGGA
GAGCTGCGCG GGCTGCTGAA GGAGCATCCC TTCCGCGAGG ACCTCTGGCA GCGGCTCATA
CTCGCCCTGC ACTGGAGCGG CCGGCAGGCC GAGGCCCTGC ACGCCTACGC CACGGTCAGG
CGGCAGCTGG TCACCGAGCT CGGCATCGAA CCCGGCACGG ATCTGCGCCG GGCGCACGCC
GCCGTCCTGG CGGGGGAGCT CCCGCCCGCC GCGACCCCGC CCGATCTCCC CCCGCCCGCC
GCCGAGACCG TCTCCGGCCC GCCGTCCCGC GCCGAGACCG TCGCCGGCCC GCCGCCCCCC
GCCGAGACCG TCCCGCACCA CCTGCCGGCC GTCCCCGCCC CCGGCTCCAC TCCTCACCAG
CTCCCGTCGG ACATCCCGGA CTTCACCGGC AGGTCTGAGG ACGTCGCCGT CCTCACGCGG
GCGCTGTCAC CGGTGGAACG GCCGCCGGAC GGACCGCCGT CGATCGTGGT GGTGGTGGGC
CCGCCGGGCG TGGGCAAGTC GGCGCTGGCC GTGCACTGCG CGCACGCTGT ACGGACCGGG
TATCCGGGCG GGCAGCTCTA CCTGGACCTC GGCGGGACCG AGTACGCGCC GGCCGACCCG
GGCGAGCTGC TGGCCGAGGC GCTGCGGGCG CTGGGGGTAG GCGAGGCCGG CCTGCCGTGC
ACCGTGCGCG AACGCTCCGC GCTGTACCGG TCCCTGCTGG CCGAACGCCC GATGCTCGTC
CTCCTCGACG ACGCCGCCGG CGCCGCCCAG GTACGGCCGC TGCTGCCCGG CAACGGCTGC
GCGGTGCTCG TGACCAGCAG GCGGCGGATC ACCGAACTGC CCGGCGCCCT CCAGCTCGAA
CTGGACGTCC TGTCACCGGA GGAGGCCGAG GAGCTCCTGG GCAGGATCGT CGGCTCCGAG
CGGCTGGGAC GGGAGAGGGA GGCGGCCTCG GCGATCCTGC GCGCCTGCGG CTACCTGCCG
CTCGCCGTCA GGGTCGCCGG AGCGCGGCTC GCGGGACGGC CCAGGTGGTC GCTGGGGGTG
CTGCGGCAGC GGCTGGAGGA CGAGGCGGGC AGGCTCGGCG AGCTGCGGGC CGGCGATCTG
GAGGTGCGGG GCAGCTTCGA CCGGAGCTAC CGGCTGCTGC CCGACGACGC GGCCCTGGCC
TTCCGGGCAC TCGGTCTCCT GGGACCGCAG TCCCTGCCCG GCTGGGTGGT CGACGCCGTG
CTGGACCGGC ACCGGGCCGA CGACGTGACG GACGTCCTGG TCGATGTGAA CCTGCTCCAG
CTGGTCGGGA CCGACCCGAT CGGCCAGCCG CGCTACCGGC TGCACGACCT GGTCCGCTGC
AACGCCAGGG AGAAGGCCGG CGGCGCCCCG GAGCGGCACG CCCTCATCAG GGTGCTCGGA
GCGTGGATGG CCACCACCGA GAGTGCCACG GCACGGTTGC CGACCACGCT CTTCAGCCTG
ACATCGGCCA GGGCGACCCG CTGGAACCTG GCGGAGGACA CCCTCGGACG GCTGACCGCC
GACCCGCTGT CGTGGTTCGA CGTCGAGCAC GAGGCGCTCG TGGGCGCGGT GCGTCTGGCC
GTCGACGCCG GGCTGGCCGA GCCGGCGTGG GGGCTCGCCG CGGCCCTCGT CCCCTACTTC
GACCTGCGCT GCCACTTCGA GGAGTGGCAG TCCACCCACC GGATCGCCCT GGACGCCGCC
CGCCTGGCGC AGGACCGCTA CGGTGAGGCG GCCATGCTCC GGGGTCTGGC CCAGGTATGC
CTCTACCAGG ACCGCTACGC CGAGGCGACG GAGATGTTCC GGCGATCCCT CACGATCTTC
CACGAGCTGG GCGACGTGCG GGGCGAGGCG ACCTCGATCT GCGGGCTGGG GGCGGTCAAC
CAGTTCTGCG GCGAGCACCT CAGGGCACTG GCCTACTTCC GGCGGGCCCT GGCCATGTTC
CTCGCCATGG GCGACCAGAG CGGTGAGGCC TACGCGCGGC AGGCCATCGG GCGCGTCTGC
CTGGCCTCCG GCGATCCCGG CCAGGCCTCG AAGTGGCTGG GCGAGGCGCT GCGGCTGGCC
AGGGAGCTCG GCGACTCCCA CCGCGAGGGC TGCGTGTCCA TGCAGTTCGG GCGGCTGCAC
GATCTGGCGG CGGAGCCCGA GCGGGCGATG CGGTTCCAGG GACACGCGCT GGACATCTTC
GAAGGCCTCG GCGACCTCCA CTGCGGGGCT TACGCCATGC AGAGTCTGGG CGGCCTCCAG
GTGGTCCGCG GCGACCAGTC GCACGCCTCC GACCAGCTGG AGCGGTCACT GCTGATCTTC
CAGCGGCTCG GCGACCGGAG CGGGGAGGCG GCCACGGTCC AGAAGCTCGG CGAGCTGCAC
CGGTCGGCGG GCCGTACCCG CCTGGCGCAG GACTACCTCC ACCACGCCCT CGCGCTCCGG
CGCGAGCTGC GGGGAGGCGC CGACCTCGCC GGCGTCGCCG AACCGTCCGC CCCCTCCGGC
GACGGGTGGT GGAGCCGGGA CGCCGTGCCA CCACCGCTCG TCCTCGACGG CAACGCCGTG
GAGTGA
 
Protein sequence
MVVDQPTFSV LGALDVRASG RPLRIAGTKP RILLASLLLH ANHVVGADLL VEVLWPRHRP 
RSAHANLRTY VSSLRGVLDA AGARIQARPP GYVIELAPGQ LDALLFADLI ARARAAGRTE
EAFDGLSRAL GLWRGTPLAD LPGSPLWDGR LQSLAELRLG AAEELIALRM ARGRYADAIG
ELRGLLKEHP FREDLWQRLI LALHWSGRQA EALHAYATVR RQLVTELGIE PGTDLRRAHA
AVLAGELPPA ATPPDLPPPA AETVSGPPSR AETVAGPPPP AETVPHHLPA VPAPGSTPHQ
LPSDIPDFTG RSEDVAVLTR ALSPVERPPD GPPSIVVVVG PPGVGKSALA VHCAHAVRTG
YPGGQLYLDL GGTEYAPADP GELLAEALRA LGVGEAGLPC TVRERSALYR SLLAERPMLV
LLDDAAGAAQ VRPLLPGNGC AVLVTSRRRI TELPGALQLE LDVLSPEEAE ELLGRIVGSE
RLGREREAAS AILRACGYLP LAVRVAGARL AGRPRWSLGV LRQRLEDEAG RLGELRAGDL
EVRGSFDRSY RLLPDDAALA FRALGLLGPQ SLPGWVVDAV LDRHRADDVT DVLVDVNLLQ
LVGTDPIGQP RYRLHDLVRC NAREKAGGAP ERHALIRVLG AWMATTESAT ARLPTTLFSL
TSARATRWNL AEDTLGRLTA DPLSWFDVEH EALVGAVRLA VDAGLAEPAW GLAAALVPYF
DLRCHFEEWQ STHRIALDAA RLAQDRYGEA AMLRGLAQVC LYQDRYAEAT EMFRRSLTIF
HELGDVRGEA TSICGLGAVN QFCGEHLRAL AYFRRALAMF LAMGDQSGEA YARQAIGRVC
LASGDPGQAS KWLGEALRLA RELGDSHREG CVSMQFGRLH DLAAEPERAM RFQGHALDIF
EGLGDLHCGA YAMQSLGGLQ VVRGDQSHAS DQLERSLLIF QRLGDRSGEA ATVQKLGELH
RSAGRTRLAQ DYLHHALALR RELRGGADLA GVAEPSAPSG DGWWSRDAVP PPLVLDGNAV
E