Gene Sros_3391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3391 
Symbol 
ID8666679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3727790 
End bp3730705 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content72% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003339071 
Protein GI271964875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.436064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGTTG ATCAGCCCGT ATTCAGCGTT CTCGGTTCTC TGCGGGTCCG GATGCCGGAA 
GGGACGCTTC CCGTCGCGGG CACGAAGCCG CGGGTCCTGC TGGCCTCGCT GTTGCTCAAC
GCGAACCAGG TGGTCGGTTC GGACCTGCTC GTAGAGACGC TGTGGCCCCA GCGCCGTCCG
CGGTCCGCCC TTGCGAACCT CCGCACCTAC GTCAGCTTCC TGCGCGGCAC CCTCGGCGCG
GCAGGCGCGC AGATCCTGGC CAGGCCGTCC GGCTACGCGG TCGAGCTGCG GATCGACCAG
CTCGACGCGC TGCTGTTCGA GGACCTGGTC GCCAGGGCGC GGGCGGCGGG CCGTGACGAG
GAGGCCATCG AGTGCCTACG CCGGGCGCTC GCGCTCTGGC ACGGCACGCC GCTGGGCGAC
CTGCCGGCCA GCCCGCAGTG GGACGGGCGG CTGCGGTCGC TCACAGAGGC GCGCCTCGGC
GCCGCCGAGG ACCTGGCCGC CATGAGAATG GAGAGAGGGG AGTACCCGGC CGCGATCGGT
GACCTGCGGG AACTCGTCAA GGTCCATCCC TTCAGGGAGG ACCTCTGGCG GCAGCTGATG
CTCGCCCTGC ACGGGAGCGG CCGGCAGGCC GAAGCGCTGC AGGCCTACGC CACCGTCAGG
CAGCAGCTGG TCGACGAGCT CGGCATCGAA CCGGGGCCGG ACCTGCGCGC GGTGCACGCG
GCCGTCCTCG CGGGAGAGCT CCTGCCCGCC GTGCCGGCCG CATCCGCTCT CCAGCGGCCC
CCGGCGCCCT CCGCGCGAAG CGTGATCGCG CCCCAGCAGC TCCCGCCGGA CATTCCGGAC
TTCACCGGCA GAACGGGCGC CGTCGCCGAC CTGGCGCGAG CCCTGTCGGC CAAGGGGCGG
CCGTCGGACG AGCCTCCGTC GATCGCGGTG GTCGTGGGGC CGCCGGGCGT GGGCAAGTCG
GCGCTCGCCG TGCACTGCGC GAACGCCGTA CGGGCCGACT ACCCCGCCGG CCAGCTCTAC
CTCGGCCTCG GCGGTACGGC CGCCGCCCCT GCCGACCTCG GCGAGCTCCT GGCCGAGGCG
TTGCGGGCGC TGGGGGCCGG TGAGGCGGAC CTGCCGCCCA CGGTGCACGA ACGCTCCGCC
CTGTACCGCT CCCTGCTGGC GGAACGTCCC ATGCTCGTCC TGCTCGACGA CGCCGCCGAC
GCGGCGCAGG TGCGGGCCCT GCTTCCCGGC AACGGTTGCG CGGTGCTCGT GACGAGCCGG
CGGCGGATCA CGGAGCTGCC CAGCTCGCTC CGGCTGGACC TGGGCGTCAT GTCGCCCCCT
GAGGCCGAGG AGTTCCTGGG GAAGATCGTG GGTGCCGAAC GGCTGTCAGA GGAGAGGGAG
GACGCCTCGG CGATCCTCCG CTCCTGCGGA TACCTGCCGC TCGCTGTCAG GATCGCAGGA
GCCAGGCTCG CCGGCCGGCC GGGCTGGCCG CTGAGCGTGC TGCGACAGCG GCTGGACGAC
GAGTCGAACC GGCTCGACGA GCTGCGAGCG GGCGACCTGG AAGTACGGGA CTCCTTCGAC
CGCAGCTACC GGCAGCTGCC CGACGAGGTG GCCAGGACCT ACCGGACGCT GGGCCTTCTC
GGCCCGCAGT CCATGCCGGG CTGGGTGGTC GACGCCGTCC TGGACCGCAC CCGGGCCGAG
ACGGTGATGG ACACTCTCGT GGACGTGAAC CTCGTGCAAC CGGCCGGGAC GGACGCGATC
GGCCAGCGCC GTTACCGGTT GCACGACCTG GCCCGCTGCA ACGCCAGGGA GAAGGCCGGC
GGCGAGCGTC ACACCCTCGT CAGGGTGCTC GGAACATGGA TGACCGCCAT CGAGCAGGCC
GCGTCGCGGC TGCCGACCAC GCTCTTCAGC CTGACGTCCG CGGCGGCACC CCGGTGGGAC
CCGGCGGAAG AGACCCTCAG GCGCCTGACC GCCGACCCGC TGCCGTGGTT CGACGCCGAG
CGGGAGTCAC TGGTGGCGGC GGTGCGGCTG GCCGCCGACG CGGGACTGTC GCAGGCCTCG
TGGGGACTCG CGGCGGCGCT CGTCCCCTAC TTCGACCTCA ACTGCCGGTT CGACGAGTGG
CGGCACACGC ATCAGGTCGC GCTGGACTCC GCGCGCATGG CCGAGGACCT CAACGGCGAG
GCCGCCATGC TCCGCGGCCT GGCTCAGGTC TGCCTCTACC AGGATCGATA CGCCGAGGCG
CGAGAGATGC TCCGGCGATC TCGCGCGATC TTCCACGAGC TGGGCGACCT ACGCGGCGAG
GCGATCTCGA TCTGCGGGCT GGGAGCGGCC AGCCAGTTCT CCGGTGAACA TCTCACGGCG
CTCGGATACT TCCGGCAGGC CCTGGCCATG TTCCTCGCCA TGGACGACAG AAGCGGTGAG
GCCTACGCCC GGCAGGCGAT CGGGCGTGTG TACCTGACGC TGCGCGACTT TCGCCGGGCC
TCGGGATGGC TCGGGGAGGC GTTGCGGCTG GCCGAGGAGC TCGGCGACGC CCATCGTGAA
GGGGGCGTGT CCATGCAGCT CGGGCGGCTG TACGACCTGG TGGCCCAGTC CGACGAGGCG
ATGCGCGTCC AGGGGCGCGC GCTCGACATC TTCGAGACGC TCGGCGATCG TCACTGCGGC
GCCTACGCCA TGCGGAACCT TGGCGGGCTG CAGGTGAAGA AGGGCGATCG GTCCAGCGGT
TCCGACCAGC TGCAGCGCTC GCTGGCGATC TTCCAGCAGC TCGGCGACCG GAGCGGGGAG
GCCGCCGCGT TCCAGACGCT CGGCGAGCTG CACCAGTCGG CGGGCCGTAC CGCTCTCGCC
CAGTACTACC TGCACCAGGC CCTCACGTTG AGGCGCGAGC TGCGAAGCGG CGCGGGAGGT
GGGCAGGGGC CCGCGCTGAT GGCTTCGCAT CCCTGA
 
Protein sequence
MVVDQPVFSV LGSLRVRMPE GTLPVAGTKP RVLLASLLLN ANQVVGSDLL VETLWPQRRP 
RSALANLRTY VSFLRGTLGA AGAQILARPS GYAVELRIDQ LDALLFEDLV ARARAAGRDE
EAIECLRRAL ALWHGTPLGD LPASPQWDGR LRSLTEARLG AAEDLAAMRM ERGEYPAAIG
DLRELVKVHP FREDLWRQLM LALHGSGRQA EALQAYATVR QQLVDELGIE PGPDLRAVHA
AVLAGELLPA VPAASALQRP PAPSARSVIA PQQLPPDIPD FTGRTGAVAD LARALSAKGR
PSDEPPSIAV VVGPPGVGKS ALAVHCANAV RADYPAGQLY LGLGGTAAAP ADLGELLAEA
LRALGAGEAD LPPTVHERSA LYRSLLAERP MLVLLDDAAD AAQVRALLPG NGCAVLVTSR
RRITELPSSL RLDLGVMSPP EAEEFLGKIV GAERLSEERE DASAILRSCG YLPLAVRIAG
ARLAGRPGWP LSVLRQRLDD ESNRLDELRA GDLEVRDSFD RSYRQLPDEV ARTYRTLGLL
GPQSMPGWVV DAVLDRTRAE TVMDTLVDVN LVQPAGTDAI GQRRYRLHDL ARCNAREKAG
GERHTLVRVL GTWMTAIEQA ASRLPTTLFS LTSAAAPRWD PAEETLRRLT ADPLPWFDAE
RESLVAAVRL AADAGLSQAS WGLAAALVPY FDLNCRFDEW RHTHQVALDS ARMAEDLNGE
AAMLRGLAQV CLYQDRYAEA REMLRRSRAI FHELGDLRGE AISICGLGAA SQFSGEHLTA
LGYFRQALAM FLAMDDRSGE AYARQAIGRV YLTLRDFRRA SGWLGEALRL AEELGDAHRE
GGVSMQLGRL YDLVAQSDEA MRVQGRALDI FETLGDRHCG AYAMRNLGGL QVKKGDRSSG
SDQLQRSLAI FQQLGDRSGE AAAFQTLGEL HQSAGRTALA QYYLHQALTL RRELRSGAGG
GQGPALMASH P