Gene Sros_3432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3432 
Symbol 
ID8666720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3775618 
End bp3777945 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content75% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003339112 
Protein GI271964916 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00818273 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.368034 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACA TACGGTTCCG CCTGCTTGGT CCGCTCCGCG TGTGGAGGGG AGAGACCGAA 
GTCAAGATCG GCTCGGACAA GCAACGGGCC GTGCTGGCCC TGCTCCTGCT CCGGGCCGGC
TCGCCGGTCA GACGTCAGGA GATCATCGAC ACGCTCTGGG GCGACGACAC CCCCGAGTCG
GTGGTCAACC TGGTACAGAC CTACGTGGGA AGGCTGCGGC GCCAGATCGA TCCGGGCAAG
GGCGCCTATT CGGCCTCGAC CTGGCTGGCG GGGATGGGCA CCGCCTACGT GGTCCGGCTC
GACCGGTGCG ACGTGGACCT GGTCCGCTTC CGCGCGGGGG TGGCGGGGGC CCGCTCGGCC
ACCTCCCCCG AGGAGTCCCT GGCGCTGCTG CTCTCCGCGC TGCGGATGTG GAACGGGCCA
TGTCTGGCCG ACCTCGACCA CGTGCTGCGC GGTCACCCTT GGGTCCGCGC CATCGAGCAC
GAGCGGATCG ACACGCTGCT GGACGCCGCG AAGACCGCGC AGCGGCTCGG CCGGTCGGCC
GACGTCATCC CGCAACTGCG CGCGGTCGCC GCCGCCGAAC CGCTGAACGA GGCCGTCCAC
ACCGCGCTCG TGCTCGCCCT GGCCGCGTCC GGGATGCAGG CCGAGGCACT GGCCGAGTAC
GGGCTCATCC GCCTCAGGCT GGCCGAGGAG CTGGGCGTCG ACCCCGGATC CCAGCTGCGC
GAGGCGTACT TCCAGGTACT GCGCCAGGAG ACCCGCTACG AGGGCGCCGG CCCCGCCGAG
CCGCCCTGCC CGTCCCTGCT GCCCGCCGAC ATCGCCGACT TCACCGGCAG GGACAAGCTG
GTCGAGCAGC TGAGCGGTCT GATCGCCGAC CGCAGGCCTG GACCGATCCC GGTGTCGACC
ATCACCGGCA GGGCGGGCGT CGGCAAGTCG ACGCTGGCCG TCCACCTGGC GCACCGCATG
ATCGGCGACT TCCCCGGCGG CCAGCTCTAC GCCGACCTTC GCGGCTCCGC CGAGCAGCCG
GCCGATCCCT CCCGGGTGCT CACCCGGTTC CTGCGCTCGC TGGGCATCAG CGGCCAGGCG
ATCCCCGAGG ACGCGGACGA GCGCGCCGAG CTGTACCGCA CGCAGCTCGC CGGCCGCCGT
GTCCTCGTCG TGCTGGACGA CGCCGCCGAC CAGGCCCAGG TACGGCCGCT GCTGCCCGGA
TCGCCCTCCT GCTCCGTCAT CGTGACGAGC CGGTCCCGGA TGGCCGGATG GCCCGGCGCG
CACGCCGTCG ACCTGGACCT GCTGGAGCCT CACCACGCCG GCGACCTGCT CGCGGTGATC
GTCGGCGCGG AGCGTGTCGC GCCCGAGCCC GAGGCCGCCA CCGAGCTCGT CCGGCTCTGC
GGCCGGCTGC CGCTGGCCAT CCGGGGCGCC GCCACCCGGC TCGCCGCCCG CCCACACTGG
ACGCTCGCCA GGATGGCCGG CCGGATGGCC GACGAACGGC ATGGCCTCGA CGAGCTCTCG
GACGTGCGGG CCACCCTCGC GCTCGGCTAC CGCAGGCTCG ACGGGCCGGC CCAGCGGGCC
CTGCGCCTGC TCGGGCTGCT GGACCTGCCG ACCTTCGCCC CGTGGCTGGT CGCGGGGGTG
CTGGAGGCGT CCACGGAGTC GGCCGAGGAC CTCATCGACG CGCTCGCCGA CGCCTACTTC
CTGGACACCG CAGGGGTCGA CGCCGTGGGC CAGCCCCGCT ACCGCTTCCA CGAGCTGGTG
CGCCGCTACG CCCGGGAGCT GGCGCTCAGG GAGGAGAGCG AGGCCACCGT CAGCACGGTC
GTCATCCGCG CCCTGGCCAT CCTGCTCGCG CTGGCCCAGG ACGCCGACGG CCGCCTGCCG
TACACTGTCC GGGCGCCGCT CTACGGGCGG TCGCCGCGCT GGCCGCCGCC GGCGGCCGTC
CGCGAGCCGC TGCTCGCCGA CCCGCTGGCC TGGTTCGACA GCGAGCGGTC CTGCCTGGTC
GCCGCGGTCC TGCAGGCGTC CGACCTGGGC CACGACGAGC TGGCCTGGGA GCTGGCCGCC
GCCACGCTGA ACGCCGCGAT CATCCGGACG CCGTGGGCCG AGATCGGGGC CACCCACCGC
TCGGCGCTCC TGGTCTGCCG TGCCACCGGC AACAGGCGTG GCGAGGCCGT CATGCTGCGC
GGCCTGGGCG AGCTGGACCA CCACCTGGGG CGGCGGCAGG AGTGCCTGGA CACCCTGAAC
CGTGCGCGCG CCCTCTTCGC CGAGATCCGC GACGCCCCGG GCGAGGCCGA CACGGCCGCC
CGGCTCGACG CGCTCCGGGC GAGGGCGGCG CCGGCCCCCG CGACCTGA
 
Protein sequence
MTDIRFRLLG PLRVWRGETE VKIGSDKQRA VLALLLLRAG SPVRRQEIID TLWGDDTPES 
VVNLVQTYVG RLRRQIDPGK GAYSASTWLA GMGTAYVVRL DRCDVDLVRF RAGVAGARSA
TSPEESLALL LSALRMWNGP CLADLDHVLR GHPWVRAIEH ERIDTLLDAA KTAQRLGRSA
DVIPQLRAVA AAEPLNEAVH TALVLALAAS GMQAEALAEY GLIRLRLAEE LGVDPGSQLR
EAYFQVLRQE TRYEGAGPAE PPCPSLLPAD IADFTGRDKL VEQLSGLIAD RRPGPIPVST
ITGRAGVGKS TLAVHLAHRM IGDFPGGQLY ADLRGSAEQP ADPSRVLTRF LRSLGISGQA
IPEDADERAE LYRTQLAGRR VLVVLDDAAD QAQVRPLLPG SPSCSVIVTS RSRMAGWPGA
HAVDLDLLEP HHAGDLLAVI VGAERVAPEP EAATELVRLC GRLPLAIRGA ATRLAARPHW
TLARMAGRMA DERHGLDELS DVRATLALGY RRLDGPAQRA LRLLGLLDLP TFAPWLVAGV
LEASTESAED LIDALADAYF LDTAGVDAVG QPRYRFHELV RRYARELALR EESEATVSTV
VIRALAILLA LAQDADGRLP YTVRAPLYGR SPRWPPPAAV REPLLADPLA WFDSERSCLV
AAVLQASDLG HDELAWELAA ATLNAAIIRT PWAEIGATHR SALLVCRATG NRRGEAVMLR
GLGELDHHLG RRQECLDTLN RARALFAEIR DAPGEADTAA RLDALRARAA PAPAT