Gene Sros_5531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5531 
Symbol 
ID8668825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6049850 
End bp6051058 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content76% 
IMG OID 
Producttranscriptional regulator ROK family 
Protein accessionYP_003341028 
Protein GI271966832 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.199786 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCCG GAAGCCCGCG CGTCCTGCGC ACCCTGAACG AGCGCGCCAC CCTGGAGCTG 
CTGCTCCGGA GCGGACCGCT CACCCGGGGG GAGCTGGAGA GCCTGACCGG CCTGTCGAAG
GCCTCCGCGG CCGAGGTCCT GCGCCGCCTG GAGAGCGCCC GCCTGGTGAA GAAGGGCGGG
CGGAAGCCGG GCAGCGCGGG ACCGGCCGCG CACATGTGGG CGCTGGACGG CTCCTGCTGC
CATGTGGCCG GGGTCGACGT CACGCCGGAC GCCCTGGACG TGGCCGTCGC CGACCTGACG
GGGCAGGTGG TCGGCGAGCA CCGCATGGCC ACCCCGGGCA TCCACGACCC GATGGGCTCG
CTCGCCGTCG CGGTGGCCGA GGCGGCGCGC GCCGCGGGGC TGCGGACCAC CGACCTCGAC
CAGATCGTCG TGGGGATGCC CGGCGTCATC GACGTCGTCG GCGACCGGCT GGACTCGGTG
ATCCAGCTGC CCAGCTGGGA GCACGTGCAC GACCTCTCGC CCCTGCGCGC GCGGCTCGGC
AACGACCGCG TGCGCATGGA GAACGACGTG AACCTCGTCG CGGTCGAGGA GATGGTCAAG
GGCTCGGCAC GCGACGCGGA GAGCTTCGCG CTGTTCTGGC TGGGCCGGGG CATCGGCGCG
GGCGTCGTGC TGAACGGCGC CCTGCTGAGA GGCGCCACAG GGCGGGGCGG CGAGATCGGC
TCCATCGTCG TCCCCGACCC CGCCGAGCGG GGACGGGTGC TGGGCCCGGA GGGCGGGTCG
CTCGACTCGA TCCTCGGCGC GGAGGCCGTA CTGCGGCTCG CCCGCGCCCA CGGCCTCGCG
GCGGGCACCG GATCCGGCGG CCCCGCGGCG GACAGCGCGG TGAGCGGCGC GGCGGACGCC
GTCAGCCGGG CCGTCGCCGA CGGGAGCACC GGCTTCCTCG AGGCGCTGGC CGCCAGGATG
GCCGTCGGCG TCATCGCGCT GGTCGGCGTC CTCGACCCGC ATCTGGTGGT GCTCGGCGGC
TCGCTCTGCG CGGCGGGCGG CGAGGAGCTC CGCCGGATGG TCGCCGTCCG GCTGGCCACC
ACCGCGCTCG CCCGCACCCC GCTGGTGCTC AGCGCGGTCA GCGGCAACGC CGTGCGGGCG
GGGGCCGTCG AGTTCGCCCT GGGCATCGCG CGCGAGCAGG TTTTCAAGGC CGGTACGGCG
GGCCGGTAG
 
Protein sequence
MSSGSPRVLR TLNERATLEL LLRSGPLTRG ELESLTGLSK ASAAEVLRRL ESARLVKKGG 
RKPGSAGPAA HMWALDGSCC HVAGVDVTPD ALDVAVADLT GQVVGEHRMA TPGIHDPMGS
LAVAVAEAAR AAGLRTTDLD QIVVGMPGVI DVVGDRLDSV IQLPSWEHVH DLSPLRARLG
NDRVRMENDV NLVAVEEMVK GSARDAESFA LFWLGRGIGA GVVLNGALLR GATGRGGEIG
SIVVPDPAER GRVLGPEGGS LDSILGAEAV LRLARAHGLA AGTGSGGPAA DSAVSGAADA
VSRAVADGST GFLEALAARM AVGVIALVGV LDPHLVVLGG SLCAAGGEEL RRMVAVRLAT
TALARTPLVL SAVSGNAVRA GAVEFALGIA REQVFKAGTA GR