Gene Sros_3061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3061 
Symbol 
ID8666348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3339524 
End bp3340747 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content74% 
IMG OID 
Producttranscriptional regulator, ROK family 
Protein accessionYP_003338754 
Protein GI271964558 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.756873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.372783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAACG ACGGCGTGCT CCGCAATCTA CGCCGGCAGC ACGAACTACG GGTGCTGGCC 
ATCCTGGTCG AGCGCGGACC CTGTTCGCGC CGGGAGCTCG AGAACGCCAC CGGGCTCTCT
CGGACCACGA TGTCGGCCAT CGTCGCCGAC CTCGTCCGGC GCCGCGCGGT GGCCGACGAC
GCCCAGCGGC CGGCGCCGGG CCGGGGCCGG CCGACGACGC TGGTCCGGCT CAACCCGCAC
GCGGCCGCCG CCGTCGGCGT GGAGCTGGGC CGCGGCCACG TCAGCGTCGC GGTGGCCGAC
ATCGCCCGCA CGGTGCTCGC CCACGTCACC GAGCCCATCG GGATGGAGAG CGGGCTCTCC
GTCCGGATGG ACGCCGCCTT ATCGCTCCTG AAGAGGGTCG CGGCCGGGCG GCGACTCGCA
CTCGAAGGGC TCGCGGCGGC CGGGCTGGGA CTGTGGGGGC ACCACCCCGA TCCCCGTGCC
GGCGGGGGCG ACCCGGCGGG CGATCTCGTC GTCACCGACC TCGCCGAACG TCTGGGCGGA
ACGCTCGACG TGCCGGTGAC CTGGGATAAC AACATCCGGC TCGCCGCCGT CGCGGAGAGA
TACACGGTCG AGTCGCCGGC CTGCGCGGAC CTGGTCTACA TCGCCCTGTC CCACGGCGTC
GGCAGCGGCG TGGTGATCAA CGGCACGCTC GCCCGCGGGG CCTCGGGCAC GGCCGGGGAG
ATCGGCCACG TCAGCGTCGA ACCGAGCGGC CCGCCGTGCT GGTGCGGCGG CAGGGGCTGC
CTTGAGCAGT ATCTCTCGAT CGACGCGGTG CTGGGGCGCG TCAGGGCCGT CGCGCCGGAC
GTCGCCGACG TCGCCGGCCT GGTCGCGGCA CTGGACCGCG GCGACCGGAG CGTCCGCGAC
GTGGTGGACT GGAGCGCCGA GCTGCTCGGC CGGGCTCTCG GCACGGTGGC GATCCTGCTC
GACCCTCACC GGGTGGTCAT CGGCGGCGAG CTGGCCGATC TGGGCGACCA CCTGCTCGAC
CCGGTCCGCG CGTCACTCGC CCGGCAAAAG CTGTCCATCC GCGACCGGCG GCTCGAACTC
GGCACCGCCA GGATCTCCCA GGGAGCGGCG GCGGTCGGCG CGGCACTGGT GGCACTCGAC
CGCCACTCCC CGATCCACGG CATCGGAGAC GTCACGGCCG AGTTCCCGGG CAAGGGCGAC
GGCGATTTAT GTAGGGACTA TTGA
 
Protein sequence
MSNDGVLRNL RRQHELRVLA ILVERGPCSR RELENATGLS RTTMSAIVAD LVRRRAVADD 
AQRPAPGRGR PTTLVRLNPH AAAAVGVELG RGHVSVAVAD IARTVLAHVT EPIGMESGLS
VRMDAALSLL KRVAAGRRLA LEGLAAAGLG LWGHHPDPRA GGGDPAGDLV VTDLAERLGG
TLDVPVTWDN NIRLAAVAER YTVESPACAD LVYIALSHGV GSGVVINGTL ARGASGTAGE
IGHVSVEPSG PPCWCGGRGC LEQYLSIDAV LGRVRAVAPD VADVAGLVAA LDRGDRSVRD
VVDWSAELLG RALGTVAILL DPHRVVIGGE LADLGDHLLD PVRASLARQK LSIRDRRLEL
GTARISQGAA AVGAALVALD RHSPIHGIGD VTAEFPGKGD GDLCRDY