Gene Sros_3539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3539 
Symbol 
ID8666827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3926527 
End bp3927714 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content72% 
IMG OID 
Producttranscriptional repressor of the xylose operon 
Protein accessionYP_003339218 
Protein GI271965022 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.689033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCGAC CTGAGCGCAG GACGGTCCGG GATGTCCGCA AGGGCAACCA GTCCATGCTG 
CTGCGGACGC TGTACTTCCA CGGGCCCGCC AGCCGCAACG AGCTCACCAG GCTCACCGGC
CTCAGTGCCG CGACGGTCAG CAGCATGACC GGTGACCTGC TCGCCGAGAA CGTCGTCGTC
GAGGCGGGCC ACGTGGAGTC CGACGGCGGG CGTCCCCGCG TGATCCTGCG GGTCAACCCC
GTCTACGGCT ACGCGATCGG CGTCGACGTG GCCGACACGC ACGTGCGCGT CGAGCTGTTC
GACCTGGAGA TGAACGAGAA GGCCAAGGTC GAGTACGCCC TCCGTCCCGC CAGGCATGAC
ATCGAGCTGG TGGTGCGCCA CATCCTCGCG GGCATCGACG TGGTGCTCGC CGACGGCGGG
GTCTCCGCCG GGCAGGTGCT CGGCGTGGGG GTCGGCGTCC CCGGCATCGT GGAGCGCGGC
GGCGACGTGC TCGTCCACGC CAAGACCTTC GGCTGGGACG GCGTCCCCCT CGGCGCCATG
ATGCGGGCCG GCACCACCTT CCCGGTGTTC ATCGACAACG GGGCCAAGAC GATGGGCCAG
GCGGAGCTCT GGTTCGGCGC GGGGCGCGGG GCCGGTGACG CGGTGATCGT GCTCATCGGC
TCGGGGGTCG GGGCCACCGT CGTCACCGAC GGGACGACCT TCCGCGGGGT GAGCAGCAGC
GCGGGCGAGC TGGGGCACAC CAAGATCGTT GTGAATGGCC GGATCTGCCG GTGCGGGGGG
CGGGGCTGCC TGGAGGCCTA CGTCGGGGCC GAGGCCATCC TCGACCGTGC CGGGATTCCC
ACCCGGACGG CCGACTGGCA GGCCGAGCTG GCCGGCCTGC TCGAAACCGG ATCGCCGGTG
CTCGCGGAGA CCGCCACCTA CCTGGGCGTC GGCCTGTCCA ACCTGATCAA CCTGATCAAT
CCCGAGCGGA TCGTCATCGG CGGCTGGGCC GGTCTCCTGC TCGGCCGGCA CCTGCTCGCC
GAGATCCGCG CGGCCTCGGC GGACAACTCC CTGGCCCAGC CGTACGCGGC CACTTCCATC
GTGCTGGGCC GCCTCGGTCC CGACGCCGTG GCACTGGGGG CGGCCACCCT GGTTCTGGAG
AAGTTCCTGA GCGCCCATCC CGCCGCGCAG GCCTCCGCCG TCCAGTGA
 
Protein sequence
MVRPERRTVR DVRKGNQSML LRTLYFHGPA SRNELTRLTG LSAATVSSMT GDLLAENVVV 
EAGHVESDGG RPRVILRVNP VYGYAIGVDV ADTHVRVELF DLEMNEKAKV EYALRPARHD
IELVVRHILA GIDVVLADGG VSAGQVLGVG VGVPGIVERG GDVLVHAKTF GWDGVPLGAM
MRAGTTFPVF IDNGAKTMGQ AELWFGAGRG AGDAVIVLIG SGVGATVVTD GTTFRGVSSS
AGELGHTKIV VNGRICRCGG RGCLEAYVGA EAILDRAGIP TRTADWQAEL AGLLETGSPV
LAETATYLGV GLSNLINLIN PERIVIGGWA GLLLGRHLLA EIRAASADNS LAQPYAATSI
VLGRLGPDAV ALGAATLVLE KFLSAHPAAQ ASAVQ