Gene Sros_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2139 
Symbol 
ID8665421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2299701 
End bp2301245 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content74% 
IMG OID 
Productputative transcriptional regulator, PucR family 
Protein accessionYP_003337866 
Protein GI271963670 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.351371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGC GGGACCTGTG CCAGGCCGAC CACCTGGGGT TGAAGGTGCT CAGCGGTCAC 
GATCTGCTCG ATCGCCGGGT CCGTGGCGTC GCCACCACCG ACCTGATCGA GCCGGGCCGC
TTCCTCAAGG CCGGCGAGCT CGTCCTCACC GAGCTGACCT GGCACGACGG TCCCGAGTCC
GCCAGACGTT TCGTGGCCGC GCTGGTCGAG GCCAAGGTCG CCGCGCTCTG CTCCGGCACC
GCGCTCAAGG TCCCGCCCGC CGACCTCATC GACGCCTGCG CCGACGCCGG GCTGCCCATG
CTCGCGCTCG GCGTGGAGGT CTCCTTCAGC GCGCTCACCG AGTACGTGCT GCGCGCCCTG
ATCGACGAGT TCGGCTCCGC CCCGAGCCGG CCGTACGGCT CCCGCAGGAG GCTGGCCAGC
ACCCTCGTGG ACGGCGGCAG CCTGAGCAAG GTCGTCACCG CGGTGGCGGG GGAGCTGGAC
GTGCCCTGCT GGGTGGTGTC GGCGACGGGC CGGGCCGTCG TGGGCTCGCA GCCGCTGCCC
GAGGGGGCCG GTGAGCGGCT GGCGCACGCG TTCCTGTCCG CGCGGTTCCT CCCCGGTACG
GCGGTGCGGG CCGACGGCAC CGTGCTGTCG GTCTTCCCGG TCAGCAAGGG GGCGCCGCAC
CGGATCGCCA ACTGGTTCCT GGCCTACGCC GGGGAGCAGG TGCGGGCGGG CGGCGAGCAC
GAGGACCTGA TCGTGGAGCT GGCCGCGCTG GTGGCCCTGG AGCGCTCCCG GCTGGAGGCG
GCGCAGCGGA TCGAACGGCG GGTGCTGGAC CAGCTCCTGG GGCTGCTCAC CTCCGGGGAC
GCCAACCTGC CGGGCGTGGT CTCCCGGCTG CACACCCTGC ACATCGAGAC CGAGGACGGC
CTGCTGGCGG TGGCGCTCGC GGTCGAGGGG ACCGATCGGG CGGACGAGGT GGGCGTCGCC
GTGCTCGACG AGCTGCTGCG CCCGCTGGCG CCGGGGGTCG CGACGGCGGT GGGCGGGGAG
GCGGTCGCGC TGGTCCCGCT GGCGGGCAAC AGCGCCGCCG AGCTGACCTC CCACATGCTG
GCCGGGGCGG CCACGCTGGA GGCCGGGCTG GGCGATGCCC GGATCACGAT CGGCGTCAGC
AGCGTCACGA CGGGACCTGC CGCGCTGAGC AGCCTGATCG AGGAGGCCAG GCACGCCCGC
ACGCTGGCCG AGCTGGGCGA GGGCCGGGTG TCGGCGATCA CCGGTGACGA CGTCAGCTCC
CACCGCTCAC TGATCGCCGC GATCCCGGGA GAGCTGCGTC GCTCGTTCCG GACGCGGCTG
CTGGGGCGGC TGGAGGAATA CGACGCCGCG CACCAGACGG AGCTGGTCGA GACGCTGGAG
ACGTTCCTGG AGGAGTCGGG GTCCTGGGCC GCGACCGCCG ACCGGCTCCA TGTCCACGTC
AACACGTTGC GCTACCGGGT GAAGCGGATC GAGGAGCTGA CCGGCAGGTC GCTCAACGCC
CTGGACGAGC GGGTCGACTT CCTGCTGGCG CTCAGGATGC GTTAG
 
Protein sequence
MRLRDLCQAD HLGLKVLSGH DLLDRRVRGV ATTDLIEPGR FLKAGELVLT ELTWHDGPES 
ARRFVAALVE AKVAALCSGT ALKVPPADLI DACADAGLPM LALGVEVSFS ALTEYVLRAL
IDEFGSAPSR PYGSRRRLAS TLVDGGSLSK VVTAVAGELD VPCWVVSATG RAVVGSQPLP
EGAGERLAHA FLSARFLPGT AVRADGTVLS VFPVSKGAPH RIANWFLAYA GEQVRAGGEH
EDLIVELAAL VALERSRLEA AQRIERRVLD QLLGLLTSGD ANLPGVVSRL HTLHIETEDG
LLAVALAVEG TDRADEVGVA VLDELLRPLA PGVATAVGGE AVALVPLAGN SAAELTSHML
AGAATLEAGL GDARITIGVS SVTTGPAALS SLIEEARHAR TLAELGEGRV SAITGDDVSS
HRSLIAAIPG ELRRSFRTRL LGRLEEYDAA HQTELVETLE TFLEESGSWA ATADRLHVHV
NTLRYRVKRI EELTGRSLNA LDERVDFLLA LRMR