Gene Sros_4174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4174 
Symbol 
ID8667468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4644664 
End bp4646529 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content69% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003339821 
Protein GI271965625 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTTCT GCATTCTCGG CCCGCTAGCC GTAACCCATG AGGGCCGGGA TATCACTCCG 
ACGGCACCCA AAGTTCGTCA GGTTCTGACG TTTCTTCTGG TGCGCAGGAA TCAGATTGTG
CAAGTCAGCG AATTCGTCGA TGAGCTATGG AGCAGCCATC CGCCCGACAG CGCCATGACG
ACTCTCCAGA CCTACATCTA CAAGCTCAGG AAAGACGTGC TGGACCCTTC CGGGCTGGCC
CGCCTGCACA CCCAGCCCTC CGGATACCTC CTTGACGTCG CCGACGAGAC CATCGACGTC
TGCGACTTCG AACGGCTGTC GCGGCAGGGT CGCCTCGCGC TGGAGAAGGG CGACCCGCTG
GGCGCCAGCG AACTGCTGAC CGAGGCCCTG AGCCTGTGGC GCGGTCAGGC GCTCGTCGGC
GTCACCGCAG GAGAGATCCT GTCCGCCCAC GTGACCCGCC TGGAGGAGAA CAGGCTGCGG
GCGCTGGAGA TGCACATCGA GGCGGACATG CGGCTGGGGC GCTACCAGGA GCTCATCAGC
GAGCTGAAGG TGCTCGTCTA CACCTATCCC CTCCACGAAC GTTTCCACGG CGACCTCATG
ACGGCCCTGA ACCGCTCGGG GCGCCGGTAC GAGGCGCTGG AGGTCTACCG GCAACTGCGC
GGGGTGCTGA TCGACGAGCT CGGGCTTGAG CCGTCCGCCG CCATGCAACG CCTCCACCAG
TCACTGCTGA GCGCCGACTC CGCCGACCCG GCCAGGACCA GGCCGGCCCC GCCGGTGGCC
ACCGCCACGC GGTACGCCGC CACCCTGACG GTCCCGGCAC AGCTACCACC AGACATATCC
GACTTCACCG GCCGGACCGA GCCTCTTGCT CAAATTCGCC GGATACTCGC CGCCGACCAG
GACAACCGCA CCACAGCCCG CGCGGTCTCG ATCTGCGGCA TGGCCGGAGC GGGGAAGACG
ACTCTGGCGC TGCACGCCGC CCACATCAAC CGGGCACAGT ACCCCGACGG GCAGCTCTTC
GCCGACCTGC GCGGCGCCTC CGCCACCCCC ACACCGCAGA CCGACGTCCT CGCCAGCTTC
CTGCGCGCCG TAGGCGTGCC CGACCACCAG ATCCCCCCCT CCCTGGAGGA ACGCAGCAAC
CTCTTCCGCA CCTGGAGCAA CGGCCGGCGG GTCCTGGTCA TCCTCGACGA CGCGTGCGCG
GCCTCCCAGG TCGCCTCGCT GCTGCCCGCG ACACCCCAGT GCACAGTGAT CATCACCAGC
CGCGAGGGGC TGCAGAGCCT GCCCGGCGTG CAGACCGTGG AACTCGGCGT CATGAACCTG
ACCGAAGGCG TGGAGCTGCT CGGCCGCATC ATCGGAGCCG GCCGCGTCGC CGCCGAGCGG
GAGCAGGCCG AAAAGATCGT CGATCTGTGC GGGCACCTGC CGCTGGCGCT GCGGTCCGTC
GGCGCCCGAC TGGCCGCCGC GCGGACCTGG CCCCTGCAGA AGATGGCGGC GCTGATCGAG
TCCGGTCCGG CCCCCCTCGA CCAGCTGCGG TTCGCGGAGT TCGACGTACG GGCCGACTAC
GACGACACCT ACTTCCGGCT CGATCCCCAC GACCGCAGCG CTCTCCGTCT CCTCAGCCTG
CTCCCCCCGC AGGATTTCAC CGCCGCGACA GCCGCCGGCC TGCTCGGCAG CGCCGCCGAC
GCCGTAGAAG CCCAGCTCAC CCGGCTGGTC AGCTGCCACC TGCTCGACGT CAAGTCGGAA
GGCGGCATCG ACGGCATCCG CTACGAGATG CACAAGCTCA CCCGGCTCTA CGCCCGGGAA
CGGCTGAACC GCGAGTTCAT CCAACCCGAG ATGAGCTCTC CCCCGCAGCA CGACCACTCC
ACCTGA
 
Protein sequence
MGFCILGPLA VTHEGRDITP TAPKVRQVLT FLLVRRNQIV QVSEFVDELW SSHPPDSAMT 
TLQTYIYKLR KDVLDPSGLA RLHTQPSGYL LDVADETIDV CDFERLSRQG RLALEKGDPL
GASELLTEAL SLWRGQALVG VTAGEILSAH VTRLEENRLR ALEMHIEADM RLGRYQELIS
ELKVLVYTYP LHERFHGDLM TALNRSGRRY EALEVYRQLR GVLIDELGLE PSAAMQRLHQ
SLLSADSADP ARTRPAPPVA TATRYAATLT VPAQLPPDIS DFTGRTEPLA QIRRILAADQ
DNRTTARAVS ICGMAGAGKT TLALHAAHIN RAQYPDGQLF ADLRGASATP TPQTDVLASF
LRAVGVPDHQ IPPSLEERSN LFRTWSNGRR VLVILDDACA ASQVASLLPA TPQCTVIITS
REGLQSLPGV QTVELGVMNL TEGVELLGRI IGAGRVAAER EQAEKIVDLC GHLPLALRSV
GARLAAARTW PLQKMAALIE SGPAPLDQLR FAEFDVRADY DDTYFRLDPH DRSALRLLSL
LPPQDFTAAT AAGLLGSAAD AVEAQLTRLV SCHLLDVKSE GGIDGIRYEM HKLTRLYARE
RLNREFIQPE MSSPPQHDHS T