Gene Sros_5257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5257 
Symbol 
ID8668551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5770653 
End bp5772620 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content79% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003340769 
Protein GI271966573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0459808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.112263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCGGGG TGCTGGGGCC GGTGGCGGCC TGGGACGGCG ACGGGGACGC CATCGCCTTG 
AAGGGGCCGC GGCACCGCGC GGTGCTGGCC CGTCTGATCG TCGCCCGCCG CCGCGTCGTC
CCGGTCACCC TCCTGGCCGA GGACCTGTGG GCGGATCCGC CTCCGGGCGC GGTGGGCGCG
GTACGCACCT TCGTGGCCGC GCTGCGCCGG GCGCTGGAGC CCCGGCGCCC GCCCCGCACC
GCGGCCCGGC TGCTGGTCAC CGAGGGGCCG GGGTACGCGC TGCGCGCGGA GCCGGGCGCG
GTGGACTCCT GGCGCTTCGA GCAGGCGGTG TCCGCCGCCG CGACGCTGCC CCCCGCAGAC
GCGCTCGCAC GGCTGGAGGA GGCGCTGGGG TGGTGGCGCG GGCCCGCCTA CGCCGAGTTC
GCCGACGAGG CCTGGGCCCG TACGGAGCGC TCCCGCCTGG CGGAGCTGCG GCTGCAGGCG
GTGGAGCGCC GGGCCGATGC CCGCCTGACC CTGGGCCTGG CGGCCGAGGC GGTGCCGGAT
CTGGACGCGC ACGTGACCGA GCACCCCTGG CGTGAGGACG CCTGGCGTCT GCTGGCCCTC
GCGCTGTACC GCGCCGGCCG CCAGGGCGAC GCGCTGGCGG TGCTGCGCCG GGCACGAACG
CTCCTGGTCG AGCAGCTGGG GGTGGATCCG GGCCCGGGAC TGCGCCGCCT GGAGACCGAC
ATCCTCAACC ACGCCGGCCA CCTCGACCCC GCAGCCACCG CGGCCGGGGC GGCGGACCGG
GTGTGGGCGC AGGCGGCCGC CGCCTACGAC CGCGCCGTGC CGCCCGGTGC CCGGGCCCGC
CTGGAGTCGA CGGTGGGCCT GCTGCGTGAT CTCGCGGTGA CCGGCGGAGG CGGCCTGGAG
GCGGCCCGCC GCCACCGCGT GGCGGCTGTC ACGGCGGCGC AGGAGATGGG CGACGCGCAA
CTGACCGCCC GCGTGATCGG CGCCTACGAC GTCCCGGCGA TCTGGACCCG CTCGGACGAC
CCGGAGCAGG CGGCACAGAT CGTGGCGGCG GCCGAACGCA CCCTGGCCGC CCTCCCGCCC
GGCGCGCACC AGGCGGCACG GGCCCGCCTG CTGGCCACGA TCGCCCTGGA GTCACGCGGC
ACCCGCGCGG AGCGCGGGCC CCAGGCCGCC CGGCAGGCAG AGGAGATCGC CCGCGGCCTG
GACGATCCCG GGCTGCTGGC CTTCGCCCTC AACGGCGTGT TCATGCAGAC CTTCCACCGG
GCGGGCCTGG CGCCGCGCCG GGACGAGATC GGCGCCGAGC TGGTCGCCCT GTCCGCCCGG
CACGGCCTGG TCACCTTCGA GGTGCTCGGG CACCTCATCC GCCTGCAGGC CCGCAGCGCG
CTTGCGGACT TCCCGGCGGC CGACGGGCAC GCGGCCGCCG CCGAACACCT GGCCGAGCGC
CACGAGCTGC CGCTGGCGGG GGTGTTCGCC CAGTGGTACC GGGCGCTGCG GCTCGCCGCG
ACAGGGCGGG CACCGGAGGC CGAGGTGGAG GCGGCCTACC GGGACGCCGC CGCGCGGCTG
GACGGCGCCG GCATGCCCGG ACTGCGGCGT GGCGTGCTGC CGCTCGCGCT GCTGTGCCTG
CGCCTGCGGC ACGCACGGCC CGCCCAAGCC GACGAGCACA CCGACTGGGG CCCCTACGAG
CCATGGGCCC GCCCCTTGGT GCTGCTGGCC CGAGACCGCC GTGATGAGGC CGCCGCGGCG
CTGCGCCAGG CCCCGGACCC GCCCCGCGAC CTGCTGTTCG AAGCCATGTG GTGCCTCGCC
GCGCGGGCCG CCGTCGCCGT CGGCGACCGG GAGACGATGG AGCGCGCCCG CACCGAGCTC
ACCCCCGCGG CCGCGGAGCT GGCGGGGGCG GGCAGCGGCC TGCTCACCCT GGGCCCCGTC
TCGGAACACC TCGGCGGCCT CGCCGCCGCC CTCCGCCGTC GCAGGTGA
 
Protein sequence
MFGVLGPVAA WDGDGDAIAL KGPRHRAVLA RLIVARRRVV PVTLLAEDLW ADPPPGAVGA 
VRTFVAALRR ALEPRRPPRT AARLLVTEGP GYALRAEPGA VDSWRFEQAV SAAATLPPAD
ALARLEEALG WWRGPAYAEF ADEAWARTER SRLAELRLQA VERRADARLT LGLAAEAVPD
LDAHVTEHPW REDAWRLLAL ALYRAGRQGD ALAVLRRART LLVEQLGVDP GPGLRRLETD
ILNHAGHLDP AATAAGAADR VWAQAAAAYD RAVPPGARAR LESTVGLLRD LAVTGGGGLE
AARRHRVAAV TAAQEMGDAQ LTARVIGAYD VPAIWTRSDD PEQAAQIVAA AERTLAALPP
GAHQAARARL LATIALESRG TRAERGPQAA RQAEEIARGL DDPGLLAFAL NGVFMQTFHR
AGLAPRRDEI GAELVALSAR HGLVTFEVLG HLIRLQARSA LADFPAADGH AAAAEHLAER
HELPLAGVFA QWYRALRLAA TGRAPEAEVE AAYRDAAARL DGAGMPGLRR GVLPLALLCL
RLRHARPAQA DEHTDWGPYE PWARPLVLLA RDRRDEAAAA LRQAPDPPRD LLFEAMWCLA
ARAAVAVGDR ETMERARTEL TPAAAELAGA GSGLLTLGPV SEHLGGLAAA LRRRR