Gene Sros_3384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3384 
Symbol 
ID8666672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3713298 
End bp3716360 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content70% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003339065 
Protein GI271964869 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.799293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAATCA GGATTCTCGG TCCGGTCGAC ATCTGGCGCG ATGGGCGATC GACCGCGATC 
GTCGGGCTGA AACAACGGAC CTTGCTGGCC GTCATGGTTA TGCACGCCAA TCGGGTGGTC
TCCCACGATC GGCTTCTGAC CGCATTGTGG GGCGCGAAGG CCCCGGCGAC CGGACGGCGA
CTGCTCCACA ACCACCTGTG GTCGCTGCGG CGCCTGCTCG CCGAGGGTGA CGCCGTGGAG
AGCACGCCCA CCGGCTACCT GCTGCGCCTG CGGCCGGGCG CCTCCGACCT CGACGTCTTC
GTCACCGAGA CGGCACGGGC CCGCTCCGCC CTGTCGGAAG GTGACACCGC CCAGGCCGCG
GAGAGGTTCC GCACGGCGCT GTCCCTGTGG CGCGGCCCCG CGCTCGGCGG CACCCACCCC
GAGCTGCAGT CGACGGAGGG GGCGGCCCTG GAGGAGTTGC GCCTCGCCGC GCTCATCGGT
CGCATCGAGG CCGATCTCGC CCTCGGACGT CACCCGGAGC TGATCGGCGA GCTGCGCCTG
CTGGTCGGCG AGCACCCGCT GAACGAGGAA CTGCGCGGCC AGCTCATGCG CGCGCTCCAC
CGTGCCGGCC GTACGGCGGA GGCGCTTGAG GAGTTCCGGG CCGGCCGCCT GCACTTCCGC
GACGAGCTCG GGCTGGACCC CGGCGAGGAA CTCACCCGCG TCCACCAGGC GATCCTCTCC
GGGGAGGCCG CAGCGACCGG AAACTCCGGA GGGAACGGGG AGAACGCTCC CGGCACCGCC
ATCGCGGCAC CCGTCCCCGC GGCACCCGGC TCACCCGTGC CCCGGCAGCT GCCCGCCGAC
GTCACGCGCT TCACCGGCCG CGTGGAGAAA CTCCGCCGGC TCGACATGCT CCTGTCCGAG
GAGGAGGGCA CCGCGACCGT GGTGATCTCG GCCATCGCGG GCACCGCCGG GGTCGGCAAG
ACCGCATTGG CGACACACTG GGGACACCGG GTGGCCGCCC GGTTCCCCGA CGGGCAGCTC
TACGTCAACC TGCACGGCTA CTCACGGGGA CGGGCCACCA CCGGGGCTCA GGCACTGGAC
CGGCTTCTCC GCGGGCTCGG CGTGGTCGAC GACGAGATCC CGCACGACGT CGACGAGCGC
GCGGGGCTTT ACCGCTCGCT GCTGGCGCAC CGGCGGATGC TCATCGTCCT GGACAACGCC
GCCACCCCGG AACAGGTCCG TCCCCTGCTG CCCGGCTCCT CCCCCTCCAG GGTCGTCATC
ACCAGCAGGG ACGCCCTGCG CGGGCTCTCC GTCACCCACG ACGTCCGCGG CATCGTGCTC
GACGTGCTGC CGGCCGACGA GGCGACCGCG CTGCTCAACA AGCTCCTGGG CAGAAACGGA
ACGGATGACG AGACGGATCC GGTCCCCGAG CTGGCCCGGC TGTGCGGATA CCTGCCGCTG
GCACTGCGAC TGGCCGCGGC GCAGCTCGCG GGAGAACCCG CCTCCCGGAT CGGTGACTTC
ATCGCCAAGC TCCGGCAGGA GAACCGGCTG ACCGTTCTGG AGCTCAGGGA AGACCCCGGC
ACCGGGGTCC GCTCCGCTCT CGAACTGTCC TACCGGAGCC TCCCGGAACC GGCACGGCGG
ACGCTCCGGC TGCTCAGCGT GCATCCGGGA CCGGACATCG ACCTCCAGGC GGTGGCCGCT
CTCACCGCCA TGTCCGCCGA GGACGCGTCG GCGGCCGTCG AGTCGCTGCT GAACGCGCAC
CTGCTCCAGC GGGACGGCGA CGGCAGACTG TCGATGCACG ACCTCGTGCG CGTCTACGCG
GGAGAACGGA ACGAGGCCGA CGACAGCGCG GCGGATCGTG ACGGCGCGCT GACGAGGATG
CTCGACTGGT ACCAGTACGC GGTCCTGAAA GCCATGGAAC ACCTGTCCTC CGATGACAGC
GCCTCAATGA CGATCACTCC CGTGGATGAC GGGATACCGG ACCTGCCCGG TGTGGATGAG
GCGATGGCCT GGCTGGAACG GGAGCGCCAC GTTCTCATAG CCGTGATCGT CCACGCCGCG
GAGCGGGATC GGCACATCCA CGCATGGCAG ACCGCCGCCC TGCTGTCGTG GTTCTTCTAC
GCGAAGAACC ACTTCGACGA CCTGTTCATG ACGGGCGAGG TCGGGCTGTC CTCCGCCCGG
CGGATCGGTC ATCGGCACGG TGAGGCCGAG ATCCTGAGCG ACCTGGGCTA CGCCAAGATG
TTCACCGGGC GGTACGCCGA GCACCTGAGC CACCAGCAGC AGGCTCTGGA CATCTGGCGT
GCCGTCAAGG ATCGCAGGGG TGAGGCGAAA GGGCTGCGCC ACGTGAGCTA CGCCCTGCAG
CTGGCCGGAC GGCCCATCGA GGCGATCGAG GTCGGCGAGC GGTGTCTCGC CCTGAGCCGC
GAACTCGGGG ATCGCACGGG CGAGTTCACC GCGTTGGACA ATCTGGCGAT CAGCTACCAC
GTCGCAGGGC GCTACGAGGA GGCGCTTGAG GCCCTGTCGA AGTGCCACGG CTACTGGCGG
GAAGAGGGCA GGGAGTATGA CGAGGCCTAC TGCCTCATCC AGATGGGCGC CGTCCACACG
AAACTGGGTG ATCTGACGAC GGCTCTGGAC TGCTTCGAGA AGGCGCTCCC CCTGGGGCGG
AGCCAGGGCA ATCTCCGGAT CGAGGTCGAC GTGTTCAACG GCATCGCCGT GGTCCTGCGC
CACCAGGGAT CTCATGCGGA GGCTCTCGAC CACCACGAGA AGGCGCTCGC CCTCGCGAGA
ACGCTGCGGA GCAGGCCGCT GGAGGGTGAG CTGCTCAGCA GCCTCGGGGA GACCTGCCTG
GCGAGCGGGG ACAGCCGGGC CGCCCTGGAG CACTACCAGG AAGCGGCGGT CTACGCCGAC
GAGGCGGACG ACGCCTACCA GCGAGGATTC GCCTACGCGG GGCTCGGCAG CGCCCTGCAC
GCCCTGGGGA GGCCCGACGA CGCCGCGAAG CATTGGCGGA CGGCGTTCGA CATACTGTCG
CCCATGGACC TCCCCGAGGC CGGGGTCATC GCCGAGCGGA TGCGCGCGGC GGGCCTGACA
TGA
 
Protein sequence
MEIRILGPVD IWRDGRSTAI VGLKQRTLLA VMVMHANRVV SHDRLLTALW GAKAPATGRR 
LLHNHLWSLR RLLAEGDAVE STPTGYLLRL RPGASDLDVF VTETARARSA LSEGDTAQAA
ERFRTALSLW RGPALGGTHP ELQSTEGAAL EELRLAALIG RIEADLALGR HPELIGELRL
LVGEHPLNEE LRGQLMRALH RAGRTAEALE EFRAGRLHFR DELGLDPGEE LTRVHQAILS
GEAAATGNSG GNGENAPGTA IAAPVPAAPG SPVPRQLPAD VTRFTGRVEK LRRLDMLLSE
EEGTATVVIS AIAGTAGVGK TALATHWGHR VAARFPDGQL YVNLHGYSRG RATTGAQALD
RLLRGLGVVD DEIPHDVDER AGLYRSLLAH RRMLIVLDNA ATPEQVRPLL PGSSPSRVVI
TSRDALRGLS VTHDVRGIVL DVLPADEATA LLNKLLGRNG TDDETDPVPE LARLCGYLPL
ALRLAAAQLA GEPASRIGDF IAKLRQENRL TVLELREDPG TGVRSALELS YRSLPEPARR
TLRLLSVHPG PDIDLQAVAA LTAMSAEDAS AAVESLLNAH LLQRDGDGRL SMHDLVRVYA
GERNEADDSA ADRDGALTRM LDWYQYAVLK AMEHLSSDDS ASMTITPVDD GIPDLPGVDE
AMAWLERERH VLIAVIVHAA ERDRHIHAWQ TAALLSWFFY AKNHFDDLFM TGEVGLSSAR
RIGHRHGEAE ILSDLGYAKM FTGRYAEHLS HQQQALDIWR AVKDRRGEAK GLRHVSYALQ
LAGRPIEAIE VGERCLALSR ELGDRTGEFT ALDNLAISYH VAGRYEEALE ALSKCHGYWR
EEGREYDEAY CLIQMGAVHT KLGDLTTALD CFEKALPLGR SQGNLRIEVD VFNGIAVVLR
HQGSHAEALD HHEKALALAR TLRSRPLEGE LLSSLGETCL ASGDSRAALE HYQEAAVYAD
EADDAYQRGF AYAGLGSALH ALGRPDDAAK HWRTAFDILS PMDLPEAGVI AERMRAAGLT