Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3384 |
Symbol | |
ID | 8666672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 3713298 |
End bp | 3716360 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003339065 |
Protein GI | 271964869 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.799293 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAAATCA GGATTCTCGG TCCGGTCGAC ATCTGGCGCG ATGGGCGATC GACCGCGATC GTCGGGCTGA AACAACGGAC CTTGCTGGCC GTCATGGTTA TGCACGCCAA TCGGGTGGTC TCCCACGATC GGCTTCTGAC CGCATTGTGG GGCGCGAAGG CCCCGGCGAC CGGACGGCGA CTGCTCCACA ACCACCTGTG GTCGCTGCGG CGCCTGCTCG CCGAGGGTGA CGCCGTGGAG AGCACGCCCA CCGGCTACCT GCTGCGCCTG CGGCCGGGCG CCTCCGACCT CGACGTCTTC GTCACCGAGA CGGCACGGGC CCGCTCCGCC CTGTCGGAAG GTGACACCGC CCAGGCCGCG GAGAGGTTCC GCACGGCGCT GTCCCTGTGG CGCGGCCCCG CGCTCGGCGG CACCCACCCC GAGCTGCAGT CGACGGAGGG GGCGGCCCTG GAGGAGTTGC GCCTCGCCGC GCTCATCGGT CGCATCGAGG CCGATCTCGC CCTCGGACGT CACCCGGAGC TGATCGGCGA GCTGCGCCTG CTGGTCGGCG AGCACCCGCT GAACGAGGAA CTGCGCGGCC AGCTCATGCG CGCGCTCCAC CGTGCCGGCC GTACGGCGGA GGCGCTTGAG GAGTTCCGGG CCGGCCGCCT GCACTTCCGC GACGAGCTCG GGCTGGACCC CGGCGAGGAA CTCACCCGCG TCCACCAGGC GATCCTCTCC GGGGAGGCCG CAGCGACCGG AAACTCCGGA GGGAACGGGG AGAACGCTCC CGGCACCGCC ATCGCGGCAC CCGTCCCCGC GGCACCCGGC TCACCCGTGC CCCGGCAGCT GCCCGCCGAC GTCACGCGCT TCACCGGCCG CGTGGAGAAA CTCCGCCGGC TCGACATGCT CCTGTCCGAG GAGGAGGGCA CCGCGACCGT GGTGATCTCG GCCATCGCGG GCACCGCCGG GGTCGGCAAG ACCGCATTGG CGACACACTG GGGACACCGG GTGGCCGCCC GGTTCCCCGA CGGGCAGCTC TACGTCAACC TGCACGGCTA CTCACGGGGA CGGGCCACCA CCGGGGCTCA GGCACTGGAC CGGCTTCTCC GCGGGCTCGG CGTGGTCGAC GACGAGATCC CGCACGACGT CGACGAGCGC GCGGGGCTTT ACCGCTCGCT GCTGGCGCAC CGGCGGATGC TCATCGTCCT GGACAACGCC GCCACCCCGG AACAGGTCCG TCCCCTGCTG CCCGGCTCCT CCCCCTCCAG GGTCGTCATC ACCAGCAGGG ACGCCCTGCG CGGGCTCTCC GTCACCCACG ACGTCCGCGG CATCGTGCTC GACGTGCTGC CGGCCGACGA GGCGACCGCG CTGCTCAACA AGCTCCTGGG CAGAAACGGA ACGGATGACG AGACGGATCC GGTCCCCGAG CTGGCCCGGC TGTGCGGATA CCTGCCGCTG GCACTGCGAC TGGCCGCGGC GCAGCTCGCG GGAGAACCCG CCTCCCGGAT CGGTGACTTC ATCGCCAAGC TCCGGCAGGA GAACCGGCTG ACCGTTCTGG AGCTCAGGGA AGACCCCGGC ACCGGGGTCC GCTCCGCTCT CGAACTGTCC TACCGGAGCC TCCCGGAACC GGCACGGCGG ACGCTCCGGC TGCTCAGCGT GCATCCGGGA CCGGACATCG ACCTCCAGGC GGTGGCCGCT CTCACCGCCA TGTCCGCCGA GGACGCGTCG GCGGCCGTCG AGTCGCTGCT GAACGCGCAC CTGCTCCAGC GGGACGGCGA CGGCAGACTG TCGATGCACG ACCTCGTGCG CGTCTACGCG GGAGAACGGA ACGAGGCCGA CGACAGCGCG GCGGATCGTG ACGGCGCGCT GACGAGGATG CTCGACTGGT ACCAGTACGC GGTCCTGAAA GCCATGGAAC ACCTGTCCTC CGATGACAGC GCCTCAATGA CGATCACTCC CGTGGATGAC GGGATACCGG ACCTGCCCGG TGTGGATGAG GCGATGGCCT GGCTGGAACG GGAGCGCCAC GTTCTCATAG CCGTGATCGT CCACGCCGCG GAGCGGGATC GGCACATCCA CGCATGGCAG ACCGCCGCCC TGCTGTCGTG GTTCTTCTAC GCGAAGAACC ACTTCGACGA CCTGTTCATG ACGGGCGAGG TCGGGCTGTC CTCCGCCCGG CGGATCGGTC ATCGGCACGG TGAGGCCGAG ATCCTGAGCG ACCTGGGCTA CGCCAAGATG TTCACCGGGC GGTACGCCGA GCACCTGAGC CACCAGCAGC AGGCTCTGGA CATCTGGCGT GCCGTCAAGG ATCGCAGGGG TGAGGCGAAA GGGCTGCGCC ACGTGAGCTA CGCCCTGCAG CTGGCCGGAC GGCCCATCGA GGCGATCGAG GTCGGCGAGC GGTGTCTCGC CCTGAGCCGC GAACTCGGGG ATCGCACGGG CGAGTTCACC GCGTTGGACA ATCTGGCGAT CAGCTACCAC GTCGCAGGGC GCTACGAGGA GGCGCTTGAG GCCCTGTCGA AGTGCCACGG CTACTGGCGG GAAGAGGGCA GGGAGTATGA CGAGGCCTAC TGCCTCATCC AGATGGGCGC CGTCCACACG AAACTGGGTG ATCTGACGAC GGCTCTGGAC TGCTTCGAGA AGGCGCTCCC CCTGGGGCGG AGCCAGGGCA ATCTCCGGAT CGAGGTCGAC GTGTTCAACG GCATCGCCGT GGTCCTGCGC CACCAGGGAT CTCATGCGGA GGCTCTCGAC CACCACGAGA AGGCGCTCGC CCTCGCGAGA ACGCTGCGGA GCAGGCCGCT GGAGGGTGAG CTGCTCAGCA GCCTCGGGGA GACCTGCCTG GCGAGCGGGG ACAGCCGGGC CGCCCTGGAG CACTACCAGG AAGCGGCGGT CTACGCCGAC GAGGCGGACG ACGCCTACCA GCGAGGATTC GCCTACGCGG GGCTCGGCAG CGCCCTGCAC GCCCTGGGGA GGCCCGACGA CGCCGCGAAG CATTGGCGGA CGGCGTTCGA CATACTGTCG CCCATGGACC TCCCCGAGGC CGGGGTCATC GCCGAGCGGA TGCGCGCGGC GGGCCTGACA TGA
|
Protein sequence | MEIRILGPVD IWRDGRSTAI VGLKQRTLLA VMVMHANRVV SHDRLLTALW GAKAPATGRR LLHNHLWSLR RLLAEGDAVE STPTGYLLRL RPGASDLDVF VTETARARSA LSEGDTAQAA ERFRTALSLW RGPALGGTHP ELQSTEGAAL EELRLAALIG RIEADLALGR HPELIGELRL LVGEHPLNEE LRGQLMRALH RAGRTAEALE EFRAGRLHFR DELGLDPGEE LTRVHQAILS GEAAATGNSG GNGENAPGTA IAAPVPAAPG SPVPRQLPAD VTRFTGRVEK LRRLDMLLSE EEGTATVVIS AIAGTAGVGK TALATHWGHR VAARFPDGQL YVNLHGYSRG RATTGAQALD RLLRGLGVVD DEIPHDVDER AGLYRSLLAH RRMLIVLDNA ATPEQVRPLL PGSSPSRVVI TSRDALRGLS VTHDVRGIVL DVLPADEATA LLNKLLGRNG TDDETDPVPE LARLCGYLPL ALRLAAAQLA GEPASRIGDF IAKLRQENRL TVLELREDPG TGVRSALELS YRSLPEPARR TLRLLSVHPG PDIDLQAVAA LTAMSAEDAS AAVESLLNAH LLQRDGDGRL SMHDLVRVYA GERNEADDSA ADRDGALTRM LDWYQYAVLK AMEHLSSDDS ASMTITPVDD GIPDLPGVDE AMAWLERERH VLIAVIVHAA ERDRHIHAWQ TAALLSWFFY AKNHFDDLFM TGEVGLSSAR RIGHRHGEAE ILSDLGYAKM FTGRYAEHLS HQQQALDIWR AVKDRRGEAK GLRHVSYALQ LAGRPIEAIE VGERCLALSR ELGDRTGEFT ALDNLAISYH VAGRYEEALE ALSKCHGYWR EEGREYDEAY CLIQMGAVHT KLGDLTTALD CFEKALPLGR SQGNLRIEVD VFNGIAVVLR HQGSHAEALD HHEKALALAR TLRSRPLEGE LLSSLGETCL ASGDSRAALE HYQEAAVYAD EADDAYQRGF AYAGLGSALH ALGRPDDAAK HWRTAFDILS PMDLPEAGVI AERMRAAGLT
|
| |