Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_5547 |
Symbol | |
ID | 8668841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 6067101 |
End bp | 6068378 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | ECF subfamily RNA polymerase sigma-70 factor |
Protein accession | YP_003341043 |
Protein GI | 271966847 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.151151 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.328912 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGCTG ACGATCCGGC CGTCGACGAC CTGCTGCGCG AACTCGCGCC GCAGGTCGTC GGCGTGCTCA CCCGCCGCAC CGGGGACTTC GACACCGCCG AGGACGCGGT CCAGGAGGCG CTGCTGAACG CCGCGGCCCA GTGGCCGGAG GAGGGTGTGC CGGGCAATCC ACGCGGCTGG CTCATCCAGG TCGCGTTCCG CCGGATGACC GAGCAGGTGC GCAACGAGCA GGCCCGCCGC CGCCGGGAGG AGCTGGTGGC CACGCAGGCG CCCCCGGAAC GGCGGACCGC GCCCCCCGCC GACGACGCCC ACGAGACGGA CCGGGACGAC ACGCTGATCA TGCTGTTCCT GTGCTGTCAT CCCGCGCTCA CCCCGGCCTC CGCCATCGCT CTCACCCTGC GGTCGGTGGG CGGCCTGACC ACGGCCGAGA TCGCCAAGGC GTTCCTGGTG CCCGAGGCGA CCATGGCGCA GCGGATCAGC CGGGCCAAGC AGCGCATCAA GGCATCGGGG GTGCCGTTCC GGATGCCGGC CTCCCGGGAG CGGGCGCGGC GGCTCGGCTC GGTGCTGCAC GTCCTCTACC TCATCTTCAA CGAGGGCTAC GCCAGCAGCG CCGGCCCCGA CCTGCACCGC GTGGAGCTGT CCCGCGAGGC GATCCGGCTG GCCAGGACGG TCCACGCCCT GCTCCCGGAC GACTGCGAGG TGGCGGGGCT TCTCGCGCTG ATGCTGCTCA CCGACGCCCG GCGCGCCGCG CGGACCGGTC CGCGCGGCGC CCCGATCCCG CTGGCCGAGC AGGACCGGAG CCGGTGGGAC GGCGACGCCG TCGCCGAAGG GGTCGCGCTC ATCACCGCCA CCCTCGCCAA GGGCGCGGTC GGCCCCTATC AGCTCCAGGC GGCGATCGCG GCGCTGCACG ACGAGGCGGC GAGTGCCGAG GAGACCGACT GGCCGCAGAT CCTCGCGCTG TACGGCCTGC TGGAACGCAT GTCCGACAAC CCCGTGATCT CGCTCAACCG CGCCGTCGCC GCCGCCATGG TGCACGGCGC CGCGACCGGC CTCGACATGC TGAAGGCACT CGACGCCGAC GGACGCCTGG CGGGGCACCA CCGCTTCCAC ACCGCGCGGG CCCATCTGCT GGAGATGGAC GGCGACCTGC AGGCGGCCGT CGACGACTAT CGGGTGGCGG CCGGCCGTAC GATGAGCGTC CCGGAGCGGG ACTACCTCAC CGCCCGTGCC GCCAGGCTCG CCGCGACGGT GTCCGGATCG GTCCGCCGGG ACCTGTAA
|
Protein sequence | MSADDPAVDD LLRELAPQVV GVLTRRTGDF DTAEDAVQEA LLNAAAQWPE EGVPGNPRGW LIQVAFRRMT EQVRNEQARR RREELVATQA PPERRTAPPA DDAHETDRDD TLIMLFLCCH PALTPASAIA LTLRSVGGLT TAEIAKAFLV PEATMAQRIS RAKQRIKASG VPFRMPASRE RARRLGSVLH VLYLIFNEGY ASSAGPDLHR VELSREAIRL ARTVHALLPD DCEVAGLLAL MLLTDARRAA RTGPRGAPIP LAEQDRSRWD GDAVAEGVAL ITATLAKGAV GPYQLQAAIA ALHDEAASAE ETDWPQILAL YGLLERMSDN PVISLNRAVA AAMVHGAATG LDMLKALDAD GRLAGHHRFH TARAHLLEMD GDLQAAVDDY RVAAGRTMSV PERDYLTARA ARLAATVSGS VRRDL
|
| |