Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4226 |
Symbol | |
ID | 8667520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 4707978 |
End bp | 4710020 |
Gene Length | 2043 bp |
Protein Length | 680 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | RNA polymerase sigma factor containing a TPR repeat domain-like protein |
Protein accession | YP_003339871 |
Protein GI | 271965675 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.338891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0144943 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACCCC GCGGGAGCGC CGCCCCCGGT TTGCGCACGA GCCGGGGCGG TGCTCTTATC GTGCCCATGA CGGCCCGGAA CGTCCATGGT GCGATCGACG CGGTCTGGAA GTTCGAGTCC GCGAGGATCA TCGCCGGGCT CACCAGAATG GTCCGCGACG TCGGCCTCGC CGAGGAGCTC GCGCAGGACG CGCTGGTAGC CGCGCTCGAA CAGTGGCCCG GACAGGGCGT CCCGGATAAT CCGGGCGCAT GGCTCATGAC CACCGCCAAA CGCCGGGCCA TCGATCACCT CCGCCGCGGC GAGCGGCTGG AACGCAAGCA CGAAGAGATC GCCCACGCGC TGGAACAGCA GGGTCTCGAA GAAGGTGTGG ACGACGACGT CCTGCGGCTG ATGTTCGTCT CCTGCCACCC GGTGCTGCCG GCCGAGGCAC GGGCCGCGCT GACGCTCCGG CTGCTCGGCG GTCTGACCGC CCTGGAGATC GCCCGGGCCT TCCTCGTGTC CGAGCAGGCC GTCGCCCAGC GCATCGCGAA GGCCAAGCGC ACCCTCGCCG AAGAGCGGGT CCCGTTCGAA CTGCCTCCCG GACCGGAACT CGCCGGACGG CTGGCGTCCG TGCTCGAAGT CATTTACCTG ATCTTCAACG AGGGCTATTC GGCGACGTCC GGCGACGACC TGATGCGCCC GTCGCTGTGC CTGGAGGCGC TCCGGCTGGG CCGGCTGCTG GCCGAGCTGG CGCCCCGGCA GGCCGAGGTC CACGGCCTGG TCGCGCTGAT GGAGATCCAG GCGTCGCGTT CGGCCGCGCG GACCGGCCCC TCGGGCGAGC CCATCCAGCT CCACGAGCAG AACCGCGGAC GCTGGGATCA GCTCCTCATC CGCCGCGGTT TCGCGGCGAT GTTACGGGCG AAGGAGGCCG GGGGCCCGCC CGGCCCGTAT GTGCTGCAGG CCGCGATCGC CGTCTCGCAC GCGCAGGCGA AGACCGCGCA GGAGACGGAC TGGGGTCAGA TCGCGGCCCT CTACGGAGCG CTGGTCCGGC TGGTGCCGTC GCCCGTGGTC CAGCTCAACC GTGCCGTGGC GCTGGGGATG GCCCGCGGGC CCCAGGCGGG GCTGGACATC GTCGACACGC TGACCTCCGA TCCCGCGCTC AAGAACTACC ACCTGCTGTC CGGCGTGCGC GGCGACTTCC TGGCCAAGCT CGGACGGCAC GACGAGGCCA GGACGGAGTT CGAGCGCGCG GCCTCGCTCA CTCACAACGC CCCCGAACGC GCTTTCCTGC TCAGGCGGGC CGCTACCGGC ACAACCCCGG TGGCGGGAGT CACCCTGGGC CAGGCCGCGG AGAGCTTCCT GGCCCGCGAG GACCTGGACG CCGGGACGAT CCGCTCCTAC GGCCAGACCC TGCGCCGGAT GTGCCTGGAC CTTGGAGCCG GCACCCCGCT GGCCGAGGTG ACCGCCGGGA AGTTGTCCAC GGTCTTCTCC GTCGCCTGGG ACGGGGCGGC GGCCAAGACC TGGAACCGGC ACCGCGCGGC CGTCCGCTCC TTCTCCTCCT GGGCGTCCGT CGACGACCTC TCCGCCGGAC TGGCCCGGAA GCCGGAGAGC CGCGAGCGCA GGCCGTCCAT CGGCCCGTCA CAGCTCGACG CCCTCTGGGA GCGCCCGGGC ACGGCGCTAC GCGAGAAGGT GCTGTGGCGG CTCCTCCACG AGTCGGCGGC CGCCGCCAGA ACGGCCCTGT CCCTCAACGT CGAAGACCTC GACCTGGACG ACCGGCGCGG ACGCGTCGCC GCCAAGAACG GGCAGGTCTG GCTGTCCTGG CAGTCCGGCA CCGCCCGCCT GTTACCCCAC CTCGTGGCGG GCCGGACGCG AGGCCCGCTG TTCCTGGCCG ACCGGAGACC CGCACCGGCC CGGATGCCCG GACCCGCCGA CCTCTGCCCC GACACCGGAC GGGGACGGCT CTCCTACGAA CGCGCCGAAT ACCTCTTCAA ACAGGCCACC AGGCCGCTCG ACCCGTCAGG CGCGGGCTAC ACCCTCCACC AGCTCAGCCA CTCCCGCCCT TGA
|
Protein sequence | MTPRGSAAPG LRTSRGGALI VPMTARNVHG AIDAVWKFES ARIIAGLTRM VRDVGLAEEL AQDALVAALE QWPGQGVPDN PGAWLMTTAK RRAIDHLRRG ERLERKHEEI AHALEQQGLE EGVDDDVLRL MFVSCHPVLP AEARAALTLR LLGGLTALEI ARAFLVSEQA VAQRIAKAKR TLAEERVPFE LPPGPELAGR LASVLEVIYL IFNEGYSATS GDDLMRPSLC LEALRLGRLL AELAPRQAEV HGLVALMEIQ ASRSAARTGP SGEPIQLHEQ NRGRWDQLLI RRGFAAMLRA KEAGGPPGPY VLQAAIAVSH AQAKTAQETD WGQIAALYGA LVRLVPSPVV QLNRAVALGM ARGPQAGLDI VDTLTSDPAL KNYHLLSGVR GDFLAKLGRH DEARTEFERA ASLTHNAPER AFLLRRAATG TTPVAGVTLG QAAESFLARE DLDAGTIRSY GQTLRRMCLD LGAGTPLAEV TAGKLSTVFS VAWDGAAAKT WNRHRAAVRS FSSWASVDDL SAGLARKPES RERRPSIGPS QLDALWERPG TALREKVLWR LLHESAAAAR TALSLNVEDL DLDDRRGRVA AKNGQVWLSW QSGTARLLPH LVAGRTRGPL FLADRRPAPA RMPGPADLCP DTGRGRLSYE RAEYLFKQAT RPLDPSGAGY TLHQLSHSRP
|
| |