Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_5166 |
Symbol | |
ID | 8668460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 5680088 |
End bp | 5681605 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | RNA polymerase, sigma-24 subunit, ECF subfamily |
Protein accession | YP_003340687 |
Protein GI | 271966491 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0248944 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGAAC CGGAAAGTGA GGACCCGGCC CTGACCCTAC TGGTCAGGGC CGGGGACGAC CGTGCCGCCT CCGAGTTGTA CGAGCGGCAC TACCCGGCCG TCATCGCTTT CGCCCGCCGC CTGTGCCAGG ACCTGCACAC CGCCGAGGAC CTGGCGAGCG AGGCGTTCGC GCGGACCCTG CGCACCGTCC GGAACGGCCC GGCTGGTCCG ACCGGTGACT GGCGTCCCTA CCTGTACGCG GTGGTCCGCA ACACGGCGGC GGAGTGGGCA CGCTCCGATC AGCGGTTCGT CCTGACCGAC GAGTTCCGCG AGGACGATCT CACGACGGCC GCGCCGGAGC CGCCGGACGA TCTCGTGACC CGCGCCTACC GCTCGCTGCC GCCACGCTGG CAGACCGTGC TCTGGCACAC CCTGATCGAG GACGAGGAGC CCGAACGGGT CGCGAAGATC CTCGGCATCA CTCCCGGCAA CGTCGGCGTG CTGGCCTTCC GCGCCCGCGA GGGACTGCGC AAGGCGTACC TGGCCGCGCA CGTATCCAGC GCGTCACCCC GCTGCCAGGA GTACGCGGAG CCGCTGGCGG CGATCGTGCG CAAGAGAAGC GGCCGCCTTC CGCGGGCGCT GCGCGCGCAC CTGGAATCCT GCGCGGGCTG TGCCCGGGCA CACACGGAGC TGCTCGACCT CAACGCCACA CTCCGCGCAG CGCTGCCGAT CGCACTGTTC CCGCTCGCCC TGGGAGCGGG GAAGTGGACC GCGGCGGGAG CGGGGACGCC GGGCACGGCA GGGGCTGGGA CATCAGCCGG GGCGGGGACC GGGAAGTCGG CCGCCGCGCA GAAGGGTGCC GCGACGCCGG GCTGGGCGAT CCCGGTGTCG GGAGCGGCGG CGATCGTCGC CGCCGCCGTG GCCGTGTTCA GCCTGTCATG GGACCCGGCA CCCTCCCCGC CTACGCAGGC CGCCGCCCCC GCGCCGAGCG CGTCACCCAC CCCGGAGCCC ATCCCGAAGC CCACCCGCGT GAAGACCCGG GCGCCCGCCC GCGAGAAAGC CCCGGCCATC CGCGTGACGA CGCCGAGACC TGCGTCGCAG TCCCCCCGCA AGCCCACTCC CCAGCCGGGC ACCCGGATCG CCCACGCGGG CCGATGTGCC GGCGCGGCGG GCGGGCTGGT GGCGCTGCCG TGCGCCGACC CGCGTACGGC CTGGCGTACG CGGGGCGGTG CCCAGCGGTT CCAGCTCGTC AACGTCGCCA GCGGCCGCTG CCTGGCCGCC GGCGAGCAGT ACGACACCGT CGCCTTCAAC GGCGGCGGCA TGCTCGCGGT CCGGCTCCAG CCCTGCTCCT CCGCCCCGGC CCAGCGCTGG CACCGCCCGG CCTTCAGCGA CGGCGTGCGC CGGCTGGTGA GCGTCCCTTC CGGCAAGGCG CTGTCCATCG GCAAGGAGTT CGCCGGCAAG CGTCCGCCGA CGGCGTTCAT CCTCTACGGT CCCTACACCG GCTCAGCCGA TCAGCGCATC ACCCTCGTGG ACGGCTGA
|
Protein sequence | MIEPESEDPA LTLLVRAGDD RAASELYERH YPAVIAFARR LCQDLHTAED LASEAFARTL RTVRNGPAGP TGDWRPYLYA VVRNTAAEWA RSDQRFVLTD EFREDDLTTA APEPPDDLVT RAYRSLPPRW QTVLWHTLIE DEEPERVAKI LGITPGNVGV LAFRAREGLR KAYLAAHVSS ASPRCQEYAE PLAAIVRKRS GRLPRALRAH LESCAGCARA HTELLDLNAT LRAALPIALF PLALGAGKWT AAGAGTPGTA GAGTSAGAGT GKSAAAQKGA ATPGWAIPVS GAAAIVAAAV AVFSLSWDPA PSPPTQAAAP APSASPTPEP IPKPTRVKTR APAREKAPAI RVTTPRPASQ SPRKPTPQPG TRIAHAGRCA GAAGGLVALP CADPRTAWRT RGGAQRFQLV NVASGRCLAA GEQYDTVAFN GGGMLAVRLQ PCSSAPAQRW HRPAFSDGVR RLVSVPSGKA LSIGKEFAGK RPPTAFILYG PYTGSADQRI TLVDG
|
| |