Gene Sros_5166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5166 
Symbol 
ID8668460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5680088 
End bp5681605 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content74% 
IMG OID 
ProductRNA polymerase, sigma-24 subunit, ECF subfamily 
Protein accessionYP_003340687 
Protein GI271966491 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0248944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAAC CGGAAAGTGA GGACCCGGCC CTGACCCTAC TGGTCAGGGC CGGGGACGAC 
CGTGCCGCCT CCGAGTTGTA CGAGCGGCAC TACCCGGCCG TCATCGCTTT CGCCCGCCGC
CTGTGCCAGG ACCTGCACAC CGCCGAGGAC CTGGCGAGCG AGGCGTTCGC GCGGACCCTG
CGCACCGTCC GGAACGGCCC GGCTGGTCCG ACCGGTGACT GGCGTCCCTA CCTGTACGCG
GTGGTCCGCA ACACGGCGGC GGAGTGGGCA CGCTCCGATC AGCGGTTCGT CCTGACCGAC
GAGTTCCGCG AGGACGATCT CACGACGGCC GCGCCGGAGC CGCCGGACGA TCTCGTGACC
CGCGCCTACC GCTCGCTGCC GCCACGCTGG CAGACCGTGC TCTGGCACAC CCTGATCGAG
GACGAGGAGC CCGAACGGGT CGCGAAGATC CTCGGCATCA CTCCCGGCAA CGTCGGCGTG
CTGGCCTTCC GCGCCCGCGA GGGACTGCGC AAGGCGTACC TGGCCGCGCA CGTATCCAGC
GCGTCACCCC GCTGCCAGGA GTACGCGGAG CCGCTGGCGG CGATCGTGCG CAAGAGAAGC
GGCCGCCTTC CGCGGGCGCT GCGCGCGCAC CTGGAATCCT GCGCGGGCTG TGCCCGGGCA
CACACGGAGC TGCTCGACCT CAACGCCACA CTCCGCGCAG CGCTGCCGAT CGCACTGTTC
CCGCTCGCCC TGGGAGCGGG GAAGTGGACC GCGGCGGGAG CGGGGACGCC GGGCACGGCA
GGGGCTGGGA CATCAGCCGG GGCGGGGACC GGGAAGTCGG CCGCCGCGCA GAAGGGTGCC
GCGACGCCGG GCTGGGCGAT CCCGGTGTCG GGAGCGGCGG CGATCGTCGC CGCCGCCGTG
GCCGTGTTCA GCCTGTCATG GGACCCGGCA CCCTCCCCGC CTACGCAGGC CGCCGCCCCC
GCGCCGAGCG CGTCACCCAC CCCGGAGCCC ATCCCGAAGC CCACCCGCGT GAAGACCCGG
GCGCCCGCCC GCGAGAAAGC CCCGGCCATC CGCGTGACGA CGCCGAGACC TGCGTCGCAG
TCCCCCCGCA AGCCCACTCC CCAGCCGGGC ACCCGGATCG CCCACGCGGG CCGATGTGCC
GGCGCGGCGG GCGGGCTGGT GGCGCTGCCG TGCGCCGACC CGCGTACGGC CTGGCGTACG
CGGGGCGGTG CCCAGCGGTT CCAGCTCGTC AACGTCGCCA GCGGCCGCTG CCTGGCCGCC
GGCGAGCAGT ACGACACCGT CGCCTTCAAC GGCGGCGGCA TGCTCGCGGT CCGGCTCCAG
CCCTGCTCCT CCGCCCCGGC CCAGCGCTGG CACCGCCCGG CCTTCAGCGA CGGCGTGCGC
CGGCTGGTGA GCGTCCCTTC CGGCAAGGCG CTGTCCATCG GCAAGGAGTT CGCCGGCAAG
CGTCCGCCGA CGGCGTTCAT CCTCTACGGT CCCTACACCG GCTCAGCCGA TCAGCGCATC
ACCCTCGTGG ACGGCTGA
 
Protein sequence
MIEPESEDPA LTLLVRAGDD RAASELYERH YPAVIAFARR LCQDLHTAED LASEAFARTL 
RTVRNGPAGP TGDWRPYLYA VVRNTAAEWA RSDQRFVLTD EFREDDLTTA APEPPDDLVT
RAYRSLPPRW QTVLWHTLIE DEEPERVAKI LGITPGNVGV LAFRAREGLR KAYLAAHVSS
ASPRCQEYAE PLAAIVRKRS GRLPRALRAH LESCAGCARA HTELLDLNAT LRAALPIALF
PLALGAGKWT AAGAGTPGTA GAGTSAGAGT GKSAAAQKGA ATPGWAIPVS GAAAIVAAAV
AVFSLSWDPA PSPPTQAAAP APSASPTPEP IPKPTRVKTR APAREKAPAI RVTTPRPASQ
SPRKPTPQPG TRIAHAGRCA GAAGGLVALP CADPRTAWRT RGGAQRFQLV NVASGRCLAA
GEQYDTVAFN GGGMLAVRLQ PCSSAPAQRW HRPAFSDGVR RLVSVPSGKA LSIGKEFAGK
RPPTAFILYG PYTGSADQRI TLVDG