Gene Sros_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4226 
Symbol 
ID8667520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4707978 
End bp4710020 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content72% 
IMG OID 
ProductRNA polymerase sigma factor containing a TPR repeat domain-like protein 
Protein accessionYP_003339871 
Protein GI271965675 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.338891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0144943 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCCC GCGGGAGCGC CGCCCCCGGT TTGCGCACGA GCCGGGGCGG TGCTCTTATC 
GTGCCCATGA CGGCCCGGAA CGTCCATGGT GCGATCGACG CGGTCTGGAA GTTCGAGTCC
GCGAGGATCA TCGCCGGGCT CACCAGAATG GTCCGCGACG TCGGCCTCGC CGAGGAGCTC
GCGCAGGACG CGCTGGTAGC CGCGCTCGAA CAGTGGCCCG GACAGGGCGT CCCGGATAAT
CCGGGCGCAT GGCTCATGAC CACCGCCAAA CGCCGGGCCA TCGATCACCT CCGCCGCGGC
GAGCGGCTGG AACGCAAGCA CGAAGAGATC GCCCACGCGC TGGAACAGCA GGGTCTCGAA
GAAGGTGTGG ACGACGACGT CCTGCGGCTG ATGTTCGTCT CCTGCCACCC GGTGCTGCCG
GCCGAGGCAC GGGCCGCGCT GACGCTCCGG CTGCTCGGCG GTCTGACCGC CCTGGAGATC
GCCCGGGCCT TCCTCGTGTC CGAGCAGGCC GTCGCCCAGC GCATCGCGAA GGCCAAGCGC
ACCCTCGCCG AAGAGCGGGT CCCGTTCGAA CTGCCTCCCG GACCGGAACT CGCCGGACGG
CTGGCGTCCG TGCTCGAAGT CATTTACCTG ATCTTCAACG AGGGCTATTC GGCGACGTCC
GGCGACGACC TGATGCGCCC GTCGCTGTGC CTGGAGGCGC TCCGGCTGGG CCGGCTGCTG
GCCGAGCTGG CGCCCCGGCA GGCCGAGGTC CACGGCCTGG TCGCGCTGAT GGAGATCCAG
GCGTCGCGTT CGGCCGCGCG GACCGGCCCC TCGGGCGAGC CCATCCAGCT CCACGAGCAG
AACCGCGGAC GCTGGGATCA GCTCCTCATC CGCCGCGGTT TCGCGGCGAT GTTACGGGCG
AAGGAGGCCG GGGGCCCGCC CGGCCCGTAT GTGCTGCAGG CCGCGATCGC CGTCTCGCAC
GCGCAGGCGA AGACCGCGCA GGAGACGGAC TGGGGTCAGA TCGCGGCCCT CTACGGAGCG
CTGGTCCGGC TGGTGCCGTC GCCCGTGGTC CAGCTCAACC GTGCCGTGGC GCTGGGGATG
GCCCGCGGGC CCCAGGCGGG GCTGGACATC GTCGACACGC TGACCTCCGA TCCCGCGCTC
AAGAACTACC ACCTGCTGTC CGGCGTGCGC GGCGACTTCC TGGCCAAGCT CGGACGGCAC
GACGAGGCCA GGACGGAGTT CGAGCGCGCG GCCTCGCTCA CTCACAACGC CCCCGAACGC
GCTTTCCTGC TCAGGCGGGC CGCTACCGGC ACAACCCCGG TGGCGGGAGT CACCCTGGGC
CAGGCCGCGG AGAGCTTCCT GGCCCGCGAG GACCTGGACG CCGGGACGAT CCGCTCCTAC
GGCCAGACCC TGCGCCGGAT GTGCCTGGAC CTTGGAGCCG GCACCCCGCT GGCCGAGGTG
ACCGCCGGGA AGTTGTCCAC GGTCTTCTCC GTCGCCTGGG ACGGGGCGGC GGCCAAGACC
TGGAACCGGC ACCGCGCGGC CGTCCGCTCC TTCTCCTCCT GGGCGTCCGT CGACGACCTC
TCCGCCGGAC TGGCCCGGAA GCCGGAGAGC CGCGAGCGCA GGCCGTCCAT CGGCCCGTCA
CAGCTCGACG CCCTCTGGGA GCGCCCGGGC ACGGCGCTAC GCGAGAAGGT GCTGTGGCGG
CTCCTCCACG AGTCGGCGGC CGCCGCCAGA ACGGCCCTGT CCCTCAACGT CGAAGACCTC
GACCTGGACG ACCGGCGCGG ACGCGTCGCC GCCAAGAACG GGCAGGTCTG GCTGTCCTGG
CAGTCCGGCA CCGCCCGCCT GTTACCCCAC CTCGTGGCGG GCCGGACGCG AGGCCCGCTG
TTCCTGGCCG ACCGGAGACC CGCACCGGCC CGGATGCCCG GACCCGCCGA CCTCTGCCCC
GACACCGGAC GGGGACGGCT CTCCTACGAA CGCGCCGAAT ACCTCTTCAA ACAGGCCACC
AGGCCGCTCG ACCCGTCAGG CGCGGGCTAC ACCCTCCACC AGCTCAGCCA CTCCCGCCCT
TGA
 
Protein sequence
MTPRGSAAPG LRTSRGGALI VPMTARNVHG AIDAVWKFES ARIIAGLTRM VRDVGLAEEL 
AQDALVAALE QWPGQGVPDN PGAWLMTTAK RRAIDHLRRG ERLERKHEEI AHALEQQGLE
EGVDDDVLRL MFVSCHPVLP AEARAALTLR LLGGLTALEI ARAFLVSEQA VAQRIAKAKR
TLAEERVPFE LPPGPELAGR LASVLEVIYL IFNEGYSATS GDDLMRPSLC LEALRLGRLL
AELAPRQAEV HGLVALMEIQ ASRSAARTGP SGEPIQLHEQ NRGRWDQLLI RRGFAAMLRA
KEAGGPPGPY VLQAAIAVSH AQAKTAQETD WGQIAALYGA LVRLVPSPVV QLNRAVALGM
ARGPQAGLDI VDTLTSDPAL KNYHLLSGVR GDFLAKLGRH DEARTEFERA ASLTHNAPER
AFLLRRAATG TTPVAGVTLG QAAESFLARE DLDAGTIRSY GQTLRRMCLD LGAGTPLAEV
TAGKLSTVFS VAWDGAAAKT WNRHRAAVRS FSSWASVDDL SAGLARKPES RERRPSIGPS
QLDALWERPG TALREKVLWR LLHESAAAAR TALSLNVEDL DLDDRRGRVA AKNGQVWLSW
QSGTARLLPH LVAGRTRGPL FLADRRPAPA RMPGPADLCP DTGRGRLSYE RAEYLFKQAT
RPLDPSGAGY TLHQLSHSRP