Gene Sros_4481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4481 
Symbol 
ID8667775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4998620 
End bp5000269 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content76% 
IMG OID 
ProductSerine phosphatase RsbU regulator of sigma subunit-like protein 
Protein accessionYP_003340091 
Protein GI271965895 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00461161 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.901449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGA CACGGCCCGC CGGCGACACC GATCGCAGAC GACAGAGCAC CCACGCGGCG 
CACGACGCCC CCGGTGGACC GGGGGAGGAA CTGCTGCGCG CCATCGTGGA GGCCAGCGGC
GAGGGCGTGG TGCTCTGCGA CACCGACGGT GTCGTCGTCA TGGTCAACGC CGCGGCGGCG
CGGATGGCGC CGGAGATCGT GCCCGGCCGG CCCGCGGCCT CGGGCGGTGC GGCGGCGCTG
TTCGGCCACG ACCACCCCGA GGGCGACAGC AGGGAGGTCG GCCGCGGCGA CCTGCGGCTG
CGGGTGCGCC GGGAGGCCTT CGGCGTCGCC CACCACGCCT GGTACCTGCG CGACGTCACC
GCCGAGTCGG CCAGGACCGA CGCGCTGCTG GCCGAGCGCC GGCGTACCGA GTTCCTGGTG
GAGGCCGGGC GGCGGCTGTC GGCCTCGCTC AACACGCGCC GCTGCGCGCG TACCGCGGTC
GAGCTCGCCG TCTCCTTCCT GGCCGATGCC GCCATGGTCG TGCTGCCGCC CGCCGGGCGC
CGCCAGGCGG GCTGGCTGCG CTCGGTCGCC GGGCAGCACA GTCTGCAGGA GGGCGCGGTC
CCCGCGGCCT CCACCGCGCA GGTGCCGGGC CTGGCCGAGG CGCTGGCGGG TTTCCCGCCG
GTGCCCAGCC GGTGGCTGGA TCCCGGCCAG GCCCCCCAGT GGCTTTTGCC CGAGGGGTTC
GGCCAGGCCG GTCACCTGCT CGTCACCCCG CTGCCCGGCA ACGGCGTCCC GGCCGGGGCG
CTGGTGCTGG CCCGCCGCGA GGGCGGACCG GCCTTCGACG AGGAGGCCGA GATCCTGGTG
CGGGTCTTCG CCGCCCGCGC GGGCGCGGCG ATCTCGGCCG CCGCCCTGTA CCAGGAGCAG
AGCGCCACCA ACGCGATCTT GATCAGGGAC CTGCTGCCGC CCGCCCTGCC CGCCCTGGCC
GGGATGGAGC TGGCCGGCTC GTTGCGCTCG GCCCAGCAGG CCGGGCTGAT CGGCGGTGAC
TTCTACGACC TCTACCTGCC CGCCCCCGGC GCCCCGGGCG AGCCGCTGGT GGTCCTGGGC
GACGTCTGCG GCAAGGGGGC GCGCGCCGCC GTCCTGGCCG GGCAGGTCCG CCAGTCCCTG
CGCACGCTGC TGCTGCTGGA AAGGCGCCCC GAGCGGCTGA TGTCGCTGCT CAACCGCTCC
CTGCTGGCCT CCCCCTCGCC CAACCCCTAC GTCACCCTCG TGCTGGGCGC GCTGCGGGCC
GGGCAGCGGG GCCACGTGCT GCTGGATTTG GCCGTGGCCG GGCATCCCCC GCCGCTCATC
CTGCGCCGGG ACGGCACCGT GCAGGAGGCA GGCGCCGGCG GCTCCCTGCT CGGCGCGCTG
CAGGAGACCG TCTTGACCCC GGTCACCGTC GACCTGGCCC CCGGCGAGAT GTGCCTGCTC
TACAGCGACG GCATCACCGA GGCCGTCGGC GGTCCCACCG GCCGGGAGAT GTACGGCGGG
CAGCGGCTCA AGAACGCCCT GTCCACCTGC GCGGGCATGC CGGTCTCGGC GGCGGTGGAG
CGGCTGGAGC AGATCAGCAC CGAATGGCTG GCCGGCGACG TCCAGGACGA CCGGGCGCTG
CTCGCCGTAC AGGCCAGGCC CGCGCGATGA
 
Protein sequence
MTATRPAGDT DRRRQSTHAA HDAPGGPGEE LLRAIVEASG EGVVLCDTDG VVVMVNAAAA 
RMAPEIVPGR PAASGGAAAL FGHDHPEGDS REVGRGDLRL RVRREAFGVA HHAWYLRDVT
AESARTDALL AERRRTEFLV EAGRRLSASL NTRRCARTAV ELAVSFLADA AMVVLPPAGR
RQAGWLRSVA GQHSLQEGAV PAASTAQVPG LAEALAGFPP VPSRWLDPGQ APQWLLPEGF
GQAGHLLVTP LPGNGVPAGA LVLARREGGP AFDEEAEILV RVFAARAGAA ISAAALYQEQ
SATNAILIRD LLPPALPALA GMELAGSLRS AQQAGLIGGD FYDLYLPAPG APGEPLVVLG
DVCGKGARAA VLAGQVRQSL RTLLLLERRP ERLMSLLNRS LLASPSPNPY VTLVLGALRA
GQRGHVLLDL AVAGHPPPLI LRRDGTVQEA GAGGSLLGAL QETVLTPVTV DLAPGEMCLL
YSDGITEAVG GPTGREMYGG QRLKNALSTC AGMPVSAAVE RLEQISTEWL AGDVQDDRAL
LAVQARPAR