Gene Sros_1009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1009 
Symbol 
ID8664283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1025374 
End bp1027605 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content71% 
IMG OID 
ProductSerine phosphatase RsbU regulator of sigma subunit-like protein 
Protein accessionYP_003336753 
Protein GI271962557 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGGT GGGGGGACGT GACACAGGAC ACCGCCGACC ACCTTTCGGC AGGTTCCGTT 
CTTGACCATG CCGAGATGGC GGTGGTCGTC ACCGACAGGT TCAGCAACCT TCTCTACTGG
AACCCTTTCG CGGAGAAGCT GTTCGGCCGC CCCGGAGTCC CCGGGTCGCG GGACCAGGCG
CTCTCGCTGG GCATCATGGA GAAGGACCAC CCACTCGCCA TCGAGCTCGC CAAACACGTG
CTCAAGGGTG GCGTCTGGGA GGGCACGTTC GACGTCAGGC GCGGTGACGG GACGATCATC
TACGTCCGCG CCCAGGCCGT GCCGCTGCGG CATCCGTCGG GATCGGTGAC GGGCATCGTC
ATCACCGCCC GCGAGGCGAT GCGCAGCAAC GAGCGGGAGA AGGACCGCTT CGGCCTGCTG
GAGCGGATCG GCGAGCGGCT CGCCGGCTCG CTCTACGTCG AGGAGACGCT CAAGCGCGTC
GCCGACATGC TCGTCCCGCA GTTCGCCGAC CACTGCTTCA TCGAGCTGAT GGAGGGCCAG
CGGCTGACCC GGAGGGTCTC CACCCACGTC CAGGGCTGGA GCCCGCCGCC CAACACCTGG
GCGCCGCTCG GCGCGGAGAT CCGCTACCCG GTGGGGCACT ACGCCGAGAT CGCGCTGCGC
CGCCAGGAGA CGATCCTGGT CGAGGACTTC TCCCAGACCA ACTATCCCAG CCCCGGCGAG
GCCAACACCC GCCTGGCGGC CGAGATCGGG ATGACCTCGG CCATCGTGGC GCCGCTGTGC
GTGCGGGGCG AGGTCCTCGG GCTGATGTAC CTGGGCCTGT CGAACCTCAC CGACCGGCGC
AGCCCGCACT ACGACGCCTT CGACCGCGAC TTCGTGGGGG CCATCGCCAC CCGGGTCGCC
CTGGCCGTGG ACAACGCGCT GCTGTTCGAG GAGGAGCGGC ACACCGCCGA GTCCTTCCAG
AAATACCTGC TGCCCCCCGC GCCGCTGCCC GAGCTCGACG GGCTGGAGAT CGCGGTCCGC
TACTACCCGG CGGCGCCGCT CGCCTCGCAC GGGCAGGGCA TCCAGACCCA GGTCGGCGGC
GACTGGTACG ACGTCATCCC GCTGTCGGCG GGCCGGGTCG GCATCGTCAT CGGCGACGTC
GAGGGCCGGG GGGCCAAGGC CGCCGCCGTC ATGGGGCAGC TCCGCGCCGC GCTGCGCGCC
TTCGCCCAGG ACGACAAGCC GCCCGCGGAG ATCCTCGCCC GGCTCGACGA GTGGACGCGC
ATCATCGCCA CCCCCGAGCA GGACGACAGC GGCGAGGACA TCAGCGTCCC GCCCATCGTC
ACCTGCCAGT ACCTCGTCTA CGACGCCTGG TCGCGGCAGC TGTCGTTCGC CAACGCCGGT
CACGCGCCGC CGCTGCTGCT CAACGACGGC ACCTGCGCGG AGCTCGACAT CAAGGAGGTC
GGCCAGCCGC TCGGCGTCCG CGCCAAGGGC GTCCACGCCG ACCTGGTCTA CAAGGAGGAG
ACGCGGACGC TGCCCCCCGG GGCCGCGCTG CTGCTCTACA CCGACGGCCT CGTGGACCGC
CGCCCCGTGC GCGACGCCGA CGGCAGGCCG CCCAGCGACG AGGAGACCCT CGCGCTGCTC
GCGGGCAAGC TCATCGAGGT CTCCGACTCC TCGGTGGAAC GGATCGCGGA CGCCGCGACG
GTCGCCGTGC CCGGCGAGAT CGACGACGAC ATGGCCATCC TGGTGGTCAG GTCCGCCCCC
GACGACCTTG AGGTCGAGGA GCGCACCTTC CCGGCCCAGC CGATCATGGT CGGTGAGGCC
CGCCGGATGG CGGCGGAGGC GTTCACGGGC TGGAACGTCC CCGAGGAGCG CGCCGAGCTC
GCCTGCCTGC TGGTCTCCGA GGTGGTGACG AACGTGGTGC TGCACGCCGC CAGCGCCAGC
GTCCCGCGCC GCGAGCTGGT GCTGGACAGC GCCCCGATGC CGTTCGACGA GACCTGGGAC
GACCTCCCGG GCCTGGAGAA CGAGGTCGTC AACGAGAAGG AGTTCACGCT CCGGCTCCGC
CGGGGCGGGG AGGCCGTCTG GGTGGAGGTC TTCGACCAGG ATCTGCGGCT TCCCCGCATC
CGCAGCGCGG GGGAGAACGA CGAGGGCGGC CGGGGCCTCT ACCTCGTCGA CCAGCTCGCC
AAGCGCTGGG GCTCGCGTCC CACCAAGGAG GGCAAGGCCG TCTGGTTCGA GATCCCCACC
AAGTCCCGGT GA
 
Protein sequence
MKRWGDVTQD TADHLSAGSV LDHAEMAVVV TDRFSNLLYW NPFAEKLFGR PGVPGSRDQA 
LSLGIMEKDH PLAIELAKHV LKGGVWEGTF DVRRGDGTII YVRAQAVPLR HPSGSVTGIV
ITAREAMRSN EREKDRFGLL ERIGERLAGS LYVEETLKRV ADMLVPQFAD HCFIELMEGQ
RLTRRVSTHV QGWSPPPNTW APLGAEIRYP VGHYAEIALR RQETILVEDF SQTNYPSPGE
ANTRLAAEIG MTSAIVAPLC VRGEVLGLMY LGLSNLTDRR SPHYDAFDRD FVGAIATRVA
LAVDNALLFE EERHTAESFQ KYLLPPAPLP ELDGLEIAVR YYPAAPLASH GQGIQTQVGG
DWYDVIPLSA GRVGIVIGDV EGRGAKAAAV MGQLRAALRA FAQDDKPPAE ILARLDEWTR
IIATPEQDDS GEDISVPPIV TCQYLVYDAW SRQLSFANAG HAPPLLLNDG TCAELDIKEV
GQPLGVRAKG VHADLVYKEE TRTLPPGAAL LLYTDGLVDR RPVRDADGRP PSDEETLALL
AGKLIEVSDS SVERIADAAT VAVPGEIDDD MAILVVRSAP DDLEVEERTF PAQPIMVGEA
RRMAAEAFTG WNVPEERAEL ACLLVSEVVT NVVLHAASAS VPRRELVLDS APMPFDETWD
DLPGLENEVV NEKEFTLRLR RGGEAVWVEV FDQDLRLPRI RSAGENDEGG RGLYLVDQLA
KRWGSRPTKE GKAVWFEIPT KSR