Gene Sros_3356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3356 
Symbol 
ID8666644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3685738 
End bp3687816 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content72% 
IMG OID 
ProductSerine phosphatase RsbU regulator of sigma subunit-like protein 
Protein accessionYP_003339038 
Protein GI271964842 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.223934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0312208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTG CCCGGGGGGA GAGATCAAGC GGGCTGGGGC TGGAGGTGTT CGATCCGGCC 
CCCGTCGGGG TGTCGGTCAC CAGTGCTCGG GACCACCGGC TGGTGTACGC CAACGCGGTC
TACCGGTCGG TCTTCGGCGA CAGATCGCTC TGGGCCCTCG CTCGCGAGGC GTTCAGCGAC
CACGTCGATC GGGACTACTA CTTCGGGCTG TCCGACCAGG TCCTGAGCAC CGGCGAGCCC
GTGACCGTCA CCGACGCCCC CGTGGCCCTC GCCTACGCCG ACACCGGCCC GCAGGAGCGC
TTCTTCACCT TCAGCCTCTC GGAGATCTCT CTCGGCGGCG GCGAGCGCGG GGTGCTGCTG
GTGTCGGTGG AAATGACCGA GCAGGTCAGC GCCGCGCAGC GGATCCAGGC CCTCTCCGAG
GAACGGCGCC GCCTTCTCCA GCGCTACCAG AAGCTGGGGA GCGTCGGCGC GGAGATGGTC
TGGGTGACCG CGGCCGACGG CCGCGTCATC GAACCGAGCC CCGGCTGGGA GAGGATGACC
GGGCAGTCGT GGGAGGAATT CGCGGGAGAG GGGTGGCGGG ATGCCGTCCA TCCCGATGAC
CGCGAGCCGA CCGACGAGTC GTGGTCCCGG GCGACTCGGG AGGTGCCCGA CCTGTGGGAG
CACATCCACC GGATACGCCT GGTGGACGGC ACCTACCGGC ACTTCAGCGT CCGCGCCGTA
CCGGTGCTCG AAGGCGACAC CGTGGTGGAG TGGGTGGGCA GCTGCACCGA CATCGAGCAG
GCGTGGCAGG AGGACCGGCG CTGGGAGCTG CTCAACCGCG CCGCCACCGC CACCGCCGAT
GTCACCCGCC TGGAGGAGAT GCTCTCCGCG CTGGCCCGCG CGATGGTGCC GGACCTGGCC
GACGGGTGCA TCATCTACCT CCTCTCCGAC TCGCCGGACC GTCCCGAGAA CGCGTCGCTG
GTGGCGCACC TCGTCGCCTC CGCCGCCCGG CCCGGCCTGC CCGAGCTGCC GGAGTACGGC
GAGGAGGTCT TCGCCCCCGA CAACGTTTTC ACCCGCACGG TGCAGCGCCG CCGCCCCATC
CACCGGACAT TCCCCACGGG GTGCCCCGCC CCCGGTGTCG CACCGGCGGG AGCCGAGGCC
TGGCTCGTCT CGGCCGAGGC CAACAGCGTG GTCCTCGTGC CGGTCGTCGT CGACGGCACC
GTGGCGGCGG TGGTGGTGGC CTCCGTCTGC GGTGACCGCC CCCCGATCAG CTCGGCCGAC
GTCGCCCTGA TGGACCAGAT GCTCGACAAC GCGCACGACT CCCTCAGCAA CGCCATCGAG
TTCCGGCGCA CGAAGCGGGT GGCGCTCGCC CTGCAGCGCA GCCTGCTCCC GAAACCGCCC
GCCGTGCCGG GCCTGCAGAT CACCGCCCGT TACCGGCCGA GCGCCGCCGC CGCCGAGGTC
GGCGGGGACT GGTACGACTG CTTCCGGCTG CGGGACGGCG CCACCATGCT GACGATCGGC
GACGTGGCCG GCCACGACCT GCCCGCCGCC GTCACCATGA GCCAGATCCG CAACCTGCTC
CGGGGGCTCG CCGTCGACCG CGAGGAGCCC GTCGGCGACA TCCTCAGACG CCTGGACATC
GCGATGGAGA CCCTCTACGA GGAGCAGACC GCGACCTGCG TCCTGGCGCG TGTGGAGTGT
TCCGAGGAGA GCGGCTGGCA GCTGAACCAC TCCGTGGCCG GGCATCCACC GCCTCTGCTG
ATCACCGGCG ACGGCGGGGG CCGCTTCCTC GACAGCTCCA CCGACCCCCT GCTGGGCCTG
CTCCCCGACC GGCCGCGGAG CAGCGTCATC GAGCCGCTGC CGCCGGACGG CACGCTGCTC
CTCTACACCG ACGGTCTCGT CGAACGCCCG GGGGAGGACA TCGACGAGGG GCTGGCCCGG
CTGTGCCGCC ACGCCGCGTC GCTCGCCAGG GCCCCCCTGG AGACGTTCTG CGACGCGCTG
CTGTCCGAGC TGGCCGTCGA CGGCAAGGAC GACATCGCCA TGATCGCCGT ACGCCTGCCG
CCCGCACGGC TCACAGCGGC AGAACGCCGC CCGGCTTGA
 
Protein sequence
MKVARGERSS GLGLEVFDPA PVGVSVTSAR DHRLVYANAV YRSVFGDRSL WALAREAFSD 
HVDRDYYFGL SDQVLSTGEP VTVTDAPVAL AYADTGPQER FFTFSLSEIS LGGGERGVLL
VSVEMTEQVS AAQRIQALSE ERRRLLQRYQ KLGSVGAEMV WVTAADGRVI EPSPGWERMT
GQSWEEFAGE GWRDAVHPDD REPTDESWSR ATREVPDLWE HIHRIRLVDG TYRHFSVRAV
PVLEGDTVVE WVGSCTDIEQ AWQEDRRWEL LNRAATATAD VTRLEEMLSA LARAMVPDLA
DGCIIYLLSD SPDRPENASL VAHLVASAAR PGLPELPEYG EEVFAPDNVF TRTVQRRRPI
HRTFPTGCPA PGVAPAGAEA WLVSAEANSV VLVPVVVDGT VAAVVVASVC GDRPPISSAD
VALMDQMLDN AHDSLSNAIE FRRTKRVALA LQRSLLPKPP AVPGLQITAR YRPSAAAAEV
GGDWYDCFRL RDGATMLTIG DVAGHDLPAA VTMSQIRNLL RGLAVDREEP VGDILRRLDI
AMETLYEEQT ATCVLARVEC SEESGWQLNH SVAGHPPPLL ITGDGGGRFL DSSTDPLLGL
LPDRPRSSVI EPLPPDGTLL LYTDGLVERP GEDIDEGLAR LCRHAASLAR APLETFCDAL
LSELAVDGKD DIAMIAVRLP PARLTAAERR PA