Gene Sros_5361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5361 
Symbol 
ID8668655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5873303 
End bp5875585 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content68% 
IMG OID 
Productpenicillin acylase 
Protein accessionYP_003340867 
Protein GI271966671 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.360999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTCT CGCCGCGCGA CAAGGTGGAC CAGCTCATGA ACCACCCGCA GAACGAAGGG 
CTGAGCCGAC GCTGCGTCCT GGGCGGAGCA GCCGGACTGG CCGCTGGAGC CGTAGCCGCC
GCGGCGGGGC TGCCGTCCGG CGACGCGTCC GCCGCTTCCA CGCGAACCGC GCCCGATCTC
GCCCGCTGGC GCATGCAGGC GGCGCGCGTG ACGATCACCC GCGATGACTG GGGCATCCCG
CATGTGGTGG GCGAGACCGA CGCCGATGCG GTGTTCGGGA TGATCTATGC CCAGGCCGAG
GACGACTTCA ACCGGATCGA GACCAACTAT CTGGTCAGCC TCGGCCGCCT CGCCGAGGCC
GAGGGCGAGA GTGCGATCTG GCAGGACCTG CGCCAGCGGC TGTTCCTCGA TCCCGAAGTG
CTGAAGAAGG ATTGTGCGAA GAGCCCGGCC TGGTTGCGCA AGCTCATGCA GGCCTGGGCG
GACGGGCTGA ACTGGTATCT CGCGACGCAC CGCGAGGTGC ATCCGCGCGT GATCAAGCGG
TTCGAGCCTT GGATGGTGCT GAGCTTCTCC GAAGGCAGCA TCGGCGGCGA CATCGAGCGG
GTGCCGCTCA CCCGGCTCGA GGCCTTCTAC GGCAACCGCG CGGTGGCGAT GACCGACGAG
GAGCGCGAGC TGCTGCTCCA CAAGCCCACG GGCTCGAACG GCATGGCGAT CGCCCCGGGC
CGCACGCGGA ACGGCCATGC GCTGCTGCTG ATCAACCCGC ACACCAGCCT CTTCTTCCGC
TCCGAGCAGC AGGTGACAAG CGGCGAAGGG CTCAACGTCT ATGGCGCGGC CACCTGGGGG
CAGTTCTTCA TCTACCAGGG CTTCAACGCG GACGCGGGCT GGATGCACGC GTCGAGCGGT
GTCGACAATG TCGACGAGTT CGCCGAGACG ATCGTGACCG GGGCGGATGG CAGCCGCTCC
TACCGTTACG GCAACGCGCT GCGGCCGGTG AAGACGGAGA CGATCACGTT GTCCTATCGC
ACTGCGGATG GCCGGCGGGG GCAACGCAGC TTCACCACCT TCGCCACGCA TCACGGCCCG
ATCGTGCGCG AGACGGACGG CAGATGGATC GCGTCCGCGC TGATGAACAG GCCCGTCGAG
GCGCTGCAGC AGAACTTCCT GCGCACCAAG ACGCGGGATT ACGCGGACTT CGTCGAGGTG
GCGGCCTTCA AGGCCAACAG CTCGAACAAC ACGCTGTTCG CGGACTCGAA GGGCGAGATC
GCCTTCCTGA TGCCGCCGTT CATGCCGCTC CGCGACGATC GCTTCGACTA TCGCAAGCCC
GTCGACGGCA GCGATCCGGC GACCGACTGG CGCGGGCTGC ACAGCCTCGA GAGCCTGCCT
CAGGCAGTGA ATCCGAAGAA TGGCTGGGTG TTCAACACGA ACAACTGGCC CTGGACCGCG
GCCGGCGCGG ACAGCCCCAA GGCCGCCGAC TATCCGCGCT ATTTCGACCA GGCCGGGGAG
AACCCGCGCG GGCCGCAAGC GATCCGGTTG CTGAACGCGC GGAGCGACTT CACCCCGCAG
ACGCTGATCG CAGCCGCCTT CGATCCCTAC CTCACCGCCT TCGCCCGGCT CGTGCCGGGG
CTGATCGCGG CGTGGGACAG GTTGCCCGGG GGCGACCGGC GAAAGGCGAA GCTCGACGGC
CCGATCGGCC TGCTGCGCGG CTGGGACTAT CGCTGGGGCG TGGATTCGGC TGCGACCTCG
CTGGCCGTGT TCTGGGGCGA GGAGCTCTGG ACGTCCTCGG TCCAGCCGGC GAAGGATGCG
GGCTTGTCGG TGTGGGATCA CATGGCCGGC CACGCCACCG ACGCGCGGCG GCTCGCTGCG
CTCGAGGCGG CGGCGGAACG GCTGACGCAA GGTTTCGGCA GCTGGCAAGT GCCCTGGGGA
GAGATCAACC GCTTTCAGCG CATCGACGGC GCGATCGTCC AGAAGTTCGA CGATGCCAAG
CCGAGCATCC CGGTGCCTTT CACCTCAGCG CGATGGGGCT CCCTCGCCTC GTTCAGGACG
AAGTGCCGGC CGGCAACGGA GCGCTGTTAC GGCACCAGAG GCAACAGCTT CGTGGCGGTC
GTGGAGTTCG GGCCGAGGGT GCGCGCCTGG GCGGTAACGG CGGGGGGCGC GAGCGGGCAT
CCCGGCTCGC CGCACTTCAA CGACCAGGCC GAACGCTACG CGAGCGGCAA CCTCAGGCCC
GTCTACTTCT ATCCCGACGA CCTCCGGGGC CACATCGAGC GGAGCTACAA GCCGGGCAAT
TGA
 
Protein sequence
MPLSPRDKVD QLMNHPQNEG LSRRCVLGGA AGLAAGAVAA AAGLPSGDAS AASTRTAPDL 
ARWRMQAARV TITRDDWGIP HVVGETDADA VFGMIYAQAE DDFNRIETNY LVSLGRLAEA
EGESAIWQDL RQRLFLDPEV LKKDCAKSPA WLRKLMQAWA DGLNWYLATH REVHPRVIKR
FEPWMVLSFS EGSIGGDIER VPLTRLEAFY GNRAVAMTDE ERELLLHKPT GSNGMAIAPG
RTRNGHALLL INPHTSLFFR SEQQVTSGEG LNVYGAATWG QFFIYQGFNA DAGWMHASSG
VDNVDEFAET IVTGADGSRS YRYGNALRPV KTETITLSYR TADGRRGQRS FTTFATHHGP
IVRETDGRWI ASALMNRPVE ALQQNFLRTK TRDYADFVEV AAFKANSSNN TLFADSKGEI
AFLMPPFMPL RDDRFDYRKP VDGSDPATDW RGLHSLESLP QAVNPKNGWV FNTNNWPWTA
AGADSPKAAD YPRYFDQAGE NPRGPQAIRL LNARSDFTPQ TLIAAAFDPY LTAFARLVPG
LIAAWDRLPG GDRRKAKLDG PIGLLRGWDY RWGVDSAATS LAVFWGEELW TSSVQPAKDA
GLSVWDHMAG HATDARRLAA LEAAAERLTQ GFGSWQVPWG EINRFQRIDG AIVQKFDDAK
PSIPVPFTSA RWGSLASFRT KCRPATERCY GTRGNSFVAV VEFGPRVRAW AVTAGGASGH
PGSPHFNDQA ERYASGNLRP VYFYPDDLRG HIERSYKPGN