Gene Sros_4390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4390 
Symbol 
ID8667684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4900488 
End bp4902809 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content72% 
IMG OID 
Productpeptidase S45, penicillin amidase 
Protein accessionYP_003340010 
Protein GI271965814 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0446871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0863607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTCA GATCACGACT GGCCGTGCTG ATCGCGCTGG TCCTGCTCCT CTCCGCCGGG 
GTGGTGGTGC GTGCGAACGC GCCGGCCGTG GCCGGCGGCG AGCGCTACTC GGCCACGATC
CGGCGCACCG AGTACGGCAT CCCGCACATC GCCGCCAAGG ACTACGGCGG ACTCGGATTC
GGGTACGGCT ACGCATTCGC GCAGGACAAC CTGTGCGTGC TGGCCTCGTG GCTGGTGACG
CTGCGGGGCG AGCGGTCCCG GCACTTCGGC GCCGCGGCCA TGTCCGACGA CCCGATCCGC
CCGGTCACCA ACTTCGCCAG CGATGTCTAC TACAGGAGCG TCGCGGACTC GGGCGTCGTG
CGGCGGCTGC TGGCCCGGCC CGCGCCGGTG GGGCCAAGCG ACGAGCTGCG CCGCATGGTC
GACGGCTACG CCGCGGGATA CAACCGCTAC CTGGCGGACA CCGGCGTGGC GAACCTGCCC
GATCCCACCT GCCGGGGCAA GGCCTGGGCC GCCCCCATCA CCGCCACGGA CATCTGGAAC
AACCTGCTCG ACCTCAACCG CATGGTCGGA AGCTCCCAGC TCAAGGACGC CATCACCGGG
GCCGGCGCGC CCGAGGCCGA AGGCCCGGTC AAGAGACCGG CCGCGGGCAG CAACGGCTGG
GCGCTGGGCC GCCAGGCGAC GCGCGACGGC CACGGCATGC TGCTGGCCGA CCCGCACTTC
CCGTGGAACG GCAGCCGCCG CTTCTACCAG GTACAGCTGA CCATCCCCGG CGTGCTCGAC
GTCTCCGGCG GCAGCCTGTA CGGCACGCCG GTCGTCCAGA TCGGCCACAA CGCGAGCGTG
GCCTGGACCC ACACCGTCTC CCATGCCCAG CGCTTCACCC TGTACCGGCT GAAGCTCGCG
GGCGACCGTA GCAGCTACAT GGTGGACGGC CGGGCCGAGC CGATCGGCCG CCAGGAGGTG
GAGGTCACCC TGCAGGACGG CACCGTCGCC GGCCACACGC TCTACACCTC GCGCTACGGC
CCGGTGCTCG CCGTCGGGCG CACCGACAAG GTCGCCTACG CGCTCGCCGA CGCCAACGCC
GCCAACCTGC GCGCGGCCGA CGAATGGCTG GCGATGGCCA AGGCGAGTGA CCTCACCCGG
CTGCGCGCGG CCCAGGCCAC CCATCAGGGC ATGCCGTTCG TCCACACGCT CGCCACCGAC
ATCGGCGGGA CCGCCTACTT CGCCGACGCC TCGGTCGTCC CCCACATCAG CGACGCGAAG
GCGCGCCGCT GCGCCAAGCC CTCGCCGGTT CCGGGGCTGG AGGCATACGT CCTGGACGGC
TCGACCTCCG CGTGCCTGTG GGGCAAGGAT CGCGACGCCG TCGTGCCCGG CATCTTCGGG
CCGGGCAACC AGCCGGAACT GACCCGCGCC GACTACGTGG CCAACTCCAA CAACACCGCC
TGGATGACGA ACCCGTCGGC GCCGCTGACC GGCTACCAGG GCGTGTGGGG CAAGGACCGC
ACGGAGCTGG AGCCACGTCC CCGGGTCAGC CTGGACATGA TCGCCCAGAG GCTGTCCGGG
GCCGACGGGC TCGGCGCGCC CGGCTTCACC CTGGAGACAC TGCAGGCCAC CGCGCTCGCC
AAGCGCAACC ACACCTTCGA GCTCATGCGC TCGGACCTGC TGAAGCTGTG CCGTACCCAC
CCCACGCCGA CCGCGTCCGA CGGCACGCGG GTGAACGCGC GCGAGGCCTG CGGGATCCTG
GGGAAGTGGG ACGGGCGCGC CACCCTGGAC GGCCAGGGCG CCATCCTGTG GCGCGAGTTC
TTCACCCGGC TGCGGCGTCC CAGCAAGCCC GGAGACTCCG AATTCACCCC ATGGCGGGTG
CCGTTCGACC CCGCGCGTCC GCTGACCACC CCCCGGGGGC TCAAGCACGA CAGCCCGGCG
GTGCGGCAGG CCCTGGCCGA TGCCGTGCGC TTCTTCCAGG CCAACCGCAT CTCCCTGACG
CTCACCCCGG GCCGGTCGCA GCACTACTCC TCGATCCCGC TTCCCGGCTG TACCGAGGGG
GAGGGCTGTT TCGACAGGGT GCGCATGCGC GGCCCGCTCG GCACGGACGG GCGCTACCCG
GAGGTGGACA CCGGCTCCAG CTTCATGATG GCCGTCGAGC TCGGCCCTGA CGGCCCCCGT
TCCCGCACCA TCCTCACCTA CTCCCTCTCC GCCGATCCCA CCTCGGCCCA CCACACCGAC
CAGACGGTCC TGTTCTCGCG CGGCCAGTGG GTCACCGGGC GCTTCACCGA AGCCGAGATC
GCCGCCGACC CGCAGCTGAG GACCACCACG GTGCGGGGGT GA
 
Protein sequence
MTLRSRLAVL IALVLLLSAG VVVRANAPAV AGGERYSATI RRTEYGIPHI AAKDYGGLGF 
GYGYAFAQDN LCVLASWLVT LRGERSRHFG AAAMSDDPIR PVTNFASDVY YRSVADSGVV
RRLLARPAPV GPSDELRRMV DGYAAGYNRY LADTGVANLP DPTCRGKAWA APITATDIWN
NLLDLNRMVG SSQLKDAITG AGAPEAEGPV KRPAAGSNGW ALGRQATRDG HGMLLADPHF
PWNGSRRFYQ VQLTIPGVLD VSGGSLYGTP VVQIGHNASV AWTHTVSHAQ RFTLYRLKLA
GDRSSYMVDG RAEPIGRQEV EVTLQDGTVA GHTLYTSRYG PVLAVGRTDK VAYALADANA
ANLRAADEWL AMAKASDLTR LRAAQATHQG MPFVHTLATD IGGTAYFADA SVVPHISDAK
ARRCAKPSPV PGLEAYVLDG STSACLWGKD RDAVVPGIFG PGNQPELTRA DYVANSNNTA
WMTNPSAPLT GYQGVWGKDR TELEPRPRVS LDMIAQRLSG ADGLGAPGFT LETLQATALA
KRNHTFELMR SDLLKLCRTH PTPTASDGTR VNAREACGIL GKWDGRATLD GQGAILWREF
FTRLRRPSKP GDSEFTPWRV PFDPARPLTT PRGLKHDSPA VRQALADAVR FFQANRISLT
LTPGRSQHYS SIPLPGCTEG EGCFDRVRMR GPLGTDGRYP EVDTGSSFMM AVELGPDGPR
SRTILTYSLS ADPTSAHHTD QTVLFSRGQW VTGRFTEAEI AADPQLRTTT VRG