Gene Sros_4533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4533 
Symbol 
ID8667827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5051049 
End bp5052383 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content72% 
IMG OID 
Productdyp-type peroxidase family protein 
Protein accessionYP_003340141 
Protein GI271965945 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.253706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.544245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAGC AGACCCACGT CAAGCTCGAA CTCGACGACA TCCAGCGTGG GGTGCTCAGT 
CCCCGGCCGA GCCCGTACGC GGCGACCTAC ATCCTGTTCC GCATCGACGA CCGCGCGCAC
GGACGGGAGC TGATGCGGCG GATGAGCGCG ATGGTCACCT CGGCCGCCGA CACGGTAAGC
CCCCTGGGCC AGACCTGGGT CAGCGTCGCC CTCACCCGCC ACGGCCTCAG CGCGCTGGGG
GTGCCGCGGG AGTCGCTGGA GACGTTCGCC TGGGAGTTCC GGCAGGGAAT GGCCGCGCGC
GCCAACGCGC TGGGCGACAT CGGCGAGAGC GCCCCCGAGA ACTGGGAGGC ACCCCTGGGC
AGCCCTGACG TCCACGTGGT GCTGGTGGCC CTCGCGCCCG ACGGCGCACG ACTGGAGGCG
GCCCTCGACC GCGCCCGCCC GGCCTACCAC GCCCTGCCCG GCGTGACGGC GATCTGGCGG
CAGGACTGCC ACGCGCTGCC CACCGAGACC GAACCCTTCG GCTACCGCGA CGGCATCAGC
CATCCGGCCG TCGAAGGCAG CGGCTTCGCC GGGTCCAACA AGCTGGAGGA GCCGCTCAAG
GCCGGGGAGT TCGTGCTCGG CTACCCTGAT GAGCTCGGCG GCGTCCAGAC CCTCCAGCCC
GAGGTGCTGG GACGGAACGG CACCTATGCG GTCTTCCGCA AACTCCATCA GCGCGTCGCG
GTCTTCCGGC GCTATCTGAG GGACAACGCC ACCGGCCCCG AGGACGAGGA CCTGCTGGCG
GCGAAGATCA TGGGCCGCTG GCGCAGCGGC GCGCCCCTGG CGCTCAGCCC GCTGCGCGAC
GATCCCGACC TCGGCGCCGA CCCGTACCGC AACAACAGCT TCCTCTACCA GCAGGACGAT
CCGGTGGGGT TCACCACCCC CGGCGGCTGC CACATCCGCC GGGGCAACCC CCGGGACGCG
GCGGTGGCGG GCGCGCCGAG GCTGCACCGG ATGATCAGAC GCGGCACCGC CTACGGCCCG
CCGCTGCCCG AAGGGATCCT GGAGGACGAC GGAGCCGACC GCGGGCTGAT GTTCGCCTTC
ATCGGGGCGC ACCTGGGACG GCAGTTCGAG TTCGTCCAGT CCGAATGGAT GAACGACGGC
GTCTTCTTCG GCGCGGGCGA CGCGAGAGAT CCCATCGTCG GATCCGGCGA CGGCCCCGGT
GACTTCACGA TTCCCCGCAG GCCGCTGCGG CGGTGCCTGC GGGCGCTGCC GCGGTTCGTG
GTCACGCGCG GCGGCGAGTA CTGCTTCATG CCCTCGCTGA GCGCCCTGCG CTGGCTCGGC
GACCTCGGAG ACTGA
 
Protein sequence
MSEQTHVKLE LDDIQRGVLS PRPSPYAATY ILFRIDDRAH GRELMRRMSA MVTSAADTVS 
PLGQTWVSVA LTRHGLSALG VPRESLETFA WEFRQGMAAR ANALGDIGES APENWEAPLG
SPDVHVVLVA LAPDGARLEA ALDRARPAYH ALPGVTAIWR QDCHALPTET EPFGYRDGIS
HPAVEGSGFA GSNKLEEPLK AGEFVLGYPD ELGGVQTLQP EVLGRNGTYA VFRKLHQRVA
VFRRYLRDNA TGPEDEDLLA AKIMGRWRSG APLALSPLRD DPDLGADPYR NNSFLYQQDD
PVGFTTPGGC HIRRGNPRDA AVAGAPRLHR MIRRGTAYGP PLPEGILEDD GADRGLMFAF
IGAHLGRQFE FVQSEWMNDG VFFGAGDARD PIVGSGDGPG DFTIPRRPLR RCLRALPRFV
VTRGGEYCFM PSLSALRWLG DLGD