Gene Sros_3487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3487 
Symbol 
ID8666775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3857372 
End bp3858652 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content74% 
IMG OID 
Productselenocysteine lyase / isopenicillin N epimerase 
Protein accessionYP_003339166 
Protein GI271964970 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0293361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0990887 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATG GCATGAAGTC GCCGGTGTCG CCGCTCGCGG AACCGGACGG TCCGGCCCGT 
TCCGGCGGCA TCTCCCGCCC GGATCCGCTG CTCGCGCCGT CCGGTGAGCC CGCCCTCTCC
GACTGGTCGC TCGACCCGGC CGTCCGGCAC CTCAATCACG GATCGTTCGG GGCCGTTCCG
CTCGCCGCTC AGCGCGCGCA ACGCGAGTAC CGGACGATCA TGGACGCGAA CCCCTGCGCA
TGGTTCACGG GCGTGGTGGG CCGCGTCGGC GCCGCGCGCG CGGAGATCGC CGCCTACCTC
GGCGCGAGCC CCGACGCGAC GGCCCTCGTC CCCAACGCCA GCGGCGGTGC GAGCGTGGTG
TTCGACAGCG TGCCGGCGTG GCGGGGGATG CGGATCGTGA CGACCGACCA CGGCTATGGC
GCGGTCCTGA TGGGGGCCGG ACGGCTTGCC CGCCGGTGGG ACGGCTCCGT GACCACCGTG
CGCATCCCCC TCGACGCCAC CGACGACGAG GCGTTCGCGG CGGTCGCCGC CGAGATGGCG
GACGATGTCG CCCTCGTCGT CATCGACCAT GTCACCTCCG CGACCGCCCG CCGGTTGCCG
GCCGGGCGCG TCGCCGCCCA CGGCCGCCGT CTCGGCATCC CCGTGCTCGT CGACGCCGCG
CACGCGCCCG GTCTCGTCGC GGACCCGCTG GCCGGTATCG ACGCCGATTT CTGGGTCGGC
AACCTGCACA AGTTCGCGTG CGCGCCGCGT GGAACGGCGG CCCTCGTCGC TTCGGGACCG
CACGCGCGGT CCCTGCACCC GCTCATCGAC TCGTGGGCCG CTCCGGAACC GTTTCCGGCG
CGCTTCGACC AGCAGGGGAC CATCGACGTG ACCTCCTATC TGGCCGCGCC GGTCGCGTTC
GCCACCGTCG AGGAGCACTA CGGCTGGGAC ACGGCGCGGC GCTACATCGC GGAGCTCGGC
GACTACGCGC AGGCCATCGT CACCGAGGCG CTGTCCGGGC TGACCGGCAC GGACGCGTCG
GCGCAGGTCG GCGAGCCCGT CGGCGGCCTG CGTCTCGTCC GCCTGCCGCG CGGCGTGGCG
GCGGACCCGG AGGCCGCCCA CGGGCTGCGG CACGACATCG CCGTACGGCT CGGCATCGAG
ACCGCCATCA CCTCGTGGGG AGGTCAGGGG TTCCTGCGGC TCTCCACGCA CGTGTACAAC
ACGGCGGAGG ACTTCGAGGA CTTCGTCGAG CGCGGCGTCC CGTTCATCGT CGAACGGAGC
CGCGCGGCGC ACGCGGGGTG A
 
Protein sequence
MIDGMKSPVS PLAEPDGPAR SGGISRPDPL LAPSGEPALS DWSLDPAVRH LNHGSFGAVP 
LAAQRAQREY RTIMDANPCA WFTGVVGRVG AARAEIAAYL GASPDATALV PNASGGASVV
FDSVPAWRGM RIVTTDHGYG AVLMGAGRLA RRWDGSVTTV RIPLDATDDE AFAAVAAEMA
DDVALVVIDH VTSATARRLP AGRVAAHGRR LGIPVLVDAA HAPGLVADPL AGIDADFWVG
NLHKFACAPR GTAALVASGP HARSLHPLID SWAAPEPFPA RFDQQGTIDV TSYLAAPVAF
ATVEEHYGWD TARRYIAELG DYAQAIVTEA LSGLTGTDAS AQVGEPVGGL RLVRLPRGVA
ADPEAAHGLR HDIAVRLGIE TAITSWGGQG FLRLSTHVYN TAEDFEDFVE RGVPFIVERS
RAAHAG