Gene Sros_5236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5236 
Symbol 
ID8668530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5751861 
End bp5753060 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content68% 
IMG OID 
Productcytochrome P450 CYP109C2 
Protein accessionYP_003340748 
Protein GI271966552 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.134393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACA TCGTCGAGCG GTACAACATC CCCGCACAGC ACTTCTGGCT GCATGGGCCG 
CGATCGCGCC AACCGGTCGA GTACGACGCC GACGCCGGCA TGTGGAACGT CTACGGCCAC
CCCGAACTGC AGGAGATCCT CGGCGATCCC GCGACGTTCT CCTCCGACAC CATGCGCCTG
ATCCCCAAGG ACCTGATGCC GGGCATCGAG GAATTCTCGA TGGCCGGCTT CATCACCCAG
ATCGACCCGC CGGAGCACGG CAAGCTGCGC AAGCTGGTCA GCAACGCCTT CACCCGGAAG
GTCGTCGCGG ATCTCGAACC GAGGATCGCC GCCCTCACCC ACGAACTGCT CGACGCGGCA
CACGATCGCG GCCGGTTGGA ACTGGTGACC GATCTGGCCT ATCCGCTCCC GGTCATCGTC
ATCGCCGAAC TGCTGGGGGT GCCCAGCAGC GATCGCGCCC TGTTCAAACA ATGGGCCGAT
GCGCTGTTCC AGCGCGACGC CAAGATCTCA CTGGCCAAAC CCGCCGAACA ACAGGACGTG
GACCTGCAGG CCACGCTGAA GCCGTGGAAG GAGATGTCGG CCTATCTCGC CGGCCACGCC
GCGGAGCGCA GGCGACAGCC GCGCGCCGAC CTGCTCACCA GGCTGGTCGA GGCCGAGGTG
GACGGCGAAC GCCTGCCCGA CGAGGAGGTG GTCAACTTCG CGATCATCCT GCTGCTCGCC
GGGCACATCA CCACGACGAT GCTGCTCGGC AACACGGTGC TGTGCCTGGA CGCCTTCCCC
GAGCAGCAGG ACAAGGTGCG GGCCGACCGA TCCTCGATCC CGGCCGTCAT CGAGGAATCC
CTGCGCCTGT TCACCCCGTT CGCCGCCCTC GGCCGCGCCA CCACCCGCGA CGTCGAGCTC
GGCGGCGTGA CGATACCGGC CGATCACATG GTCATGGCCT GGCTCGGAGC GGCCAACAGG
GACCCCCGGC AGTTCCCCGA CCCCGACGTC TTCGACCCCG GTCGCGACCC CAACCCGCAT
CTCGGGTTCG GCCGCGGCAT CCACTTCTGC CTAGGCGCCC CCTTGGCCCG GCTGGAGGGA
CGGGTCGCCC TGAACATCCT GCTCGACCGC GTCGACCCTC TGCGCACCGA TCCGGACGAC
CCCGTGGAGT TCATGCCCAC GCCGACCATG ACAGGGGTGC GCCGCCTCCC GTTGATCTGA
 
Protein sequence
MADIVERYNI PAQHFWLHGP RSRQPVEYDA DAGMWNVYGH PELQEILGDP ATFSSDTMRL 
IPKDLMPGIE EFSMAGFITQ IDPPEHGKLR KLVSNAFTRK VVADLEPRIA ALTHELLDAA
HDRGRLELVT DLAYPLPVIV IAELLGVPSS DRALFKQWAD ALFQRDAKIS LAKPAEQQDV
DLQATLKPWK EMSAYLAGHA AERRRQPRAD LLTRLVEAEV DGERLPDEEV VNFAIILLLA
GHITTTMLLG NTVLCLDAFP EQQDKVRADR SSIPAVIEES LRLFTPFAAL GRATTRDVEL
GGVTIPADHM VMAWLGAANR DPRQFPDPDV FDPGRDPNPH LGFGRGIHFC LGAPLARLEG
RVALNILLDR VDPLRTDPDD PVEFMPTPTM TGVRRLPLI