Gene Sros_7140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7140 
Symbol 
ID8670451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7880919 
End bp7882205 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content68% 
IMG OID 
Productcytochrome P450 CYP124E1 
Protein accessionYP_003342578 
Protein GI271968382 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.946745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.358768 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGACG ATATGAGCGT GACGCCAGCC GAGCGGAAGC CGATCGTCTC CGCCGCCGAC 
TTCGACCTGT CGGACTACGA CTTCTGGGCC AGGCCCATGC CGGAGCGCGA GCACGCCTTC
CGGCTCCTGC GCGGCCTGGA CCACCCCGCC TTCTACGAGG AGATGGAGGT CCCCTTCGCT
CCCAAGGGGG CCGGTTACTA CGCGCTGGTC AAGCACGCCG ACATCCTGGA GGCCAGCCGC
AACCCCGAGG TGTTCTGCTC GGGGGACGGC GGCGCCACCA ACATCCCGGA CATGCCGCCG
GAGTTCACCG AGTACTTCGG CTCGATGATC AACATGGACG ACCCGAGGCA CGCCCGGCTC
CGCCGGATCG TCTCCCGGGC GTTCACCCCC AAGATGATCA AACAGTTCGA GGCCGACGTC
GAGGCCGCCG CCGTCCGGAT CGTCGACGAC CTGATCGCCG GAGGACCCGG CTGCGACTTC
GTCACCGAGG TGGCCGCCAA GCTCCCCCTG AAGATCATCT GCGACATGAT GGGGATCCCG
GAGAGGGACT ACGGCTCCGT CTTCGACCGG TCCAACGTGA TCCTCGGCGG ATTCGACCCC
GAATACGGCG GCGACGACAT GACCACGGTC GCCGAGAGGC TGCTGGCCGC CGGGATGGAG
CTGCAGCAAC TCGTCCAGGA CCTCGCCGCG CACCGGGTCG AGCACCCCAC GGGCGACCTC
ACCTCGTCAC TGGTCAACGC CAACATCGAC GGCGAGAAGC TCACCGCCCA GGAGCTCGGC
TCGTTCTTCA TCCTGCTGGT GGTGGCGGGC AACGAGACCA CGCGCAACGC CATCTCCTAC
GGCCTGCGCC TGCTCACCCA GAATCCGGAC CAGCGCGCGC TGCTCCTGGC CGACCTCGAC
GGCCGCCTCC CCGGCGCGGT CGAGGAGATC GTCCGCCTGG CCTCCCCGGT GAACTTCATG
CGCCGCAAGG TCACCCGGGA CCACGAGATG AACGGCCACC TCTACCGCAA GGGCCAGAAG
GTCGTCCTGT ACTACTGGGC GGCGAACCGG GACGAGAGCG TCTTCCCCGA CCCGCTCCGC
TTCGACGTCA CCCGCGACCC CAACCCGCAC GTCGGCTTCG GCGGCCCGGG GCCGCACTTC
TGCCTCGGCG CCCACCTGGC CCGCCGCGAG ATCACCGTGA TGTTCCGCGA GCTGCTCCGG
CGGATCCCGC AGATCGAGGG CGGCAGACCG CAGCGGCTCT ACTCCAGCTT CATCAACGGC
ATCAAGCACA TGGACTGCGC GTTCTGA
 
Protein sequence
MIDDMSVTPA ERKPIVSAAD FDLSDYDFWA RPMPEREHAF RLLRGLDHPA FYEEMEVPFA 
PKGAGYYALV KHADILEASR NPEVFCSGDG GATNIPDMPP EFTEYFGSMI NMDDPRHARL
RRIVSRAFTP KMIKQFEADV EAAAVRIVDD LIAGGPGCDF VTEVAAKLPL KIICDMMGIP
ERDYGSVFDR SNVILGGFDP EYGGDDMTTV AERLLAAGME LQQLVQDLAA HRVEHPTGDL
TSSLVNANID GEKLTAQELG SFFILLVVAG NETTRNAISY GLRLLTQNPD QRALLLADLD
GRLPGAVEEI VRLASPVNFM RRKVTRDHEM NGHLYRKGQK VVLYYWAANR DESVFPDPLR
FDVTRDPNPH VGFGGPGPHF CLGAHLARRE ITVMFRELLR RIPQIEGGRP QRLYSSFING
IKHMDCAF