Gene Sros_5331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5331 
Symbol 
ID8668625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5844620 
End bp5845831 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content71% 
IMG OID 
Productputative cytochrome P450 
Protein accessionYP_003340838 
Protein GI271966642 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.463394 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCA ACGAGCAGGT TCTGAGCTAC CCGATCCCCG CTGACGCGGC GCTGGAGCCA 
CCCGCCGAAT GGGCGGAGCT GCGCGGCAAG TGCCCGGTCG CCCACGTGAC CTTGCCCAGT
GGCGACCGGG CGACGCTGCT GACCCGCTAC GCGGACGTCA AGCAGGTGCT CGCCGACCCG
CGCTTCACCC GCCAGCTGAA CGCGCCCGAC GCGGCCAGGC TGTCGGCAGA GGGAGGCGGG
GTGTTCAACA GCGAGATAGC GACGATCATC CCGGACGGCG GCGAGGAGCA CCAGCACTGG
CGGCGCCTGG TCGGCAAGTG GTTCACCGCC AAGCGCATGA CAGCCCTGCG GCCCTCGATG
ACGGAGATCG CCGATCAGCT CATCGACGAC ATGGTCAAGC GCGGCCTGCC CGGTGACCTC
AGGGCCGGCC TGGGCTTCCC CCTGCCGGTG TACGTCATCT GCGACATGCT CGGCGTGCCC
GCCGAGGACC GCGACCGGTT CTCCTACTGG TCCGACACGC TGCTCAACCT CACCCGCTAC
GGCAAGGCCG AGATCGACGC CGCCCAGGGC GAGTTCTTCC AGTACATGTC CGACCACGTC
GCCGCCAAGC GGTCCGAGCC GGGGGAGGAC CTGCTGAGCG AGCTGATCGC GGCCGGCGGC
CCCGAGGACG GCGGGCTGGC CGACATCCAG ATCCTGGTCA CCGGCATGGC GTTGCTGGTC
GCCGGGCACG AGACCACCGC CAACATGATC GGCAAGATGG TCTCCATGCT GCTTGCCGAC
CGCAGCCGGT GGGAGGCGCT GCTCGCCGAC CCGTCGCTGA TCCGCACGGC GGTGGAGGAG
TCGCTGCGCC TGGACGCCAA CTCCGGCTTC GGCCTGCCGC GCTACCTGCG CGAGGAGACC
GAGGTCAGCG GCACGGTCCT GCCCCGGGGC ACCACCGTGG TGTGCAGCAT GGCCGCCGCC
AACCGCGACG AGAGCGCCTT CGAGGCCGCC GCCGAGATGG ACCTGAGTCG CAGCCCGAAC
CCGCACCTGT CCTTCGGCAG CGGCGCCCAC TCCTGTCTGG GCCAGGCACT GGCCCGCACC
GAGCTGCAGG TCGTGCTGGA AGTGCTGCTG CGCAAGCTTC CGACCCTGGA GCTGGCCGTA
CCGGTGGAGG AGCTGGAGCG GGTGGAGGGG CTGGCCGTCG GCGGCCTGCG CACGGTTCCG
GTCCGCTGGT GA
 
Protein sequence
MSLNEQVLSY PIPADAALEP PAEWAELRGK CPVAHVTLPS GDRATLLTRY ADVKQVLADP 
RFTRQLNAPD AARLSAEGGG VFNSEIATII PDGGEEHQHW RRLVGKWFTA KRMTALRPSM
TEIADQLIDD MVKRGLPGDL RAGLGFPLPV YVICDMLGVP AEDRDRFSYW SDTLLNLTRY
GKAEIDAAQG EFFQYMSDHV AAKRSEPGED LLSELIAAGG PEDGGLADIQ ILVTGMALLV
AGHETTANMI GKMVSMLLAD RSRWEALLAD PSLIRTAVEE SLRLDANSGF GLPRYLREET
EVSGTVLPRG TTVVCSMAAA NRDESAFEAA AEMDLSRSPN PHLSFGSGAH SCLGQALART
ELQVVLEVLL RKLPTLELAV PVEELERVEG LAVGGLRTVP VRW