Gene Sros_4555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4555 
Symbol 
ID8667849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5072392 
End bp5073576 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content68% 
IMG OID 
Productputative cytochrome P450 
Protein accessionYP_003340161 
Protein GI271965965 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.268858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.410272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAA ACGTTCCGTT GCCGATCCGC AGGGACGCGG CCTGCCCATT CAGCCCCGAT 
CCGGAGATGG CCCGGCGGCG TGAAACCGAT CCGGTCTCGA CCTCCCAGTC GCTCCTGCCG
ACCGGCGACC TCATCGAAGT CCGGCTGGTC ACCGGGTACG GCGCCGCCCG CGAAGCGCTC
GCCTGCCCGC ACCTGAGCAG CCGCCCCGAC GCCAAGCTGA GGAAGTTGAT CGGCGAGCAG
GCCGGCTTCC TGGTGGCCAT GGACCCGCCC GACCACATCC GCGTCAGACG GCTGTTGACC
GGCGAGTTCA CCGTCCGGCG GATCAATGCC CTCCGCCCCC GCTTCGTCGA ACTTGTGGAC
GAAGCCCTGG ACCGGATGGA ACAGGCCGGG GGGCCGGTGG ACCTAATGAC CGCGTTCGCG
CTGCCGCTGC CGTCACTGAT GATCTGCGAG CTGCTCGGCG TCCCGTATGA GGAGCGCGCG
GACTTCCAAC GGCACAGCAA CACCATGCTC GACCTCAGCC TCACCATGGA GGAGCAGTTC
GCCAACGCCA TGGAGATGCA CACTTACATG GGCGATCTGG TGGCCGTCCA GCGCGAGAAT
CCCGGCGCGG ACATCCTCGG CATGCTGGTC CGGGAACACG GCGACGAACT CAGCGATGAC
GACCTGATCG GCATCGGCAA CCTCCTGCTG ATCGCCGGGC ATGAGACCAC CGCCAACATG
CTCGGCCTCG GCACCCTGCT CCTGCTGCGT CACCCCGACC AGCTCGTGCG AGTGCAGGAG
GAGCCCGAGG TGGTCAACGG CGCGATCGAG GAGATGCTGC GCTACCTGTC GATCGTCAAC
AACGGCGCCA TCCGCACCGC GACCGAGGAG TTCGCCCTGG CCGGCCAGGT GATCCACGAG
GGCGAGCGGG TGGCAGTCTC CCTGCCCTCG GCCAACCGAG ACCCGGCACT GATGGCCGAG
CCCGACACCT TCGACGTCAC CCGCCGCCCC AGCGCTCACG TCGCCTTCGG ACATGGCATC
CACCAATGCC TCGGCCAGCA ACTGGCACGC ATGGAGCTCC GCCTCGCCCT GCCGGCCCTC
CTGCGCCGAT TTCCCACGCT ACGGCTCGCC GTGCCACATG AGGAGCTGCG CTACCGCGAG
CTGGCACCCG TCAACGGCGT GCTCTCCCTC CCGGTGACCT GGTAA
 
Protein sequence
MPENVPLPIR RDAACPFSPD PEMARRRETD PVSTSQSLLP TGDLIEVRLV TGYGAAREAL 
ACPHLSSRPD AKLRKLIGEQ AGFLVAMDPP DHIRVRRLLT GEFTVRRINA LRPRFVELVD
EALDRMEQAG GPVDLMTAFA LPLPSLMICE LLGVPYEERA DFQRHSNTML DLSLTMEEQF
ANAMEMHTYM GDLVAVQREN PGADILGMLV REHGDELSDD DLIGIGNLLL IAGHETTANM
LGLGTLLLLR HPDQLVRVQE EPEVVNGAIE EMLRYLSIVN NGAIRTATEE FALAGQVIHE
GERVAVSLPS ANRDPALMAE PDTFDVTRRP SAHVAFGHGI HQCLGQQLAR MELRLALPAL
LRRFPTLRLA VPHEELRYRE LAPVNGVLSL PVTW