Gene Sare_4553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4553 
Symbol 
ID5705815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5147696 
End bp5148910 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content68% 
IMG OID641273965 
Productcytochrome P450 
Protein accessionYP_001539312 
Protein GI159040059 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.174977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00837132 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTCATG GCGAGCCTAC CCTCACGTTG TACGCCAACG AACTCTGCGA TCCGGAGCTG 
TACCGGCAAG GAAATCCGGA AGACCTGTGG CGGCGGATGC ACGCGGCGGC GCCCGTTCAC
GAGGGCGCTT TCGAGGGCCG GCGGTTCCAC GCCGTCATCT CGCACGCGCT GATCTCGCGG
ATGCTGAAGG ACCCGAAGGG GTTCTCCTCC GAGCGGGGGA TGCGGCTCGA CCAGAACCCG
GCCGCCACCT CGCTCGCGGC CGGCAAGATG CTGATCATCA CAGACCCGCC GCGGCACGGC
AAGATCCGGC GCATCGTCAA CTCGGTCTTC ACACCGCGCA TGGTGGCCCG CCTCGAAGAG
AACATGCGGG TCACAGCCGC CGGCATCGTT GACCAGGCGA TCGAGGAAGG CGAGTGCGAC
TTCACCGACG TGGCCGCGCG GCTGCCGCTC TCGGCCATCT GCGACATGCT CGGTGTGCCG
CCGGAGGACT GGGACTTCAT GCTGGACCGG ACCATGGTGG CGTTCGGGTC GGGCGAGGCC
GACGAGCTCG CGATGGCCGA GGCGCACGCC GACATCCTGT CGTACTACGA GGACTTGATC
CGCCGTCGTC GGCGCGAACC ACGCGAGGAC GTGGTGACCG CGCTGGTCAA CGGCGTCGTC
GACGGCACCA AGCTGACCGA TGAGGAGATC TTCCTCAACT GCGACGGCCT GATCTCCGGC
GGCAACGAGA CGACTCGGCA CGCCACGATC GGCGGGTTCC TGGCGCTGCT CGACAACCCG
GAACAGTGGG AGACGTTGCG CGACGATCCC GGGCTATTGC CAGGCGCGGT GCAGGAGATC
CTCCGATACA CCAGCCCGGC GATGCACGTT CTGCGGACCG CGGTCGCGCC GACCCGGATC
GGGGAGTACG CGCTGAACCC GGGGGATCCG GTCGCACTGT GGCTGTCGGC CGGCAACCGC
GACCCCCAGG TGTTCGCCGA TCCCGATCGC TTCGACATCA CCCGGAGCCC CAACCCGCAC
CTCACCTTCT CGACCGGCGC GCACTACTGC CTCGGGTCCG CACTGGCCAC GTCGGAGCTC
ACGGTGCTCT TCGACCGGCT GCTGCGGCGG GTGGACAGCG CCGAACTCAC CGGGCCACCC
CGGCGTACGC GATCGATCCT GATCTGGGGT TACGACTCGG TACCCGTGCG GCTGACGGCC
GGATCGGAGC GATGA
 
Protein sequence
MPHGEPTLTL YANELCDPEL YRQGNPEDLW RRMHAAAPVH EGAFEGRRFH AVISHALISR 
MLKDPKGFSS ERGMRLDQNP AATSLAAGKM LIITDPPRHG KIRRIVNSVF TPRMVARLEE
NMRVTAAGIV DQAIEEGECD FTDVAARLPL SAICDMLGVP PEDWDFMLDR TMVAFGSGEA
DELAMAEAHA DILSYYEDLI RRRRREPRED VVTALVNGVV DGTKLTDEEI FLNCDGLISG
GNETTRHATI GGFLALLDNP EQWETLRDDP GLLPGAVQEI LRYTSPAMHV LRTAVAPTRI
GEYALNPGDP VALWLSAGNR DPQVFADPDR FDITRSPNPH LTFSTGAHYC LGSALATSEL
TVLFDRLLRR VDSAELTGPP RRTRSILIWG YDSVPVRLTA GSER