Gene Sare_2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2331 
Symbol 
ID5704255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2683326 
End bp2684579 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content67% 
IMG OID641271809 
Productcytochrome P450 
Protein accessionYP_001537180 
Protein GI159037927 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.24175 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCCG CGACGTTGCC GCGGTTCGCC CTGACCGGCT GGAGCAGGGA GAACATCGTC 
AATCCCTACC CGGTGTACCA GCGCTACCGG GAGGTCGCTT CGGTGCACCG GGGCGAACCA
GGCGGCGACG CCCCGGACAC CTTCTACGTG TTCTCCTACG ACGAGGTTGT CCAGGTGCTG
TCGAGTAACT GTTTCGGTAG GGGGAGGTCC CTCGACGCCG CGAAGGCATC GGTCCCGGTA
CCGGCCGAGC AGAAGGCCCT CCGGGCAATC GTCGAGAACT GGCTGGTGTT CATGGACCCG
CCACGCCACA CCGAACTACG CTCGCTACTC AACCGGAGCT TTTCTCCCCG GATCGTGACC
GAACTGCGGC CCCGCATCGC GCGGATCGCA CAGGAACTCC TGTCCCGGCT CGGCCAGCAG
GTGGACGTCG ATCTCGTCGA GAGCTTCGCC GCGCCGTTGC CCATCCTTGT CATCTCCGAG
CTGCTGGGGA TTCCGGAGGA GCGTCGCGCA TGGTTGCGTG CCAACGCGTT GGCGCTGCAG
GAAGCCAGTT CCTCCCGTGC GGGTCGGGAC GTGGATGGCT ACGCACAGGC CGAAGTGGCC
GCGCAGGAGT TCACCGAGTA CTTCCGGGAG CAGGTGCGGC TACGGCGGGG TCGTGCCGGT
GGAGATCTGA TCACGATCCT CGCCAACGCA CAGGAGCGCG GTGCACCAGT GAGTCTGGAC
GCGATCGTGG GGACCTGCGT CCACCTCCTG ACCGCCGGGC ACGAGACCAC CACCAATTCG
CTGGCGAAGG CCGTGCTTGC GCTGCGGGAA CATCCGGCGG TACTCGACGA GCTCCGCGGC
GCCGAGGGGC TGACGACGGA TGCGGTCGAG GAGTTCCTGC GCTATGACCC GCCCGTGCAG
GCCGTGACCC GATGGGCGCA CCAGGACACC ACCCTCGGCG GGTGTGACAT ACCACGCGGC
AGCCGAGTGG TCGCGCTGCT GGGCTCGGCG AATCGGGATC CGGCACGCTT CCCGTCACCT
GATGTCCTGG ACGTACGTCG CCCTGCCGAC CGGCACCTCA GTTTCGGTCT GGGTATCCAC
TACTGCCTCG GCGCGACGCT GGCCCGCGCC GAGCTCGAGA TCGGGCTCCA GGCACTGCTG
GACGGTGTTC CCACGCTGGG CTACGGCACC CAGCACGTCG ACTACGCCGA CGACCTGGTC
TTCCACGGAC CGAGCCGGCT GGTACTCGTC AACCTAGGAG AAAGGTGTAA ATAA
 
Protein sequence
MPSATLPRFA LTGWSRENIV NPYPVYQRYR EVASVHRGEP GGDAPDTFYV FSYDEVVQVL 
SSNCFGRGRS LDAAKASVPV PAEQKALRAI VENWLVFMDP PRHTELRSLL NRSFSPRIVT
ELRPRIARIA QELLSRLGQQ VDVDLVESFA APLPILVISE LLGIPEERRA WLRANALALQ
EASSSRAGRD VDGYAQAEVA AQEFTEYFRE QVRLRRGRAG GDLITILANA QERGAPVSLD
AIVGTCVHLL TAGHETTTNS LAKAVLALRE HPAVLDELRG AEGLTTDAVE EFLRYDPPVQ
AVTRWAHQDT TLGGCDIPRG SRVVALLGSA NRDPARFPSP DVLDVRRPAD RHLSFGLGIH
YCLGATLARA ELEIGLQALL DGVPTLGYGT QHVDYADDLV FHGPSRLVLV NLGERCK