Gene Sare_2811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2811 
Symbol 
ID5707003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3193200 
End bp3194426 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content66% 
IMG OID641272267 
Productcytochrome P450 
Protein accessionYP_001537637 
Protein GI159038384 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000999119 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGAGC CGCGTATTCC GGCGGGATTC GACTTCACCG ATCCCGAGGT CCTGGCGCAC 
CGAGTGCCCC GGGAGGAGTT CGCCGAGCTT CGCCGGACCG CTCCGGTCTG GTGGAACGCC
CAACCGAGGG GCTCGGCCGG CTTCGACGAC GACGGGTACT GGGTGGTGAC CCGCTACGCC
GACGTGATGA CGGTGTCCCG GGACAGCGAC ACGTACTCGA CGCGGGAGAA CACCGCGATA
GCCCGGCTCC GGCCGGACAC CACCCGCGAG GACATCGAGA TGCAGCGGGT CATCATGCTC
AACGTCGACC CGCCGGAGCA CACCAAGCTG CGTGCCATCG TGTCCCGTGG CTTCACGCCC
AGAGCAATCA ACGCACTACG CGGATCATTG GCGGAGCGGG CCGAGCACAT CGTGCGTGAC
GCAGCCGTGC GTGGTGTCGG TGACTTTGTC ACCGACGTCG CTTGCGAGTT GCCACTGCAG
GCGATCGCGG AACTGATCGG GGTTCCGCAA CACCACCGAC GCAAGGTCTT CGACTGGTCG
AACCAGCTGA TCGGATACGA CGATCCCGCC TACGGAACGG ATCCACTGAC CGCCTCGGCC
GAGCTACTCG CGTACGCCAT GGAGATGGCG GAGGAGCGGC AGCGCAGTCC GAGCGACGAT
CTGGTGACCA AACTGGTCAA TGCGCAGATC GACGGTGAGC ACCTGACGAC CGACGAGTTC
GGCTTCTTCG TGATGCTCCT GGCGGTCGCG GGCAACGAGA CGACCCGGAA CGCGATCACC
CATGGCATGG TGGCCTTCCT CGACAACCCA GAACAGTGGG AGCTGTTCAA GGCCGAGCGC
CCCAAGAGTG CGGTCGAGGA GATCATCCGC TGGGCCACCC CGGTGAACGT GTTCCAACGT
ACCGCGTTGG TCGACACCGT GCTGGGGGGA CAGGCCATCT CCGCCGGCCA ACGGGTAGCG
CTCTTCTACG GTTCGGCGAA CTTCGACGAG GCGGTGTTCG AGGATCCCGA ACGGTTCGAC
ATCACCCGTA GCCCCAACCC GCACCTCGGG TTCGGCGGCA GCGGCGCGCA CTTCTGCCTG
GGCGCGAACC TCGCCCGCCT GGAGATTGAG CTGATCTTCA ACTCGATCGC CGACCACCTG
CCGGACATCC GCAAGGTGGC CGCACCGCAG CGGCTGCGGT CGGGCTGGAT CAACGGCATC
CGGCAGATGC CGGTGCGGTA CCGCTGA
 
Protein sequence
MTEPRIPAGF DFTDPEVLAH RVPREEFAEL RRTAPVWWNA QPRGSAGFDD DGYWVVTRYA 
DVMTVSRDSD TYSTRENTAI ARLRPDTTRE DIEMQRVIML NVDPPEHTKL RAIVSRGFTP
RAINALRGSL AERAEHIVRD AAVRGVGDFV TDVACELPLQ AIAELIGVPQ HHRRKVFDWS
NQLIGYDDPA YGTDPLTASA ELLAYAMEMA EERQRSPSDD LVTKLVNAQI DGEHLTTDEF
GFFVMLLAVA GNETTRNAIT HGMVAFLDNP EQWELFKAER PKSAVEEIIR WATPVNVFQR
TALVDTVLGG QAISAGQRVA LFYGSANFDE AVFEDPERFD ITRSPNPHLG FGGSGAHFCL
GANLARLEIE LIFNSIADHL PDIRKVAAPQ RLRSGWINGI RQMPVRYR