Gene Sare_4949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4949 
Symbol 
ID5706499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5617189 
End bp5618427 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content67% 
IMG OID641274344 
Productcytochrome P450 
Protein accessionYP_001539686 
Protein GI159040433 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAACAGA GCTGTCCTTA CAAGTTGGAT GTGACCGGTC GAGACGTCCA CGCTGAGGGC 
GAGGCGATCC GCGCTCGGGG GCCGGTCGCC CAGGTCGAAC TGCCGGGCGG CGTGCAGGGA
TGGTCAGTAA CGGGCTACCA GGCCGCCCGT CAGGTGTTGG CTGACCCCCG CTTCGCGAAG
GACCCGAAGA AGTGGCCGGC GTACACCTCC GGCGCGATTC CCCCGAACTG GCCGCTCATC
GGGTGGCTAC TGATGGACAA CATGACGACC AACGACGGTG CGGACCATCA GCGTCTGCGC
AAGCTGGTGT CGCACGGCTT CACGCCACGG CAGGTGGAGC GTACCCGTCC ACTGATTGTG
AAGATCGTGA ATGACCTGCT GGACGGGTTG TCCTCGGCTG GGCCGGACGA GGTCGTCGAC
CTGAAGGGCC GGTTCGCCAC CCCGCTGCCA GCCCGCGTCA TCTGCGACAT GTTTGGTGTG
CCAGAGGCGC TGCGGGCTTC CGTGCTGCGT GGTGCCCAGG TCAACGTCAC CTCGTCGATC
AGCGGCGAGG AGGCCGAGGC GAACGTCGAG CAGTGGCACC GTGAGCTGTT GGAGCTGGTC
GAAGCCAAGC GCGAGAAACC AGACGAGGAC ATGGCCAGTC TGCTGATCGC GGCCAAGGAG
GAGGACGGCA GCACGCTCAC CCAGGAGGAG GTCGTCGGCA CGCTGCACCT GATGCTCGGC
GCCGGCTCGG AAACCCTGAT GAACGCGCTC AGCTACGCCG TGCTCGGCAT GCTGAGCAAT
CCGGGCCAGT ACGAGATGGT CCGCAACGGG ACCTCGTCGT GGGACGACGT CATCGAGGAG
ACGCTGCGGG CCCAGGCGCC GGTGGCCCAA CTCCCGCTGC GCTACGCCAC CGAGGACGTC
GCCGTCGGCG GTGCTGTCAT CAAGGCGGGT GACCCGGTCC TGATGGGCTT CACCGCCATC
GGTCGGGATC CGGCGGTGCA CGGTGAGACG GCCGGCGACT ATGACATCAC TCGAGAGGAC
AAGACGCATC TGTCCTTCGG GCACGGAGTG CACTTCTGCC TTGGCGCGCC GCTCGCCCGC
CTCGAGCTGA AGATCGCCCT ACCCGCGCTG TTCGAACGCT TTCCCCACAT GACGCTCGCG
GTCCGTCCCG ACCAGTTGGA GCCGCAGGGC ACCTTCATCA TGAACGGCCA CCGTGAGTTG
CCCGTCCGGC TCGGTCAACC AGCCACCGTC CTCGCCTGA
 
Protein sequence
MEQSCPYKLD VTGRDVHAEG EAIRARGPVA QVELPGGVQG WSVTGYQAAR QVLADPRFAK 
DPKKWPAYTS GAIPPNWPLI GWLLMDNMTT NDGADHQRLR KLVSHGFTPR QVERTRPLIV
KIVNDLLDGL SSAGPDEVVD LKGRFATPLP ARVICDMFGV PEALRASVLR GAQVNVTSSI
SGEEAEANVE QWHRELLELV EAKREKPDED MASLLIAAKE EDGSTLTQEE VVGTLHLMLG
AGSETLMNAL SYAVLGMLSN PGQYEMVRNG TSSWDDVIEE TLRAQAPVAQ LPLRYATEDV
AVGGAVIKAG DPVLMGFTAI GRDPAVHGET AGDYDITRED KTHLSFGHGV HFCLGAPLAR
LELKIALPAL FERFPHMTLA VRPDQLEPQG TFIMNGHREL PVRLGQPATV LA