Gene Sare_4560 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4560 
Symbol 
ID5705418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5156489 
End bp5157676 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content70% 
IMG OID641273972 
Productcytochrome P450 
Protein accessionYP_001539319 
Protein GI159040066 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.924133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0532868 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCACGG CACCGGCGTA CTCGCTGGAC GGGGGACGCT CGCTGCATCG GTGGCTGCGG 
GACATGCGAG AGCATCACCC GGTTCATCGG GAGCTGCTCA CTCGCGTGTG GACGCTTTAC
CGATACCGCG ACATCACCCA GGCCACCGCC GATCCGGCCG TCTTCTCGTC GGAGCTGTGG
CGGTACCTGC CCGGGGATAG GGGCGACGAC GCCCTGACCG CAGGCAACCT GACCGCAATG
GACCCGCCCC GGCACCGCCT CGTGCGGGAC CTGGTCAGCC GCTCGTTCAC GGCTCGCGCG
GTGGGCGCGC TGCGGCCCCG GATCGCCGCG ATCGCGGCCG AGCTGATCGG CGCCGTCGCC
GACCGCGGCG AGATGGACGT CGTCGCCGAC CTGTCCGACC CGCTGCCCGT CCTGGTCATC
GGGGAGCTGC TCGGCCTGCC GATGGCGGAC CGCGAGCTGT TGAGCGACTG GGCGCGGCGC
CTGCTCTCCT TCGACAAGGG CGACCTGACC GACGAGGTGG TCCGCAAGCG TGTCGCCGAC
ACTCAGCAGG AGCTGCTGGA CTATCTCCGG GTCCACTGCC GGCGTCGCCG GACGAATCCG
CAGGACGATC TGATCAGCCG GCTGATCCGG GCCGAGGTTG ACGGGCAGCG GCTCACCGAG
GACGAGGTGG TCAACTTCGC CAACCTCCTC CTGCTCGCCG GTCACGTGAC GACGACGCTG
CTGCTGGCGA ACATCGTCCT GACACTCGAC GAGCACCCCG CCGTGGCGGC GGAGGCACGC
GCCGACCGCG GGCTGATCCC GGGACTCATC GAGGAGACCC TGCGATACCG GCCGGTCATC
GTCAGCAACA TGCGGGTCAC CACCCGCGCG GTCACGGTGG GCACAGAGCA GCTACCGGCC
GGCCAGCTCG TGTCGCTGTC GTTCATCTCC GGCAACCGCG ACGAGCAGTA CTTCACCGAC
CCCGACCGGT TCGACATCCA CCGCGACGCC CGCAAGCACC TGGGGTTCGG CCATGGGATC
CACTACTGCC TGGGTGCGCC GCTGGCCCGC CTCGAACTGG GGATCGCCCT CGATGCGATG
TTCGACCGCT TCAGCCGGAT CGAGGTGACG GGCGTTCCCG TCGACTACTA CGACACGCCC
GGGGTCGCCG GTCCGCGTTC CCTTCGCATC GCCTTCCGTC ACCACTGA
 
Protein sequence
MGTAPAYSLD GGRSLHRWLR DMREHHPVHR ELLTRVWTLY RYRDITQATA DPAVFSSELW 
RYLPGDRGDD ALTAGNLTAM DPPRHRLVRD LVSRSFTARA VGALRPRIAA IAAELIGAVA
DRGEMDVVAD LSDPLPVLVI GELLGLPMAD RELLSDWARR LLSFDKGDLT DEVVRKRVAD
TQQELLDYLR VHCRRRRTNP QDDLISRLIR AEVDGQRLTE DEVVNFANLL LLAGHVTTTL
LLANIVLTLD EHPAVAAEAR ADRGLIPGLI EETLRYRPVI VSNMRVTTRA VTVGTEQLPA
GQLVSLSFIS GNRDEQYFTD PDRFDIHRDA RKHLGFGHGI HYCLGAPLAR LELGIALDAM
FDRFSRIEVT GVPVDYYDTP GVAGPRSLRI AFRHH