Gene Sare_2538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2538 
Symbol 
ID5706860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2891511 
End bp2892737 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content65% 
IMG OID641272001 
Productcytochrome P450 
Protein accessionYP_001537371 
Protein GI159038118 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.107795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000443731 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGAGA CTGCCTCCAG CCGGCTCACC GACACCGAGT TTCCGGTCCA GCGCGAATGC 
CCATTCGCCG AGCCCGTCGA GTACGAGCAG ATCAGGGAAC AATCGTCGAT CGCCATGGTC
CGCCTGACGG GTGGTGGTGA GGCGTGGTGG ATCTCCGGAC ACGAGCAGGG GCGCGCCGTC
CTGGCCGACC GACGGTTCTC CTCCGACCGC CGTAAGGCCA ACTTCCCGTT CGTCAGCACC
GATCCGGCGA TAAGGAAACG GTTACACGCC CAGCCCCTGT CGCTGATCAG CATGGACGGC
GCCGAGCACA CCCAGGCACG GCGGGCCCTC ATCGGCGAGT TCACCGTCCG GCGCCTGGCC
GCGCTGCGAC CGCGGATCCA GCAAATCGTC GACCAGTGCA TCGACGAGAT GCTGACCACC
GACCAGCACC GCGCCGATCT GGTCAAAACG CTGTCGCTGC CAGTGCCATC GCTGGTCATC
TGTGAGTTGC TCGGCGTCCC CTATGCTGAC CACGACTTCT TCCAGGAACA CACCGCCACC
TTGGTCCGCC GCAACACCGC ATCGGAGGTT CGACAACACA GCATCGACGA GCTGAACGCA
TACCTCGGCG CGCTGATCGA CCGCAAGCTC GCCAGCCCCG ACGACGACCT GCTCGGTCGG
CAGATCGCCA GACAACACCG GGACGGCACC TTCGATCGAT CGAGCATGGT CAGTCTGGCC
TTCCTACTGC TCGTCGCCGG TCACGAAACC ACGGCGAACA TGATCTCCCT GGGCGTTGTC
GGGCTGCTAC AGCATCCCGA GCAGTTGGCC ATGATCAAGG ACGACCCGGA CAAGACGCCG
CTGGCGATCG AGGAACTGCT GCGCTTCTTC ACCATCGTCG ACAGTGTCAC CTCCCGCGTG
GCCACCGAGG ACGTACGGTT CGGCGACACC ACCATCAACG CGGGCGACGG AGTGGTCGTC
TCCGGACTGT CCGCCGACTG GGATCCCACG GTCTTCGCAG ACCCGGACCG ACTCGACCTC
GAACGCGGCG CCCGCCACCA CCTTGCTTTC GGCTTCGGTC CGCACCAGTG CCTCGGCCAG
AACCTGGCCC GCCTCGAGCT GCAGATTGTG TTCGACACAC TGTTCCACCG CATTCCCACC
CTCCGCCTGG CCGCACCGCT CGACAAGATC CCGTTCAAGA CGGACGCGGC CATCTACGGC
GCCCGGGAAC TCCCGGTCGC CTGGTGA
 
Protein sequence
MTETASSRLT DTEFPVQREC PFAEPVEYEQ IREQSSIAMV RLTGGGEAWW ISGHEQGRAV 
LADRRFSSDR RKANFPFVST DPAIRKRLHA QPLSLISMDG AEHTQARRAL IGEFTVRRLA
ALRPRIQQIV DQCIDEMLTT DQHRADLVKT LSLPVPSLVI CELLGVPYAD HDFFQEHTAT
LVRRNTASEV RQHSIDELNA YLGALIDRKL ASPDDDLLGR QIARQHRDGT FDRSSMVSLA
FLLLVAGHET TANMISLGVV GLLQHPEQLA MIKDDPDKTP LAIEELLRFF TIVDSVTSRV
ATEDVRFGDT TINAGDGVVV SGLSADWDPT VFADPDRLDL ERGARHHLAF GFGPHQCLGQ
NLARLELQIV FDTLFHRIPT LRLAAPLDKI PFKTDAAIYG ARELPVAW