Gene Sare_1260 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1260 
Symbol 
ID5703488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1457669 
End bp1458934 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content66% 
IMG OID641270775 
Productcytochrome P450 
Protein accessionYP_001536156 
Protein GI159036903 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000262244 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGACGATCG AGACGACGGA AACTCCGCCC GCCGACGACT CACTTCGGGC GCCGCTTCCT 
CGGCAGTTCA TGCAGCGCGA CGATCCGTCG AAGCTGCCGC CGGCGCTGGC GGCGTTGGCC
GAGCAGTCAC CGGTCGGCAG GTCGACGCTG CCCGACGGGG ATCCGTTCTG GATGGTGTCC
GGGTACGACG AGGCCCGCGC GGTCCTGTCG GATCCACGGT TTTCCTCCGA CCGGTTTCGT
TACCACCCCA GGTTCAAGAA ACTCTCCGGT CAGCTTGGCG AGCGGCTACG AAACGACAAG
GCCCGGGCCG GATCGTTCAT CAACATGGAT CCACCTGAGC ACACCCGCTA CCGTAAGTTA
CTCACCGGCC AGTTCACCGT ACGGAGAATG CGCCAACTCA CCGTCCGGAT CGAGCAGATC
GTCACCGAGC AGGTGGATGT GATGCTGGCG GAGGGAAACA GCGCCGACCT CGTTTCGGCG
TTCGCGGTTC CGGTGCCCTC GTTGATGATC TGCGAGCTGC TGGGGGTGCG CTACGAGGAT
CGTACGGAGT TCCAGCGCCG CGCGGCGGGC CTGCTGCAGA CGGATTTGCC GATCAAACAG
GCGGTGGAAA ACCTCGAAGC TCAGCGCGCG TTCATGCAGC GGCTGGTGAC GGACAAGCGG
AGGACTCCCG CGGACGACAT GATCTCCGGT CTGGTGCACC ACGCGGGTGC TGAACCCCCA
TTGACCGACG ACGAGCTGGT CGGCATCGCT ACCCTGTTGC TCTTCGCCGG CCTCGACACC
ACCGCGAGCA TGCTGGGGCT CGGCATGTTC ATGCTGTTGC AGCGGCCCGA GCAAATGGCT
GTGCTGCGCG ACGACCCGTC CCGGATCGGG GACGCCGTCG AGGAGTTGCT GCGCTACCTG
ACTGTCGTCA GCACCGGGCT CTTCCGGTTC GCCAAGGAGG ACGTGGTGCT CGGTGACGAG
CACATCCCGG CCGGGTCGAC AGTGGTGGTC TCCCTGATGG CCGCGAACCG CGACGGGCGG
CACTGGCCGG AGCCAGAGAC GCTGGACGTG ACCCGGGTGC GGAGCTCGCA CCTGGCGTTC
GGCCACGGCG TGCACCAGTG TCTCGGTCAG CAGTTGGCGC GGATCGAGTT GACGGTCGGC
ATCACCGAGC TGCTGCGTCG CCTGCCCAAC GTCCGGCTCG CCGTACCACC CGCAGACGTG
CCACTGCGCA ATGACATGAT CACTTATGGC GTGCACCGTC TGCCGATCCT GTGGGACACG
CCGTGA
 
Protein sequence
MTIETTETPP ADDSLRAPLP RQFMQRDDPS KLPPALAALA EQSPVGRSTL PDGDPFWMVS 
GYDEARAVLS DPRFSSDRFR YHPRFKKLSG QLGERLRNDK ARAGSFINMD PPEHTRYRKL
LTGQFTVRRM RQLTVRIEQI VTEQVDVMLA EGNSADLVSA FAVPVPSLMI CELLGVRYED
RTEFQRRAAG LLQTDLPIKQ AVENLEAQRA FMQRLVTDKR RTPADDMISG LVHHAGAEPP
LTDDELVGIA TLLLFAGLDT TASMLGLGMF MLLQRPEQMA VLRDDPSRIG DAVEELLRYL
TVVSTGLFRF AKEDVVLGDE HIPAGSTVVV SLMAANRDGR HWPEPETLDV TRVRSSHLAF
GHGVHQCLGQ QLARIELTVG ITELLRRLPN VRLAVPPADV PLRNDMITYG VHRLPILWDT
P