Gene Sare_3008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3008 
Symbol 
ID5707618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3416940 
End bp3418145 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content66% 
IMG OID641272455 
Productcytochrome P450 
Protein accessionYP_001537823 
Protein GI159038570 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000739402 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGACA GCGTCGCGTT CCCGCAGGGT CGCGTCTGCC CCCACCAGCC CGCACCGGGC 
TACCGACCGT TGGCCGTACA ACGGCCACTC GCCCAGGTGA CCCTCTACGA TGGCCGGCGG
GTCTGGGCTG TCACCACTCG TGATCTGGCC CGCCGGCTCC TGGTCGACCC TCGCATTTCC
AGCGACCGCA CCAACCCCGC GTGGCCCGCG ATAGTGCCGA TCGTCGCCGC CGCCGTCAAC
GACGCCCAGC AGAAGGTCCT GAAGATCGCC ACCGCGTTGG TGGGCACCGA CGGCCCGGAG
CACAAAGCCC AGCGCAAGAT GCTGATCCCC AGCTTCACGT TCAGACGCAT GAACGCCCTG
CGTCCAATGA TCCAGGAGAT CGTCGACCAG CAGCTGGACG AGATGATCCA GAGTGGCGCT
CCCACGGACC TGATCCCGGC GTTCGCCTCG GCTGTGCCCG TGACGGTGTT GTACCGACTG
ATGGGCATAC CGGACGACGA CCACGGAATC TTCGAGAAAC TGTCACACCA GCTCCTCGCC
GGCCCGAACG CCAACGAGGC ATATGACCAG CTCATGGGCT ATATGAGCAG GCTGATCGCG
GAGCGGCGGC GCAACCCCGG TGAGGGGGTT CTCGACGACC TCCTGGCGCA GCACGGTGCC
AACGATGATG CGGACCACGA CGAGCTGGTC TCGACGCTGG TCGTGCAGGT GGCGGGGAAC
CACGGCACCA CCGGGAGCAT GATCGCGCTC GGCCTGTTCG CCCTGCTGCA ACACCCCGAG
CAGCTCGCCG AGCTGCGAGC CGATCCCTCG CTGATGCCCA CCGCTGTGGA CGAGCTGCTG
CGGTTCCTGT CCGTCCCTGA CGCTGTCACG CGATTGGCAG CGGACGACAT CGAGGTCGAG
GGAACCATCA TCCGCCAGGG CGACGGCGTG TTCTTCATAA CCTCTCTCAT CAACCGCGAC
ACCGACGTTC ACGACGCGCC GAACTCCCTG GGTTGGCATC ACGCCTCCGC CGCCGACCAC
CTGACATTCG GGTTCGGTGC CCACCAGTGC CTCGGCCAGA GCCTCGCGCG CATCACGATG
GAGATCGCCC TCGGCGCGCT GATAGATCGG CTTCCCAGCC TGCGTCTCGC GGTTCCGGCG
GAGGAGGTCC CGTTCCTTCC GGCTGCGAGC CTTCAGGTGA TTGCCGAACT TCCCATCACC
TGGTAG
 
Protein sequence
MTDSVAFPQG RVCPHQPAPG YRPLAVQRPL AQVTLYDGRR VWAVTTRDLA RRLLVDPRIS 
SDRTNPAWPA IVPIVAAAVN DAQQKVLKIA TALVGTDGPE HKAQRKMLIP SFTFRRMNAL
RPMIQEIVDQ QLDEMIQSGA PTDLIPAFAS AVPVTVLYRL MGIPDDDHGI FEKLSHQLLA
GPNANEAYDQ LMGYMSRLIA ERRRNPGEGV LDDLLAQHGA NDDADHDELV STLVVQVAGN
HGTTGSMIAL GLFALLQHPE QLAELRADPS LMPTAVDELL RFLSVPDAVT RLAADDIEVE
GTIIRQGDGV FFITSLINRD TDVHDAPNSL GWHHASAADH LTFGFGAHQC LGQSLARITM
EIALGALIDR LPSLRLAVPA EEVPFLPAAS LQVIAELPIT W