Gene Sare_3162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3162 
Symbol 
ID5706111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3650185 
End bp3651399 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content66% 
IMG OID641272594 
Productcytochrome P450 
Protein accessionYP_001537961 
Protein GI159038708 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.411247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000117397 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCGAGTC TTCCGCTGCC CACATATCCG AAGTTGCGTG ACCCGGCAGA CCCGTTGCTT 
CCGCCGGCCG AATACCTCGC GATCCAGTCG GAGAAGCCGA TTGCGAAGGT GCTTCTGCCA
TCAGGCCGGC CGACCTGGCT GATCACCGGC CACGCGCTCG CCCGTCAGGT CCTCACCGAG
CCCTGTGTCA GCGTCGACCG CAGGCACCCG AACTTTCCCT ACCCGGTACC GAACCCCGAT
GCGGTCGTCG CGCAGGTGGC TCGGTGGACA TACATCTTGT TGGGCGACGA CCCGCCCCTG
CACACCGAAC GGCGGCGTCT GTTGATCAGC GAGTTCACCG TGCGGCAGGC GCAGGCAATG
CGGCCCCGCA TTCAGCAGCT CGTCGACTTC CACCTCGAAC AGCTCATCGC GGCAGGCCCG
GGAGCGGACT TCAGCAAGCA CTTTGCGATG AAGGTTCCCT CAGCGGTCAT CTGCGAGATG
CTCGGCGTAC CGTTTGCCGA CCACGACTAC TTCCAGGAAC GAACCGCGCT TCAGCTACGG
CGGGACGTGC CCGTCGCCGC CCAGAAGCAG GCGATCGACG AACTGCTGGC CTACTTCGAG
CAGCTCATCC AGGAAAAGAG CAGCCACCCC GGCGACGACG TGCTGAGCCG TCTGATCGTC
AGCAACCGCG AGACAGAGGC GTTCGACCAC GAGGCGCTGG TCGCTCTCGG ACTGCTACTG
CTGGTGGGGG GGCACGAGAC GACCGCCAAC ACCCTCACGC TCGCCACCGC GACCATGCTG
GAACGGCCGG AAATCGCCGA GCAGCTACGG ACCGACCCGT CGCTGATGCC CTCGGCGGTG
GAGGAGTTTC TGCGTTACTT CAGCGTCGCC GTCGCCGTGT CCCGGATCGC GACGGCCGAC
CTGCAGGTCG GCGGCCAGTT GGTCCGTGCG GGTGAGAGCA TGCTGTTGGT GCTCAACACC
ATCGCCCGCG ACGGCACGGT TTTCCCCGAG CCGCACAGGT TGGACATTCG TCGCAACGCC
CGCAACCACC TGGCCTTCAG CCACGGCATT CACCAGTGCA TGGGGCAGAA TCTGGCACGG
GTCGAGATGC AGATCGCCCT TGACACCGTG CTGCGGCGGC TGCCTGGGCT TCACCTGGTC
GCCCCGTTCG AGGAGTTGCC GTTCAAATAC CGGCACCTGG TGTGGGGCAT CGAGGAACTC
AGGGTGGCGT GGTGA
 
Protein sequence
MSSLPLPTYP KLRDPADPLL PPAEYLAIQS EKPIAKVLLP SGRPTWLITG HALARQVLTE 
PCVSVDRRHP NFPYPVPNPD AVVAQVARWT YILLGDDPPL HTERRRLLIS EFTVRQAQAM
RPRIQQLVDF HLEQLIAAGP GADFSKHFAM KVPSAVICEM LGVPFADHDY FQERTALQLR
RDVPVAAQKQ AIDELLAYFE QLIQEKSSHP GDDVLSRLIV SNRETEAFDH EALVALGLLL
LVGGHETTAN TLTLATATML ERPEIAEQLR TDPSLMPSAV EEFLRYFSVA VAVSRIATAD
LQVGGQLVRA GESMLLVLNT IARDGTVFPE PHRLDIRRNA RNHLAFSHGI HQCMGQNLAR
VEMQIALDTV LRRLPGLHLV APFEELPFKY RHLVWGIEEL RVAW