Gene Sare_3149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3149 
Symbol 
ID5706207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3586860 
End bp3588086 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content70% 
IMG OID641272581 
Productcytochrome P450 
Protein accessionYP_001537948 
Protein GI159038695 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.58668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0958953 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCATCG GTCAAACCCT GCCGGACCTG GTCTACAGCC CGGAGTTCAC CCGTGACCCG 
TACGCGATCT TCGCCCGGCT GCGCGAGCAG GCCCCGGTCT GCCGCGTGAC GACCCACCGT
GGGATGAGCG CCTGGATGGT GACCCGTCAC GCCGACGTGC GGGCGCTGCT CGCGGACAAC
CGGTTGGCAA AGGACGGCAA CCGAATCGGC GAGTTGATGC CCCGGCACAG CACGCTCACG
GGTGCGGCCA CCGGGTTCCC GCCCGGACTG ACGACCAACA TGGTCAACAG TGACCCGCCC
GACCACACCC GGCTGCGGCA CCTGGTCGGC CGCGAGTTCA CCGGGCACCG CGTCGAGGGC
CTGCGCCCGC GGATCGAGGA GATCGTTGAC GACCTGCTCG ACGGCGTCGC CGCCTGCGGG
GACGAGGCTG ACCTGGCGGA GACCCTCGCA CGGCGCCTGC CGATCGCGGT GATCGGCGAA
CTGCTCGGCG TGCCCGAAGC CGACCGCGCG GAGTTCTTCC GCTGGGCCGA CACCCTGTAC
GGCGGCACTG CGTCACCGGA AGCGCTGGGC CAGGCGTACA ACGCGATCGT CGACTACCTC
GGCCGGCTCT GCGACGCCAA ACGTGACGTG CCCGCCGACG ACCTGCTCAC CGCGCTGGTG
CAGGTCAGCG CCGACGAGGA CCGGCTGTCA CGCGAGGAAC TCGTGTCGAT GGCCCTGCTG
CTGTTGGTGG CCGGGCACGA GACGACCAGC AAGCAGATCA GCAACGGGGT GCTGGCCCTG
CTGCTCAACC CGGAGCAGCT GAAGCTGCTG AAGGCGCAAC CCGCGCGAAC CGCCGGTGCG
GTCGAGGAAC TGCTGCGGTT CGAGGGCCCG AGCCTCTCGG CCAGCCTGCG CTTCACCACC
GAGCCGGTGG AGGTAGCCGG TGTGGTCATC CCCGAGGGGG AGTTCGTCCT GCTGTCGCTG
GCGTCGGGCA ACCGTGACCC GGAGAAGTTC CCCGACCCCG ACCGGCTCGA CATCACCCGC
TCCACCCAGG GCAATCTGGC AATGGGACAC GGCATCCACC ACTGTGTCGG CGCTGCCCTC
GCCCGCCTCG AACTGGAGAT CGTCCTCAGC CGTCTGGTGG CGCGGTTCCC GCAGATGCAA
CTGGCCGTCG AGGCGGATGA CCTTGAGTGG CTGGTGAATT CCTTCTTTCG CGCGCCCCTG
CACCTGCCGG TGTCACTCCG GCGGTGA
 
Protein sequence
MTIGQTLPDL VYSPEFTRDP YAIFARLREQ APVCRVTTHR GMSAWMVTRH ADVRALLADN 
RLAKDGNRIG ELMPRHSTLT GAATGFPPGL TTNMVNSDPP DHTRLRHLVG REFTGHRVEG
LRPRIEEIVD DLLDGVAACG DEADLAETLA RRLPIAVIGE LLGVPEADRA EFFRWADTLY
GGTASPEALG QAYNAIVDYL GRLCDAKRDV PADDLLTALV QVSADEDRLS REELVSMALL
LLVAGHETTS KQISNGVLAL LLNPEQLKLL KAQPARTAGA VEELLRFEGP SLSASLRFTT
EPVEVAGVVI PEGEFVLLSL ASGNRDPEKF PDPDRLDITR STQGNLAMGH GIHHCVGAAL
ARLELEIVLS RLVARFPQMQ LAVEADDLEW LVNSFFRAPL HLPVSLRR