Gene Sare_2105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2105 
Symbol 
ID5704719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2426466 
End bp2427689 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content67% 
IMG OID641271590 
Productcytochrome P450 
Protein accessionYP_001536961 
Protein GI159037708 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.34341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0115341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAAT CAATGCCGGT TCAGGACCTG CCGGCGTTCC CGATCCCCCG GGAGTGCCCG 
TACCGGCCCT CGGCGCAGCA CGTGTCACTG CGATCCGGCG GCCCGATGGC GAAAGTGCGG
CTCTACAACG GCCGCACCGC GTGGCTGGTG ACCGACTCCG CGCATGCGCG CGCGGTCCTG
TCTGACTATC GCCGTGTGTC GATCAAGCCC TACCACGGCA ACTACCCACT CCTGAACGAG
GAGTTCGAGA AGGTCGTCGA CAGCGGGTAC GCGGACGTGT TGTTCGGCGT CGACCCGCCC
GAGCACACCC GCCAACGACA GATGATCATG CCGAGCTTCA CGTTGCGGCG AACGGCGGTG
CTCCGCCCGG ACATCCAGCG CATCGTTGAC GAAAAGCTCG ACGAGATGAT GCGCCACGGC
GCCCCCGGCG ACCTGGTCAC CGAATTCGCC CAGCCCGTGC CGTCGATGGT GATGAGTTTC
CTGCTCGGCG TTCCGTGGGA GGACCACGAG GAGTTCGAGA CCCCGGCGCA CAAGCTGTTC
GTCCCGGAAC TCGCCGAGGA GGCAACCACC GAACTCGGCG CATACCTCGA ACGGCTGATC
CAGAAGAAGG AACAGCCTGG TGGAACCCCC GGCGGGACCG GCCTGCTCGA CGACCTGATC
CGGGATCACC TGCGGGCCGG CGCGCTGAGC CGGGACGAAC TCGTCCACAT CGCGATGGCG
ATGCTGGTCG CCGGCACCGA CACGACCACC AATGTGATCT CCCTCGGCAC GCTCGCGCTG
CTGGACAACC CGGACCAGTG GGCGGCCCTG CGCGACAACC CGGACGAGCT GATCCCCGGC
GCGGTCGAGG AGATCCTGCG GTACACATCA CTGATCGAGG CGTTCGCCCG CGTCGCGGTG
TCGGACATCG AGTTGAACGG TGCTGTCATC AAGGAGGGCG AGGGCATCCT GATCAGCTCC
GCGGGCGTCA ACTTCGACCC GGCGCTGGCA CCGGACCCGG GCCGGTTCGA CATCCGCCGC
CCACCCCGCC CAAGCTTCTC GTTCAGCCAC GGCATCCACC GCTGCCCAGG CGACAACCTG
GCCCGCCTCG AACTCGAGAT TGCGTTTCGG AGCCTGGTCA CCCGCATGCC GAACCTCCGC
ACCGCCAAGC CGATCGACCA GATTCCCAGC AACAACAACG ACGGGACGTT GCAGCGGCTG
TACGAGCTCC CGGTTGTCTG GTAG
 
Protein sequence
MTKSMPVQDL PAFPIPRECP YRPSAQHVSL RSGGPMAKVR LYNGRTAWLV TDSAHARAVL 
SDYRRVSIKP YHGNYPLLNE EFEKVVDSGY ADVLFGVDPP EHTRQRQMIM PSFTLRRTAV
LRPDIQRIVD EKLDEMMRHG APGDLVTEFA QPVPSMVMSF LLGVPWEDHE EFETPAHKLF
VPELAEEATT ELGAYLERLI QKKEQPGGTP GGTGLLDDLI RDHLRAGALS RDELVHIAMA
MLVAGTDTTT NVISLGTLAL LDNPDQWAAL RDNPDELIPG AVEEILRYTS LIEAFARVAV
SDIELNGAVI KEGEGILISS AGVNFDPALA PDPGRFDIRR PPRPSFSFSH GIHRCPGDNL
ARLELEIAFR SLVTRMPNLR TAKPIDQIPS NNNDGTLQRL YELPVVW