Gene Sare_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2024 
Symbol 
ID5704461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2314564 
End bp2315838 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content70% 
IMG OID641271514 
Productcytochrome P450 
Protein accessionYP_001536885 
Protein GI159037632 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00720801 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0287993 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGT ACCAGGACCG GCCGACCGGC GACCAGCCCG GCGCTCCGGT GCCGTCAGGG 
TCGACCGATC CGGGCATCGG CGCGTTTCCG CTGCCCCGGC GCTGCCCCTT CAGCCCACCT
GCCGAGTACG CCCGACTACG CGCCGAGCAT CCGGTCGTCC GACTGCCGAT GCTCGGCGGT
GACACGGCCT GGGTGGTCTC CCGGCATGCC GACGTCCGGC AAGTGCTCAG CGATCCCCGG
ATGAGCGCGG ACCGACGTCG ACCCGGTTTC CCGAAGTTCG CGCCGACGAC GGAGGGCCAG
CGGCAGGCGT CGTTCGCGAA CTTCCGCCCA CCGTTGAACT GGCTGGACCC GCCGGAGCAC
GCCATCTGTC GGCGACAGAT CGTCGACGAG TTCTCCGTGC GCCGGGTCCG GCAGTCACGG
GCGTTGGTCG AACGGGTCGT CGACACGCAC CTCGACGCGT TGACCGCCGC CGCGCCCGGC
GCCGACCTGG TGTCGACGTT CGCCTACCCG GTCCCATCAC AGGTGATCTG TGAGGTACTC
GGCGTGCCCT ACGGCGAGCA CGAGTTCTTC GAGCGCCGTT CGACGCTGAT GTTCCGCCGG
AGCACGCCGG CCGACGAACG CGCCCGCTGC GCCCGGGAGA TCCGCGATTT TCTCGACGTG
GTGGTCACCG ACAAGGAGCG CCGTCCCGGC GACGACGTGC TCAGCCGGCT GCTGTACCGG
CAGCGCCGCG CCGGTGGCGT GGACCACGAG GCCGTGGTGA GCATGGCCTT CGTGCTGCTG
GTCGCCGGGC ATGTCACCAC GTCCAATATG CTCGCGTTGA GCGTGCTGGC CCTGCTCACT
CATCCGGCAC GGCTGGCCCG GCTACGCGCC GAACCGGAAC GGTTCCCGGC CGCCGTGGAG
GAACTGCTGC GGTACTTCAC CGTGGTCGAG GCGGCGACAG CCCGCACCAC CACCGCCGAG
GTCACGATCG GCGGGGTGAC CATCGCGGCG GGAGAGGGGG TGGTGGCGCT GGGGCAGGCG
GCGAACCGTG ATCCGAGGGT GTTCGAACAC CCCGACGAGT TCGACCCCGA CCGGGACGCC
CGTGCGCACC TCGCCTTCGG CCACGGCCGG CACATCTGCC CGGGTCAGCA TCTCGCTCGG
TTGGAGATGG AGGTCGCGCT CAGTCGCCTG TTCCGGCGGC TGCCCGGCCT AAGACTCACG
ATGGAGGTTT CCGACCTGCC CCTCAAGGAG GACAGCAACA TCTTCGGGTT GTACGCCCTA
CCGGTCGCCT GGTGA
 
Protein sequence
MTGYQDRPTG DQPGAPVPSG STDPGIGAFP LPRRCPFSPP AEYARLRAEH PVVRLPMLGG 
DTAWVVSRHA DVRQVLSDPR MSADRRRPGF PKFAPTTEGQ RQASFANFRP PLNWLDPPEH
AICRRQIVDE FSVRRVRQSR ALVERVVDTH LDALTAAAPG ADLVSTFAYP VPSQVICEVL
GVPYGEHEFF ERRSTLMFRR STPADERARC AREIRDFLDV VVTDKERRPG DDVLSRLLYR
QRRAGGVDHE AVVSMAFVLL VAGHVTTSNM LALSVLALLT HPARLARLRA EPERFPAAVE
ELLRYFTVVE AATARTTTAE VTIGGVTIAA GEGVVALGQA ANRDPRVFEH PDEFDPDRDA
RAHLAFGHGR HICPGQHLAR LEMEVALSRL FRRLPGLRLT MEVSDLPLKE DSNIFGLYAL
PVAW