Gene Sare_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2032 
Symbol 
ID5705686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2325839 
End bp2327044 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content72% 
IMG OID641271522 
Productcytochrome P450 
Protein accessionYP_001536893 
Protein GI159037640 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0674155 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGGCCG ATGCCGTGAC CGCGTTCGAT CCGACCGCCG TCGACGTTCG GCGCGACCCG 
TACCCGTCGT ACCACTGGCT GCTCCGCCAC GATCCGGTGC ACCGGGGCGC CCACCAGGTC
TGGTACGTCT CGCGTTTCGC CGACGTCCGC GCGGTACTCG GCGACGAACG GTTCGCCCGT
ACCGGCATCC GCCGGTTCTG GACCGATCTG GTCGGGCCCG GCCTGCTCAG CCAGATCGTC
GGCGACATCA TCCTGTTCCA GGACGAGCCA GACCACGGCC GGCTACGTGG TGTGGTCGGC
CCGGCCTTCT CCCCGTCGGC GCTGCGCCGC CTGGAACCGA CGATCGAGGC CACCGTCAAC
GACCTGTTGC GCCCGGCGCG GGCCCTCGGC GCGATGGATG TGGTGGCCGA CCTGGCGTAC
CCGCTGGCGC TGCGCGCGGT GCTCGAGCTG CTCGGCCTAC CGGCCGGCGA CGCCAACACG
GTCGGCCGCT GGTCGCGTGC GGTGGGCCGG ACACTGGACC GGGGCGCCAC CGCCGAGGAC
ATGCGGCGGG GACACGCGGC CATCGCCGAG TTCGCCGACT ACGTGGAACG GGTGCTGGCC
GAGCGCCGCG AGGACGGTGC GGACCTGCTG GCCCTGATGC TCGCCGCCCA CCGGAGCCAG
CTGATGAGCC GCAACGAGAT CGTCAGCACC GTGGTCACCT TCATCTTCAC CGGTCATGAG
ACGGTGGCCA GCCAGCTGGG CAACGGCCTG CTCAGCCTCC TGGACCACCC GGAGCAGATG
GAGTTGATGC GCCGACAGCC GCACCTGCTA CCACACGCGG TCGAGGAATG CCTGCGCTTC
GACCCGGCGG TGCAGTCGAA CACCCGACAG TTGGCGGCCG ACGTCGAGCT GCACGGCCGG
CGGCTGCGCC GCGACGACGT CGTGGTGGTC CTCGCCGGCG CGGCCAACCG GGACCCCGGG
CGGTACGACC GGCCCGACGA GCTCGACATC CGCCGCGACC CCGTCCCGTC GATGTCCTTC
GGGGCGGGCA TGCGCTACTG CCTCGGGTCG TACCTGGCCC GGCTTCAGCT GCGTACCGCT
CTCGGCGCCA TGGTCGCGCT GCCGGACCTG CGCTTGGTCT GCAGCCCGAA CGAACTGGCC
TACCAGCCTC GCACGATGTT CCGTGGTCTC ACGAGGCTGC CGGTCGCGTT CACGCCGGCC
GGCTGA
 
Protein sequence
MLADAVTAFD PTAVDVRRDP YPSYHWLLRH DPVHRGAHQV WYVSRFADVR AVLGDERFAR 
TGIRRFWTDL VGPGLLSQIV GDIILFQDEP DHGRLRGVVG PAFSPSALRR LEPTIEATVN
DLLRPARALG AMDVVADLAY PLALRAVLEL LGLPAGDANT VGRWSRAVGR TLDRGATAED
MRRGHAAIAE FADYVERVLA ERREDGADLL ALMLAAHRSQ LMSRNEIVST VVTFIFTGHE
TVASQLGNGL LSLLDHPEQM ELMRRQPHLL PHAVEECLRF DPAVQSNTRQ LAADVELHGR
RLRRDDVVVV LAGAANRDPG RYDRPDELDI RRDPVPSMSF GAGMRYCLGS YLARLQLRTA
LGAMVALPDL RLVCSPNELA YQPRTMFRGL TRLPVAFTPA G