Gene Sare_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1053 
Symbol 
ID5708332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1180408 
End bp1181913 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content67% 
IMG OID641270569 
Productall-trans-retinol 13,14-reductase 
Protein accessionYP_001535953 
Protein GI159036700 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0671242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCC ACTACGACGT GATCGTCATC GGTAGCGGGC TCGGCGGCCT CGCCGCAGCC 
ACCACCCTGC AGCGTGGGGG CAGACGAACG TTGCTGTTGG AACGGCACAG CGTTCCCGGC
GGCGCGGCCA CCTCCTTCGT ACGGGGCCGC TTCGAATTCG AGGTATCGCT GCACCAGCTG
GGCGGCATGG GCGGGCACGG TCCCCTGAAG GCGGTGCTGG ACGAGCTGGA CGTGACGCGC
CGATTGGAGT GGCTGGAAGA TCGTGATCTG TGCCGCACGT TCGTGCCGGG TGAGGTCGAT
GTCCGGCTGC CCCACGACTG GACGGGTCTG GCGGACGTGC TGGAGGGCCT CAGCCCCGGC
AACCGGCCAC AGATCGTGCG ATTCCTGGAA ATCGTCCGGG AGACCGGGCT GTGGCAGCTG
ACCGCGCGGG TCAACCTGCA CCGGATTCCG GAACAGTTGG AGTGGCTGAA GACCCTTCCC
GACGTGCGCC GTTACGCACT GCGGACCTTC CGTGAGGTGC TGGACGAGTT CTTCACCGAC
GAACGGCTGA AGCTCGTGTT GTCCTCGTAC TGGAGCTACA ACGGTCAGCT TCCGCACCGA
ATCGCCTTCG TGGACATGGC CCGACTCCTG ACGCTGTATG TGGAGACCAG GCCGTACCAG
GTCGTCGGCG GCGGTCAGGC GCTCTCCTCC GCGCTCGTGG AGTCCTTCGA GGAGGCCGGG
GGCGAGCTGC GGCTGAACAC CAACGTCGTA CAGATTCTCA CCCACGGGGG ATCCGCGGTG
GGAGTGCGGT TGGAAAACGG TGACACGGTC GGCGCCAAGC TGGTGGTCTC CAACGCTCCG
TCGACGACGA CGTACACCCG GCTGTTGGAC GACCCGGGGG TGGTACCCGC CCATGTTCTG
CAGGGTCTGC GTGCCCGGCG GCCCGGGGCG TCGGCATCGT GCCTGTTCCT CGGAGTGGAC
GCGTCGCCGA AGGAGTTGGG CTTTACCGCG GCGACGACGT TCCTCTCCGC CAGTCTGGAC
GAGCAGTCGG TGCTGCGTGG GGCCTACTCC CTGACCGAGC CCTGCCCATT CCTGATCGTC
ACCTGCTACG ACGTGCAGCC GACCGGTTTC GCCCCCAAGG GCTGCACCCA GGTTGTGCTC
TCCGCGATCC AGTACGCCGA GCCGTGGGAA TCGCTGGCGC CGGAGGACTA CGCGGCGGCA
AAGGCCTCCT ACGCGGAGTC CCAACTGGAC CTGGCGGAGA CCCTCGTACC CGGCCTCCGG
GACGCGATCG AGGAGGCCGA GCTGGCCACA CCGCTGACGT TCAAGCGCTA CACCAACCAG
CCCGGCGGCG CCATCTACGG CTTCGACCAG GACATCACCG ACAGCTGGCT GTTCCACGAC
GAGGACCTCA AGCAGAACGT GCCGGGCCTG CTGCTGACCA GCAACTGGAC GACGGCCGGC
GGTTACAACT CCAACCTCGT GACAGCTTCC CGGCGCTGCC AGGGGCTGCT CGCCATGGGC
CAGTGA
 
Protein sequence
MSTHYDVIVI GSGLGGLAAA TTLQRGGRRT LLLERHSVPG GAATSFVRGR FEFEVSLHQL 
GGMGGHGPLK AVLDELDVTR RLEWLEDRDL CRTFVPGEVD VRLPHDWTGL ADVLEGLSPG
NRPQIVRFLE IVRETGLWQL TARVNLHRIP EQLEWLKTLP DVRRYALRTF REVLDEFFTD
ERLKLVLSSY WSYNGQLPHR IAFVDMARLL TLYVETRPYQ VVGGGQALSS ALVESFEEAG
GELRLNTNVV QILTHGGSAV GVRLENGDTV GAKLVVSNAP STTTYTRLLD DPGVVPAHVL
QGLRARRPGA SASCLFLGVD ASPKELGFTA ATTFLSASLD EQSVLRGAYS LTEPCPFLIV
TCYDVQPTGF APKGCTQVVL SAIQYAEPWE SLAPEDYAAA KASYAESQLD LAETLVPGLR
DAIEEAELAT PLTFKRYTNQ PGGAIYGFDQ DITDSWLFHD EDLKQNVPGL LLTSNWTTAG
GYNSNLVTAS RRCQGLLAMG Q