Gene Sare_3475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3475 
Symbol 
ID5703538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4007465 
End bp4008952 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content75% 
IMG OID641272902 
Productzeta-phytoene desaturase 
Protein accessionYP_001538268 
Protein GI159039015 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.419175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0366285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGGA TCGTGGTCGT CGGCGCCGGG GTGGGTGGCC TGGCCACCGC CGCCCGGCTG 
GCCACCACCG GGCACGAGGT CACCGTCCTT GAGCAGGGCG ACACGGTGGG TGGCAAGCTT
GGCCGGTACG TACGCGACAC GCCCGCCGGG CCCTTCCACT TCGACACCGG CCCCAGCCTG
CTGACCCTGC CCCAGGTGTT CCACGACCTG TTCGAGGCCA CCGGGGCGAA ACTCGACGAA
TACCTGGACC CGGTCCCGCT CGACCCGATC GTGCGACACG TCTTCGCCCG GGGCGGGCCG
ACGGTGGACT CGTGCGCCGA CCCCGACGAG TTCGCGACCC GGGTCGGCGC GGCCTTCGGT
GAACGGGCCG CCGCCGAGTG GCGACGCCTC TGGCGACGCG CCGAGCGGGT CTGGAGGGCC
TCGGAACGCG ACGTCCTGCG CCGCCGGGTC GACTCTCCCC GGGACCTGGC GGTGCTGGCC
TGGCGGCTGG GTGACCTGGC CGCCATCGGG CCGGGTCGCA CGCTACGTGG GCTGGGCCGC
GCCCACCTGT CCGACCCCCG GCTGCGAATG CTGCTCGACC GGTACGCCAC GTACGCCGGC
ACCGACCCAC GCCGGGCGCC GGCGGCCCTT GTCGCCGTCC CCTACGCGGA GCTGACCTTC
GGTGGGTGGT ATCTGCGGGG CGGCCTGGCC TCCCTCGCCG ACGCCCTGCT GACCCGCTGC
CTGGATCTCG GGGTGGTGGT ACGGACCGGT GTGCGGGTCA CCCGGATCGA CGCGACGGGC
GGGCGGGTGC ATGGCGTACG CCTCGCCGAC GCGACCGCCC CCGTGCCGGC CGACGTGGTG
GTGTCCAACG TCGACGCGCT CACCCTCTAC CGGGACCTGC TACCCACACC GCGACGGCTG
GCCACGCTGA CCGACCGCAG CCTGGCCGGC TTCGTGCTGC TGCTCGGCGT ACGTGGGGAT
TCCGGGCTGG CCCACCACAA CGTCTTCTTC CCCGACGCCT ACGACGCCGA GTTCGACGCG
GTCTTCGGGG CACCCGGACG GGGGATCCGG GCACGTCCGG CCCCGGACCC GACGGTCTTC
GTCACCGTAG CCGCCGACGA GACGGTCCGT CCGGCCGGAC ACGAGGCGTG GTTCGTGCTG
GTCAACGCCG CGCGACAGGG CACCGCCGCC GGTGCCGTCG ACTGGCGGCG TCCCGGCCTC
GCCGACGCGT ACGCGGACCG GATCCTGGAC GTGCTGGCCG CGCGAGGGGT GGACGTACGG
GACCGACTGC TGTTTCGGGA GGTCCGCACG CCGGCCGACC TGGAGGCCAC CGCGGACACG
CCGGGCGGAG CAATCTACGG CACCGCGGGC GGCCTGCTCC GCCCGCCCAA CCGGGGGCCG
GCCCGTGGGC TGTGGCTGGT CGGCGGCTCC TGCCATCCGG GCGGTGGCCT GCCGATGGTC
GCCCTCTCCG CCCAGATCGT CGCCGACTCC ATTGGTCCGG CCTGGTAG
 
Protein sequence
MARIVVVGAG VGGLATAARL ATTGHEVTVL EQGDTVGGKL GRYVRDTPAG PFHFDTGPSL 
LTLPQVFHDL FEATGAKLDE YLDPVPLDPI VRHVFARGGP TVDSCADPDE FATRVGAAFG
ERAAAEWRRL WRRAERVWRA SERDVLRRRV DSPRDLAVLA WRLGDLAAIG PGRTLRGLGR
AHLSDPRLRM LLDRYATYAG TDPRRAPAAL VAVPYAELTF GGWYLRGGLA SLADALLTRC
LDLGVVVRTG VRVTRIDATG GRVHGVRLAD ATAPVPADVV VSNVDALTLY RDLLPTPRRL
ATLTDRSLAG FVLLLGVRGD SGLAHHNVFF PDAYDAEFDA VFGAPGRGIR ARPAPDPTVF
VTVAADETVR PAGHEAWFVL VNAARQGTAA GAVDWRRPGL ADAYADRILD VLAARGVDVR
DRLLFREVRT PADLEATADT PGGAIYGTAG GLLRPPNRGP ARGLWLVGGS CHPGGGLPMV
ALSAQIVADS IGPAW