Gene Haur_3185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3185 
Symbol 
ID5735060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4028975 
End bp4030453 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content50% 
IMG OID641280331 
Productzeta-phytoene desaturase 
Protein accessionYP_001545950 
Protein GI159899703 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00134863 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAATTG TTGTGATTGG CAGTGGGTTT GGCGGTTTGG GTGCGGCGAT TCGGTTGCAA 
GCCAAAGGCC ACGATGTGAC GATTTTGGAG AAGCGCGATA AGCCTGGTGG TCGGGCTTAT
GTTTATGAGC AAGATGGTTT TAAATTTGAT GGCGGGCCAA CTGTGATCAC CGCGCCATTT
ATGATTGATG ATTTATTTAC TTTGGCGGGC CGCAAAACCG AAGATTATGT CACGATGATG
CCAGTCAGTC CGTTTTATCG CATCTTTTTT CATGATCAAA GCCATTTCGA TTACTCCGAT
GACCTGCCAA GCATGGTGCG CCAAATTCGC GAGTGGAACC CTGCTGATGT TGATGGCTAT
TTGGAATTTG TGCGCCGCGC CGAGCAGATT TTCAACAAAG GCTTCACCGA ATTAGCCGAT
CAGCCTTTTT TGAAATTAGC TGATATGCTC AAAATCGTGC CCGATATGAT CAAACTTGAG
TCGTATCGCA CAGTTTATGG CTTTGTTTCG AAATTTGTGC AAGATGAGCG TTTGCGCCAA
GTGCTCAGTT TTCACCCGTT GCTGGTCGGC GGCAACCCCT TCCAAACCAC CAGCATTTAT
ACCTTGATCA ACTTCCTCGA ACGCAAATGG GGCGTGTGGT TTGCCAAAGG CGGCACAGGG
GCGTTAGTTC AAGCCTTGGT TAAATTATTT GAAGATTTGG GCGGCAAAAT CGAACTCAAT
CGCGATGTAG CGGAAATTAG CACCTACAAC GGCAAGGCTA CGGGCGTGCG CCTGCGTGAT
GGCAGCGAAA TCAAAGCCGA TGCGGTGGTT TGCAATAGCG AAGTTGCTTG GGCCTATCAA
AATCTCTTGC CCAGCAGCAT GCGCAAAAAA TACACCGACC GCAAGCTCGC TCGGATGCGC
TATTCAATGT CGCTGGTCGT GATCTATTTC GGCACCGATC GTCAATATCG CGGCCAAGAT
GGGCCAAAAC TCGCACATCA CGATATTATT TTGGGGCCAC GCTACCAGCC CTTGCTTGAT
GATATTTTCA CGAAAAAACA GCTGGCCGAT GATTTCTCGC TGTATTTGCA TATGCCCACC
TTGACCGATC CATCGTTAGC ACCAGAAGGC TGCGAGGCTT TTTATGTGCT TTCGCCAGTG
CCGCACCTTG GTTCGGGCAC CGATTGGCGG CAGACAGCCA AGCCATATCG CGACCGAATT
ATGAACTTTT TAGAAGATCG CTACCTGCCC AATTTATCCA AACATATTGT CAGCGAGCAT
ATGATTACGC CCTTGCATTT TGCCGAAACC CTGAATAGCT ACCAAGGCAG CGCCTTCTCG
GTCGAGCCAA TTCTGACGCA ATCGGCCTGG TTCCGTCCGC ACAATCGCTC GGAAGAAATT
CCTAATCTCT ATTTTGTCGG GGCTGGAACC CACCCCGGGG CTGGCTTGCC GGGCGTATTA
TCCAGCGCCA AAATCGTCGA TGATTTGATT GGAGCCTAA
 
Protein sequence
MRIVVIGSGF GGLGAAIRLQ AKGHDVTILE KRDKPGGRAY VYEQDGFKFD GGPTVITAPF 
MIDDLFTLAG RKTEDYVTMM PVSPFYRIFF HDQSHFDYSD DLPSMVRQIR EWNPADVDGY
LEFVRRAEQI FNKGFTELAD QPFLKLADML KIVPDMIKLE SYRTVYGFVS KFVQDERLRQ
VLSFHPLLVG GNPFQTTSIY TLINFLERKW GVWFAKGGTG ALVQALVKLF EDLGGKIELN
RDVAEISTYN GKATGVRLRD GSEIKADAVV CNSEVAWAYQ NLLPSSMRKK YTDRKLARMR
YSMSLVVIYF GTDRQYRGQD GPKLAHHDII LGPRYQPLLD DIFTKKQLAD DFSLYLHMPT
LTDPSLAPEG CEAFYVLSPV PHLGSGTDWR QTAKPYRDRI MNFLEDRYLP NLSKHIVSEH
MITPLHFAET LNSYQGSAFS VEPILTQSAW FRPHNRSEEI PNLYFVGAGT HPGAGLPGVL
SSAKIVDDLI GA