Gene Haur_0609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0609 
Symbol 
ID5732507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp701931 
End bp703418 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content52% 
IMG OID641277736 
Productzeta-phytoene desaturase 
Protein accessionYP_001543385 
Protein GI159897138 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.392694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCATG TGGTGATTGT TGGCGGCGGT TTAGGTGGTT TGGCAGCAGC CTTGCGGCTA 
CGGGCGGCTG GTGTCGCCGT AACTTTATTT GAAAAAAACA CGGCCCTTGG TGGCAAAATG
GCCCAAGTTG TGCAACATGG CTTTCGCTTT GATACAGGAC CATCGCTGTT TACGATGCCA
TGGGTGGTGG AAGAATTGCT CGCCAGTGTT GGGCGTGATC TCGCCAGCGA ACTGACAATT
AAGCCGGTTG ATCCAACCTG CCGTTATCAA TGGCCTGATG GCACACGCCT CGATGCTTGG
AGCGATCTGC CCAAGCTTTT GAAAGAAATT GAACGCCTCG AACCAGCCGA TGTTGCAGGC
TTTTTGGGCT TTATGGCATT TAGCGCCCAG ATTTATCAAG CAGCGGCTGA GCCATTTTTG
CTTGAGCCAT TTCAGGGCTT GCGCACAATG CTGCAACCAC GTTTACTGCG CGATGTTTGG
AAAATTGCCC CCTTGAAAAC TGTTGATCAA GCGGTGCGCC ACTATTTCAA ACACCCCTAT
CTGCGCCAGT TGTTCAATCG CTATGCTACC TATAATGGCT CCTCGCCCTA TCGTTCGCCC
GCGACGTTTT GTATTATTCC TTATGTTGAA ATCGCCCAAG GCGGCTGGTA TATCGACGGT
GGTATGTATC AATTGGCGGC GACGCTAGGC CGAATTGCTG GCGAAATGGG TGTCGATATT
CAGTTAAATA GTTTGGTTAG CGAAGTTATT GTAACCAACA AAACCGCTAC TGGTGTGCGT
TTGAGCGATG GCCGCACGAT TAATGCCGAT TATGTGATTG TGAATGCTGA TGCGATGTAT
GCGCTTGATC AGCTGATTCC AACCACCAAA GCGCCAACCC ACGAATTGGC GTGCTCAGGT
TTTGTGTTGC TGTTGGGGGT TAATCGCGAT TACGCTCAAT TGCAGCATCA CAACATTTTC
TTTAGCGGCG ATTATGCCGC CGAATTTCGG GCAATTTTTG AGCATGGTGT GCCAGCAGTT
GACCCGACAA TCTATATCGC TGCAACCTGC CGCTCGAACC CTGAGCATGC CCCGCCTGGA
ATGCTGAATT TATTTGTGCT GGTCAATGCC CCAGTCACAG GTCGGGTGAA TTGGCAACGC
GAGGCGGCGG CCTATCGTGA TTTAGTGGTG CGCCGTTTGG AGCAGCATGG CTTGGTTGGG
CTGAATCAAG CGATTATCAG CGAAACTATG CTCACGCCTG CTGATCTGGC GAGCATGACC
AATGCCCAGC GTGGCTCGTT GTATGGCCCT GCCTCGCATG GCTTACAGGC AGCCTTTTTG
CGGCCAGCCA ATCAGCCAGC AGGCTTGCAG AATTTGGCCC TTGTTGGTGG GGCGACCCAT
CCTGGTGGTG GAATTCCGCT GGTGTTGTTG TCGGGCAAAG CTGGGGCACG CTGGGCATTG
CAAAGATTAG GAGTTGCTGA GAAGCCCAAA CTTGGGACTA TTTTCTGA
 
Protein sequence
MQHVVIVGGG LGGLAAALRL RAAGVAVTLF EKNTALGGKM AQVVQHGFRF DTGPSLFTMP 
WVVEELLASV GRDLASELTI KPVDPTCRYQ WPDGTRLDAW SDLPKLLKEI ERLEPADVAG
FLGFMAFSAQ IYQAAAEPFL LEPFQGLRTM LQPRLLRDVW KIAPLKTVDQ AVRHYFKHPY
LRQLFNRYAT YNGSSPYRSP ATFCIIPYVE IAQGGWYIDG GMYQLAATLG RIAGEMGVDI
QLNSLVSEVI VTNKTATGVR LSDGRTINAD YVIVNADAMY ALDQLIPTTK APTHELACSG
FVLLLGVNRD YAQLQHHNIF FSGDYAAEFR AIFEHGVPAV DPTIYIAATC RSNPEHAPPG
MLNLFVLVNA PVTGRVNWQR EAAAYRDLVV RRLEQHGLVG LNQAIISETM LTPADLASMT
NAQRGSLYGP ASHGLQAAFL RPANQPAGLQ NLALVGGATH PGGGIPLVLL SGKAGARWAL
QRLGVAEKPK LGTIF