Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3185 |
Symbol | |
ID | 5735060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4028975 |
End bp | 4030453 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280331 |
Product | zeta-phytoene desaturase |
Protein accession | YP_001545950 |
Protein GI | 159899703 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | [TIGR02734] phytoene desaturase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00134863 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAATTG TTGTGATTGG CAGTGGGTTT GGCGGTTTGG GTGCGGCGAT TCGGTTGCAA GCCAAAGGCC ACGATGTGAC GATTTTGGAG AAGCGCGATA AGCCTGGTGG TCGGGCTTAT GTTTATGAGC AAGATGGTTT TAAATTTGAT GGCGGGCCAA CTGTGATCAC CGCGCCATTT ATGATTGATG ATTTATTTAC TTTGGCGGGC CGCAAAACCG AAGATTATGT CACGATGATG CCAGTCAGTC CGTTTTATCG CATCTTTTTT CATGATCAAA GCCATTTCGA TTACTCCGAT GACCTGCCAA GCATGGTGCG CCAAATTCGC GAGTGGAACC CTGCTGATGT TGATGGCTAT TTGGAATTTG TGCGCCGCGC CGAGCAGATT TTCAACAAAG GCTTCACCGA ATTAGCCGAT CAGCCTTTTT TGAAATTAGC TGATATGCTC AAAATCGTGC CCGATATGAT CAAACTTGAG TCGTATCGCA CAGTTTATGG CTTTGTTTCG AAATTTGTGC AAGATGAGCG TTTGCGCCAA GTGCTCAGTT TTCACCCGTT GCTGGTCGGC GGCAACCCCT TCCAAACCAC CAGCATTTAT ACCTTGATCA ACTTCCTCGA ACGCAAATGG GGCGTGTGGT TTGCCAAAGG CGGCACAGGG GCGTTAGTTC AAGCCTTGGT TAAATTATTT GAAGATTTGG GCGGCAAAAT CGAACTCAAT CGCGATGTAG CGGAAATTAG CACCTACAAC GGCAAGGCTA CGGGCGTGCG CCTGCGTGAT GGCAGCGAAA TCAAAGCCGA TGCGGTGGTT TGCAATAGCG AAGTTGCTTG GGCCTATCAA AATCTCTTGC CCAGCAGCAT GCGCAAAAAA TACACCGACC GCAAGCTCGC TCGGATGCGC TATTCAATGT CGCTGGTCGT GATCTATTTC GGCACCGATC GTCAATATCG CGGCCAAGAT GGGCCAAAAC TCGCACATCA CGATATTATT TTGGGGCCAC GCTACCAGCC CTTGCTTGAT GATATTTTCA CGAAAAAACA GCTGGCCGAT GATTTCTCGC TGTATTTGCA TATGCCCACC TTGACCGATC CATCGTTAGC ACCAGAAGGC TGCGAGGCTT TTTATGTGCT TTCGCCAGTG CCGCACCTTG GTTCGGGCAC CGATTGGCGG CAGACAGCCA AGCCATATCG CGACCGAATT ATGAACTTTT TAGAAGATCG CTACCTGCCC AATTTATCCA AACATATTGT CAGCGAGCAT ATGATTACGC CCTTGCATTT TGCCGAAACC CTGAATAGCT ACCAAGGCAG CGCCTTCTCG GTCGAGCCAA TTCTGACGCA ATCGGCCTGG TTCCGTCCGC ACAATCGCTC GGAAGAAATT CCTAATCTCT ATTTTGTCGG GGCTGGAACC CACCCCGGGG CTGGCTTGCC GGGCGTATTA TCCAGCGCCA AAATCGTCGA TGATTTGATT GGAGCCTAA
|
Protein sequence | MRIVVIGSGF GGLGAAIRLQ AKGHDVTILE KRDKPGGRAY VYEQDGFKFD GGPTVITAPF MIDDLFTLAG RKTEDYVTMM PVSPFYRIFF HDQSHFDYSD DLPSMVRQIR EWNPADVDGY LEFVRRAEQI FNKGFTELAD QPFLKLADML KIVPDMIKLE SYRTVYGFVS KFVQDERLRQ VLSFHPLLVG GNPFQTTSIY TLINFLERKW GVWFAKGGTG ALVQALVKLF EDLGGKIELN RDVAEISTYN GKATGVRLRD GSEIKADAVV CNSEVAWAYQ NLLPSSMRKK YTDRKLARMR YSMSLVVIYF GTDRQYRGQD GPKLAHHDII LGPRYQPLLD DIFTKKQLAD DFSLYLHMPT LTDPSLAPEG CEAFYVLSPV PHLGSGTDWR QTAKPYRDRI MNFLEDRYLP NLSKHIVSEH MITPLHFAET LNSYQGSAFS VEPILTQSAW FRPHNRSEEI PNLYFVGAGT HPGAGLPGVL SSAKIVDDLI GA
|
| |