Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0361 |
Symbol | |
ID | 5732271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 432016 |
End bp | 432876 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277484 |
Product | 2,5-didehydrogluconate reductase |
Protein accession | YP_001543140 |
Protein GI | 159896893 |
COG category | [R] General function prediction only |
COG ID | [COG0656] Aldo/keto reductases, related to diketogulonate reductase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0479945 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAATCA TCCCAACAGT GAGCTTAAAT GCGGGTATTG CAATGCCGAT CCTCGGGTTT GGTGTGTTTC AGATTACCGA TGCAGCGGAA TGCGAACGCG CCGTTAGTGA TGCGATTCAG ACGGGCTATC GGCTAATTGA CACTGCTGCC GTCTATGGCA ATGAAGCAGC GGTTGGAGCC GCGATCGCCC GAAGTGGTGT GGCCCGCGAA GATCTATTCG TGACAACCAA GCTGTGGATT CAGGACACGG GCTACGAGCA GACCAAAGCT GCTTTTGAGC GTTCGCTCCA ACGCTTAGGC CTCGATTATC TCGACCTGTA TTTGATTCAC CAACCATACG GAGATATTTA TGGTGCGTGG CGGGCTATGG AAGAGTTATA TCAGGCAGGC CGGATTCGCG CCATTGGGGT CAGCAATTTT CACCCCGATC GGGTAATGGA TTTGATCGTT CATAACCAAG TTCCGCCTGC GGTCAACCAG ATTGAAACGC ATCCATTCCA GCAGCAAATC GATACTCAAC AGTTCTTACA GCAGCAGGGT GTACAGATTG AGTCGTGGGG GCCATTTGCT GAAGGTAAAC ATGCTATTTT CCAGAATGAG CTACTAGCAG GTATTGCCGC TACCCATCAA AAAACTGTTG CACAGGTGAT TCTGCGCTGG TTAACTCAAC GTGGGGTTGT GGCAATTCCC AAATCGGTAC GCAAAGAACG CATGGAAGAA AACTTCAACG TGTTCGATAT TACCCTTAGC CCAGAAGAAA TGGCGGCGAT TAGCACATTG GATACAAAAA CTAGCAGCTT CTTTGACCAC CGCGACCCAG CCGTCGTGAA GATGTTAGGT GAAGCAAACC GCCCAACCTA G
|
Protein sequence | MTIIPTVSLN AGIAMPILGF GVFQITDAAE CERAVSDAIQ TGYRLIDTAA VYGNEAAVGA AIARSGVARE DLFVTTKLWI QDTGYEQTKA AFERSLQRLG LDYLDLYLIH QPYGDIYGAW RAMEELYQAG RIRAIGVSNF HPDRVMDLIV HNQVPPAVNQ IETHPFQQQI DTQQFLQQQG VQIESWGPFA EGKHAIFQNE LLAGIAATHQ KTVAQVILRW LTQRGVVAIP KSVRKERMEE NFNVFDITLS PEEMAAISTL DTKTSSFFDH RDPAVVKMLG EANRPT
|
| |