Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0347 |
Symbol | |
ID | 5732257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 418341 |
End bp | 419447 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277470 |
Product | 2-alkenal reductase |
Protein accession | YP_001543126 |
Protein GI | 159896879 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000181204 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAATC CCAAACGTAC CGTCGGCATC GGCGGATTGA TTATTGCCGC CCTCGCACTT TCAGGGGTTG GTGGTGGTTT AGTCGGTGGT GTCGCAGGCT ATCGCTTAGC CCAAGTTGAT CAAACGGCAA CGGCAGTGGC GGTCACGCCA ACCAGCGCCA CCCAACAAAC GACCGATAAT CTGACTGCGC AACCAGTTGT TGCCAGCACG AATGATGCAG TTGTCCAAAC CGTTGCCAAG GCCTCACCAG CAGTCGTAAA AATTATTAGT ACGGTGGTTG GCGGTCAGGC CAGCGGTTCA GGCGCAATTA TCAATGAAGA TGGCTATATC ATCACCAATA ATCATGTGGT CGAAGGTCAA AGTCGCTTGC AAGTAATTTA TTCTGATGGA ACCAGCCACA ATGCCGAGTT GATCGGCACT GATGCTTTTT CGGATATCGC CGTGATTCGG GTGCTCGATG CCGTGCCAGC CACAATTTCC TTAGGCGATT CGGATAGTTT GCAGCCTGGC GAAACGGTCG TGGCCATTGG CAGTCCGCTT GGTAAATTCC AAAATAGTGT GACGGTTGGA GTGGTCAGCG CTCTTGACCG CACAATTGAT AGCATGGAAG GCCTAATTCA AACTGACGCA GCGATTAACC ATGGCAATAG TGGCGGCCCA CTGATCAACC TCAAAGGCGA AATTGTTGGG ATCAATACGT TGGTTGTGCG CGGTGATATT GGTAGTATCG ATGAAGCTCA AGGCCTAGGT TTTGCTGTCC CATCAAATAT TGTGCGCGAA GTCAGCGATG CTTTGATTGC CAATGGCCAA GTTATTCGAC CATATATAGG CATTCGCTAC GAATTGCTTT CGCCCGAAAC CGCCGAACTG GGGATTGCCA ATGATAAAGG TGCTTTCGTG ACCAATGTTG ATGAAGGTAC ACCTGCTCGG CGAGCTGGCA TTAGCCGTGG CGATATTATT CTCGCAGTCA ACGGCGAAGA AATTACTCAA CGCCACTCAT TGCAGCGCCT GCTGTTGCAA TATAAACCAG GCGATACCGT AACGGTCACA ATCGAGCGTA ATGACCAACA ACAGGAAGTG CAAATTACCC TCGCTGAACG CCCATAA
|
Protein sequence | MSNPKRTVGI GGLIIAALAL SGVGGGLVGG VAGYRLAQVD QTATAVAVTP TSATQQTTDN LTAQPVVAST NDAVVQTVAK ASPAVVKIIS TVVGGQASGS GAIINEDGYI ITNNHVVEGQ SRLQVIYSDG TSHNAELIGT DAFSDIAVIR VLDAVPATIS LGDSDSLQPG ETVVAIGSPL GKFQNSVTVG VVSALDRTID SMEGLIQTDA AINHGNSGGP LINLKGEIVG INTLVVRGDI GSIDEAQGLG FAVPSNIVRE VSDALIANGQ VIRPYIGIRY ELLSPETAEL GIANDKGAFV TNVDEGTPAR RAGISRGDII LAVNGEEITQ RHSLQRLLLQ YKPGDTVTVT IERNDQQQEV QITLAERP
|
| |