Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4219 |
Symbol | |
ID | 5736073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5376302 |
End bp | 5377279 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281374 |
Product | aldo/keto reductase |
Protein accession | YP_001546979 |
Protein GI | 159900732 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTATC GCAAATTAGG CCGAACTGGA CTCAAAGTTT CGGAATTATG TTTGGGAACA ATGACCTTTG GCTGGAGTGT TGAAGAGCCA GAATCGTTCG CGATTATGAG CCAAGCGCTG GATGCAGGCA TTAATTTCTT CGATACTGCC GATGTTTATT CGCGTTGGGA CCCCCGTTCA TATGCTGGCA AAACCGAGGA AATTATCGGG CGTTGGTTGG CCGAGGAGCC AAGTCGTCGC GATAAAGTGG TGTTAGCAAC CAAAGTTCGT GGCCAAATGG GCGATCAGCC GAACGATCAA GGGCTGTCGC GCAAACATAT TATTGCAGCG GTTGATGCCA GCTTGCGGCG CTTGCAAACA GATTATATCG ACCTCTATCA AACCCACTTT CCCGATAGCG AAGTGCCAAT CGAAGAAACC TTGCGGGCGC TTGATGATTT GGTACGAGCT GGTAAAGTGC GCTATATCGG CTGCTCGAAT TACCGCGCAT GGCAATTGAT CGAAGCCCTG TGGACAAGCG ATAAACTGAA TCTCGCCCGT TACGATTCGC TGCAACCACA TTATCATTTG CTCAATCGAG CCGAGTTTGA GCGCGAATTA CAACCAGTTT GCGCCAAGTA TGGCGTGGGC GTGATTGTCT ATAGCCCGTT GGCGGGTGGC TTATTGACTG GCAAGTTTCG GCGTGGCCAG CCTATCCCCG AGGGCACACG GGTTGCTGAA CGTGGCCGTG ACGACAAACG CCTGAGCGAA CCATTGTGGA AATTAATCGA TCAGCTGGAA GTGTTAGCTG AGAAGCACCA AAAAACCATT CCCCAACTGG CAGTGGCATG GGTCACGGGT GCACCAGCGA TCACCTCGGC GATTATTGGG GCAACCAGCA GTGGGCAACT CCACGATACC TTGGGTGCAG CCGATTTTGA GTTGAGTAGC GAAGATCGGG CCATTTTAGA CAACCTAAGT GCTTGGGAGA ACGTCTAA
|
Protein sequence | MRYRKLGRTG LKVSELCLGT MTFGWSVEEP ESFAIMSQAL DAGINFFDTA DVYSRWDPRS YAGKTEEIIG RWLAEEPSRR DKVVLATKVR GQMGDQPNDQ GLSRKHIIAA VDASLRRLQT DYIDLYQTHF PDSEVPIEET LRALDDLVRA GKVRYIGCSN YRAWQLIEAL WTSDKLNLAR YDSLQPHYHL LNRAEFEREL QPVCAKYGVG VIVYSPLAGG LLTGKFRRGQ PIPEGTRVAE RGRDDKRLSE PLWKLIDQLE VLAEKHQKTI PQLAVAWVTG APAITSAIIG ATSSGQLHDT LGAADFELSS EDRAILDNLS AWENV
|
| |