Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4586 |
Symbol | |
ID | 5736431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5867850 |
End bp | 5868842 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281748 |
Product | aldo/keto reductase |
Protein accession | YP_001547345 |
Protein GI | 159901098 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00281248 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCAAGC GAACAATTGG CAAAAGTGGT ATTGAAGTCA GTGCTTTGGG TTTAGGCTGT TGGGCCATCG GCGGCCCATT CAATCACAAT GGCAATCCCT CAGGTTGGGG CCAAGTTGAC GATGCTGAAT CAATTCAGGC AATTCATGCA GCGCTTGAGC TTGGAGTCAA TTTTCTCGAT ACTGCCAATG TCTATGGGTG TGGCCACAGC GAGCAAATTA TTGCCCAAGC GATCGCTGGC CGCCGTGATC AGGTGATTCT CGCCACTAAA TTTGGCAATA GCTGGGCTGA AGGCAGCAAA GATTCGTTTG AGCAAATTCC AATTAGCCCT GCCGAAATTC GCCAACAATT GGAAGCCAGC CTCAAACGGC TCAATACCGA TTATGTCGAT TTGTTACAAT TTCACTTGTG GGGCTATCCG GTCGAAGATG CCGCCCCAGT GCGCGATACG CTTGAAAGTT TGGTGAGCGA GGGCAAAATT CGCGGCTATG GTTGGAGCAC CGATCGGCTT GATGCGATTA AGCTCTTTGC TGAAGGCCCA CATTGTATTG CTGTGCAACA ACAATTGAAT TTATTGCATG GTCATAGCAC GGGCGAGAGC GATGCAATTA TCGCCTTTTG CGAAGCCCAC AATTTAGCCA GCATCAATCG TGCACCCTTA GCGATGGGTT TGTTGACAGG AAAATTCACC CCCACTACCA GCTTTAGCAG CGATGATGTA CGCAGCAAAG TTGCATGGTT TGAGGGGTTT CAGGCCGGCA AGCCCAATCC TGAATGGCTC AAAAAGCTCG AAGCCTTGCG CGAAGTGTTG ACCAGCGAAG GCCGCACGCT GACTCAAGGT GCTTTAGCTT GGTTGTGGGG TCGCAGCAGC CAAACCCTGC CAATCCCAGG CTTCCGTACC GTCGCTCAAG CCAGCGAAAA CGCCAAAGCC TTACAATTCG GCCCATTGAA CCCCAGCCAA ATGCAGCAAA TCGCTAGCAT TTTGCAGGGC TAG
|
Protein sequence | MLKRTIGKSG IEVSALGLGC WAIGGPFNHN GNPSGWGQVD DAESIQAIHA ALELGVNFLD TANVYGCGHS EQIIAQAIAG RRDQVILATK FGNSWAEGSK DSFEQIPISP AEIRQQLEAS LKRLNTDYVD LLQFHLWGYP VEDAAPVRDT LESLVSEGKI RGYGWSTDRL DAIKLFAEGP HCIAVQQQLN LLHGHSTGES DAIIAFCEAH NLASINRAPL AMGLLTGKFT PTTSFSSDDV RSKVAWFEGF QAGKPNPEWL KKLEALREVL TSEGRTLTQG ALAWLWGRSS QTLPIPGFRT VAQASENAKA LQFGPLNPSQ MQQIASILQG
|
| |