Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1976 |
Symbol | |
ID | 4711105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2178256 |
End bp | 2179710 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639856449 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001003542 |
Protein GI | 121998755 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGATT CACCGCTACT GCCCGAATTC AACGGCTACA TTGCCGGGGA GTGGGTGGCC GCCGACAACG GCGCCACCTT CCAGATCACC AACCCGGCCA ACGGCGAGCT GCTCGGCGAA CTGCCGGCGA TGGGCGCCCA CGAGGCCGGG CGTGCCGTCG ATGCCGCCCA CACCGCCCTG GAAGAGACCC CGGACCTGGA GACCCGGCGC GGCTGGCTGC AGGCCATCGA CACGGCCCTG CGCGAGCACC AGGAGGAGCT GGGGCGAATC CTCACCCTGG AGCACGGCAA GCCCCACGCC GAGGGCCAGG GCGAAGTGCT CTACGCCGCC GGCTTCTTCG CCTACGCGGC GCGCAACCTC GACGCGCTCG CGCCGCGCAC CCTGGACGAA CAGCCCCGCG GCTGCACCTG GACCGTGCAC AATCGCCCGG CGGGCGCCGT GGCCCTGGTC ACGCCGTGGA ACTTCCCCAT CGGCATGATC GCCAAGAAGC TCTCCGCGTC GCTGGCCGCC GGCGCGCCCG CCGTGATCAA ACCCTCCTCG AAGACGCCGC TGACCATGAC GGCCCTGTTC ACCCTGCTCC ACCGGGAGCT CGACCTGCCC GCGGGCATGG TCAACCTGGT CACCGGCGCT GCTGGCCCCA TCGGCGATGC CCTGCTGACC GACCGGCGGA TCCGGGTGTT CAGCTTCACC GGCTCCACCG ACGTCGGCCA GTCCCTGATC CGCGACAGCG CGGACGACTG CACGCGCCTG GCCTTGGAGC TCGGCGGCAA CGCCCCGTTC ATTGTCTTCG CCGACGCCGA CCTGGACTGG GCCGCGGATC AGCTCATGGC CAACAAGTTC CGGGGCGGCG GCCAGACCTG CGTCTGCGCC AACCGGATCC TGGTAGAGGA CTCGGTCATC GACCGCTTCT CGGAGAAGGT CGCCGAGCGC GCCGCCCGAT TGACCGTCGG CGACGGCATG AGCCCCGGCG TGGATCTGGG CCCGCTGATC GACCGCAGCG GCTACGACAA GGTCCGCCGC CACTTCCTGG ACGCCGTCGA ACAGGGTGCC ACCCCGGTAC TGGGCAGCGA CCCGGGCCCG CTGGAGCAGG AGCACGGCGC CTACTTCCCG CCCACGGTGG TCCGCGGCGT GACCGCCACG ATGGCCTGCT GGCAGGAGGA GACGTTTGGG CCGCTGGTGC CCATGGCCGC CTTCGGCGAC GAGGCTGAGG CGCTGGCCAT GGCCAACGAC ACCGAGTTCG GGCTCGCCGC CTACCTGTTC ACCGGCGACG ACGCCCGGGC GGAGCGGTTC ATTCCGCGCC TATCCTTCCC CCACGTGGGC TGGAACACCG GCAGCGGCCC CACCCCGGAG GCGCCCTTCG GCGGTATGAA GCTCTCCGGC TACGGCCGCG AGGGCGGCCT GGAAGGGCTG TTCGAGTTCA TCGACACCCA GACCGTGCCG CGGGTCCAGG GGTAA
|
Protein sequence | MIDSPLLPEF NGYIAGEWVA ADNGATFQIT NPANGELLGE LPAMGAHEAG RAVDAAHTAL EETPDLETRR GWLQAIDTAL REHQEELGRI LTLEHGKPHA EGQGEVLYAA GFFAYAARNL DALAPRTLDE QPRGCTWTVH NRPAGAVALV TPWNFPIGMI AKKLSASLAA GAPAVIKPSS KTPLTMTALF TLLHRELDLP AGMVNLVTGA AGPIGDALLT DRRIRVFSFT GSTDVGQSLI RDSADDCTRL ALELGGNAPF IVFADADLDW AADQLMANKF RGGGQTCVCA NRILVEDSVI DRFSEKVAER AARLTVGDGM SPGVDLGPLI DRSGYDKVRR HFLDAVEQGA TPVLGSDPGP LEQEHGAYFP PTVVRGVTAT MACWQEETFG PLVPMAAFGD EAEALAMAND TEFGLAAYLF TGDDARAERF IPRLSFPHVG WNTGSGPTPE APFGGMKLSG YGREGGLEGL FEFIDTQTVP RVQG
|
| |