Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1159 |
Symbol | |
ID | 5733052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1330754 |
End bp | 1332274 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278299 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001543935 |
Protein GI | 159897688 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTCTACA CCAATCCCAA TCAGCCGGGC AGTAAAGTCA GTTTTAAGTC ACGGTATGGC AACTACATCA ATGGCGAGTT TGTTGAGCCA GTCAAGGGCA TGTATTTTGA AAATATCAGC CCAGTTAATG GCAAGCCGTT CTGCGAAATT CCTCGTTCGA CCGCCGAAGA CATCGAAAAA GCGCTCGATG CGGCCCATGC TGCCAAAGCT GCTTGGGGTG CAACCTCACC CGCCCAACGC GCCAATATTC TGAATAAGAT TGCCGATCGC ATGGAAGCCA ACTTGGAGAT GCTGGCGGTC GCCGAAACAT GGGAAAATGG CAAGCCTGTG CGCGAAACTC TCGCCGCTGA TTTACCCTTG GCGATCGATC ACTTCCGCTA TTTTGCTGGG GTAATTCGGG CGCAAGAAGG CAGTGCCGCC ACAATCGACG AAAATACAAT TGCCTACCAT TTTTATGAGC CGCTGGGCGT GGTGGGCCAA ATTATTCCGT GGAACTTCCC GCTGTTGATG GCAACTTGGA AATTGGCTCC GGCCCTCGCT GCTGGCAATT GTGTGGTGCT CAAGCCCGCC GAACAAACGC CCAGCACAAT TTTGGTCTTG ATGGAATTGA TCGGCGATTT GATTCCAGCG GGCGTGGTCA ATGTGGTCAA TGGCTTTGGG ATTGAAGCTG GCAAGCCGTT GGCGAGCAGC AATCGGATCG CCAAAATAGC CTTTACGGGC GAAACCACCA CTGGCCGTTT GATTATGCAA TATGCCTCAG AAAATATCAT TCCTGTGACC TTAGAGCTTG GTGGCAAATC GCCCAACATC TTCTTCGAGG ATGTTTTGAG CAAGCAAGAT TCGTTTGTTG ATAAAGCGCT CGAAGGCTTC ACCATGTTTG CCTTGAACCA AGGCGAAGTT TGTACCTGTC CATCACGGGC ATTAATTCAA AAATCGATCT ATGGTGAGTT TTTAGAGCGA GCGGTCGAAC GCACCAAACG CTGCATTCAG GGCAATCCAC TTGATCCAGC CACAATGGTG GGGGCACAAG CCTCCAATGA TCAATTCGAG AAAATTTTGT CGTATTTGGC GATTGGCCGC GACGAAGGGG CTAAAGTGCT GGTTGGCGGA GCCAAAGCTG AGCTTAGCGG CGATTTAGCT GAAGGCTATT ATGTTCAGCC GACGATCTTT GCTGGCAACA ACCGCATGCG AATCTTCCAA GAAGAAATTT TCGGGCCAGT CGTTTCGGTG ACTTCGTTCG ATGATTTTGA CGATGCCTTG AGTATTGCCA ATGATACCTT GTATGGCTTG GGCGCTGGTT TGTGGACTCG CGATATGAAC ACAGCCTATC GCATGGGTCG GGCAATTCAA GCAGGCCGTG TTTGGACCAA CTGTTACCAC TTGTATCCAG CCCATGCTGC GTTCGGTGGC TACAAACACT CGGGGATTGG CCGCGAAAAC CATAAGATGA TGCTCAACCA TTATCAACAA GTTAAAAACC TGTTGGTCAG CTACGATCCC AATCCAATGG GCTTCTTTTA G
|
Protein sequence | MVYTNPNQPG SKVSFKSRYG NYINGEFVEP VKGMYFENIS PVNGKPFCEI PRSTAEDIEK ALDAAHAAKA AWGATSPAQR ANILNKIADR MEANLEMLAV AETWENGKPV RETLAADLPL AIDHFRYFAG VIRAQEGSAA TIDENTIAYH FYEPLGVVGQ IIPWNFPLLM ATWKLAPALA AGNCVVLKPA EQTPSTILVL MELIGDLIPA GVVNVVNGFG IEAGKPLASS NRIAKIAFTG ETTTGRLIMQ YASENIIPVT LELGGKSPNI FFEDVLSKQD SFVDKALEGF TMFALNQGEV CTCPSRALIQ KSIYGEFLER AVERTKRCIQ GNPLDPATMV GAQASNDQFE KILSYLAIGR DEGAKVLVGG AKAELSGDLA EGYYVQPTIF AGNNRMRIFQ EEIFGPVVSV TSFDDFDDAL SIANDTLYGL GAGLWTRDMN TAYRMGRAIQ AGRVWTNCYH LYPAHAAFGG YKHSGIGREN HKMMLNHYQQ VKNLLVSYDP NPMGFF
|
| |