Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4636 |
Symbol | |
ID | 5736483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5924475 |
End bp | 5925803 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641281800 |
Product | dehydrogenase catalytic domain-containing protein |
Protein accession | YP_001547395 |
Protein GI | 159901148 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | [TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.695457 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAAGA AACTAGAAAT GCCCAAAATG GGCTACGATA TGGTCGAAGG TACTTTGGCC AAATGGTTGA AAAAGCCAGG CGATGAGGTT TCGCGTGGTG AACCAATTGC TGAAGTCGAA ACCGATAAGG TCACGATTGA AATCGAGGCT TTTGAGGCTG GGACAATCTT AAAGTTCTTG GTCAACGAAG GCGAAACCGT GCCAGTTGGT GCGCCAATCG CCGAAATTGA CGATGGCTCA GGCGATGACG AGGCCGAAGC AGCCAATGCC AGCGTTACGC CTTCCAGCGA TGCTCCAGCA GTTGGCGAGG GTGGCGAGGC CGCTCCGCCG GCTCCTGCCG TGGTCGCTCA ACCAGAAAAA GTTGAGGCTA CACCAGCAGC CAGTGCTCCG GCAACCAGCA CTGGGCGCTT GTTTGCAACT CCAGCTGCTC GCGGTTTGGC CGAACAACGC GGCGTAGATT TGGCTGGCCT CAAGGGTTCT GGCCCTGATG GCCGAATTGT TAAGGCCGAT GTATTGGCTG CTGCCGTTGC ACCAAAGGCT GCACCTGCTG CTACCCCAGC CGCTGCGCCA GCTGCTGCAC AAGCGGCTCC AGTTGCATCA CCAGTGCCAG CACCAGTTGG CTTGATCTTC GCGCCACCAG CACCAAATTC GGTCTACACC GAGGAGCCAC TCTCGCGCTT ACGCCAAACC GCTGCCAAGC GCATGGTCGA AAGCCAACAA CAAGTGCCAC CATTCTTCGT TACTTCAACG ATTGAAATGG ATGCGATTCA AGCCTTGTTG CCTAAGTTGC GTGAAGCTCA TGGTGGCAAA CTTTCAGTGA CTGAATTGTT GCTGAAGGCT TGTGCTATCG CCTTGAAGAA GTTCCCCGCA CTCAACTCGA CCTTCGCTGG CGATAAGTTG TTGGTTCACA AAGATGTTCA CATCAGCGTG GCTGTAGCAA CCGATGCTGG CTTGTTGGCT CCAGTCGTGC GCAACTGCGA TAGCTTGAGC CTCGGCGCAA TCTCCAACCA AATGCGCGAT GTGATTGGCC GCACCCGCGA TGGCAAAGCT GGCCTCGACG ATCTCCAAGG CGGCACGTTT ACCGTCAGCA ACTTGGGGAT GTTCGATGTC ACCAACTTCA TCGCGATTAT CACGCCACCC CAAAGCGCAA TTTTGGCAGT TGGCAGCACA ATTGCCACTC CAGTTGTCCG CGATGGTGAA ATTGTGATTC GTCAATTGAT GAATGTCACG GTTTCAGCTG ACCACCGCGC CACTGATGGA GCAAGCGTTG CCCAGTTCTT GGTTGAACTC AAGAACTTGC TGCAAAACCC ATTCAAGCTC TTGCTCTAA
|
Protein sequence | MAKKLEMPKM GYDMVEGTLA KWLKKPGDEV SRGEPIAEVE TDKVTIEIEA FEAGTILKFL VNEGETVPVG APIAEIDDGS GDDEAEAANA SVTPSSDAPA VGEGGEAAPP APAVVAQPEK VEATPAASAP ATSTGRLFAT PAARGLAEQR GVDLAGLKGS GPDGRIVKAD VLAAAVAPKA APAATPAAAP AAAQAAPVAS PVPAPVGLIF APPAPNSVYT EEPLSRLRQT AAKRMVESQQ QVPPFFVTST IEMDAIQALL PKLREAHGGK LSVTELLLKA CAIALKKFPA LNSTFAGDKL LVHKDVHISV AVATDAGLLA PVVRNCDSLS LGAISNQMRD VIGRTRDGKA GLDDLQGGTF TVSNLGMFDV TNFIAIITPP QSAILAVGST IATPVVRDGE IVIRQLMNVT VSADHRATDG ASVAQFLVEL KNLLQNPFKL LL
|
| |