Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4634 |
Symbol | |
ID | 5736481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5922400 |
End bp | 5923377 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281798 |
Product | pyruvate dehydrogenase (acetyl-transferring) |
Protein accession | YP_001547393 |
Protein GI | 159901146 |
COG category | [C] Energy production and conversion |
COG ID | [COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit |
TIGRFAM ID | [TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAGC AAGATTTACT TGCCGATTAC CGCACGATGG TGCTGATTCG CTCATTTGAA GAACATTGCC AGCAACAGTA CACCCGTGCT CGGATCGGCG GCTTCTTGCA CTTATATGTT GGGCAAGAAG CAGTCGCGGT TGGTGCGATT GGTGCCTTGA AAGCACAAGA TCATTTAGTC ACTCACTATC GCGACCATGG CCACGCCCTT GCTCGTGGCT TGGAACCCAA ACCTCTGATG GCTGAATTGT TTGGCCGCAG CACTGGCACC GGTAAAGGCA AAGGCGGCTC AATGCACTTT GCTGATAAAA ATAAAAATTT CTGGGGCGGT TACGCCATCG TTGGTGCCCA CTTGCTGTTG GCCATGGGGA TTGCCTACTC GATCAAATAC AAGCGCGAAG TGCTTGGCCA AGCTGATCAA GATGGTGTTG TCATGTGTTT CTTTGGCGAT GGCGCAACCA ATGGCGGCGA ATTCTACGAA GCCGTCAGTA TGGCCGCATT ATATAAATTG CCAATCGTTT TCCTATGCGA AAACAACGAA TTTGCCATGG GTACGCCGCT CAGCGTGCAC ACCTCGGTCA CCGAAATTCA CAAAAAAGCT TCGCCATTTA TGCCTGGCGA ACGGGTGAAT GGCAACGACG TTGAAGAAAT GCGTGCTCGC GCCCTTTACG CCGTCAACCA TGCCCGCACC GAAGGCCCAT ATTTCTTAGA AGCGATGACC TATCGTCTCC GTGGTCACTC GGCTGCCGAC CCTCAAATGT ATCGAACTCG CGACGATATT AATGCTCGGC GTTCCGGCGA CCCAATTGCT TTGCTCAAGC AAAAACTGAT CGATCAAAAC TTGTTGACTG AAAAACAAGC CAAGCAAATC GATAAAGAAG TTGAAAAGGA AATGGATGTA GTGGTGCAAT TTGCCGAAGA AAGCCCTGCC CCAGACCTGA GCGAAGCATG GACCGAAATC TATTCGAAGC CGCTCTAA
|
Protein sequence | MEKQDLLADY RTMVLIRSFE EHCQQQYTRA RIGGFLHLYV GQEAVAVGAI GALKAQDHLV THYRDHGHAL ARGLEPKPLM AELFGRSTGT GKGKGGSMHF ADKNKNFWGG YAIVGAHLLL AMGIAYSIKY KREVLGQADQ DGVVMCFFGD GATNGGEFYE AVSMAALYKL PIVFLCENNE FAMGTPLSVH TSVTEIHKKA SPFMPGERVN GNDVEEMRAR ALYAVNHART EGPYFLEAMT YRLRGHSAAD PQMYRTRDDI NARRSGDPIA LLKQKLIDQN LLTEKQAKQI DKEVEKEMDV VVQFAEESPA PDLSEAWTEI YSKPL
|
| |