Gene Haur_4634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4634 
Symbol 
ID5736481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5922400 
End bp5923377 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content50% 
IMG OID641281798 
Productpyruvate dehydrogenase (acetyl-transferring) 
Protein accessionYP_001547393 
Protein GI159901146 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGC AAGATTTACT TGCCGATTAC CGCACGATGG TGCTGATTCG CTCATTTGAA 
GAACATTGCC AGCAACAGTA CACCCGTGCT CGGATCGGCG GCTTCTTGCA CTTATATGTT
GGGCAAGAAG CAGTCGCGGT TGGTGCGATT GGTGCCTTGA AAGCACAAGA TCATTTAGTC
ACTCACTATC GCGACCATGG CCACGCCCTT GCTCGTGGCT TGGAACCCAA ACCTCTGATG
GCTGAATTGT TTGGCCGCAG CACTGGCACC GGTAAAGGCA AAGGCGGCTC AATGCACTTT
GCTGATAAAA ATAAAAATTT CTGGGGCGGT TACGCCATCG TTGGTGCCCA CTTGCTGTTG
GCCATGGGGA TTGCCTACTC GATCAAATAC AAGCGCGAAG TGCTTGGCCA AGCTGATCAA
GATGGTGTTG TCATGTGTTT CTTTGGCGAT GGCGCAACCA ATGGCGGCGA ATTCTACGAA
GCCGTCAGTA TGGCCGCATT ATATAAATTG CCAATCGTTT TCCTATGCGA AAACAACGAA
TTTGCCATGG GTACGCCGCT CAGCGTGCAC ACCTCGGTCA CCGAAATTCA CAAAAAAGCT
TCGCCATTTA TGCCTGGCGA ACGGGTGAAT GGCAACGACG TTGAAGAAAT GCGTGCTCGC
GCCCTTTACG CCGTCAACCA TGCCCGCACC GAAGGCCCAT ATTTCTTAGA AGCGATGACC
TATCGTCTCC GTGGTCACTC GGCTGCCGAC CCTCAAATGT ATCGAACTCG CGACGATATT
AATGCTCGGC GTTCCGGCGA CCCAATTGCT TTGCTCAAGC AAAAACTGAT CGATCAAAAC
TTGTTGACTG AAAAACAAGC CAAGCAAATC GATAAAGAAG TTGAAAAGGA AATGGATGTA
GTGGTGCAAT TTGCCGAAGA AAGCCCTGCC CCAGACCTGA GCGAAGCATG GACCGAAATC
TATTCGAAGC CGCTCTAA
 
Protein sequence
MEKQDLLADY RTMVLIRSFE EHCQQQYTRA RIGGFLHLYV GQEAVAVGAI GALKAQDHLV 
THYRDHGHAL ARGLEPKPLM AELFGRSTGT GKGKGGSMHF ADKNKNFWGG YAIVGAHLLL
AMGIAYSIKY KREVLGQADQ DGVVMCFFGD GATNGGEFYE AVSMAALYKL PIVFLCENNE
FAMGTPLSVH TSVTEIHKKA SPFMPGERVN GNDVEEMRAR ALYAVNHART EGPYFLEAMT
YRLRGHSAAD PQMYRTRDDI NARRSGDPIA LLKQKLIDQN LLTEKQAKQI DKEVEKEMDV
VVQFAEESPA PDLSEAWTEI YSKPL