Gene Hlac_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0139 
Symbol 
ID7401660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp147209 
End bp148336 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content68% 
IMG OID643707203 
Productpyruvate dehydrogenase (acetyl-transferring) E1 component, alpha subunit 
Protein accessionYP_002564815 
Protein GI222478578 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03181] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATGAAG AGGTGGGTTC GACCGTGAGC GTGTTCGACA GAGCGTACGA CGATCCGGTC 
CGCGTTCTCG ACGAGGGCGG CGAGGTCGTC GGCGACGTGC CGGACCTCGA CGACGAGGCG
CTCGTCGAGA TGTACCGGGA CATGCGGCTG GCGCGGCACT TCGACGGTCG CGCCGTGAGC
CTCCAGCGGC AGGGGCGGAT GGGGACGTAT CCGCCGCTGT CCGGACAGGA GGGCGCGCAG
ATCGGCTCCG CGACCGCGCT CGACGAGGAC GACTGGATGG TCCCCTCGTA CCGCGAGCAC
GGCGCGGCGC TCGTTCGTGG ACTCCCGCTG AAACAGACCC TACTCTACTG GATGGGCCAC
GAGGCGGGCA ACGCGACGCC CGAGGGCGTG AACGTGTTCC CGGTCGCGGT CCCCATCGCC
TCGCAGGTCC CGCACGCCAC CGGTGCGGCG TGGGCGTCGA AGCTCCGCGG CGAGAACGAC
GCGTTCCTCT GTTACTTCGG GGACGGCGCG ACCAGCGAGG GCGACTTCCA CGAGGGGGTC
AACTTCGCGG GCGTGTTCGA TACGCCGACC GTCTTCTTCT GTAACAACAA CCAATGGGCC
ATCTCCGTGC CCCGCGAGCG ACAGACGCGG AGTGCGACGC TGGCCCAGAA GGCGGAGGCG
TACGGGATCG ACGGGGTACA GGTCGACGGG ATGGACCCGT TGGCGGTGTA CAGCGTCACG
AAGGCAGCCG TCGAGAAGGC GCGTGACCCC GAGACCGACC GACCTCGCCC GACGCTGATC
GAGGCGATCC AGTATCGGTT CGGCGCGCAC ACGACCGCCG ACGATCCGAC GGTCTACCGC
GACGACGACG AGGTCGAGAG CTGGAAACGG AAGGACCCGA TCCCGCGACT CGAACGCTAC
CTCCGGTCCG AGGGCGTGCT CGACGACGAG CGCGTCGCGG AGATCGAGAC CGCCGTCGAG
ACACGGGTGG CAGAGGCCAT CGAGGCGGCC GAGTCGGAGG TGCGGCCGAA GCCACAAGAG
ATGTTCGAGC ACGCGTACGC GGAGCTCCCA CCCGAGCTAG AGCGGCAGTA CGAGGAGTTC
GCGGCGTTCC GCGAGGCACA CGGCGACGAA GCATTCTTGG AGGAGTGA
 
Protein sequence
MYEEVGSTVS VFDRAYDDPV RVLDEGGEVV GDVPDLDDEA LVEMYRDMRL ARHFDGRAVS 
LQRQGRMGTY PPLSGQEGAQ IGSATALDED DWMVPSYREH GAALVRGLPL KQTLLYWMGH
EAGNATPEGV NVFPVAVPIA SQVPHATGAA WASKLRGEND AFLCYFGDGA TSEGDFHEGV
NFAGVFDTPT VFFCNNNQWA ISVPRERQTR SATLAQKAEA YGIDGVQVDG MDPLAVYSVT
KAAVEKARDP ETDRPRPTLI EAIQYRFGAH TTADDPTVYR DDDEVESWKR KDPIPRLERY
LRSEGVLDDE RVAEIETAVE TRVAEAIEAA ESEVRPKPQE MFEHAYAELP PELERQYEEF
AAFREAHGDE AFLEE