Gene Hlac_0141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0141 
Symbol 
ID7401662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp149326 
End bp150945 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content70% 
IMG OID643707205 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_002564817 
Protein GI222478580 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.652081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTCA AAGAGTTCAA ACTGCCCGAC GTGGGCGAAG GAGTCGCCGA GGGCGAGCTG 
GTCACTTGGC TGGTCGCGCC CGGCGACCGC GTCGAGGAGG ATCAGCCGGT CGCGGAGGTC
GAGACCGACA AGGCGCTCGT CGAGGTCCCC TCGCGGTACG ACGGGACCGT CGAGGAGCTG
TTCGTCGAGG AGGGAGATAT CGTCCCCGTC GGCGACGTGA TCATCTCGTT CCGCGTCGGA
GAGGACGGTG AAGACGTCGA GGCGGGAGGA GACGACTCCG CAGAGACGGG AGCCGACGCC
ACGGAGCCGG AGCCGGAGAC GGACATCGGC GCCGAGACCG ACGCGGAATC CGACGCCGAG
ACGGAGCCCG ACACCCCGCC CGGCCGGACC TTCGCGCCGC CGTCGGCCCG TCGACTCGCC
CGCGAACTGG GCGTCGACAT CGCCGTCGTC GACGGGAGCG GTCCCGGCGG TCGGATCGGC
GAGGCCGACG TGCGGGCGCA CGCGGAGGGC GGTGGCGACC ACGCTGGCGC CGACGCGGGC
GATTCTGGCT CCGACAAGGC ACCGGCCCCG ACCCCGACCG ACGTGGGATC CAGTGACCGG
AAATCCGCCG TACACAAGCG CGGTGACGAC GGATCCGCCG AATCCTCTGC GGACGCCCCG
TCCGCTGCCG GGGCTCCCGA GTCGGCCGGC CGCGAGACGA CGCTCGCGAC GCCCGCGACC
CGGAAGGTCG CCCGCGAACT GGGCGTCGAC ATCGACGACG TGCCGACCGA CGAGACCCGG
GACGGCGAGG CGTTCGTCAC CGGCGAGATC GTTCGGGCCT ACGCGGAGGC GCTGGAGTCG
GGGGCGTCGC CCGCGGCGGA CGCGGTCGAT ACGTCGGCGC CCGAACCGAA ATCCGCCGAT
GCCTCGCTGA GCGCCCCTGG CTCGGCCGAC GAGACCATCC CCTATCGCGG GGTGCGGCGT
ACCATCGGAA AGCAGATGGA GCGGTCGAAG TACACCGCTC CGCACGTCAG CCACCACGAT
ACCGCCGAGG TCGACGGGCT GGTGGCGGCG CGGGAAGAAC TGAAACGGCG CGCGGAGGAG
CAGGGCGTGA AGCTCACCTA CATGCCGTTC GTGATGAAGG CGATCGTCGC GGGGCTGAAG
GAGTACCCGT CCCTCAACAG TGAGCTTCGC GAGGACGACG AGGAAATCGT GTTGAAGGGC
GACTACAACC TCGGGATCGC CGTCGCGACC GACGCCGGGC TGATGGTGCC GGTCGTCGAG
AACGTCGACG AGAAGGGGCT CTTCGAGCTG GCGGAGGAGG TCAGAGATCT CGCTTCGCGC
GCCCGCGAGC GCAAGCTCAC GCCGGCGGAG ATGAAGGGCG GCACGTTCTC GATCACCAAC
TTCGGCGCCA TCGGCGGGGA GTACGCCACG CCGATCATCA ACTATCCCGA GACCGCGATC
CTCGGGCTGG GCGCCATCGA GGAGCGCCCG GTCGTGCGCG ACGGTGAGGT CGTCGCGGCG
CCGACGCTTC CGCTTTCGCT GTCGATCGAC CATCGCGTGA TCGACGGCGC GGTCGCGGCC
GAGTTCGCGA ACACCGTGAT GGAACACCTT GAACATCCGC TGCTACTGTT GACTCAATAA
 
Protein sequence
MPVKEFKLPD VGEGVAEGEL VTWLVAPGDR VEEDQPVAEV ETDKALVEVP SRYDGTVEEL 
FVEEGDIVPV GDVIISFRVG EDGEDVEAGG DDSAETGADA TEPEPETDIG AETDAESDAE
TEPDTPPGRT FAPPSARRLA RELGVDIAVV DGSGPGGRIG EADVRAHAEG GGDHAGADAG
DSGSDKAPAP TPTDVGSSDR KSAVHKRGDD GSAESSADAP SAAGAPESAG RETTLATPAT
RKVARELGVD IDDVPTDETR DGEAFVTGEI VRAYAEALES GASPAADAVD TSAPEPKSAD
ASLSAPGSAD ETIPYRGVRR TIGKQMERSK YTAPHVSHHD TAEVDGLVAA REELKRRAEE
QGVKLTYMPF VMKAIVAGLK EYPSLNSELR EDDEEIVLKG DYNLGIAVAT DAGLMVPVVE
NVDEKGLFEL AEEVRDLASR ARERKLTPAE MKGGTFSITN FGAIGGEYAT PIINYPETAI
LGLGAIEERP VVRDGEVVAA PTLPLSLSID HRVIDGAVAA EFANTVMEHL EHPLLLLTQ