Gene Hlac_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0140 
Symbol 
ID7401661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp148337 
End bp149323 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content67% 
IMG OID643707204 
ProductTransketolase central region 
Protein accessionYP_002564816 
Protein GI222478579 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA CACAGAACCT TACCGTGGTA CAGGCGGTTC GAGACGGATT GTACACCGAG 
ATGCGCGAGG ACGACGACGT CCTCGTGTTG GGCCAAGATG TGGGGAAGAA CGGCGGCGTC
TTCCGCGCGA CCGAGGGCCT GTTCGACGAG TTCGGCGGCG ACCGCGTCGT CGACACCCCG
CTTGCAGAGT CGGGGATCGT CGGTGCCGCC GTCGGCATGG CCGCGATGGG ACTCAAACCC
GTTCCCGAGA TCCAGTTTTC GGGGTTCATG TATCCCGGTT TCGACCAGAT CGTCTCCCAC
ATGGCCCGCT TCCGGACGCG AAGCCGGGGG CGATTCAACC TGCCGATGAC CCTCCGCGCC
CCGTACGGTG GCGGAATTCG GGCGCCGGAG CACCACTCCG AGTCGAAGGA GGCGTTTTAC
GCCCACGAGG CCGGGCTGAA GGTCGTCATC CCCTCGACCC CGTACGACGC GAAGGGGCTG
CTCGCGGCGT CGATTCGCGA CCCCGACCCG GTGATCTTCC TCGAACCGAA GCTCATCTAC
CGGGCGTTCC GCGGCGAGGT GCCCGAGGAG CCGTACACCG TTCCCATCGG TGAGGCGGTC
ACCCGCCGTG AGGGCGGCGA CGTGGCGGTG TTCACCTACG GCGCCATGAC GCGCCCGACG
CTCGAGGCCG CTGAGACCCT CGCCGAGGAG GGGATCGATT GCGAGGTCGT CGACCTCCGA
ACCGTCTCAC CGCTCGACCG CGAGGCGATC ATCGAGGCGT TCGAGGCCAC CGGGCGTGCC
GTCGTCGTCC ACGAAGCCCC GAAGACGGGG GGGCTCGCCG GCGAGATCAC GGCGATCATT
CAGGAGGAGG CGCTCCTGTA TCAGGAGGCG CCCGTGAAGC GCGTCACCGG ATTCGACGTG
CCGTACCCGC TGTACGCGCT GGAGGACTAC TACCTCCCGA CCGCGACCCG CATCGAGGAG
GGTATCAGAG AGGCGGTGGA GTTCTGA
 
Protein sequence
MSETQNLTVV QAVRDGLYTE MREDDDVLVL GQDVGKNGGV FRATEGLFDE FGGDRVVDTP 
LAESGIVGAA VGMAAMGLKP VPEIQFSGFM YPGFDQIVSH MARFRTRSRG RFNLPMTLRA
PYGGGIRAPE HHSESKEAFY AHEAGLKVVI PSTPYDAKGL LAASIRDPDP VIFLEPKLIY
RAFRGEVPEE PYTVPIGEAV TRREGGDVAV FTYGAMTRPT LEAAETLAEE GIDCEVVDLR
TVSPLDREAI IEAFEATGRA VVVHEAPKTG GLAGEITAII QEEALLYQEA PVKRVTGFDV
PYPLYALEDY YLPTATRIEE GIREAVEF