Gene Hlac_1238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1238 
Symbol 
ID7399506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1249702 
End bp1251867 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content65% 
IMG OID643708302 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_002565900 
Protein GI222479663 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.768724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0266527 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACTG AGGGAGACGA GCCGGTGAAG ACCATCTGCC CGTACTGCGG CGTCGGCTGC 
GGGATCAAAG TGAACCAGGG CGACGACCCC GGCGACGTGA GTTTCATGCC GTGGGGGGAG
GCGCCGGTCA ACGAGGGGCG GGTGTGTATC AAGGGCGGCG CGGCGACGCA GGTCGTCGAC
CACGAGGACC GCCTGACGGA GCCGCTGATA AAGGAAGATG GCGAGTTCCG CGAGGCGACG
TGGGAGGAAG CCTACTCGCG GATCGTCTCG GAGATGGAGC GGATCCGCGA CGAGAACGAC
CCAGACGCGA TGGGATTCTT CGGCTCCTCG AAGACGATGA ACGAGGAGAA CTACCTCCTC
CAGAAGATCG CGCGCCGATA CGGCACGAAC AACGTCGACA ACTGCACGCG GATGTGCCAC
GCCTCGACGG TGTGGGCGCT CCGGAGGAGC TTGGGGGCGG GCGCGATGAC GAACAGCATG
GTCGACCTAG AGGAATCGGC CGACGTGTTC TGGATCCAAG GGGCGAATCC CGGCGAACAA
CACCCGATCG CCAACAGCCA GTACTTCCGG CAGGCCGTCT TGGAGGGTGC GACCGTCATC
CAGGTCGACC CGCACGCCAA CAAGACCACC CGGTCGTTCA AGATCGGCGA GACCGACCGG
CACATGCACC TTCAGGTGAA CCCCGGCGCC GATATTCCCC TGCTCAACAT CGTCTTGAAG
ACGATCCTTG AACGCCACGA GGAAGAGCCG GACGCGGGCT GGATCGACGA GGCGTTCATC
GACGAGCGCA CCGAGGGGTT CGATCACTTG AAAGAGACCC TCGAAGACTT CGACAAGGAG
GCGGCCGCGG AGGAGGCCGG CGTCCCCCTC GAAGACATCG AACTCGCCGC CGAGAAGTAC
GCGATGGCGA ACAACGCCGC CATCTTCACC GGGATGGGGA TGAGCCAGCA CACCTGCGGC
GTCGACAACG TGCAGAACGA GATCAACCTC GCGCTGATCA CTGGGAACCT CGGGAAGCCC
GGCACCGGCG TCAACCCGCT TCGTGGACAG AACAACGTCC AAGGGACCAG CGACGTGGGT
GCGATGCCGA ACGTCCTCCC CGGCTACCAG CCCGTCAACG ACGACGAGGC CCGCGGGAGC
GTCGAGGACG TGTGGGGGTT CGAGGTGCCC GACGAGCCCG GGCTCACCAA CGTGGAGATT
TCCCACGAGG CGGGTCACTC GGTGAAGGGG CTGTACGTGA TGGGCGAGAA CCCGATCATG
AGCGAGCCCG ACGGCAACGA GGTCGAAGAG CGCTTGAAGT CGCTGGAGTT CATGGTCGCA
CAGGACATCT TCATGACCGA GACCGCGGAG TTCGCGGACG TGGTCCTCCC GGCGACGACG
TGGGCGGAAC GCGGCGGCAC AGTCACCAAC ACCGACCGCC GAGTCCAACG CATGCGCGGC
GCCGAGATGG TCCACGAGAA CACGAAACAC GACCTCGACA TCCTGATGGA GGTCGGGAGC
CGCCTCTTTA GCGAAGACGA GTTCCGCTTC GACGACGTGG AGGCCGTCTT CGAGGAGCTG
CGCGAGGTGT GTCCGATCTA CCACGGGATG ACCTACGACG CGCTCGGCGA GACCGGGATC
CAGTGGCCCT GCTACGAGGA GGGCGACCAG GGCGACCAGT ACCTCTACGA GGACTCCTTC
GACACAGAGA GCGGGCTCGG ACATATCGAG GGCGTCCGCC ACCAGCCACC GGCGGAGGTG
CCCGACGAGG AGTACCCGCT GATCCTCACC ACTGCGCGGC TCGAAGAGCA CTACAACACG
GGGACGATGA GCCGGCGCTC GCCGACGCTG ATGCGACAGC ACCCGGAGAA CTTCGTCGAC
GTGCACCCGA ACGACGCCGA AGAGTACGGG ATCGAGGACG GCGACATGGT GACGCTCCGG
TCGCGACGCG GCGAGATCGA AGTGAAAGCG CAGGTGACCG AGGACATCAA GGAGGGCGTC
GTCTGGACGA CGCCGCACTT CGCGGCCGCC TCCGCGAACC GGCTCACGAA CGACGTGCTC
GACGAGCGGG CGAAGATACC CGAGTACAAG GCCGCGGCGG CGGACATCGC GGTCACCGTC
TCTGACGGCG GGGAACGCGT GGACGACGCT GAGCCCGACG CGGGTTCGGA GCCGGGCGAC
GACTGA
 
Protein sequence
MSTEGDEPVK TICPYCGVGC GIKVNQGDDP GDVSFMPWGE APVNEGRVCI KGGAATQVVD 
HEDRLTEPLI KEDGEFREAT WEEAYSRIVS EMERIRDEND PDAMGFFGSS KTMNEENYLL
QKIARRYGTN NVDNCTRMCH ASTVWALRRS LGAGAMTNSM VDLEESADVF WIQGANPGEQ
HPIANSQYFR QAVLEGATVI QVDPHANKTT RSFKIGETDR HMHLQVNPGA DIPLLNIVLK
TILERHEEEP DAGWIDEAFI DERTEGFDHL KETLEDFDKE AAAEEAGVPL EDIELAAEKY
AMANNAAIFT GMGMSQHTCG VDNVQNEINL ALITGNLGKP GTGVNPLRGQ NNVQGTSDVG
AMPNVLPGYQ PVNDDEARGS VEDVWGFEVP DEPGLTNVEI SHEAGHSVKG LYVMGENPIM
SEPDGNEVEE RLKSLEFMVA QDIFMTETAE FADVVLPATT WAERGGTVTN TDRRVQRMRG
AEMVHENTKH DLDILMEVGS RLFSEDEFRF DDVEAVFEEL REVCPIYHGM TYDALGETGI
QWPCYEEGDQ GDQYLYEDSF DTESGLGHIE GVRHQPPAEV PDEEYPLILT TARLEEHYNT
GTMSRRSPTL MRQHPENFVD VHPNDAEEYG IEDGDMVTLR SRRGEIEVKA QVTEDIKEGV
VWTTPHFAAA SANRLTNDVL DERAKIPEYK AAAADIAVTV SDGGERVDDA EPDAGSEPGD
D