Gene Hlac_1837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1837 
Symbol 
ID7400029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1845060 
End bp1846229 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content65% 
IMG OID643708907 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_002566486 
Protein GI222480249 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.607752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.577621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCTG TTGTATACCA AGGCCCGTAC GACGTGGCTA TCGAAGAGGT AGATGACCCG 
GAGATCGAAC ACCCGAACGA CGTCGTGATC GACATCACGA CGTCGTGTAT CTGTGGCTCC
GACCTCCACA TGTACGAGGG GCGAACCGCG GCCGAGGAGG GAATCGTGTT CGGCCACGAG
AATATGGGGA TCGTGAGCGA AGTCGGTGAG GCCGTCTCGA CGCTAGAGGA AGGTGACCGG
GTTGTGATGC CGTTTAACGT CTCCTGTGGG TTCTGTCAGA ACTGCGAAGA GGGATACACC
GGTTTCTGTA CCAACGTCAA CCCCGGCTTC GCGGGCGGCG CCTACGGCTA CGTCGCCATG
GGGCCGTATC CGGGCGGACA GGCGGAGAAG CTTCGCGTGC CGTACGCCGA CCACAACGCC
CTGAAACTCC CGGAGGGCGA CGAGCACGAG GACGCGTTCT CACTGCTCGC GGACATCTTC
CCGACCGGCT GGCACGGCAC GGAACTGGCG AACCTCCAGC CGGGCGAGTC GGTGGCCATC
TTCGGGGCCG GCCCGGTCGG ACTGATGGCG GCGTACAGCG CGAAGATCAA GGGCGCGGCA
GAGATATACG TCGTCGACCA GGTCCCGAGC CGGCTGGAAC TGGCCGAGGA GAACTGCGAC
GCCACCGCCA TCGACTTCTC CGAGGGCGAC CCGGTCGACC AGATCATCGA GGAACACGGC
GGGATGGTCG ACAAGGGCGT CGACGCGGTC GGGTATCAGG CGACCGACCC CGAGGACGTC
GACACCGAAT CCGAGGACTA CTCGTACGAC CCAGCCAAGG AGAATCCGGC GGTCGTGATC
AACAGCCTCA TTCGCGTGGT CCGTCCGACC GGCGAGCTCG GTATCCCCGG CCTGTACGTG
CCGGAGGACC CGGGCGCACC GGACGACATG GCCGCGCAGG GGCGACTCGG AATCGACTTC
GGGAAGTTCT TCGAGAAGGG ACTCAAGTGC GGTACGGGCC AGTGTAACGT CAAGTCGTAC
AACCGCTACC TCCGCGACAT GATCATCGAG GGGCGCGCCG ACCCGAGTTG GGTCGTCTCC
CACCGCGTCA ACCTCGACGA GGCGCCGGAG ATGTACGAGG CGTTCGACGC CCGCGAGGAG
GGCGTCACGA AGGTCCTGCT CGAACCCTGA
 
Protein sequence
MRAVVYQGPY DVAIEEVDDP EIEHPNDVVI DITTSCICGS DLHMYEGRTA AEEGIVFGHE 
NMGIVSEVGE AVSTLEEGDR VVMPFNVSCG FCQNCEEGYT GFCTNVNPGF AGGAYGYVAM
GPYPGGQAEK LRVPYADHNA LKLPEGDEHE DAFSLLADIF PTGWHGTELA NLQPGESVAI
FGAGPVGLMA AYSAKIKGAA EIYVVDQVPS RLELAEENCD ATAIDFSEGD PVDQIIEEHG
GMVDKGVDAV GYQATDPEDV DTESEDYSYD PAKENPAVVI NSLIRVVRPT GELGIPGLYV
PEDPGAPDDM AAQGRLGIDF GKFFEKGLKC GTGQCNVKSY NRYLRDMIIE GRADPSWVVS
HRVNLDEAPE MYEAFDAREE GVTKVLLEP