Gene Hlac_0215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0215 
Symbol 
ID7402144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp230762 
End bp231802 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content70% 
IMG OID643707278 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_002564890 
Protein GI222478653 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0696766 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCTG CCATTTTCAA CGGCCCCGGA GACATCGGCG TCGAAGAGCG TCCGCGCCCC 
GAGATCGAGG CCCCGACCGA CGCGATCGTC CGCGTGACCC ACACCGCGAT CTGCGGCTCG
GACCTGTGGT TCTACCGCGG CGACAGCGAC CGCGACGAGG GCTCGCCGGT CGGCCACGAG
CCGATGGGGA TCGTCGAGGA AGTCGGCAAC GAGGTCACCT CGGTCGCGCC CGGAGACCGC
GTGCTCGCGC CCTTCGCCAT CTCCTGTGGC GAGTGCGAGT TCTGCCGCAA GGGACTCCAT
ACCTCCTGCG AGAACGGGGA CTCGTGGGGC GGCGACAACG GCGGCGGGCA GGGCGAGTAC
GTTCGATCGA CTCACGCCGA CGGCACCCTC GTTCGAGTCC CCGATCGGTT TGCCGACGAC
GAGGAGACGC TCCGGTCACT GCTCCCGCTG ACCGACGTGA TGGGAACCGG TCACCACGCG
GCCGTCAATG CGGGCGTCGA GGCGGGTTCG ACCGTGGTCG TGATCGGCGA CGGCGCAGTC
GGCCTCTGCG GCGTGCTCGC GGCCCGCCGA CTCGGCGCCG AGCGGATCAT CGCGGTGGGC
CACCACGAGG ACCGACTCGA ACTCGCCGAG GAGTTCGGCG CCACGGAGAC CGTCTCGGAG
CGCGGCGAGG CCGCCGTCGA ACGGATCCAA GAGCTCACCC ACGGCGGGCC GAACCACGTG
ATGGAGTGCG TCGGCGCCGC AAGCGCGATG AACACCGCCA TCGACGTGGT CCGGCCAGGC
GGCACGATCG GCTACGTCGG CGTCCCCTAC GGCGTCGAGG AGGAGGGCCT CAACGTGTTC
GGAATGTTCG GCGACAACGT CACACTTGCA GGCGGCGTCG CGCCCGTCCG CGCGTACGCT
GAGGAACTGA TGGCGGACGT ACTGCAGGGC ACCCTCGACC CCGCGCCGGT CTTCACCGAG
ACGGTCGGCC TTGACGAGGT CGACGAGGGG TACCGCATGA TGGACGAGCG CGAGGCGATC
AAGGTGCTTG TGAAGCTCTG A
 
Protein sequence
MRAAIFNGPG DIGVEERPRP EIEAPTDAIV RVTHTAICGS DLWFYRGDSD RDEGSPVGHE 
PMGIVEEVGN EVTSVAPGDR VLAPFAISCG ECEFCRKGLH TSCENGDSWG GDNGGGQGEY
VRSTHADGTL VRVPDRFADD EETLRSLLPL TDVMGTGHHA AVNAGVEAGS TVVVIGDGAV
GLCGVLAARR LGAERIIAVG HHEDRLELAE EFGATETVSE RGEAAVERIQ ELTHGGPNHV
MECVGAASAM NTAIDVVRPG GTIGYVGVPY GVEEEGLNVF GMFGDNVTLA GGVAPVRAYA
EELMADVLQG TLDPAPVFTE TVGLDEVDEG YRMMDEREAI KVLVKL