Gene Hlac_0797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0797 
Symbol 
ID7400762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp799084 
End bp800154 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content67% 
IMG OID643707862 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_002565466 
Protein GI222479229 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCGA TCGCCGTGTA CGACGGGGCA GACGAACCGG TCGTCACAGA GAAGCCGCGA 
CCGGAGCCAG CGCCCGGGGA GGCGCTGGTT CGAACCCTCC GGGTCGGCGT CGACGGGACC
GACCACGAGG TCATCTCCGG GAGCCACGGC GGCTCTCCCG AAGGCGAGGA TCACCTCGTG
TTGGGTCACG AGGCAGTCGG CGTGGTTGAA GAGCCGACCG ACACGCCGTT CGAGGTCGGA
GATATTGTGG TACCGACGGT CAGGCGACCG CCCAACGGGG CGAACGAGTA CTTCGCTCGC
GGCGAGCCAG ACATGGCGCC GGACGGGCAG TACCACGAGC GCGGCATCGT CGGGGCCCAC
GGATTCATGG CGGAGTACTT CACCAGTCCC GCGGAATTCC TCGTCGAGAT CCCGCCGGCG
CTGGCTGAGT GGGGGTTCCT CGTCGAACCC GTCTCTATCG CGGAGAAGGC GATCGAACAC
GCCTACGCCA GCCGGTCCGC GTTCCACTGG GAGCCGGAGT CGGCGTTGAT TCTCGGAAAC
GGCTCGCTCG GGCTGCTGAC GGTCGCGACT CTCGACGACG GGTTTGACCG GATCTACTGT
CTCGGCCGCC GCGAGCGCCC GGACCCGACG ATCGATATCA TCGAGTCGCT CGACGCGACG
TACGTCAACT CCAACGAGAC GCCCGTCCCC TCGGTGCCGG CGGCCCACGA GCCGATGGAC
TTCGTCTTCG AGGCGACCGG CTACGCCCCG CACGCCTTCG AGACGATCGA GGCGCTCGCG
CCGAACGGGG TGGGCGCGCT GCTCGGGGTC CCGGGCGACT GGGAGTTCGA GATCGACGGC
GGCCGACTCC ACCGGGAGTT CGTCCTTCAC AACAAGGCGC TCGTCGGCAG CGTCAACTCC
GGCTACGAGC ACTTCGAGGC CGCCGTCGAC TCGCTGTCCC GCTTTTCCGA GACGTTCCTC
GACGATCTCG TCACGGGCGT GCACGGGCTC GACGAGTTCG AGGCCGCGTT CGCGGATGAC
GACACGACTA TTAAAACGGC GGTCGAATTC GGTACATATG AAGAACGTTG A
 
Protein sequence
MNAIAVYDGA DEPVVTEKPR PEPAPGEALV RTLRVGVDGT DHEVISGSHG GSPEGEDHLV 
LGHEAVGVVE EPTDTPFEVG DIVVPTVRRP PNGANEYFAR GEPDMAPDGQ YHERGIVGAH
GFMAEYFTSP AEFLVEIPPA LAEWGFLVEP VSIAEKAIEH AYASRSAFHW EPESALILGN
GSLGLLTVAT LDDGFDRIYC LGRRERPDPT IDIIESLDAT YVNSNETPVP SVPAAHEPMD
FVFEATGYAP HAFETIEALA PNGVGALLGV PGDWEFEIDG GRLHREFVLH NKALVGSVNS
GYEHFEAAVD SLSRFSETFL DDLVTGVHGL DEFEAAFADD DTTIKTAVEF GTYEER