Gene Hlac_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1547 
Symbol 
ID7401479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1568041 
End bp1568988 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content70% 
IMG OID643708614 
ProductD-isomer specific 2-hydroxyacid dehydrogenase NAD-binding 
Protein accessionYP_002566205 
Protein GI222479968 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC CAGATATCGT CGTGTTGCGA CAGACGATCC ACGGCTCGGG CGGCGCGGAG 
CTGGCCGCGG CGATCCGCGA GCGACTCTCC GACCGATCTG TTGCGCTGGC GCGGACGCCA
GCCGAGGAGC GCGAGCTGCT CGAAACAGCC CGGATCGCCG TCGGGCTCGA TATCGACGAG
GGGCAGTTGG CCGCCGCCGA GAACCTAGAG CTGTTCGCGT GCGTATTCGC CGGGACCGGC
CACCTCCCCC GAGACGCGCT CGCGGACCAC GGCGTCGCCC TGACGAACGC CTCCGGGGTC
CACGGACCGA ACATCGCCGA ACACGTTCTT GGGTCGATGA TCACGCACGC CCGGCAGTGG
GCGCGCGCGC ACCGCCAGCA GGAGCGCCGG GAGTGGCGGA GCTACGAGAC GACCGAGATG
TACGGCTCGA CCGTCGCCGT TGTCGGGCTC GGCGCGATCG GCTCGGCCAT CGTCGACCGG
CTGGAGCCGT TCGACGTGGA CACGGTCGGC GTCCGGTACT CGCCCGAGAA GGGCGGCCCG
ACCGACGAGG TGTACGGGTT CGACGCGTTC CACGACGCGA TCGCGGACGC CGAGTACGTG
GTGCTCGCGT GCCCGCTCAC GGAGACGACC CGCGGGCTCG TCGACGCCGA GGCGCTCCGG
ACGATGCGGG CCGATGCGAT CCTCATCAAC ATCGCGCGCG GACCGATCGT CGACACCGAC
GCACTCGTCT CCGAACTCCG GAACAACCGC ATCCGCGGGG CCGCACTCGA CGTGACCGAC
CCCGAACCAC TGCCCGAAGA CCACCCGCTG TGGGGGCTCG GTAACGTCAC GATCACCCCG
CACAACGCCG GCCACACGCC CCACTACTAC GAGCGCGTCG CCGACATCCT CGCGGAGAAC
GTCGGTCGGC TCGACGACGG CGACGACCTG AAAAACCGGG TCCTGTGA
 
Protein sequence
MSDPDIVVLR QTIHGSGGAE LAAAIRERLS DRSVALARTP AEERELLETA RIAVGLDIDE 
GQLAAAENLE LFACVFAGTG HLPRDALADH GVALTNASGV HGPNIAEHVL GSMITHARQW
ARAHRQQERR EWRSYETTEM YGSTVAVVGL GAIGSAIVDR LEPFDVDTVG VRYSPEKGGP
TDEVYGFDAF HDAIADAEYV VLACPLTETT RGLVDAEALR TMRADAILIN IARGPIVDTD
ALVSELRNNR IRGAALDVTD PEPLPEDHPL WGLGNVTITP HNAGHTPHYY ERVADILAEN
VGRLDDGDDL KNRVL