Gene Hlac_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1074 
Symbol 
ID7400146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1073388 
End bp1074365 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content67% 
IMG OID643708140 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002565739 
Protein GI222479502 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGG AGTCGGGACG CGATACGAGC GTCCTCATCA CCGGCGGCTG TGGCTACATC 
GGGAGCGCGT TGATCCCGCG GTTGCTGGCG GACGAGCGCG TCGGCGACGT GGTAGTCCTC
GACTCGCTCT CGTCGGGATC GCCCGCGAAC CTCGCGGGGA GTATCCGCGA CGATCTGACC
TTTCGACGCG GCGACGTGCG GGAGTACGGT GCGGTTGAGG GAGCGGTTCG CGGCGTCGAC
GCCGTGATCC ATCTCGCGGC GATCACCGGG GCGGCCTCGA CGCACGACCG GAAGGCCGAG
ACGTTCGCGG TGAATCGCGA CGGGACCGAG AACGTGCTCA CGGCCGCGGG CAAGTTCGAT
GTCGAGAACG TCGTGGTCGC TTCCTCGTGT AACAACTACG GGCGCGCCGC GAGCACCGAC
ATCGACGAGG AGACCGAGCA GAACCCGCTG AACCCCTACG CCGAGTCGAA GGTCGCGTGC
GAGCGTCTGC TCAACGAGGC GATCGAAGCG CACGGCTTCG ACGGCACTGC CCTGCGGATG
AGCACGAACT ACGGCTGGTC GCTCGGGGTC CGGTTCAACC TCGTGGTGAA CCACTTCGTC
TTCCGCGGGC TCACCGACCG CCCCTTGACG GTCTACGGCG ACGGATCGAA CTGGCGCCCG
TTTATCCACG TGCGCGACGC GGCGCGGGCG TACGCCGACG CCGCGCTCTC CCCCGACGCG
TGGGACGAAC GCGTGTACAA CGTCGGGTCG AACGACGGCA ACTACCGGAT CGCGGAGATC
GCGGAGATCG TTCGTGAGGA GCTCGATCGC GACCTCGACG TGACGTATCT GGAGGACGAA
CAGCCCGGCC CCTCCTACCA CGTGAACTTC GACCGACTCG CCGAGACCGG CTTCGAGACG
GAGTGGACGC TGCGGGAGGG AATCCGCGAC ATCGCAGACG AGTTGCGCGG AGACGCGCGC
GAAGAGGTGA CCGCATGA
 
Protein sequence
MSEESGRDTS VLITGGCGYI GSALIPRLLA DERVGDVVVL DSLSSGSPAN LAGSIRDDLT 
FRRGDVREYG AVEGAVRGVD AVIHLAAITG AASTHDRKAE TFAVNRDGTE NVLTAAGKFD
VENVVVASSC NNYGRAASTD IDEETEQNPL NPYAESKVAC ERLLNEAIEA HGFDGTALRM
STNYGWSLGV RFNLVVNHFV FRGLTDRPLT VYGDGSNWRP FIHVRDAARA YADAALSPDA
WDERVYNVGS NDGNYRIAEI AEIVREELDR DLDVTYLEDE QPGPSYHVNF DRLAETGFET
EWTLREGIRD IADELRGDAR EEVTA