Gene Hlac_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1104 
Symbol 
ID7400913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1109885 
End bp1110817 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content64% 
IMG OID643708170 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002565769 
Protein GI222479532 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.949606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGATC AGCGCGTTCT CGTGACGGGT GGCGCGGGCT TTATCGGCTC GAATCTCGCG 
AACCGGCTCG CGGCCGACAA CGACGTGATC GCCGTCGACG ACACCTATCT CGGGACGCCC
GAGAATCTCG ACGACAACGT GGAGTTCGTC GAGGCCGATG TGATCGACGA CGACTTCCCC
GCCGATGTCG ACGTTCTGTT CCACCTCGCC GCGCTCTCCT CGCGGAACAT GCACGAGAAC
GACCCTCAGC GCGGCTGCCG CGTCAACGTC GAGGGGTTCG TCAACGCGGT TGAGCGCGCT
CGACAGGAAG GGTGCGAGAC CGTCGTCTAC GCCTCGACCT CCTCGATCTA CGGGAACCGC
ACCGAGCCCT CCCCTGTGGA CATGGATGTC GAGGCGCGCA CCGCCTACGA GGCCTCGAAG
CTCGCCCGCG AGCGCTACGC TGAGTACTAC GGCAATTATC ACGACATGGC GATGGCCGGC
CTCCGGTTCT TTTCCGTCTA CCAAGGATTT GGGGGCAACG AGAAACACAA AGGCGAGTAC
GCGAACACGG TTGCGCAGTT CGCGGACGCG ATCGCGAACG GCGAGGCCCC CGAGCTGTTC
GGCGACGGCA GCCAGACGCG CGACTTCACG CATGTCTCGG ACGTGGCGCG TGCCTGCGAA
CTCGCCGCCG ACCACGAGCT AACGGGCGTG TACAACGTCG GTACCGAGGA AGCCTACTCG
TTCAACGAGA TGGTAGCCAT GATCAACGAC GCGCTCGGCA CCGACATCGA TCCGGTGTAC
ATCGAGTGCC CCTTCGACGG CTACGTCCAC GACACCATGG CGGACTACTC GACGTTCCAC
GAGGCGACCG GCTGGGAACC CGAGATCGGT TTCGAGGAGG GCGTCGAACT CGTGTGTGAG
CCGTACTCCG AGCGCGCTCC CGAGGAGGTG TAG
 
Protein sequence
MHDQRVLVTG GAGFIGSNLA NRLAADNDVI AVDDTYLGTP ENLDDNVEFV EADVIDDDFP 
ADVDVLFHLA ALSSRNMHEN DPQRGCRVNV EGFVNAVERA RQEGCETVVY ASTSSIYGNR
TEPSPVDMDV EARTAYEASK LARERYAEYY GNYHDMAMAG LRFFSVYQGF GGNEKHKGEY
ANTVAQFADA IANGEAPELF GDGSQTRDFT HVSDVARACE LAADHELTGV YNVGTEEAYS
FNEMVAMIND ALGTDIDPVY IECPFDGYVH DTMADYSTFH EATGWEPEIG FEEGVELVCE
PYSERAPEEV