Gene Hlac_1073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1073 
Symbol 
ID7400145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1072354 
End bp1073391 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content64% 
IMG OID643708139 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002565738 
Protein GI222479501 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.688029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACC AAGAATCCGA TATCGAGAGT GAGACGGAGC AGACGACGGA ACCGAACGGC 
GACGGGTCGA CTCCGACGAT CGCGGTCACC GGCGCGGCCG GATACATCGG AAGCCGCGTG
ATCGTGGAGT TCCAAGAGGC GCACCCCGAC TGGGAGATCG TCGCGATCGA CAACCAGTAC
CGCGGGCAGG TGGATTCGGT CGGTGAGGTG GAGATTCAGC ACGTCGACAT CCGAAACCGC
GACCGGCTGG AGGACGCGCT CGCGGGCGCG GACGTGGTGT GTCACCTCGC GGCGATAAGC
GGCGTCGACG ACTGCGAGGA GAACGCCGAC CTCGCGTACG AGGTGAACGT CACCGGGACG
AACAACGTCG CGTGGTTCTG TCGGAAGACC GGTGCGGCGC TGGCGTTCCC GTTCAGCATG
GCAGTATTGG GGGACCCGCA GTCGTTCCCG ATCACGGCCG ACCAGCCGCG CGACCCGTTG
AACTGGTACG GGCGGACGAA GCTGCTCGGC GAGCGCGCGA TCGAGACGTT CGCCGACGGC
GCGTTCCCCG CGCACCTCTT TTTGAAGTCG AACCTCTACG GCGAGCACGT CGTCGACGGG
ACGACGGTGA GCAAGCCGAC CGTGATCAAT TTCTTCGTGA ACCGGGCGCT CGCGGGCGAA
ACGCTGACCG TCTACGAGCC CGGCACGCAG GCACGGAACT TCGTCCACGT GAAGGACGTG
GCGCGCGTGT ACGTCCGGAG CGCGGAGCGG CTGCTGGAGC AGCTCGCGAG TGGGGAGACT
GGAACCGAAA CGTTCGAAAT CGCGAGTGAG GAGGACATGA GCGTGATGGA GGTCGCGGAG
ATCGTGCGGG AGGTGGCGCA CGAGGAGCGC GAGATCGACG TCGACGTGGA GTTGGTCGAG
AATCCGCGAA GTGCGGAGAC GATGGTTGAG GAGTTTGGGG TGGATATTTC GGCGGCGGGG
GAACGGTTAG GATGGGCACC AAGCGAGAGT GTGAACGAGT CAGTTCGACA TCTGTTGACT
CCAAAATCTG ATTCGTAG
 
Protein sequence
MTDQESDIES ETEQTTEPNG DGSTPTIAVT GAAGYIGSRV IVEFQEAHPD WEIVAIDNQY 
RGQVDSVGEV EIQHVDIRNR DRLEDALAGA DVVCHLAAIS GVDDCEENAD LAYEVNVTGT
NNVAWFCRKT GAALAFPFSM AVLGDPQSFP ITADQPRDPL NWYGRTKLLG ERAIETFADG
AFPAHLFLKS NLYGEHVVDG TTVSKPTVIN FFVNRALAGE TLTVYEPGTQ ARNFVHVKDV
ARVYVRSAER LLEQLASGET GTETFEIASE EDMSVMEVAE IVREVAHEER EIDVDVELVE
NPRSAETMVE EFGVDISAAG ERLGWAPSES VNESVRHLLT PKSDS