Gene Hlac_1075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1075 
Symbol 
ID7400147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1074358 
End bp1075506 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content66% 
IMG OID643708141 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002565740 
Protein GI222479503 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTCC TACTCACCGG TGCAGACGGG TACCTCGGAT GGCCGACCGC GCTTCGACTG 
GCAGACCGGC TCGACGAACG GATCGTCTGC GTCGACAACT TCGCGCGACG CAGTTGGGTC
GCCGAGTCGG GGAGCGTCTC CGCGACGCGG GTCGAGAGCC CCGAGGAGCG ATTCGACGCG
GTGGAGAACC TGAGTCTCGT GGAAGGCGAC CTGGCCGACC GCGACTTCGT ACTCCAGCTG
TTGGAGACGT ACGAGCCGGA CACCGTGCTG CACGCGGCCG CGCAGCCGAG CGCGCCCTAC
TCGTCGATCA ACGGCGAGCG CGCGCTGTAC ACCCAGCGGA ACAACGTCTC GATGAACCTC
AACCTGCTCC ACGGGCTCGC CGAGTGCGGG CTCGACGACA CGCACTTCAT CGAGACGACG
ACGACGGGCA TCTACGGCGC CCCGCACTTC CCGATCCCGG AGGGCGGGCT GGAGGTCGAG
CGGAAAGACG GCAGCGACGA GGTCCCGTTC CCGGCGATGG GCGGGAGCTG GTACCACCAG
ACGAAGTCGT TCGACGCGGC GAACATGCGG CTCGCGGAGT CGCAGTTCGA GTTCCCGATG
AGCGAGGTTC GGACCGCGAT CGTGTACGGG ACGGAGACCG AAGAGACACA GGCGCACGAG
AGCCCGACGC GGTTCGACTT CGACTACTAC TTCGGCACGG TCGTGAACCG CTTCTGCGCG
CAGGCGGTCG CCGGCTACCC GATCACCGTC TACGGCAAGG GCGAACAGCG CAAGCCGATG
GTGAGCCTCG AAGACACCGT CGAGAGCCTC GTCCGGCTCG TCGAGGAGGG ACACTCCGGC
GACGACGGGA TCGACATCTA CAATCAGGTC ACCCGCCCGG TCGCCATCGT CGAGCTCGCG
GAGACGATCG CCGAGGTCGG CGACGAGTTC GACCTCGACG CCGCGGTGAA ACACTACGAG
AACCCGCGCA ACGAGGACGA GGAACACAAG ATGGAGATGG AGAACGACCG GTTCCTCGAT
CTGGTCGGCG GACAGCAGCA GACCTTGGAA GAGGGGATCC GCGATGTGCT CGGAACGCTC
GTCGACGAGC AGGACCGGAT CGCGGCCCAC GAGGACCGGT TCCTGCCCGG CGTGTTGACT
GATGAGTGA
 
Protein sequence
MTVLLTGADG YLGWPTALRL ADRLDERIVC VDNFARRSWV AESGSVSATR VESPEERFDA 
VENLSLVEGD LADRDFVLQL LETYEPDTVL HAAAQPSAPY SSINGERALY TQRNNVSMNL
NLLHGLAECG LDDTHFIETT TTGIYGAPHF PIPEGGLEVE RKDGSDEVPF PAMGGSWYHQ
TKSFDAANMR LAESQFEFPM SEVRTAIVYG TETEETQAHE SPTRFDFDYY FGTVVNRFCA
QAVAGYPITV YGKGEQRKPM VSLEDTVESL VRLVEEGHSG DDGIDIYNQV TRPVAIVELA
ETIAEVGDEF DLDAAVKHYE NPRNEDEEHK MEMENDRFLD LVGGQQQTLE EGIRDVLGTL
VDEQDRIAAH EDRFLPGVLT DE