Gene Hlac_1919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1919 
Symbol 
ID7399871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1922259 
End bp1923518 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content72% 
IMG OID643708990 
ProductN-acylglucosamine 2-epimerase 
Protein accessionYP_002566567 
Protein GI222480330 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.266603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGA CGACGCGCGA GGCGGTTCGA ACGCAAGCGC GATCGCACCG GGCACGGCTG 
CTTTCGGTCC TGCGCGTGCA GTACCCCGAC GCGCTCGCCG ACCGCGGGTA TCGCCTCATT
CACCCGACGA CCGGAGATCC CTACGCCGGA GACCGGCGGC ACCTCATCGC GACCTGCCGA
TCGGTCGCCA ACTTCGCGGT CGGCGCGCTC GCCGACGGCC CCGACTGGTG TCTCGACGCC
GCCGAGCACG GCCTGCAATT CCTCCGGGAG GCGCACCGCG CCGACGACGG GGGGTATCAC
CTCGTCGTCG ACGCGGAGGG CGAGCCGGTG GATCGGACGC GGTCGGCGTA CGGCCACGCG
TTCGTTCTAC TGGCGTACGC CCGCGCGGTC GACGCCGGGA TCGAGGGCGC CGAGCGCGAC
CTCGACGCGA CCCGAGAGCT GATCGACGAC CGGTTCCGCG ACGACCGGGG ACTCCTCCGG
AGCGACTGCG ACGCCGACTG GACCGAGCGA GAGCCGTACC GCGGCCAGAA CGCGAACATG
CACGCCTGCG AGGCGTTCCT CGCCGCCTAC GAGGCGACGG ACGAGGCGAG ATACCTCGAC
CGCGCGCGTC ACATCGCCGA GGCGATCACG GTCGACCTCG CCGCCGAGAC CGACGGTCTG
CTGTGGGAGC ACTACACCGC CGACTGGGAG CACGACTTCG CGTACAACGT GGACGAGCCG
CGCCACCAGT TCCGGCCGCC GGGGTACCAG CCGGGCCACC ACGCGGAGTG GGCGAAGCTC
CTCGCGCTGC TCGACCGGTA CGAGGGCGAG GAGGGTGAGG ATGGAGAGAG TGAGGATCCA
GCCGCGACCA TCGACTGGTA CACCCGCGCC CGCGAACTGT TCGACGCCGC AGTCGACCGC
GGCTGGTCGG AGAACGGATT CGTGTACACC CACGCGGCCG ACGGGTCGCC GATCGTCGCC
GATCGATACG GGTGGGCGCT CGCGGAGGCG ATCGGCGCGT CCGCGGCACT GGCCGAGCGT
GCGGCGGCTC GCGGCGACGC CGACGAGGCC GATCGGCTCC GGAACTGGCA TCGGCGGTTC
CTCGTTCGGA CCGACCTGTT CCGCGGCCCG GCGGGCGTCT GGTACGAGAA GCGCCTGCCC
GCGGACGCCG ACGGCGACCT CGTCGCACAG GACCCGCCCG GCGTCGAACC CGACTACCAC
CCGGCCGGCG CGTTCTTCGA GGGGTGGCGC TCCGCGCGGG GAGAGCTGTC TGACGGGTGA
 
Protein sequence
MNETTREAVR TQARSHRARL LSVLRVQYPD ALADRGYRLI HPTTGDPYAG DRRHLIATCR 
SVANFAVGAL ADGPDWCLDA AEHGLQFLRE AHRADDGGYH LVVDAEGEPV DRTRSAYGHA
FVLLAYARAV DAGIEGAERD LDATRELIDD RFRDDRGLLR SDCDADWTER EPYRGQNANM
HACEAFLAAY EATDEARYLD RARHIAEAIT VDLAAETDGL LWEHYTADWE HDFAYNVDEP
RHQFRPPGYQ PGHHAEWAKL LALLDRYEGE EGEDGESEDP AATIDWYTRA RELFDAAVDR
GWSENGFVYT HAADGSPIVA DRYGWALAEA IGASAALAER AAARGDADEA DRLRNWHRRF
LVRTDLFRGP AGVWYEKRLP ADADGDLVAQ DPPGVEPDYH PAGAFFEGWR SARGELSDG