Gene Hlac_2866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2866 
Symbol 
ID7399102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp125506 
End bp126654 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content64% 
IMG OID643706686 
Productgalactonate dehydratase 
Protein accessionYP_002564312 
Protein GI222475791 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACATTA CTGACTACGA GCTGTTCGAA GTACCGCCCC GCTGGCTATT CTTGAAGTTG 
ACGACGAGTG ACGGCACCGT TGGTTGGGGA GAGCCAGTCG TGGAGGGGCG GTCTCACACG
GTCGTTGCCG CCGTCGAGGA GTTGCTCGAC AACTACCTAC TCGGTGAGGA TCCGGCACGG
ATCGAGGATC ACTGGCAGGC GATGTACCGC GGCGGCTTCT ACCGCGGCGG CCCGGTGCTG
ATGAGCGCCA TCGCCGGCGT TGACCAAGCG CTGTGGGACA TCAAGGGAAA GCAGTTCGGC
GCTCCCGTCC ACGAACTTCT CGGCGGCCGT GCGCGCGACC GAATCCGCGT GTACCAGTGG
ATCGGCGGAG ACCGACCAGC AGATGTCGGC GACGCCGCTC GTGAGAAGGT CGACGCAGGC
TTCACAGCGC TGAAAATGAA CGCCACCGCA GAGATGCGGC CGATCGACAC TCCGGCCACT
GTGGCTGACG CGGTGGACCG GATCGCGGCC GTTCGGGAGT CGGTCGGCGA CGAGGTCGAT
ATTGGGGTTG ACTTTCACGG GCGCGTGTCG AAGCCGATGG TGCGGCGGCT CGCGGCCGCA
CTGGAGCCGT ACGACCCGAT GTTCATCGAG GAGCCGGTGT TACCGGAACA TAACGACGCG
CTCCCAGCGA TCCGTCAGTC GACGACGACC CCGATCGCGA CGGGCGAGCG GATGTACTCG
CGGTGGGACT TCAAGGAGGT GTTCGAGACG GACGCAGTGG ACCTGATCCA GCCGGATGTC
TCGCACGCAG GCGGGATCAC CGAGCTGAAG AAGATCGCGT CGATGGCAGA GGCGTACGAC
GTATCGGTCG CGCCGCACTG CCCGCTCGGG CCGATCGCGC TCGCGTCTTG TATCCAGGTT
GATGCCTGCA CGCCGAACGT GCTGATCCAA GAGCAGTCTC TCGATATTCA CTACAACGAA
ACGAGTGACG TGCTGGACTA CCTCGCCGAC CCGACGGTGT TCGAGTACCG CGACGGTTTC
GTCGATATCC CCGACGGTGA CGGACTCGGG ATCAAGATCG ACGAAGAGCA CGTCCGCGAA
CAGCGCGGAA ACGTTGATTG GCACAACCCC GTCTGGCGGC ACGACGACGG CTCTGTCGCG
GAGTGGTGA
 
Protein sequence
MYITDYELFE VPPRWLFLKL TTSDGTVGWG EPVVEGRSHT VVAAVEELLD NYLLGEDPAR 
IEDHWQAMYR GGFYRGGPVL MSAIAGVDQA LWDIKGKQFG APVHELLGGR ARDRIRVYQW
IGGDRPADVG DAAREKVDAG FTALKMNATA EMRPIDTPAT VADAVDRIAA VRESVGDEVD
IGVDFHGRVS KPMVRRLAAA LEPYDPMFIE EPVLPEHNDA LPAIRQSTTT PIATGERMYS
RWDFKEVFET DAVDLIQPDV SHAGGITELK KIASMAEAYD VSVAPHCPLG PIALASCIQV
DACTPNVLIQ EQSLDIHYNE TSDVLDYLAD PTVFEYRDGF VDIPDGDGLG IKIDEEHVRE
QRGNVDWHNP VWRHDDGSVA EW