Gene Hlac_0587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0587 
Symbol 
ID7401723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp605333 
End bp606991 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content66% 
IMG OID643707653 
Productalpha amylase catalytic region 
Protein accessionYP_002565259 
Protein GI222479022 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.504326 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC GGGACTGGTA CGAAGACGCG ACGATCTACT CGCTCGATAT CAAGACGTTC 
AACGACAGCG ACGGGGACGG GTGGGGGGAC TTCCGTGGCG CGATCGAGCG GCTCGACCAC
CTCGACGACC TCGGCGTCGA CGCCGTGTGG ATCCGCCCGT TCTACCCCAG CCCGCTCCGG
GACAACGGGT ACGACGTGGC CGACTACCGC GGCGTCGACG AGCGGCTCGG CACCCTCGAC
GACTTCCGCG AGTTCGCGGA CCGAGCCCAC GAGCGCGGGA TCCGCGTGCT CACCGATCTC
GTGTTCAACC ACACGTCGAA CGAACACGAG TGGTTCCAAC GGGCGTGCGA GGACCCCGAA
TCGGAGTACC ACGACTACTA CCTGTGGACG AGCCACGTCG ACGACGCGCA CAACCGACAG
AACATCTTCC CCGAGTACGA GGACGGCGTC TGGTCGTACG ACGAAACTGC CGACAAACAC
TACTTCCACC AGTTCTACGG CCACCAGCCC GACCTTAACG TCGCGAATCC CGCCGTCCGC
GAGGAGCTGT ACGACGTGCT CCGGTTTTGG CTTGATCAGG GCGCCGACGG GTTCCGGATC
GACGCCGCTC ACCCCATGCT GCTGCCGAAG GGTCACAACG CGTCGACGCT CCACGACACC
GACCTCGACG AGCCCATCGA CCTGTTCAAG CGGATGCGCG AGGTCGTCGA GGCGGAGCAG
TCGGACGCGG TCTTACTCGC CGAGGCCGAC GACGAGCCCG AGAACCTCGA CTACTACTTC
GGCGACGGAG AGGCGTTCCA CCTCCAGTTC AACTTCGTGA TGAACGCCCA CCTCACGTAC
GGGGTCGGGG TGACGGACAC GTGGCCGCTC GACCGCGCCG AGGAGCTCCT CCCGGACGTC
TCCGGCGTGG GCGGGTGGGT GAACTTCCTG CGGAACCACG ACGAGTGGAA CCTGTTGAAG
CTCCCGCAGG AGTCGTTCGA TCACGCCCGC GAGTACTTCG GCGACGACGC CGGCAACTCG
TGGATCTTCG AGCGCGGCCA CCGGCTCCGG CTCGCAGACT TGTACGCCGG GGACCACGAT
CGGATCGCGG TGGCTCACAG CCTGCTGTTC TCCCTGCCGG GATCGGTCGC CCTCCAGTCC
GGCGACGAGA TCGGGATGGG CGCCGACCTC TCCTTACCCG AGCGCGAGGC CGTCCGCACC
CCGATGCAGT GGGACGACTC GGCGAACGGC GGCTTCTCGA CGGCCAACCA GGACGACTGT
TACAACCCCG TTATCGACGA GGGCGAATAC GCCTACGAGC GAATAAATGC CGCCGCACAG
CGCGACGACC CCGACTCGCT GCTCTCTCGA GTCCGGGACC TCTCGGCGGC CCGCGATGAC
TGCCCGGCGA TCGCTCGAGG TTCGTACTCA CTCCCCGAGC CCGACCACAA GGAAACGCGC
GTCCACCGGT TCGACCACGG GGAAGGCGAG TCCGAGACCG TCCTGCTCTG CGCGCACAAC
CTCGCGGACG GCTACCGCGA GGAGGTAGTC GGGTTCGACG TCGACCCCGA CACGGTCGAA
CGCGTCGTCG GCGACGGCGG CTATCACGTC GCTGAGGGCG GCGTCACCTT CTTGCTCGAC
GAGTGCGATT ACGTCTGGCT GCGCGGCGAG AAGCGGTAG
 
Protein sequence
MSDRDWYEDA TIYSLDIKTF NDSDGDGWGD FRGAIERLDH LDDLGVDAVW IRPFYPSPLR 
DNGYDVADYR GVDERLGTLD DFREFADRAH ERGIRVLTDL VFNHTSNEHE WFQRACEDPE
SEYHDYYLWT SHVDDAHNRQ NIFPEYEDGV WSYDETADKH YFHQFYGHQP DLNVANPAVR
EELYDVLRFW LDQGADGFRI DAAHPMLLPK GHNASTLHDT DLDEPIDLFK RMREVVEAEQ
SDAVLLAEAD DEPENLDYYF GDGEAFHLQF NFVMNAHLTY GVGVTDTWPL DRAEELLPDV
SGVGGWVNFL RNHDEWNLLK LPQESFDHAR EYFGDDAGNS WIFERGHRLR LADLYAGDHD
RIAVAHSLLF SLPGSVALQS GDEIGMGADL SLPEREAVRT PMQWDDSANG GFSTANQDDC
YNPVIDEGEY AYERINAAAQ RDDPDSLLSR VRDLSAARDD CPAIARGSYS LPEPDHKETR
VHRFDHGEGE SETVLLCAHN LADGYREEVV GFDVDPDTVE RVVGDGGYHV AEGGVTFLLD
ECDYVWLRGE KR