Gene Hlac_0545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0545 
Symbol 
ID7401680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp568062 
End bp570266 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content71% 
IMG OID643707610 
Productglucan 14-alpha-glucosidase 
Protein accessionYP_002565217 
Protein GI222478980 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.495812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATA ATTACACCAA ACCTTCATAC ACGGAGGGGC CGTTTACTCG CGTAATGCAA 
CTCCGCGACG CGCTCGATGA CTACAAGCGT AACACCGGGC ACGACACCCG GTTCCCCGGC
GAACGCCGGA CGGTCACCGG TCGGTTCTCC GGCGGGGAGG GCCGGCTCGT CCACGTCGAC
GCCGACGGCG AACTCCGCGA CTTCGGCTAC CCGCTGACCG GTCAGACGGG ACTCGTCCGC
TCGCGCGTCG GCCTCGCGGT CGGCGACGAG GTCACGTGGC TCGACGAGGT CGAGACCCAG
CAGCGCTACG TCGGCGATAC GACCCTGATC GAGACCGTCC ACGAGACCGA CCGAGCGACC
GTGACTCGTC ACGACGTGAC GCTCGGCGAC GCCCACCTCA CACGCGTGAC GATCGATGTC
GGTGACGACG GGCATGCGAA CGTCGACCCT GCCGACCTCT CGCTCGTCGT GTACGCCCGG
TTCGCGCCTG ACGGGCGCGA CAACCGGGTC GGGCAGCTCC GATACGACGA CGCCATCGAG
GTGTACCACG CCGACGAGCA CGACTTCCTC GCGAGCGACA CCGGCTTTAC CGACCTGCGC
GGCCAGCTCC CGACGACGTT CCCCGAGCTG CTCGACGACG CGCCTACCGA CCTCCCCCGC
GACCGCGACG GCGACCGCTA CGAGGAAGAG CGTCTCTCCG GTGAGGTCAT CGTCGACGCC
CCCTTCGAGG ACGGCGTCGC GACCGTCGGC ACCCTGCTGA CCGACCGCGA GGAGACGACG
CGCGAGGCCG CCCGCGACCG TCTCGCGAAG CTGTTCGACA CCCTTGACGA CCCCGACGCG
CTCGCTGCGG CCGCGGGCGA GACGATTCCC TCGGTCCCTG AGTCGGTCCC GGCCCAGGAG
TCGGTCGTCG CCGACCTTCG GGTGCTCTCG CTGCTCTCCG CCGAATCCGG GCTCCGGATC
GCCGGCCCCG ACTTCGACCC CTTCTACGCG ACCTCCGGCG GCTACGGCTA CACCTGGTTC
CGCGACGACG CGGAGATATC GACGTTCGTG CTCGGCGCCG ACGACCGGAT CGGACTCGGG
CTTGACGACT GGCACGCCCG CTCGGCCGAG ATGTACGTCG CCACCCAGCG CCCCGACGGG
TCGTGGCCGC ACCGGGTGTG GCCCCGGGAC GGCGCGCTCG CGCCCGGCTG GGCGAACGCG
CGCATCGAGG ACGGCCCCGA CGTTGACTAC CAGGCCGACC AGACCGGCAG CGTCATCGCC
TACCTCGCGC AGGCTCGCGC CGCGGGAGTC GACGTGGAGA AGCTCGACGC GACTCTCGTC
GCCGCGCTCG CCGGACTCGA CGAGACCCTC GCGGCCGACG GCCGCCCCGT CGTCTGCCAG
AACGCGTGGG AGGACAGCGC TGGGCGATTC GCGCACACGA CCGCGACGTT CTTAGAGGCG
TACAGCGAGC TTGCGCTTCA CGGCGACGGG CTGGACGCGA GCGCGCTCGA TCGGAACGGG
GACGCGGATG CGCCCGGCCG CGACCTCGAC CCCGCCATCC TCCCGGACGA CCTCGCCGCA
CACGCCCGCG ACCAAGCAGT TCGGGTGTAC GACGCTCTCG ACGACCTCTG GGTGCCCGAA
CGCGGCTGTT ACGCGCTCCG CGAGACGCCC GAGGGCACGA CCGACGACCG GCTCGACTCC
TCGACGCTGG CGCTGGCGAG CGCGCACCGC TCGTTCGACG CGCTCGGCGG CGACGGCGAG
GCGGAAGGCC ATAGCGGCGC TGTCGACGCC GAGCGACTCG ACCGGCTGGT CTCGCACGTC
GAGACGGTCG TCGACGGGCT CGCTCACGAG AGCGACGAGA TTTCCGGACT GATCCGGTAC
GAGGGCGACG GCTGGCGGCG GGCTGGCCAG CTCTCGGAGA AGGTATGGAC GGTCTCGACC
GCGTGGGGCG CGAACGCCTG CGCCGAGCTG GCGGTCCTCC TCGCTGACCG CGACGACCCG
CGTGCGGCCG AGATGGTCGA ACGCGCCCGG GAGCTGCTCG CGCACGTCTC GCCCGGCGGG
ACGCTCTGTG AGCCGACGAG CTATCTCCCC GAGCAGTTCT TCGACGACGG AACGCCGGAC
AGCGCGACGC CGCTCGGGTG GCCCCACGCG ATCCGGCTGG CGACCGTGGC GCTGCTCGAC
GACGAGCTTC CCCAGTTTAC CGACCGCGCC GTCGCCGACG ACTGA
 
Protein sequence
MNNNYTKPSY TEGPFTRVMQ LRDALDDYKR NTGHDTRFPG ERRTVTGRFS GGEGRLVHVD 
ADGELRDFGY PLTGQTGLVR SRVGLAVGDE VTWLDEVETQ QRYVGDTTLI ETVHETDRAT
VTRHDVTLGD AHLTRVTIDV GDDGHANVDP ADLSLVVYAR FAPDGRDNRV GQLRYDDAIE
VYHADEHDFL ASDTGFTDLR GQLPTTFPEL LDDAPTDLPR DRDGDRYEEE RLSGEVIVDA
PFEDGVATVG TLLTDREETT REAARDRLAK LFDTLDDPDA LAAAAGETIP SVPESVPAQE
SVVADLRVLS LLSAESGLRI AGPDFDPFYA TSGGYGYTWF RDDAEISTFV LGADDRIGLG
LDDWHARSAE MYVATQRPDG SWPHRVWPRD GALAPGWANA RIEDGPDVDY QADQTGSVIA
YLAQARAAGV DVEKLDATLV AALAGLDETL AADGRPVVCQ NAWEDSAGRF AHTTATFLEA
YSELALHGDG LDASALDRNG DADAPGRDLD PAILPDDLAA HARDQAVRVY DALDDLWVPE
RGCYALRETP EGTTDDRLDS STLALASAHR SFDALGGDGE AEGHSGAVDA ERLDRLVSHV
ETVVDGLAHE SDEISGLIRY EGDGWRRAGQ LSEKVWTVST AWGANACAEL AVLLADRDDP
RAAEMVERAR ELLAHVSPGG TLCEPTSYLP EQFFDDGTPD SATPLGWPHA IRLATVALLD
DELPQFTDRA VADD