Gene Hlac_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0544 
Symbol 
ID7401679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp565736 
End bp568012 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content71% 
IMG OID643707609 
Productalpha amylase catalytic region 
Protein accessionYP_002565216 
Protein GI222478979 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGAAC CCGGCCCGCC GCGCACGACG AGTGTCGGCG AATCCGTCGA GCTGGCCCCG 
CGGAGCCCCG ATCCCGACGG GACCTACGAG TGGACCCTCC GCGACGCGCC GGCCGAGAGC
GACGCGCGCG TGGACGGGCA GCCAGTGAGC GGGAGCAGTG ATGACGGGGA ACTCCTCGTT
CCCGGCGAGA CGAGCCGACC GGACGCACCG GTCGTCCACC TCCGTCCCGA CGCCCCCGGA
ACGTACGTCC TGACGCTCGA CGCCCCGGAC GGAACGCACC GCCAGCGGGT GCGCGCGTAC
CCCGACGAGC GGCAGTCGGT CGAACTTCGC GTCCCCGCCG CGGACCTCCC GGTGGACGAC
GGCGACGTGG ACCGCGTCTC GGTGATGTGG CCCCACAACG ACCGCCTGCT CGCGCGCGAC
CGTCCCGAGC GCGACGGCGA CGATTGGATC TACGAGGTCC AGATTCCGCC GGGTCGCCAC
GGGTTCAGCT TCGTCGCCAA CGACGACCCC GGCAACGAGC ACCGCGACGA GGTGACCGTG
CCGGGTCCGG GGCGCCCGCG CGTCTCGATG TCGGCGACCG TCGTCGAGGG TGGGGAGGGA
GAGGGGGACG CCGACGCCTC ATCCGTCCGG ATCGTTGCCG ACACCGAGGC GCCGCCCGAC
CTCGACGGCG AGGGCGATGC CGGGAGACGC ACCGGCGAGT CCCCCGTCGC CGTCGACTTC
CTCGTCGACG ATCGCGATGC GGACCCCGAG ACAGTCGCGC GGATCGAGTC GCTGGCGGCG
GGCGACACAC TCACGATCCC GCTTGCGGAG CTTTCCGACG ACCTCTCTGG TGGGATCCGG
GTCCACGCAG TCCCGAACGC CGACCGGTAC GGCGCGGCGG AGACGATCCG GATCGAGCGG
GACGAAGAGG GGGCCGTGGG CGCCGGCGAG GGCGGCTCCG TGACCGTTTC CGACCCCCAT
GCACCCCCCG AGTGGGCCGA CTCGCCGACG ATCTACGAGG TGTTCGTCCG GTCGTTCGCC
GGCGACACGC TCCCGACGAC GTTCCGCGAG ATCGAGCGTC GAGTCCCCTA CATAGAGAGT
CTCGGCGTCG ACACGCTCTG GCTCACGCCC GTGCTCGCCT CGCCGACGGA ACACGGCTAC
CACGTCACCG ACTACTACGA CACCGCCGCC GACCTCGGCT CGCGCGAGGC GTTCGAGTCC
CTGGTCGCGG CCTGCCACGA GGCGGGGATC AAGGTCGTGT TCGATCTGGT GATCAACCAC
ACCTCCCGCG ATCATCCCGT CTTCCAGATG CACGCCGCCG GCGTCGACGC GTACGCCGAT
CACTACCGCC GGGCTGACGG CGACTTCGAC GTGACAGACA CCGACTGGGC GGAGCTGGCA
GCGGGCGATA TGCCGGAGTA CTACTTCAAC TGGCGCCGGA TCCCGAACCT CAACTTCGAC
AGCCCGGCGG TCCGCGAGTG GCTGCTCGAC GTGGTCGACG AGTGGAGCGC GGTCGTCGAC
GGCTTCCGCG CCGACGTGGC GTGGGGCGTG CCCCACGGCT TCTGGAAGGA GGTCAGTGAG
CGCGTGCCCG ACGACTTCCT CCTGCTCGAC GAGACGCTCC CCCACGACCC CTTCTACGGC
GAGGGGGAGT TCGACGTCCA CTACGACACC TCGCTGTACG ACGCGCTCCG GGATGTCGGG
GCGGGCGACG CGCCCGCGGA CGCGATCGCC GACGCGTTCG CCCGCGCCGA GTGGCTCGGG
TTCGACGACC CGGGCGTCCA GATGCGCTAC GTCGAGAACC ACGACGAGGA GCGCTACCTC
GCCGAGTACG GGCGGGAGGC GCTGAAGGCG GCCGCCGCGA CCGTCTTCAC CCTCCCCGGC
GCGCCGATGA TCTACGCCGG ACAGGAGCGC GGCAACGAGA CGTACCGCGG ACCCGTACGC
TGGCACGACG GCGACAACGA CCTCACCGAC TTCCACCGCG ACCTCGCCGC GCTCCGCGAG
CGCGAGCCGC TCCTCCGGGA CGGCGCCGTC GACTTCGAGG GCCGCGCCGC TGACGTGTCG
GTCGTCGACG GCGATCCCGA GCGCGTAACG GCGTACGAGC GAACGCCGGC GACGGACGAT
TTCGAGGGCG ACGATCACGA CGCGGACCCC GACCCGCTCC TCGTCGTCGT CAACTTCGCA
GACCGGCCGG TGACGGTGGA GGTTCCGGCG GGGTTCGAGA CCGACCTGTT CGCGGGTGAC
GGGTCTGGCG CGGTCGACGA GGGAGTCACG GTCGAGAGCG TCGCCGTCCT CCGGTAG
 
Protein sequence
MHEPGPPRTT SVGESVELAP RSPDPDGTYE WTLRDAPAES DARVDGQPVS GSSDDGELLV 
PGETSRPDAP VVHLRPDAPG TYVLTLDAPD GTHRQRVRAY PDERQSVELR VPAADLPVDD
GDVDRVSVMW PHNDRLLARD RPERDGDDWI YEVQIPPGRH GFSFVANDDP GNEHRDEVTV
PGPGRPRVSM SATVVEGGEG EGDADASSVR IVADTEAPPD LDGEGDAGRR TGESPVAVDF
LVDDRDADPE TVARIESLAA GDTLTIPLAE LSDDLSGGIR VHAVPNADRY GAAETIRIER
DEEGAVGAGE GGSVTVSDPH APPEWADSPT IYEVFVRSFA GDTLPTTFRE IERRVPYIES
LGVDTLWLTP VLASPTEHGY HVTDYYDTAA DLGSREAFES LVAACHEAGI KVVFDLVINH
TSRDHPVFQM HAAGVDAYAD HYRRADGDFD VTDTDWAELA AGDMPEYYFN WRRIPNLNFD
SPAVREWLLD VVDEWSAVVD GFRADVAWGV PHGFWKEVSE RVPDDFLLLD ETLPHDPFYG
EGEFDVHYDT SLYDALRDVG AGDAPADAIA DAFARAEWLG FDDPGVQMRY VENHDEERYL
AEYGREALKA AAATVFTLPG APMIYAGQER GNETYRGPVR WHDGDNDLTD FHRDLAALRE
REPLLRDGAV DFEGRAADVS VVDGDPERVT AYERTPATDD FEGDDHDADP DPLLVVVNFA
DRPVTVEVPA GFETDLFAGD GSGAVDEGVT VESVAVLR