Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0544 |
Symbol | |
ID | 7401679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 565736 |
End bp | 568012 |
Gene Length | 2277 bp |
Protein Length | 758 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643707609 |
Product | alpha amylase catalytic region |
Protein accession | YP_002565216 |
Protein GI | 222478979 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGAAC CCGGCCCGCC GCGCACGACG AGTGTCGGCG AATCCGTCGA GCTGGCCCCG CGGAGCCCCG ATCCCGACGG GACCTACGAG TGGACCCTCC GCGACGCGCC GGCCGAGAGC GACGCGCGCG TGGACGGGCA GCCAGTGAGC GGGAGCAGTG ATGACGGGGA ACTCCTCGTT CCCGGCGAGA CGAGCCGACC GGACGCACCG GTCGTCCACC TCCGTCCCGA CGCCCCCGGA ACGTACGTCC TGACGCTCGA CGCCCCGGAC GGAACGCACC GCCAGCGGGT GCGCGCGTAC CCCGACGAGC GGCAGTCGGT CGAACTTCGC GTCCCCGCCG CGGACCTCCC GGTGGACGAC GGCGACGTGG ACCGCGTCTC GGTGATGTGG CCCCACAACG ACCGCCTGCT CGCGCGCGAC CGTCCCGAGC GCGACGGCGA CGATTGGATC TACGAGGTCC AGATTCCGCC GGGTCGCCAC GGGTTCAGCT TCGTCGCCAA CGACGACCCC GGCAACGAGC ACCGCGACGA GGTGACCGTG CCGGGTCCGG GGCGCCCGCG CGTCTCGATG TCGGCGACCG TCGTCGAGGG TGGGGAGGGA GAGGGGGACG CCGACGCCTC ATCCGTCCGG ATCGTTGCCG ACACCGAGGC GCCGCCCGAC CTCGACGGCG AGGGCGATGC CGGGAGACGC ACCGGCGAGT CCCCCGTCGC CGTCGACTTC CTCGTCGACG ATCGCGATGC GGACCCCGAG ACAGTCGCGC GGATCGAGTC GCTGGCGGCG GGCGACACAC TCACGATCCC GCTTGCGGAG CTTTCCGACG ACCTCTCTGG TGGGATCCGG GTCCACGCAG TCCCGAACGC CGACCGGTAC GGCGCGGCGG AGACGATCCG GATCGAGCGG GACGAAGAGG GGGCCGTGGG CGCCGGCGAG GGCGGCTCCG TGACCGTTTC CGACCCCCAT GCACCCCCCG AGTGGGCCGA CTCGCCGACG ATCTACGAGG TGTTCGTCCG GTCGTTCGCC GGCGACACGC TCCCGACGAC GTTCCGCGAG ATCGAGCGTC GAGTCCCCTA CATAGAGAGT CTCGGCGTCG ACACGCTCTG GCTCACGCCC GTGCTCGCCT CGCCGACGGA ACACGGCTAC CACGTCACCG ACTACTACGA CACCGCCGCC GACCTCGGCT CGCGCGAGGC GTTCGAGTCC CTGGTCGCGG CCTGCCACGA GGCGGGGATC AAGGTCGTGT TCGATCTGGT GATCAACCAC ACCTCCCGCG ATCATCCCGT CTTCCAGATG CACGCCGCCG GCGTCGACGC GTACGCCGAT CACTACCGCC GGGCTGACGG CGACTTCGAC GTGACAGACA CCGACTGGGC GGAGCTGGCA GCGGGCGATA TGCCGGAGTA CTACTTCAAC TGGCGCCGGA TCCCGAACCT CAACTTCGAC AGCCCGGCGG TCCGCGAGTG GCTGCTCGAC GTGGTCGACG AGTGGAGCGC GGTCGTCGAC GGCTTCCGCG CCGACGTGGC GTGGGGCGTG CCCCACGGCT TCTGGAAGGA GGTCAGTGAG CGCGTGCCCG ACGACTTCCT CCTGCTCGAC GAGACGCTCC CCCACGACCC CTTCTACGGC GAGGGGGAGT TCGACGTCCA CTACGACACC TCGCTGTACG ACGCGCTCCG GGATGTCGGG GCGGGCGACG CGCCCGCGGA CGCGATCGCC GACGCGTTCG CCCGCGCCGA GTGGCTCGGG TTCGACGACC CGGGCGTCCA GATGCGCTAC GTCGAGAACC ACGACGAGGA GCGCTACCTC GCCGAGTACG GGCGGGAGGC GCTGAAGGCG GCCGCCGCGA CCGTCTTCAC CCTCCCCGGC GCGCCGATGA TCTACGCCGG ACAGGAGCGC GGCAACGAGA CGTACCGCGG ACCCGTACGC TGGCACGACG GCGACAACGA CCTCACCGAC TTCCACCGCG ACCTCGCCGC GCTCCGCGAG CGCGAGCCGC TCCTCCGGGA CGGCGCCGTC GACTTCGAGG GCCGCGCCGC TGACGTGTCG GTCGTCGACG GCGATCCCGA GCGCGTAACG GCGTACGAGC GAACGCCGGC GACGGACGAT TTCGAGGGCG ACGATCACGA CGCGGACCCC GACCCGCTCC TCGTCGTCGT CAACTTCGCA GACCGGCCGG TGACGGTGGA GGTTCCGGCG GGGTTCGAGA CCGACCTGTT CGCGGGTGAC GGGTCTGGCG CGGTCGACGA GGGAGTCACG GTCGAGAGCG TCGCCGTCCT CCGGTAG
|
Protein sequence | MHEPGPPRTT SVGESVELAP RSPDPDGTYE WTLRDAPAES DARVDGQPVS GSSDDGELLV PGETSRPDAP VVHLRPDAPG TYVLTLDAPD GTHRQRVRAY PDERQSVELR VPAADLPVDD GDVDRVSVMW PHNDRLLARD RPERDGDDWI YEVQIPPGRH GFSFVANDDP GNEHRDEVTV PGPGRPRVSM SATVVEGGEG EGDADASSVR IVADTEAPPD LDGEGDAGRR TGESPVAVDF LVDDRDADPE TVARIESLAA GDTLTIPLAE LSDDLSGGIR VHAVPNADRY GAAETIRIER DEEGAVGAGE GGSVTVSDPH APPEWADSPT IYEVFVRSFA GDTLPTTFRE IERRVPYIES LGVDTLWLTP VLASPTEHGY HVTDYYDTAA DLGSREAFES LVAACHEAGI KVVFDLVINH TSRDHPVFQM HAAGVDAYAD HYRRADGDFD VTDTDWAELA AGDMPEYYFN WRRIPNLNFD SPAVREWLLD VVDEWSAVVD GFRADVAWGV PHGFWKEVSE RVPDDFLLLD ETLPHDPFYG EGEFDVHYDT SLYDALRDVG AGDAPADAIA DAFARAEWLG FDDPGVQMRY VENHDEERYL AEYGREALKA AAATVFTLPG APMIYAGQER GNETYRGPVR WHDGDNDLTD FHRDLAALRE REPLLRDGAV DFEGRAADVS VVDGDPERVT AYERTPATDD FEGDDHDADP DPLLVVVNFA DRPVTVEVPA GFETDLFAGD GSGAVDEGVT VESVAVLR
|
| |