Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1504 |
Symbol | |
ID | 7400332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1515522 |
End bp | 1517708 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643708566 |
Product | alpha amylase catalytic region |
Protein accession | YP_002566162 |
Protein GI | 222479925 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.496352 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCATC CCGGACCCCC GCGATTCCTC GCGACCGGCG AGACCACGGA GCTTGCACCG CGAGATCCGG ACCCGAACGG ATCCTACGAG TGGCGCGTTG TCGACGCGCC GCCAGACAGT GAAGCGACGG TCGGAACCGA CCCGGTGACC GAGTTCACCC CAGACGTACC CGGACGCTAC TGGATCGGGC TCGACGCCCC CGACGGTGAT CACCGGCTGA CGGTCCACGC GTTCCCGTCG AGCTACGAGG GTGTCGACGT CGAGGGCGGA AGCGGGACGG AGATCCGCGA CCGCACCGCT GGCAACGCCC CGGTCGACTA CGCCGAGCCC CGCGGCGATG GGGGCGTGGG GCGGCCGCGA ATGCGTCTGG ACGCGAGCGT GGAGTTAGGG GAGACCGACG GGGACGAGGA GGGCGGTGAC GAGAAGGGCA GCGACGAGAA GAGCAGCGAC GAGAAGAGCA GCGACGGGAC TGGCAAGCCC GAAATCGTCG TCCGAGCGAC ACCGACACCG AATCCCCATT CCTCGCTCGG CGCGGGCGAC CTACGAGTGA CGTTCATCGT CGACGACCGC GACGTCGAGA GCGCGGTCGC AGAGGGGCGG AGGAACCCAC GTGACGCGCT CCGGACGAGC GACGACGGGC GGGAGCTTCG CGTGCCCGCG GCCGCGGTCG CGGATCGACT TCGGGTCCAC GGGGTCGCGG TCGCGGCGGA GCCGGGGCAG GAGCCGCGGG TTAGCGTCGC CGACGCGGTC GCGGTCGATC GAAGCGACGG CGGCGTGGGC AGGCACGACG AGGGCGACAA AACGAGCGAT CGGAACAACG GCGATGACGG ACGCATCGGC GGCCCGACAC GCCCCGCCTT CGAGACGGTT CGGCTTAACG ACCCGCCGAC GTGGACGCAC GACGCAACCG TCTACGAGGT GTACGTCCGG ACGTTCGCCG ACGAGGGGAA AGGGGAGACG TTCGGCTCGA TCGCCGACCG GATCCCGGCG ATTGCGGAAC TCGGCGTCGA CACGCTGTGG CTCACGCCGG TCCTCCAGCA CGACGGGAAG CCGCACGGCT ACAACATCAC CGACTTCTTC GACGTGGCCG AGGACCTCGG CGAGCGCGAC GACTACGAGG CGCTGGTCGA GACGGCCCAC GACCACGGGA TGCGGGTGCT GTTCGACTTC GTCGCCAATC ACACCGCGCG CGACCACGAG TGGTTCGAGG ACGCCTACCA GAATCCGGAC TCCCCGTACC GCGACCGCTA CGAGTGGCAA GAGTCGGGCG AGCCGGGGAC GTACTTCGAC TGGGAGCTGA TCGCGAACCT GAACCACTCG AACCTCGAGG TTCGGCGATT CCTCCTCGAC GTGGTCGACG AGTGGGCCCC GCTCGTGGAC GGATTCCGGT GCGACATGGC GTGGGCCGTG CCGGACTCCT TCTGGCGCGA GCTTCGCGAC CGGGTGAAGG ACATCGACCG GGAGTTCCTG TTGATGGACG AGACGATTCC GTACATCCCC GGCTTCCACG AGGGGATGTT CGACGTTCAC TTCGACGCGA CGCTGTACTT CCAGCTCCGC CAGATCGGCC GGGGCGTCGA CCCCGCCATG TCGCTTTTGG ACGCGATCGA CCAGCGCGCC GAGATCGGCT TCCCGGACCA CGCCGAGTTC CTCCAGTACA TTGAGAATCA CGACGAGACG CGCTACCGCG TCGAGTGCGG CGACGCCGCC GCGGCCGCGG CGGGGGCGGC GATATTCACG TTGCCCGGCG TGCCGATGAT CTACGCCGGT CAGGAGATCG GCCAGCGCGG CCGGCGCGAC GCGATCGCGT GGGATCACGC CCGCGAAGGA GTCCGCGACC GCTACGAGCG GCTGATCGCG GTTCGCGAGG CTCACCCGGC GCTCGGACCC GAGGGCGACC TCGACCGAGT AGGGTACCAC GTCGCCAGCG GTGACGTCTC GGAGCGACCG ATCGTCGCCA GCGGCGACGT TCACCCCGAC GACGTGGTGG CGTTCCGTCG GAGCGACGAG GACGAAGAGC TCGTCGTCGT CCTCAACTTC GCACCCGAAC CCGCGAGCGT CTCGGTCGGC GTCGACCACG CCGATCGGGA CCTCGTGTCG GGCGATCCCT GCGTGGTTGT CGACGGCGAT GGGACCGAAC GAATTCGCGT CGACGACGTG GCGGTCGCCC GGGTCGAGGG GCGCTGA
|
Protein sequence | MHHPGPPRFL ATGETTELAP RDPDPNGSYE WRVVDAPPDS EATVGTDPVT EFTPDVPGRY WIGLDAPDGD HRLTVHAFPS SYEGVDVEGG SGTEIRDRTA GNAPVDYAEP RGDGGVGRPR MRLDASVELG ETDGDEEGGD EKGSDEKSSD EKSSDGTGKP EIVVRATPTP NPHSSLGAGD LRVTFIVDDR DVESAVAEGR RNPRDALRTS DDGRELRVPA AAVADRLRVH GVAVAAEPGQ EPRVSVADAV AVDRSDGGVG RHDEGDKTSD RNNGDDGRIG GPTRPAFETV RLNDPPTWTH DATVYEVYVR TFADEGKGET FGSIADRIPA IAELGVDTLW LTPVLQHDGK PHGYNITDFF DVAEDLGERD DYEALVETAH DHGMRVLFDF VANHTARDHE WFEDAYQNPD SPYRDRYEWQ ESGEPGTYFD WELIANLNHS NLEVRRFLLD VVDEWAPLVD GFRCDMAWAV PDSFWRELRD RVKDIDREFL LMDETIPYIP GFHEGMFDVH FDATLYFQLR QIGRGVDPAM SLLDAIDQRA EIGFPDHAEF LQYIENHDET RYRVECGDAA AAAAGAAIFT LPGVPMIYAG QEIGQRGRRD AIAWDHAREG VRDRYERLIA VREAHPALGP EGDLDRVGYH VASGDVSERP IVASGDVHPD DVVAFRRSDE DEELVVVLNF APEPASVSVG VDHADRDLVS GDPCVVVDGD GTERIRVDDV AVARVEGR
|
| |