Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0545 |
Symbol | |
ID | 7401680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 568062 |
End bp | 570266 |
Gene Length | 2205 bp |
Protein Length | 734 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643707610 |
Product | glucan 14-alpha-glucosidase |
Protein accession | YP_002565217 |
Protein GI | 222478980 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.495812 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAATA ATTACACCAA ACCTTCATAC ACGGAGGGGC CGTTTACTCG CGTAATGCAA CTCCGCGACG CGCTCGATGA CTACAAGCGT AACACCGGGC ACGACACCCG GTTCCCCGGC GAACGCCGGA CGGTCACCGG TCGGTTCTCC GGCGGGGAGG GCCGGCTCGT CCACGTCGAC GCCGACGGCG AACTCCGCGA CTTCGGCTAC CCGCTGACCG GTCAGACGGG ACTCGTCCGC TCGCGCGTCG GCCTCGCGGT CGGCGACGAG GTCACGTGGC TCGACGAGGT CGAGACCCAG CAGCGCTACG TCGGCGATAC GACCCTGATC GAGACCGTCC ACGAGACCGA CCGAGCGACC GTGACTCGTC ACGACGTGAC GCTCGGCGAC GCCCACCTCA CACGCGTGAC GATCGATGTC GGTGACGACG GGCATGCGAA CGTCGACCCT GCCGACCTCT CGCTCGTCGT GTACGCCCGG TTCGCGCCTG ACGGGCGCGA CAACCGGGTC GGGCAGCTCC GATACGACGA CGCCATCGAG GTGTACCACG CCGACGAGCA CGACTTCCTC GCGAGCGACA CCGGCTTTAC CGACCTGCGC GGCCAGCTCC CGACGACGTT CCCCGAGCTG CTCGACGACG CGCCTACCGA CCTCCCCCGC GACCGCGACG GCGACCGCTA CGAGGAAGAG CGTCTCTCCG GTGAGGTCAT CGTCGACGCC CCCTTCGAGG ACGGCGTCGC GACCGTCGGC ACCCTGCTGA CCGACCGCGA GGAGACGACG CGCGAGGCCG CCCGCGACCG TCTCGCGAAG CTGTTCGACA CCCTTGACGA CCCCGACGCG CTCGCTGCGG CCGCGGGCGA GACGATTCCC TCGGTCCCTG AGTCGGTCCC GGCCCAGGAG TCGGTCGTCG CCGACCTTCG GGTGCTCTCG CTGCTCTCCG CCGAATCCGG GCTCCGGATC GCCGGCCCCG ACTTCGACCC CTTCTACGCG ACCTCCGGCG GCTACGGCTA CACCTGGTTC CGCGACGACG CGGAGATATC GACGTTCGTG CTCGGCGCCG ACGACCGGAT CGGACTCGGG CTTGACGACT GGCACGCCCG CTCGGCCGAG ATGTACGTCG CCACCCAGCG CCCCGACGGG TCGTGGCCGC ACCGGGTGTG GCCCCGGGAC GGCGCGCTCG CGCCCGGCTG GGCGAACGCG CGCATCGAGG ACGGCCCCGA CGTTGACTAC CAGGCCGACC AGACCGGCAG CGTCATCGCC TACCTCGCGC AGGCTCGCGC CGCGGGAGTC GACGTGGAGA AGCTCGACGC GACTCTCGTC GCCGCGCTCG CCGGACTCGA CGAGACCCTC GCGGCCGACG GCCGCCCCGT CGTCTGCCAG AACGCGTGGG AGGACAGCGC TGGGCGATTC GCGCACACGA CCGCGACGTT CTTAGAGGCG TACAGCGAGC TTGCGCTTCA CGGCGACGGG CTGGACGCGA GCGCGCTCGA TCGGAACGGG GACGCGGATG CGCCCGGCCG CGACCTCGAC CCCGCCATCC TCCCGGACGA CCTCGCCGCA CACGCCCGCG ACCAAGCAGT TCGGGTGTAC GACGCTCTCG ACGACCTCTG GGTGCCCGAA CGCGGCTGTT ACGCGCTCCG CGAGACGCCC GAGGGCACGA CCGACGACCG GCTCGACTCC TCGACGCTGG CGCTGGCGAG CGCGCACCGC TCGTTCGACG CGCTCGGCGG CGACGGCGAG GCGGAAGGCC ATAGCGGCGC TGTCGACGCC GAGCGACTCG ACCGGCTGGT CTCGCACGTC GAGACGGTCG TCGACGGGCT CGCTCACGAG AGCGACGAGA TTTCCGGACT GATCCGGTAC GAGGGCGACG GCTGGCGGCG GGCTGGCCAG CTCTCGGAGA AGGTATGGAC GGTCTCGACC GCGTGGGGCG CGAACGCCTG CGCCGAGCTG GCGGTCCTCC TCGCTGACCG CGACGACCCG CGTGCGGCCG AGATGGTCGA ACGCGCCCGG GAGCTGCTCG CGCACGTCTC GCCCGGCGGG ACGCTCTGTG AGCCGACGAG CTATCTCCCC GAGCAGTTCT TCGACGACGG AACGCCGGAC AGCGCGACGC CGCTCGGGTG GCCCCACGCG ATCCGGCTGG CGACCGTGGC GCTGCTCGAC GACGAGCTTC CCCAGTTTAC CGACCGCGCC GTCGCCGACG ACTGA
|
Protein sequence | MNNNYTKPSY TEGPFTRVMQ LRDALDDYKR NTGHDTRFPG ERRTVTGRFS GGEGRLVHVD ADGELRDFGY PLTGQTGLVR SRVGLAVGDE VTWLDEVETQ QRYVGDTTLI ETVHETDRAT VTRHDVTLGD AHLTRVTIDV GDDGHANVDP ADLSLVVYAR FAPDGRDNRV GQLRYDDAIE VYHADEHDFL ASDTGFTDLR GQLPTTFPEL LDDAPTDLPR DRDGDRYEEE RLSGEVIVDA PFEDGVATVG TLLTDREETT REAARDRLAK LFDTLDDPDA LAAAAGETIP SVPESVPAQE SVVADLRVLS LLSAESGLRI AGPDFDPFYA TSGGYGYTWF RDDAEISTFV LGADDRIGLG LDDWHARSAE MYVATQRPDG SWPHRVWPRD GALAPGWANA RIEDGPDVDY QADQTGSVIA YLAQARAAGV DVEKLDATLV AALAGLDETL AADGRPVVCQ NAWEDSAGRF AHTTATFLEA YSELALHGDG LDASALDRNG DADAPGRDLD PAILPDDLAA HARDQAVRVY DALDDLWVPE RGCYALRETP EGTTDDRLDS STLALASAHR SFDALGGDGE AEGHSGAVDA ERLDRLVSHV ETVVDGLAHE SDEISGLIRY EGDGWRRAGQ LSEKVWTVST AWGANACAEL AVLLADRDDP RAAEMVERAR ELLAHVSPGG TLCEPTSYLP EQFFDDGTPD SATPLGWPHA IRLATVALLD DELPQFTDRA VADD
|
| |