Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1052 |
Symbol | |
ID | 8383326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1018643 |
End bp | 1020667 |
Gene Length | 2025 bp |
Protein Length | 674 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644972117 |
Product | alpha amylase catalytic region |
Protein accession | YP_003129968 |
Protein GI | 257052135 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCACC CCGGCCCACC CCACTTCGTC GCCGTCGGCG AAGCAATCGA ACTCGCCCCC CGCGATCCGG ACCCGGAAGC GACCTACGCG TGGGACGTGA CCGCCCGACC GGACGGATCG ACAGCGACCG TCGGCGACGA CCCCGTCGAG CATCTCGAAC CCGACGTCGC GGGAACCTAC GTCGTTCACC TCGCGGCCCC CGACGGCGGC CACGACCTCA CCGTTCGCGC GTTCGCGTCG GAACTGACCC CCTCGACGGG CGGCGTCTCC GGGGCATCGG GCGTTTCGGC GGGCGAGAGT GGATTCGAGA GCGGGGGATC CGGATCGGGC GGTCGATCGG GCAGTGCTCG CGCCGACGCC GTAACCGGTG ACGGGGGCCG ACCCCGGCTC ACCCTCGAAC CCGCGATCGA GGGGGACGAA GCCGTCGTCC GGGCCGATCC GCTCCCGCAT CCCGACGGAC CGGAGACGTC GGCCGACCTG GCCGTCGAAT TTCTGCTCGA CGACCGGGAC AATGTCGACC GCGAGGCGGT GACCATCGAT GAGACCGAGC TCCGGGTTCC GCTCTCGGCG ATCGACGACA GACTACGGGT GCACGCCGTC TCGGTCGGGG ATCGAGGCTA CAGCGTCCCG GACGCCGTCG AGTTCGCGCG GGAGGACCCG ACCGGGTCGG CAGTCACGGA TGGGAGCGTG ACTGCCAGTC ACATCTACGA GCCACCCGCG TGGGCCGAGG ACACGATCAT CTACGAGATC TACGTCCGGA CCTTCGCGGG CCAGGCAGGT GATCAGCGGA GCGGTGGCGA TGGAGCGGCT GCGGGCGAGG GGGAAACCGA GCGCTCGGCC TTCGACGCGA TCGTCGACCG GCTGGACTAC ATCGAGTCCC TCGGAGTGGA TACCCTCTGG CTGACGCCGG TCCTGGAGAA CGACCACGCG CCCCACGGGT ACAACATCAC GGACTTCTTC TCGATCGCCG AGGATCTTGG GTCACGGGCC GATTACGAGC GCCTCATCGC GGCCGCCCAC GACCGCGGGA TGAACGTCCT GTTCGACCTC GTGTGCAATC ACTCCGCGCG AACCCATCCG CACTTCCAGG CGGCCGTCGC CGATCCGGAC AGCGAGTACC ACGAGTGGTA CGAGTGGCGC GGCCCCGGCG AGCCCGAGAC GTACTTCGAG TGGGAGCACA TCGCGAACTT CGACTTCACG CACCTGCCGG TCCGGCGACA CCTCCTCGAC GCGATCGATC AGTGGGCCCC ACTGGTCGAC GGCTTCCGGA TCGACATGGC GTGGGCCGTG CCGAACAACT TCTGGCGGGA GGTCCACGAC CGGGCGAAAG CCATCGACAG CGAGTTCCTG CTACTCGACG AGACGATCCC GTACATTCCG GACTTCCAGG GCGGGTGTTT CGACATGCAC TTCGACTCGA CCACGTACGC GGCGCTGCGT CGGGTGGGCA ACGGCGCGCC GGCCGCGGAA GTGCTCGATG CTGTCGACGA ACGCGCAGCG ATCGGCTTTC CGCCACATGC CGGGTTCATG CTGTACGCGG AGAACCACGA CGAGACGCGC TATCTTGTCG AATGTGGTCG TGCGGCCGCC CGTGCCGCCA CGGGCGCGCT GTTCACGCTG CCGGGGTCCC CACTGGTGTA CGCCGGCCAG GAGTTCGGCC AGCGCGGCAA GCGCGACGAC CTCGCGTGGG AACACGCCGA CGAGGACCTC CAGGCGCACG TCCGACAGCT CGCGGCGGCG CGTCGCGACG TGACGGCGCT GGAATCGGCC GCGACACTCC ATCGCATCGA GTGGACCGTT CAGTCGGGTG CGGCCGACCG TGTCGTCGCG TTCGGACGCG TTCGTGGCGA CGATGCCGTC GTCGTCGTCC TGAACTTCGG GCCGGAGACG GCGACAGTCG AACTACCGAT AGCGACCGGG ACGACCGACG CAGTGTCCGG GAAGGCTGTC GGAACTGGCG AGGGGGGACT GCGGGTCGAC AACGTTCTCG TCGTGCCGGC CGAGTCGGAG ACGATCAAGG GGTAG
|
Protein sequence | MHHPGPPHFV AVGEAIELAP RDPDPEATYA WDVTARPDGS TATVGDDPVE HLEPDVAGTY VVHLAAPDGG HDLTVRAFAS ELTPSTGGVS GASGVSAGES GFESGGSGSG GRSGSARADA VTGDGGRPRL TLEPAIEGDE AVVRADPLPH PDGPETSADL AVEFLLDDRD NVDREAVTID ETELRVPLSA IDDRLRVHAV SVGDRGYSVP DAVEFAREDP TGSAVTDGSV TASHIYEPPA WAEDTIIYEI YVRTFAGQAG DQRSGGDGAA AGEGETERSA FDAIVDRLDY IESLGVDTLW LTPVLENDHA PHGYNITDFF SIAEDLGSRA DYERLIAAAH DRGMNVLFDL VCNHSARTHP HFQAAVADPD SEYHEWYEWR GPGEPETYFE WEHIANFDFT HLPVRRHLLD AIDQWAPLVD GFRIDMAWAV PNNFWREVHD RAKAIDSEFL LLDETIPYIP DFQGGCFDMH FDSTTYAALR RVGNGAPAAE VLDAVDERAA IGFPPHAGFM LYAENHDETR YLVECGRAAA RAATGALFTL PGSPLVYAGQ EFGQRGKRDD LAWEHADEDL QAHVRQLAAA RRDVTALESA ATLHRIEWTV QSGAADRVVA FGRVRGDDAV VVVLNFGPET ATVELPIATG TTDAVSGKAV GTGEGGLRVD NVLVVPAESE TIKG
|
| |