Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0587 |
Symbol | |
ID | 7401723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 605333 |
End bp | 606991 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643707653 |
Product | alpha amylase catalytic region |
Protein accession | YP_002565259 |
Protein GI | 222479022 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | [TIGR02456] trehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.504326 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACC GGGACTGGTA CGAAGACGCG ACGATCTACT CGCTCGATAT CAAGACGTTC AACGACAGCG ACGGGGACGG GTGGGGGGAC TTCCGTGGCG CGATCGAGCG GCTCGACCAC CTCGACGACC TCGGCGTCGA CGCCGTGTGG ATCCGCCCGT TCTACCCCAG CCCGCTCCGG GACAACGGGT ACGACGTGGC CGACTACCGC GGCGTCGACG AGCGGCTCGG CACCCTCGAC GACTTCCGCG AGTTCGCGGA CCGAGCCCAC GAGCGCGGGA TCCGCGTGCT CACCGATCTC GTGTTCAACC ACACGTCGAA CGAACACGAG TGGTTCCAAC GGGCGTGCGA GGACCCCGAA TCGGAGTACC ACGACTACTA CCTGTGGACG AGCCACGTCG ACGACGCGCA CAACCGACAG AACATCTTCC CCGAGTACGA GGACGGCGTC TGGTCGTACG ACGAAACTGC CGACAAACAC TACTTCCACC AGTTCTACGG CCACCAGCCC GACCTTAACG TCGCGAATCC CGCCGTCCGC GAGGAGCTGT ACGACGTGCT CCGGTTTTGG CTTGATCAGG GCGCCGACGG GTTCCGGATC GACGCCGCTC ACCCCATGCT GCTGCCGAAG GGTCACAACG CGTCGACGCT CCACGACACC GACCTCGACG AGCCCATCGA CCTGTTCAAG CGGATGCGCG AGGTCGTCGA GGCGGAGCAG TCGGACGCGG TCTTACTCGC CGAGGCCGAC GACGAGCCCG AGAACCTCGA CTACTACTTC GGCGACGGAG AGGCGTTCCA CCTCCAGTTC AACTTCGTGA TGAACGCCCA CCTCACGTAC GGGGTCGGGG TGACGGACAC GTGGCCGCTC GACCGCGCCG AGGAGCTCCT CCCGGACGTC TCCGGCGTGG GCGGGTGGGT GAACTTCCTG CGGAACCACG ACGAGTGGAA CCTGTTGAAG CTCCCGCAGG AGTCGTTCGA TCACGCCCGC GAGTACTTCG GCGACGACGC CGGCAACTCG TGGATCTTCG AGCGCGGCCA CCGGCTCCGG CTCGCAGACT TGTACGCCGG GGACCACGAT CGGATCGCGG TGGCTCACAG CCTGCTGTTC TCCCTGCCGG GATCGGTCGC CCTCCAGTCC GGCGACGAGA TCGGGATGGG CGCCGACCTC TCCTTACCCG AGCGCGAGGC CGTCCGCACC CCGATGCAGT GGGACGACTC GGCGAACGGC GGCTTCTCGA CGGCCAACCA GGACGACTGT TACAACCCCG TTATCGACGA GGGCGAATAC GCCTACGAGC GAATAAATGC CGCCGCACAG CGCGACGACC CCGACTCGCT GCTCTCTCGA GTCCGGGACC TCTCGGCGGC CCGCGATGAC TGCCCGGCGA TCGCTCGAGG TTCGTACTCA CTCCCCGAGC CCGACCACAA GGAAACGCGC GTCCACCGGT TCGACCACGG GGAAGGCGAG TCCGAGACCG TCCTGCTCTG CGCGCACAAC CTCGCGGACG GCTACCGCGA GGAGGTAGTC GGGTTCGACG TCGACCCCGA CACGGTCGAA CGCGTCGTCG GCGACGGCGG CTATCACGTC GCTGAGGGCG GCGTCACCTT CTTGCTCGAC GAGTGCGATT ACGTCTGGCT GCGCGGCGAG AAGCGGTAG
|
Protein sequence | MSDRDWYEDA TIYSLDIKTF NDSDGDGWGD FRGAIERLDH LDDLGVDAVW IRPFYPSPLR DNGYDVADYR GVDERLGTLD DFREFADRAH ERGIRVLTDL VFNHTSNEHE WFQRACEDPE SEYHDYYLWT SHVDDAHNRQ NIFPEYEDGV WSYDETADKH YFHQFYGHQP DLNVANPAVR EELYDVLRFW LDQGADGFRI DAAHPMLLPK GHNASTLHDT DLDEPIDLFK RMREVVEAEQ SDAVLLAEAD DEPENLDYYF GDGEAFHLQF NFVMNAHLTY GVGVTDTWPL DRAEELLPDV SGVGGWVNFL RNHDEWNLLK LPQESFDHAR EYFGDDAGNS WIFERGHRLR LADLYAGDHD RIAVAHSLLF SLPGSVALQS GDEIGMGADL SLPEREAVRT PMQWDDSANG GFSTANQDDC YNPVIDEGEY AYERINAAAQ RDDPDSLLSR VRDLSAARDD CPAIARGSYS LPEPDHKETR VHRFDHGEGE SETVLLCAHN LADGYREEVV GFDVDPDTVE RVVGDGGYHV AEGGVTFLLD ECDYVWLRGE KR
|
| |