Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1664 |
Symbol | |
ID | 7400421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1685033 |
End bp | 1686472 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643708733 |
Product | peptidase M28 |
Protein accession | YP_002566319 |
Protein GI | 222480082 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.617767 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACTCGG AGCCGAACGA GACGAACGCG GCGGTCGATC CGGCTGCCGT CGAGCGCGTC CGCGAGCGAC GCACCGAACT CGCACCGGCG CTCGGCCGGA CGTGGACCGA CGGTGACCCG TGGCGCTTCC TCACCGACCT CACCGCGATC GGGAGCCGGA TGGCCGGTAG CGAGGGCGAG CGCCGGGCCG CCGAGATCGT CGCCGACGCG TTCGAGCGGG CGGGGCTCTC CGCGGTCGAG ACGCGCCCGT TCGAGATGGC GGCGTGGGAG CGCGGGAGCG CGACGCTCCG CGTGACGGCG CCCGGACGCG ACGGCGCGGC GGCGACCCGC GAGTTCGAGG CGCTCGCGCT GCCGTACTCG CCGGGCGGGA GTGTCACTGG GGAGCTCGTG GACGTGGGGT ACGGCACTCC CGCCGAGATC GACGAGCGGG AGGTTGAGGG CCGGATCGCA GTCGCGTCGA CGACGACCCC GGAGGGCGGT CGGTTCGTCC ACCGGATGGA GAAGTTCGGG TACGCGCTCG ACGCGGGCGC GGTCGGCTTC GTCTTCGTCA ACCACCTCGA CGGCCAGCTT CCCCCCACCG GATCCCTGAC CTTCGGCGAG GAGGCCGAGG CCGTCGCCGT CGGCGTCTCG AAGGAGACCG GCGCGTGGCT CCGGGAGTAC GCGGCCGGAG GGGACGGCGG GGTCGCCGCC GAATCGAGCC CCGCTGCGCA GGCCGAGCTG TCGGTGACGG CGACGACCGA GCCGGGCGAG AGCCGGAACG TAGTCGGTCA CGCGGGACCG GACACCGACG AGCGGCTCCT CCTGCTCGCG CACTACGACG CCCACGACAT CGCGGAGGGC GCGCTCGATA ACGGTTGCGG GATCGCGACC GTCGCGACCG CCGCGGGAAT CCTGACCGAG GCGGACCTCC CGCTCGGCGT CGACGTGGTC GCGGTCGGGG CGGAGGAGGT GGGGCTCCTC GGTTCGGAGC AGTTGGCAGA GCGGCTCGAC CTCGACCGGG TGAAGGGAGT GATCAACGTC GACGGCGCGG GGCGGTTCCG CGACCTCGTG GCGCTGGCGC ACGCCTCCGA GACGGCTGCG TCGGTCGCCG AGGCGGTGTC GACGGCGACG AACCAGCCGA TCGCTGTGGA CGCGGAGCCG CACCCGTTCT CCGACCAGTG GCCGTTCGTC CGGCGCGGGG TGCCGGCGAT CCAGCTACAC AGCGACTCCG GCGATCGGGG ACGCGGCTGG GGACACACCC ACGCCGACAC CCGCGACAAG GTCGACGACC GAAATGTTCG GGAACACGCG ATGCTCATCG CCCTGCTCGT CGCCGAGTTC GCAGCCCCCG AGCGCGACGC GCCCCGCCTC GACCGCGACG ACCTGATCGC GGCGTTCCGG GACGCCGACT TCGAGACGGG CATGCGCGCG GCCGACCTCT GGCCGGCCGG CTGGGAGTAG
|
Protein sequence | MHSEPNETNA AVDPAAVERV RERRTELAPA LGRTWTDGDP WRFLTDLTAI GSRMAGSEGE RRAAEIVADA FERAGLSAVE TRPFEMAAWE RGSATLRVTA PGRDGAAATR EFEALALPYS PGGSVTGELV DVGYGTPAEI DEREVEGRIA VASTTTPEGG RFVHRMEKFG YALDAGAVGF VFVNHLDGQL PPTGSLTFGE EAEAVAVGVS KETGAWLREY AAGGDGGVAA ESSPAAQAEL SVTATTEPGE SRNVVGHAGP DTDERLLLLA HYDAHDIAEG ALDNGCGIAT VATAAGILTE ADLPLGVDVV AVGAEEVGLL GSEQLAERLD LDRVKGVINV DGAGRFRDLV ALAHASETAA SVAEAVSTAT NQPIAVDAEP HPFSDQWPFV RRGVPAIQLH SDSGDRGRGW GHTHADTRDK VDDRNVREHA MLIALLVAEF AAPERDAPRL DRDDLIAAFR DADFETGMRA ADLWPAGWE
|
| |