Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2666 |
Symbol | |
ID | 7400872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2650630 |
End bp | 2651751 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643709739 |
Product | peptidase M29 aminopeptidase II |
Protein accession | YP_002567307 |
Protein GI | 222481070 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2309] Leucyl aminopeptidase (aminopeptidase T) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGAAC GCGTACACGA GCACGCCGAA GTGCTCGTCG ATTGGAGCGC ACGCATTGAG GCCGGCGACG ACGTTGTCGT GAGCGTCGCC GAGGATGCCC ACGACCTCGG CGTCGCAGTC GTCGAGGCGC TCGGCGAGCG GGGCGCGAAC GCCACGACAC TGTACGGGTC GGCGGAGATC TCGCGGGCGT ATTTAAAAGG GAGTGAGCAG GGCAGTCACG GCTTCGACGA CGATCCGGCC GTCGAGCGCG CGCTGTTCGA GGCCGCGGAC GCCTACCTCC GGATCGGCGG CGGCCGCAAC ACCACCGCGA CCGCAGACGT GTCGAGCGAG ACGCGGCAGG CGTACGCGAA GGCGCGGAAG GACGTGCGCG AGGCGCGGAT GGACACCGAC TGGGTGTCGA CGGTCCACCC CACGCGCTCG CTCGCCCAGC AGGCCGGTAT GGCCTACGAG GAGTATCAAG AGTTCGTCTA CGACGCCGTC CTCCGCGACT GGGAGGCACT TGCCGACGAG ATGGCGGCGA TGAAGGAGGC CCTCGACGCG GGTGAGGAGG TCCGGATCGT CACCGATCGC GACGACGCCC CCGACACCGA TATTTCGATG TCGATCGCGG GCCGGACCGC GGTCAACTCT GCCGCGTCGG TCGCGTACGA CTCACACAAT CTCCCCTCCG GTGAGGTGTT CACCGCCCCC TACGACACCG AGGGCGAGGC GTTCTTCGAC GTGCCGATGA CGATCGACGC GACCCGCGTT CGGGACGTGC ACCTCGTCTT CGAGGACGGC GAGGTCGTCG ACTTCTCGGC GGGCGCCGGC GAGGACGCCC TCGCAAGCGT GCTCGACACC GACCCCGGAG CCCGGCGACT CGGTGAACTC GGTATCGGGA TGAACCGCGG CATCGATCGG TTCACCGACT CGATCCTCTT CGACGAGAAG ATGGGCGACA CGATCCACCT CGCGGTGGGA CGCGCCTACG ACGCCTGCCT GCCGGAGGGC GAATCGGGCA ACGACAGCGC GGTCCACGTC GACATGATCA GCGACGTGAG CGAGAATTCT CGAATGGAGA TCGACGGCGA GGTCGTTCAG CGCAACGGTA CGTTCCGGTG GGAAGACGGG TTCGACGGCT GA
|
Protein sequence | MDERVHEHAE VLVDWSARIE AGDDVVVSVA EDAHDLGVAV VEALGERGAN ATTLYGSAEI SRAYLKGSEQ GSHGFDDDPA VERALFEAAD AYLRIGGGRN TTATADVSSE TRQAYAKARK DVREARMDTD WVSTVHPTRS LAQQAGMAYE EYQEFVYDAV LRDWEALADE MAAMKEALDA GEEVRIVTDR DDAPDTDISM SIAGRTAVNS AASVAYDSHN LPSGEVFTAP YDTEGEAFFD VPMTIDATRV RDVHLVFEDG EVVDFSAGAG EDALASVLDT DPGARRLGEL GIGMNRGIDR FTDSILFDEK MGDTIHLAVG RAYDACLPEG ESGNDSAVHV DMISDVSENS RMEIDGEVVQ RNGTFRWEDG FDG
|
| |