Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0342 |
Symbol | |
ID | 7399734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 364129 |
End bp | 364926 |
Gene Length | 798 bp |
Protein Length | 265 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643707406 |
Product | protein-L-isoaspartate(D-aspartate) O-methyltransferase |
Protein accession | YP_002565016 |
Protein GI | 222478779 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG2518] Protein-L-isoaspartate carboxylmethyltransferase |
TIGRFAM ID | [TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.461338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.049584 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGAGG CGTCGCTTCG GGCCGACATG ATCGAGGGGC TCGAACACCA GATCGGCGAG ACGCTGGATG AGCCCGTACT GACGGCGCTT CAGCGCGTCT CCCGGGACCC GTTCGTCGAC GACGGGGCGA CTGCGGCCGG GGGCGAGACC GGCCGAGGTG GCGGGAGCGG TGGGAACGGA GACCGGTCGC TCACGCTGGC GACGGTGGTC CGACTGATAT CCGCACTCGA CGCCGACGCG GGCGACGAGG TGCTCGTCGT GGGCGCGGGC GTCGGCTACT CGGTCGCGCT GCTCGCCGAG ATAGCGGGTG CAAGACACAT TCACGCGATC GATATCGACC GAGAGGCGGT GGCGATCGCC CGATCGAACC TCTCGACAGC CGGATACGAC GCCGTCCTCG TCGACCGACG TGACGGAGTG AACGGGCTTC CCGAGTACGC CCCGTACGAC CGAATCCTTC TCGAAGCGAG CGTCGTCAAG CCGCCGCGGG CACTCCGCGA GCAGCTGGCC GAGGGGGGTC GGATCGTCTA CCCCCGCGGG ACGGCAGTCC AGACGATCGC CGCGATCGAA CCCGACTCAG GGGGAGACTC GGCGGACGAC GGTGACGACG CTCCGACCGG GTTCCGGACG ACCGAGACGG CCGGTCCGAC GCGGCTCCAG CCAATGCTCG TCGACGGCGA GCAGCGCGGC GTCGAGCGGA ACCGCACGCG CCGCGAAGAC GCCGAGCGCG CCGAGCAAGG GCACTTCGCA CGTCACGGCT GGGAGCAAGA CTGGATCGAC TGGGACGACC GGATTTGA
|
Protein sequence | MDEASLRADM IEGLEHQIGE TLDEPVLTAL QRVSRDPFVD DGATAAGGET GRGGGSGGNG DRSLTLATVV RLISALDADA GDEVLVVGAG VGYSVALLAE IAGARHIHAI DIDREAVAIA RSNLSTAGYD AVLVDRRDGV NGLPEYAPYD RILLEASVVK PPRALREQLA EGGRIVYPRG TAVQTIAAIE PDSGGDSADD GDDAPTGFRT TETAGPTRLQ PMLVDGEQRG VERNRTRRED AERAEQGHFA RHGWEQDWID WDDRI
|
| |