Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0531 |
Symbol | |
ID | 7400412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 553235 |
End bp | 554110 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643707596 |
Product | apurinic endonuclease Apn1 |
Protein accession | YP_002565203 |
Protein GI | 222478966 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0648] Endonuclease IV |
TIGRFAM ID | [TIGR00587] apurinic endonuclease (APN1) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.888127 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTCA AGGTCGGTGC GCACGTCTCG ATCTCCAGTT CGAGAGCGTC CAACGATCCC GAAACGCCGC CGTACGGAAA CATCGCGAAT GCGGTGTTCA GACAGAAGCA GTTCGGCGGC AACTGCGGAC AGATATTCAC CCACTCCCCA CAGGTGTGGC AGGATCCGAA CATCGGCGAC GAGGAGGCCG AGCGGTTCCG AGAGGGTACC GAACGCGACC TGGAGGGCCC GTGGGTGATC CACACCTCCT ACCTCGTCAA CCTCTGTACA CCCAAGGAGG GGCTCCGCGA GAAGTCGCTC GACTCGATGC AGAAGGAGGT CGACGCGGCC GACACGCTCG GGATTCCGTA CGTCAACGTC CACCTCGGCG CGCACACGGG GGCGGGCGTC GAGGGCGGGC TCGACAACGC GGCGTCGGTG ATCGACGACA TCGACGTGCC CGACGGCGTC ACGATCCTGA TCGAGAGCGA CGCGGGCGCT GGCACGAAGC TCGGTGGCGA GTTCGAGCAC CTCGCGGGGA TCATCGATCG CACAGAGACC GACATCGACG TCTGCGTCGA CACCGCCCAC GCGTTCGCGG CGGGCTACGA TCTTTCGACA CCCGAAGCCG TCGACGAAAC GATCGCGGAG TTCGACGACG TGGTCGGGCT GGAACACCTC AAGTACATCC ACCTCAACGA CTCGAAGCAC GCCTGCGGCA CCAACAAAGA CGAGCATGCC CACATCGGCG AGGGACTCAT CGGCGAAGAC GGGATGGAGC GATTCCTCAA CCACCCGGAC CTGATCGACG TGCCGCTCGC GCTGGAGACA CCCACCGAAG ACGGGAAGAG CTTCGCGTGG AATATCGATC GCGTGCGCGA GCTTCGCGCG GAGTGA
|
Protein sequence | MSLKVGAHVS ISSSRASNDP ETPPYGNIAN AVFRQKQFGG NCGQIFTHSP QVWQDPNIGD EEAERFREGT ERDLEGPWVI HTSYLVNLCT PKEGLREKSL DSMQKEVDAA DTLGIPYVNV HLGAHTGAGV EGGLDNAASV IDDIDVPDGV TILIESDAGA GTKLGGEFEH LAGIIDRTET DIDVCVDTAH AFAAGYDLST PEAVDETIAE FDDVVGLEHL KYIHLNDSKH ACGTNKDEHA HIGEGLIGED GMERFLNHPD LIDVPLALET PTEDGKSFAW NIDRVRELRA E
|
| |