Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1627 |
Symbol | |
ID | 5733499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1886872 |
End bp | 1887753 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278766 |
Product | apurinic endonuclease Apn1 |
Protein accession | YP_001544398 |
Protein GI | 159898151 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0648] Endonuclease IV |
TIGRFAM ID | [TIGR00587] apurinic endonuclease (APN1) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00209687 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAATT TAGGAGCGCA TGTTTCAACC GTTGGTGGGT TGCAGACCGC CTTTGAGCGA GCGCAAGCCA CGACCTGTCA TGTGATTCAA ATTTTTACCA AAAGCCAACG CCAATGGAAT GCCAAACCGC TGACTGCTGC CGATTGTGCC AATTTTCAAG CGGCCCATCG TGACGCTGGC ATGCCACCCT TGATTGCCCA CGATTCCTAT TTGATCAACT TGGGCAGCCC CGACAATGGC TTATGGGAAA AATCGATTGC AGCCTTTCGC GTTGAGCTAG AGCGTTGTCA GCAATTGGGA GTTGGCTCGT TGGTGACCCA TCCTGGGGCG CATGTTGGCT CAGGCGAGGC CGCTGGCCTT GATCGGGTTG GCGCAGCCTT GCGGCGTTTA CTGGCCGAAG ACGTTGGCGG CGAAACCCAG ATTTTGCTCG AAATTACTGC TGGACAAGGC ACGGCCTTAG GTCATAGTTT CGAGCATTTG GCTCGCTTGA TCGAGCTGTG CGATGGTCAT CCACGGCTTG GAATTTGCTT TGATACCTGC CATGGCTTAG CTGCTGGCTA CGATTTTCGC ACTGCTGAAG GCTACCAAAC CACCTTTGAT CATTTTGATC GTTTAATTGG GATTGATCGG CTGCGGGCGT TTCATCTTAA CGATTCGAAA AACGATCTTG GCAGTCGGGT TGATCGGCAT ACCCATATTG GTGAGGGATT TGTGGGCTTA GAAGGGTTTG AACTCATTAT GAACGATCCT CGCTTTCAAA CGATTCCGAT GGCCTTAGAG ACTCCCAAAG AACCCGATGA AAGCGCAGAT ATTCGCAATT TGGCAACGTT GCGCGGTTTG CGCCGCGCAA GCGTTAGTGC ATCACCACAA AGCGAGGTAT AA
|
Protein sequence | MPNLGAHVST VGGLQTAFER AQATTCHVIQ IFTKSQRQWN AKPLTAADCA NFQAAHRDAG MPPLIAHDSY LINLGSPDNG LWEKSIAAFR VELERCQQLG VGSLVTHPGA HVGSGEAAGL DRVGAALRRL LAEDVGGETQ ILLEITAGQG TALGHSFEHL ARLIELCDGH PRLGICFDTC HGLAAGYDFR TAEGYQTTFD HFDRLIGIDR LRAFHLNDSK NDLGSRVDRH THIGEGFVGL EGFELIMNDP RFQTIPMALE TPKEPDESAD IRNLATLRGL RRASVSASPQ SEV
|
| |