Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0069 |
Symbol | |
ID | 5731942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 88237 |
End bp | 91098 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277191 |
Product | DNA polymerase I |
Protein accession | YP_001542849 |
Protein GI | 159896602 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.512123 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCAGC GACCCACGCT CGTCTTGGTT GATGGCCATG CACTGGCATT TCGCGCTTTT TTTGCGCTTC GCGACACAGG CATGTCGGTT CGCGCCACGG GCGAGCCAAC CTATGCCGTG CAAGGCTTTT TATCAATTTT GCTCAACTTG TTACGTGAGC GCCAACCAGA GTATGTGGCG GTTTCGTTTG ATATTGGTCG AACCTTCCGC GATGATCTGT ACCCCGATTA TAAGGCTGGC CGCGCTGAAA CACCCGCCGA TTTCCACCCC CAACTTGAAC GGATCAAGCA AATTATCAAT GCTTTGAATA TTCCGATCTA CACTGCCGAG AACTATGAGG CCGATGATGT GATTGGGACG TTGTGTCGTC AAGCTGAGGC GCAGGGTGTC GATACCTTGA TTATCACAGG CGATACCGAC ACCCTGCAAT TGGTCAACGA CTATACCAAG GTGCTGTTGG CCAATCCTTA TGGCAAGGGC AATGTTTCGC TCTACGATGA AGCTCAAGTG CGCGAACGCT ACAAAGGCTT GGCTCCCAAC CAACTGGCCG ATTTGCGCGG CCTCAAGGGC GATACCTCCG ACAACATTCC TGGGGTCAAG GGCATTGGTG AGGCTGGGGC AATCAGCATG CTCAATGAAT GGGGCAGCGT CGAAAATATC TACGCCAACC TCGATAAAGT CGCCAATCGC TATCGTTCAA AGCTCGATGG CCAGCAAGAA GCCGCGCGAT TTAGCACTCA CTTAGCAACT ATCGTTACTA ACGCCCCGGT AACCTTGGAT CTTGAAGCCA CCAAAGTGCA CGATTATGAT CGTGATACGG TCTTGGCCTT GTTCAGCCAG CTTGAATTTC GCAAGTTGGT CGATAAGCTG CCACTTTCCA GCCAAGTGAG CGCTGTCCAA GTTGTTGCCA TTCCGCAAAC GACCAATCCT AACCAATTAA CCATGTTCGA TGATGCCACG CCCCCAAGTG CTGAGCCGAT CGCTCAATCT GGCGATTATC AAGCGGTGAC TACCAGCGAA CAATTGGCCG AATTGGTAAG AATTTTGACG GCAGCTGAAC GTTTAATCTT CGATGTGGAA ACCAATAGTT TGAATTTGTT CTCGCCTGTG CCCGCCAAGG TTGTCGGCAT CGCCCTGACC CATACTGCTG GCTGCGGTTG GTATATTCCA TTGGGCCATC GCAGCGGCCA GCAATTGCCG ATAGCCGAAG TTGTCGCGGC ACTGCAACCA TTGTTTAGCG ACCCACAAAA AGCTGTGGTA GCGCATAATG GCAAGTTTGA TATGAGCGCC TTGAGCTTGA TTGGGCTTGA TGTGCCCCAT TTGAGTTTCG ATACGGCCAT CGCCGCCGCC TTGTTGGGCA AACGCCAGAG CCTCAAAGAT TTGGCCTTTG CCGAATTACG CGATGCCGAT GATCGCCCAA TTGAGATGAC CCGGATCGAA ACACTGATCG GTACTGGCAA AAAGCAAATC ACCATGGATC AGGTGGCGAT TGAACAAGTA ACGCCTTATG CAAGCGCCGA TGTTGATATG ACGGCGCGTT TATTGGCGCT ATTCATGCCG CAACTTGGGG CAATTCCGGC GGTACGCGAG GTGTTTGAGC AAATCGAAAT GCCGCTTAGC CCTGTGTTGA TGCGCATGGA AGCCTGTGGC ATTGGGCTTG ATCGGGCGCA ATTGGTGCAA CAAGGTCAAG TGTTAGGTCA AAGTTTACGC GAAATCGAGC AACATATTGC CGATTTTGTC GGCGAACCAC TCAACATTAA TTCGCGCTTC GATTTAAATG ATCTGTTGTT TATTCGCTTG AAGTTGCCAA CCGCCAATCT CAAGCGTTTG GCTGGTACAA CTCGCAGTGG CGGCGCGGTT TACTCAGTTA ACGCTGAAAC CTTGGAAGAT TTACAAACTC ACGATCAAAG CGGGATTGTA GCCATGATTT TGCGCTATCG CCGTTTGTCG AAGCTCAAAT CGACCTATGT TGATGCCTTG ATTGAGTTGA TCAACCAGCA AACAGGCCGC GTGCATACCC AATATCGCCA AATTGGCGCG GAAACTGGGC GGCTTAGCTC CGACTCACCC AACTTGCAAA ATATTCCGGT GCGCAGCGAG GAAGGCCGCG AAATTCGACG GGCTTTTGTT GCGCGGCCAG GCCATGTACT GATGACCGCC GACTATTCAC AGATTGAACT ACGAGTCTTG GCTCATATCA CCGCCGATCC AGCCTTAGTC GAAGTGTTTA AAACTGGCCA AGATATTCAC GCAGCCACCG CTGCCCGTTT GTTTGATATT CCCATGGATG AAGTCAGCAA AAATCAGCGG CGGATCGCCA AAATGACGGT CTTTGGTATT ATTTACGGCA TTAGCAGCTT TGGCTTGGCC GCTCGCACGG CGCTTTCACG CACCGAAGCC CAACAAATGA TCAACGGCTT GTTTGCTCAA TACCCAGGCC TGAAGAGCTA TATCGAACGA ACATTGGAGC GAGTTAAGGC AGTTGGCTAT GTTGAAACCT TGTTTGGCCG CCGCCGCTAC TTCCGCGAAT TGCAAGACGG TGGCGTAACT GGGCCTCGCC GTAGCGCCTT TGAGCGTGAA GCGACCAACG CTGGGATTCA AGGCACAGCC GCCGATTTGA TCAAGTTGGC CATGATTCGG CTGGAACAAG CATTAATTGC TGGCGGCTAT CAGGCCAAAA TGCTGCTGCA AGTGCATGAC GAATTGGTTT TGGAAGTGCC TGAGGATGAG CGTGATGCCG TAGCCCAATT AGTTTGTGAT ACGATGACCC AAGTCTATCC CGATTTGGCC GTGCCATTGG AAGTAAACGT TGAAACTGGG CTGAATTGGG ATCAGCTTCA GCGCTGGCAT GCCCCAGCCT AG
|
Protein sequence | MQQRPTLVLV DGHALAFRAF FALRDTGMSV RATGEPTYAV QGFLSILLNL LRERQPEYVA VSFDIGRTFR DDLYPDYKAG RAETPADFHP QLERIKQIIN ALNIPIYTAE NYEADDVIGT LCRQAEAQGV DTLIITGDTD TLQLVNDYTK VLLANPYGKG NVSLYDEAQV RERYKGLAPN QLADLRGLKG DTSDNIPGVK GIGEAGAISM LNEWGSVENI YANLDKVANR YRSKLDGQQE AARFSTHLAT IVTNAPVTLD LEATKVHDYD RDTVLALFSQ LEFRKLVDKL PLSSQVSAVQ VVAIPQTTNP NQLTMFDDAT PPSAEPIAQS GDYQAVTTSE QLAELVRILT AAERLIFDVE TNSLNLFSPV PAKVVGIALT HTAGCGWYIP LGHRSGQQLP IAEVVAALQP LFSDPQKAVV AHNGKFDMSA LSLIGLDVPH LSFDTAIAAA LLGKRQSLKD LAFAELRDAD DRPIEMTRIE TLIGTGKKQI TMDQVAIEQV TPYASADVDM TARLLALFMP QLGAIPAVRE VFEQIEMPLS PVLMRMEACG IGLDRAQLVQ QGQVLGQSLR EIEQHIADFV GEPLNINSRF DLNDLLFIRL KLPTANLKRL AGTTRSGGAV YSVNAETLED LQTHDQSGIV AMILRYRRLS KLKSTYVDAL IELINQQTGR VHTQYRQIGA ETGRLSSDSP NLQNIPVRSE EGREIRRAFV ARPGHVLMTA DYSQIELRVL AHITADPALV EVFKTGQDIH AATAARLFDI PMDEVSKNQR RIAKMTVFGI IYGISSFGLA ARTALSRTEA QQMINGLFAQ YPGLKSYIER TLERVKAVGY VETLFGRRRY FRELQDGGVT GPRRSAFERE ATNAGIQGTA ADLIKLAMIR LEQALIAGGY QAKMLLQVHD ELVLEVPEDE RDAVAQLVCD TMTQVYPDLA VPLEVNVETG LNWDQLQRWH APA
|
| |