Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0718 |
Symbol | polA |
ID | 4240207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 767061 |
End bp | 769916 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638104271 |
Product | DNA polymerase I |
Protein accession | YP_718930 |
Protein GI | 113460863 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAAA TTGCTGAAAA TCCGCTTGTT CTTGTAGATG GTTCTTCTTA TTTATATCGT GCATTCTATG CTTTTCCTCC ATTAACAAAT TCTCTTGGCG AACCGACAGG TGCAATGTAT GGCGTATTGA ATATGCTAAA AAGTTTAATT GCTCAGGTTC AACCGACCCA TATAGCGGTT GTATTTGATG CAAAGGGAAA AACGTTCCGT GATGAAATAT TTGAACAATA TAAATCCCAT CGTCCCCCTA TGCCAGATGA TTTACGTCAA CAAATTCAGC CATTACACAA TATTATCAAA GCACTAGGAA TTCCGCTGTT GTCTGTTGAA GGTGTCGAGG CTGATGATGT AATCGGTACA TTAGCGGTTC AGGCCTCTCA GCAGGGAAAA AACGTTCTAA TTAGCACCGG TGACAAAGAT ATGGCACAAT TGGTTGATGA TAATATTATG TTGATCAATA CCATGAATAA CAGTTTACTG GATAGAGATG GTGTAATAGA CAAATATGGT ATTCCGCCTG AATTAATTGT TGATTATTTA GCACTAATGG GCGACAGCTC CGATAATATT CCGGGAGTTA GCGGTGTCGG TGAAAAAACA GCATTGGGAT TGTTGCAAGG CATTGGGAGT ATGGCGGAAA TTTATGCAAA TTTAGAAAAA GTGGCTGGAT TATCAATTCG AGGAGCAAAA AAACTCGGGG AGAAACTATC CTCCGCCAAG GCAGATGCGG ATTTATCTTA CCTTTTAGCA ACAATAAAAA CCAACGTTGA GTTAGATATT ACACCGGATC AACTCGTGTT AGGGCAAAAT AACCAAGATC AATTGATTGA ATACTTTGCT CGTTATGAAT TTAAACGTTG GCTAAACGAA GTGATAAGCG GTGAAAGCTC ACTTACCAAA GCAACTCAGC AAGAAATTAA ACTTGATAAA ACTCAAACAA ATAGTGCCTC TGAGTCGAAA AGTGCGGTCA AAAAAACGAT TAAAATCGAC CGCACTTCAT ATGAAACTAT TGATACTCAA GAGAAATTAA ATAGTTGGCT GGAAAAACTT CAACAGGCAG ATCTTATTGC TGTTGATACG GAAACCGATG CTTTAGATCC TATGCGTGCT AATCTTGTCG GTATTTCCTT TGCACTGACC AACAGCGAGG CTTGCTATAT TCCTCTGGCT CATAAACAAG CTGTAAAAGA AATGACACAA ACAGATTTAT TCACTGAATC AGAACAAAAT GCTGAACAAT TTGAGCTTGT AAAAAATCAA TTAAATAAAG AGACGTGTCT AGCACAATTA AAGCCGTTGC TAGAAAATCC GTCTATTCAA AAAGTTGGAC AAAATATTAA ATATGATTTA ACTATTTTTG CCAATCATCA TATTCAATTA AACGGTGTCT GTTTTGATAC GATGCTACAA TCCTATGTGC TTGACAGCAC AGGACGACAT AATATGGGTG CCTTATCGGA ACGATATTTA GGACATCAAG TTATTGAATT TGAAAGTATT GCCGGTAAAG GAAAAAAACA AGTAACGTTT GATAAAATTG CCATTGCTCA AGCGACAGAA TATGCTGCTG AAGATGCGGA TATTACAATG AAATTACATC AAGTCCTCTG GCAAGAATTA CAACAATCGC CAAGTTTAGT GAAGGTATTT AATGATATTG AGTTGCCTTT GGTAAAAGTA TTATCAAAGA TGGAACGCAA TGGTGTACTA ATTGATAGTC AGGCATTGTT AAAGCAATCT GAAAAAATTG CGAGAAGATT GACCGCACTT GAGCAACAAG TTTATCAAGA AGCGGGAGCA GAATTTAATT TAGCTTCAAC CAAACAACTA CAAGAAATTC TCTTTACTAA ATTAGCCTTA CCTATCATTG CAAAAACCCC AAAAGGTGCA CCTTCAACAA ATGAAGATGT ATTGGAAACG CTAGCTCAAC AGGGACATAT TGTGCCTAAA TTATTGATGG AACATCGTGG ATTAGCAAAA TTAAAATCTA CTTACACTGA TAAATTGCCT TCAATGGTGA ACAAAAAAAC CGGACGAGTG CATACATCTT ATCATCAAGC AGTAACAGCA ACGGGACGTT TATCTTCCAG TGATCCCAAT TTACAAAATA TTCCGATTAA AAATGAAGAA GGTCATTGTA TTCGCCAAGC CTTTATTGCA CGTAAGGGGT ATAAAATTAT TGCCGCAGAC TATTCACAAA TTGAATTACG AATTATGGCA CATTTATCTC AAGATAATGG TTTGATTATG GCATTTAATG AGGGAAAAGA TATTCATCGT TCTACGGCAG CTGAGATTTT CGGTATTCCA TTAAATGAAG TAACAAGTGA ACAACGTCGA AGTGCAAAAG CAATTAACTT TGGCTTGATT TATGGAATGA GTTCTTTTGG TTTATCTAAT CAACTTGGTA TTTCACGGGC AAATGCTCAA AAATATATGG ATCTATATTT CCAACGTTAT CCGGGAGTGC AAACTTTTAT GGTTGATATT CGAGAAAAAG CAAAAGAGCA GGGCTATGTA GAAACATTAT TTGGACGCCG TCTATATTTG CCTGAAATCA ATTCATCTAA TGCAATACGC CGTAAAGGAG CTGAACGAGT GGCGATAAAT GCACCAATGC AGGGTACTGC GGCAGATATT ATTAAATTAG CTATGATTGC TATTCATAAC GAAATCAAAA AAAATGATAT TCAAATGATC ATGCAGGTTC ATGATGAATT GGTATTTGAA GTGAAAGAAA ATAAGGTAGA TGAATATTCT GCATTAATTA AGTCTTTAAT GGAAAATGCG GCACATTTAT CAGTCCCATT AATTGTGGAT ATTGGTGTGG GTGATAATTG GGATGAGGCA CATTAA
|
Protein sequence | MAKIAENPLV LVDGSSYLYR AFYAFPPLTN SLGEPTGAMY GVLNMLKSLI AQVQPTHIAV VFDAKGKTFR DEIFEQYKSH RPPMPDDLRQ QIQPLHNIIK ALGIPLLSVE GVEADDVIGT LAVQASQQGK NVLISTGDKD MAQLVDDNIM LINTMNNSLL DRDGVIDKYG IPPELIVDYL ALMGDSSDNI PGVSGVGEKT ALGLLQGIGS MAEIYANLEK VAGLSIRGAK KLGEKLSSAK ADADLSYLLA TIKTNVELDI TPDQLVLGQN NQDQLIEYFA RYEFKRWLNE VISGESSLTK ATQQEIKLDK TQTNSASESK SAVKKTIKID RTSYETIDTQ EKLNSWLEKL QQADLIAVDT ETDALDPMRA NLVGISFALT NSEACYIPLA HKQAVKEMTQ TDLFTESEQN AEQFELVKNQ LNKETCLAQL KPLLENPSIQ KVGQNIKYDL TIFANHHIQL NGVCFDTMLQ SYVLDSTGRH NMGALSERYL GHQVIEFESI AGKGKKQVTF DKIAIAQATE YAAEDADITM KLHQVLWQEL QQSPSLVKVF NDIELPLVKV LSKMERNGVL IDSQALLKQS EKIARRLTAL EQQVYQEAGA EFNLASTKQL QEILFTKLAL PIIAKTPKGA PSTNEDVLET LAQQGHIVPK LLMEHRGLAK LKSTYTDKLP SMVNKKTGRV HTSYHQAVTA TGRLSSSDPN LQNIPIKNEE GHCIRQAFIA RKGYKIIAAD YSQIELRIMA HLSQDNGLIM AFNEGKDIHR STAAEIFGIP LNEVTSEQRR SAKAINFGLI YGMSSFGLSN QLGISRANAQ KYMDLYFQRY PGVQTFMVDI REKAKEQGYV ETLFGRRLYL PEINSSNAIR RKGAERVAIN APMQGTAADI IKLAMIAIHN EIKKNDIQMI MQVHDELVFE VKENKVDEYS ALIKSLMENA AHLSVPLIVD IGVGDNWDEA H
|
| |