Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0617 |
Symbol | polA |
ID | 8709537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 682074 |
End bp | 685151 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 646482726 |
Product | DNA-directed DNA polymerase |
Protein accession | YP_003373852 |
Protein GI | 283783098 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATACCG ACATCGTTGA AGAACATTCT TCGGAACTTA AGAATGAGAA AGGAACGTTG TTGGTTGTTG ATGGCCATTC TCTTGCGTTC CGTGCTTTTT TTGCAATATC TGCGGATAAT TTTTCTACTC GCTCTGGGCA AGCAACAAAT TCCATTTGGG GATTTATTAC TATGCTTTCA CAGGTTGTAA AAAAAGAGAA GCCTGACCGC TTAGCTATTG CTTTTGACGT TAAAGGTGGA ACTTTTAGAA ATACTATGCT GCCTCAATAT AAAGGTACTC GTGATGCTGC TCCACAGGAG CTTCTTTCGC AGCTTCCTAT CATTCAACAG ATGCTTAAAG CTCTTGGTGT CACTTATATT GAGAAACCCG GTTATGAAGG TGATGATGTT ATTGGCACTC TTGCAGTAAT GGGTTCGAAT GCCGGTTATA GAACTTTAGT ACTTTCTGGC GATCGTGATG CATTCCAACT GATAGATGAC AATATTACTG TTTTATATCC AGGACAGCAT TTTAAAGACT TAAAAGAAAT GGATAAGAAT GCTGTTTATG AAAAATATCA TGTTACGCCT AAGCAATATC CTGATTTAGC AGCGTTAAGA GGAGAAACTG CAGATAATAT TCCAGGAGTT CCAGGTGTTG GAGATGGGTA CGCTGCTAAA TGGATTAATC AATTCGGCAG CCTAGAAAAT ATTATTGAAC ATTCGAATGA GATTAGCGGT AAAAAGGGAG AAGCTTTAAG AGAAAATATT GATCAAGTTC GTTTAAACCG TAAAGTTAAT GCTTTAGTTT GCGATTTGGA TTTAGGCGTA ACTGTAGATG ATTTGTGCTT TGGTGATGTT GATGCTGCAA AATTAGGAAT CTTGCTGTCT CAACTTGAAT TTGGCGAATT AAAAAAGAGA AAGATTCTCA AAACTTTTAG CAACGGTATG CAACAATCTA TCGAAAATAA TATGCGTATA ATGTTTTCGT CTACGTATGA AGGTGACGCT AATGACGATA AGCTAGCTCA AAGTGCTCAA TCAAATATTG ACGCAGAAAA TTTTCAAGAA ATGCTAGCAG GCATTGACTT ACATCATGTT AAAACTGTAG AGGACTTTAT TGATTGGGAG AATTCAGTTT CTTCTTTGCT TTATACAAAA ATAAAGAAAG ATTCAAAAAA TACTGAAGAT ACTTTCGATC AAGTAAGTTC TGAGGAAGAG AATTTTGAAG AATTAAATTC CGAAAAAATT ATAAATAATT CTCGTAAATG TGCTGAACCT ATACTGAAAA ATCAAGAAGT AGTGATATAC GCTCATGAGT GCAATAAGAA TTCAAAACAT CATTTTGAGT TGATTGCAGA TTGTTTAATA ATTCTTATTG CTAATAAAGC TGCAATATTA TATACGACTG ATTTGCAATC GTCAGATCAA GCTTTGCTTG AGATGTTGCG AAAGTTCTTG CAAAAAAATT CCTCAATAAC AGTATTGCAT GACTATAAAA AGCATTTGGG ATTGCTAAAA TCGCTAAAAT TATTCGATTC GAATAATTCA TTTGGCACTC CACTTCTTGA TACTCGATTG GCATCATATT TATTGGAGCC AGATTTTCGA GCTGAGACTT TAGAAAATAT GGCTGAGCAT TATTTAGATA TTCAGTCAGT TGAAGATAAT TTGGGTGAAA ATGAAGCTAA GCAAGGTGAG CTTGATCTTC TTATTGACAA TGAAGTGTCA TCTAGTGAAG AAGAAACTAA TGAAAAGCAA ACTCAAGCAC GTGAAGCAAA ATTCTTGAAA ACTGCGCTTC TTATTCGTCT TTTGGCTCTT CATTTTGCTT CAACACTTGA CGATAGGAAA CAAACTGGTT TGATGACTGA TTTGGAAATG CCTATTTCGC GAGTTTTGCG TGGCATGGAA GACGTCGGTG CTGCAGTCGA TATGCAACGT TTAAGTGGTA TGCTTAAGCA ATTTTCTGCA GATGCTGAGT TTGCGCAAAA TATGGCTTGG AAAGTTGCAG GTAGTTCTGT TAATTTACAA AGCCCTAAGC AGTTGCAGCA AATACTTTTT GAGCATTTTG GTCTTAAACC TACTCGTCGC ACAAAGAATG GCTCGTACAC GACTAATGCC GAAGCTTTGC AAGCTATGTA CGTTAAACTC GATCCCGAAG AGCCTGCAAG TCAATTCTTG GGTGCTTTAC TTAAACATCG CGAAACTAAT AAATTGAAGC AGATTGTGCA AAGTTTAATT GATGCGGCTC ATTATGACGA TAGGCGCATC CATACAACTT TTGAGCAGAC TATTGCAGCA ACTGGTCGTT TAAGTTCTGC TGATCCTAAT CTTCAAAATA TTCCTAATAG GAATGCTGCT GGTCGTGAAA TTCGTGCTGC TTTTATTCCA GGAGAAGAAT TTGAATATTT AATGAGTTGT GATTATTCAC AAGTTGAGCT TCGTATTATG GCTCATGCAT CTCATGATAA AACTCTTATT GAAGCATTCC GTTCCGGCGT AGATTTTCAT AAATATGTTG CTAGCTTAGT TTATAAGATT CCTGTTGAAC AAGTAACGCC GGATCAACGT TCTCATGTAA AAGCTATGAG CTATGGATTG GCATATGGAT TGAGTACTTA TGGGCTGGCA AAACAGCTTT CTATTAGTCC GGCTGAAGCT GAAATGTTAA AAAATAAGTA TTTTGATACT TTTGGCAATG TTCATGACTA TTTGGAATCT CTTGTGTCGA AGGCAAAAGA ACTTGGGTAC ACAGAAACTA TGTTTGGTCG ACGTAGGTAT TTCCCGCAAC TTTCTTCCAA TAATCGTGCA TCTCGTGAAG CTGCTGAGCG CGGAGCTCTT AATGCTCCAA TACAAGGTAC TGCTGCAGAT ATTATGAAAC TTGCTATGCT TCGTGTTGAT TTGGGATTGC GTGAAGCAAA AGTGCGCAGC CGCGTAATAT TGCAAATACA TGATGAATTG ATTCTGGAAA TTTCACATGG TGAGCAAGCG CAGGTAGAAA ATATTGTGCG TAAAGCTATG GAAAATGCTG TGCATTTAGA CGTTCCTCTT AGTGTTTCTA CTGGAGTTGG TGTAGATTGG CAACTAGCAG CGCATTAG
|
Protein sequence | MNTDIVEEHS SELKNEKGTL LVVDGHSLAF RAFFAISADN FSTRSGQATN SIWGFITMLS QVVKKEKPDR LAIAFDVKGG TFRNTMLPQY KGTRDAAPQE LLSQLPIIQQ MLKALGVTYI EKPGYEGDDV IGTLAVMGSN AGYRTLVLSG DRDAFQLIDD NITVLYPGQH FKDLKEMDKN AVYEKYHVTP KQYPDLAALR GETADNIPGV PGVGDGYAAK WINQFGSLEN IIEHSNEISG KKGEALRENI DQVRLNRKVN ALVCDLDLGV TVDDLCFGDV DAAKLGILLS QLEFGELKKR KILKTFSNGM QQSIENNMRI MFSSTYEGDA NDDKLAQSAQ SNIDAENFQE MLAGIDLHHV KTVEDFIDWE NSVSSLLYTK IKKDSKNTED TFDQVSSEEE NFEELNSEKI INNSRKCAEP ILKNQEVVIY AHECNKNSKH HFELIADCLI ILIANKAAIL YTTDLQSSDQ ALLEMLRKFL QKNSSITVLH DYKKHLGLLK SLKLFDSNNS FGTPLLDTRL ASYLLEPDFR AETLENMAEH YLDIQSVEDN LGENEAKQGE LDLLIDNEVS SSEEETNEKQ TQAREAKFLK TALLIRLLAL HFASTLDDRK QTGLMTDLEM PISRVLRGME DVGAAVDMQR LSGMLKQFSA DAEFAQNMAW KVAGSSVNLQ SPKQLQQILF EHFGLKPTRR TKNGSYTTNA EALQAMYVKL DPEEPASQFL GALLKHRETN KLKQIVQSLI DAAHYDDRRI HTTFEQTIAA TGRLSSADPN LQNIPNRNAA GREIRAAFIP GEEFEYLMSC DYSQVELRIM AHASHDKTLI EAFRSGVDFH KYVASLVYKI PVEQVTPDQR SHVKAMSYGL AYGLSTYGLA KQLSISPAEA EMLKNKYFDT FGNVHDYLES LVSKAKELGY TETMFGRRRY FPQLSSNNRA SREAAERGAL NAPIQGTAAD IMKLAMLRVD LGLREAKVRS RVILQIHDEL ILEISHGEQA QVENIVRKAM ENAVHLDVPL SVSTGVGVDW QLAAH
|
| |