Gene HMPREF0424_0617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0617 
SymbolpolA 
ID8709537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp682074 
End bp685151 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content37% 
IMG OID646482726 
ProductDNA-directed DNA polymerase 
Protein accessionYP_003373852 
Protein GI283783098 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATACCG ACATCGTTGA AGAACATTCT TCGGAACTTA AGAATGAGAA AGGAACGTTG 
TTGGTTGTTG ATGGCCATTC TCTTGCGTTC CGTGCTTTTT TTGCAATATC TGCGGATAAT
TTTTCTACTC GCTCTGGGCA AGCAACAAAT TCCATTTGGG GATTTATTAC TATGCTTTCA
CAGGTTGTAA AAAAAGAGAA GCCTGACCGC TTAGCTATTG CTTTTGACGT TAAAGGTGGA
ACTTTTAGAA ATACTATGCT GCCTCAATAT AAAGGTACTC GTGATGCTGC TCCACAGGAG
CTTCTTTCGC AGCTTCCTAT CATTCAACAG ATGCTTAAAG CTCTTGGTGT CACTTATATT
GAGAAACCCG GTTATGAAGG TGATGATGTT ATTGGCACTC TTGCAGTAAT GGGTTCGAAT
GCCGGTTATA GAACTTTAGT ACTTTCTGGC GATCGTGATG CATTCCAACT GATAGATGAC
AATATTACTG TTTTATATCC AGGACAGCAT TTTAAAGACT TAAAAGAAAT GGATAAGAAT
GCTGTTTATG AAAAATATCA TGTTACGCCT AAGCAATATC CTGATTTAGC AGCGTTAAGA
GGAGAAACTG CAGATAATAT TCCAGGAGTT CCAGGTGTTG GAGATGGGTA CGCTGCTAAA
TGGATTAATC AATTCGGCAG CCTAGAAAAT ATTATTGAAC ATTCGAATGA GATTAGCGGT
AAAAAGGGAG AAGCTTTAAG AGAAAATATT GATCAAGTTC GTTTAAACCG TAAAGTTAAT
GCTTTAGTTT GCGATTTGGA TTTAGGCGTA ACTGTAGATG ATTTGTGCTT TGGTGATGTT
GATGCTGCAA AATTAGGAAT CTTGCTGTCT CAACTTGAAT TTGGCGAATT AAAAAAGAGA
AAGATTCTCA AAACTTTTAG CAACGGTATG CAACAATCTA TCGAAAATAA TATGCGTATA
ATGTTTTCGT CTACGTATGA AGGTGACGCT AATGACGATA AGCTAGCTCA AAGTGCTCAA
TCAAATATTG ACGCAGAAAA TTTTCAAGAA ATGCTAGCAG GCATTGACTT ACATCATGTT
AAAACTGTAG AGGACTTTAT TGATTGGGAG AATTCAGTTT CTTCTTTGCT TTATACAAAA
ATAAAGAAAG ATTCAAAAAA TACTGAAGAT ACTTTCGATC AAGTAAGTTC TGAGGAAGAG
AATTTTGAAG AATTAAATTC CGAAAAAATT ATAAATAATT CTCGTAAATG TGCTGAACCT
ATACTGAAAA ATCAAGAAGT AGTGATATAC GCTCATGAGT GCAATAAGAA TTCAAAACAT
CATTTTGAGT TGATTGCAGA TTGTTTAATA ATTCTTATTG CTAATAAAGC TGCAATATTA
TATACGACTG ATTTGCAATC GTCAGATCAA GCTTTGCTTG AGATGTTGCG AAAGTTCTTG
CAAAAAAATT CCTCAATAAC AGTATTGCAT GACTATAAAA AGCATTTGGG ATTGCTAAAA
TCGCTAAAAT TATTCGATTC GAATAATTCA TTTGGCACTC CACTTCTTGA TACTCGATTG
GCATCATATT TATTGGAGCC AGATTTTCGA GCTGAGACTT TAGAAAATAT GGCTGAGCAT
TATTTAGATA TTCAGTCAGT TGAAGATAAT TTGGGTGAAA ATGAAGCTAA GCAAGGTGAG
CTTGATCTTC TTATTGACAA TGAAGTGTCA TCTAGTGAAG AAGAAACTAA TGAAAAGCAA
ACTCAAGCAC GTGAAGCAAA ATTCTTGAAA ACTGCGCTTC TTATTCGTCT TTTGGCTCTT
CATTTTGCTT CAACACTTGA CGATAGGAAA CAAACTGGTT TGATGACTGA TTTGGAAATG
CCTATTTCGC GAGTTTTGCG TGGCATGGAA GACGTCGGTG CTGCAGTCGA TATGCAACGT
TTAAGTGGTA TGCTTAAGCA ATTTTCTGCA GATGCTGAGT TTGCGCAAAA TATGGCTTGG
AAAGTTGCAG GTAGTTCTGT TAATTTACAA AGCCCTAAGC AGTTGCAGCA AATACTTTTT
GAGCATTTTG GTCTTAAACC TACTCGTCGC ACAAAGAATG GCTCGTACAC GACTAATGCC
GAAGCTTTGC AAGCTATGTA CGTTAAACTC GATCCCGAAG AGCCTGCAAG TCAATTCTTG
GGTGCTTTAC TTAAACATCG CGAAACTAAT AAATTGAAGC AGATTGTGCA AAGTTTAATT
GATGCGGCTC ATTATGACGA TAGGCGCATC CATACAACTT TTGAGCAGAC TATTGCAGCA
ACTGGTCGTT TAAGTTCTGC TGATCCTAAT CTTCAAAATA TTCCTAATAG GAATGCTGCT
GGTCGTGAAA TTCGTGCTGC TTTTATTCCA GGAGAAGAAT TTGAATATTT AATGAGTTGT
GATTATTCAC AAGTTGAGCT TCGTATTATG GCTCATGCAT CTCATGATAA AACTCTTATT
GAAGCATTCC GTTCCGGCGT AGATTTTCAT AAATATGTTG CTAGCTTAGT TTATAAGATT
CCTGTTGAAC AAGTAACGCC GGATCAACGT TCTCATGTAA AAGCTATGAG CTATGGATTG
GCATATGGAT TGAGTACTTA TGGGCTGGCA AAACAGCTTT CTATTAGTCC GGCTGAAGCT
GAAATGTTAA AAAATAAGTA TTTTGATACT TTTGGCAATG TTCATGACTA TTTGGAATCT
CTTGTGTCGA AGGCAAAAGA ACTTGGGTAC ACAGAAACTA TGTTTGGTCG ACGTAGGTAT
TTCCCGCAAC TTTCTTCCAA TAATCGTGCA TCTCGTGAAG CTGCTGAGCG CGGAGCTCTT
AATGCTCCAA TACAAGGTAC TGCTGCAGAT ATTATGAAAC TTGCTATGCT TCGTGTTGAT
TTGGGATTGC GTGAAGCAAA AGTGCGCAGC CGCGTAATAT TGCAAATACA TGATGAATTG
ATTCTGGAAA TTTCACATGG TGAGCAAGCG CAGGTAGAAA ATATTGTGCG TAAAGCTATG
GAAAATGCTG TGCATTTAGA CGTTCCTCTT AGTGTTTCTA CTGGAGTTGG TGTAGATTGG
CAACTAGCAG CGCATTAG
 
Protein sequence
MNTDIVEEHS SELKNEKGTL LVVDGHSLAF RAFFAISADN FSTRSGQATN SIWGFITMLS 
QVVKKEKPDR LAIAFDVKGG TFRNTMLPQY KGTRDAAPQE LLSQLPIIQQ MLKALGVTYI
EKPGYEGDDV IGTLAVMGSN AGYRTLVLSG DRDAFQLIDD NITVLYPGQH FKDLKEMDKN
AVYEKYHVTP KQYPDLAALR GETADNIPGV PGVGDGYAAK WINQFGSLEN IIEHSNEISG
KKGEALRENI DQVRLNRKVN ALVCDLDLGV TVDDLCFGDV DAAKLGILLS QLEFGELKKR
KILKTFSNGM QQSIENNMRI MFSSTYEGDA NDDKLAQSAQ SNIDAENFQE MLAGIDLHHV
KTVEDFIDWE NSVSSLLYTK IKKDSKNTED TFDQVSSEEE NFEELNSEKI INNSRKCAEP
ILKNQEVVIY AHECNKNSKH HFELIADCLI ILIANKAAIL YTTDLQSSDQ ALLEMLRKFL
QKNSSITVLH DYKKHLGLLK SLKLFDSNNS FGTPLLDTRL ASYLLEPDFR AETLENMAEH
YLDIQSVEDN LGENEAKQGE LDLLIDNEVS SSEEETNEKQ TQAREAKFLK TALLIRLLAL
HFASTLDDRK QTGLMTDLEM PISRVLRGME DVGAAVDMQR LSGMLKQFSA DAEFAQNMAW
KVAGSSVNLQ SPKQLQQILF EHFGLKPTRR TKNGSYTTNA EALQAMYVKL DPEEPASQFL
GALLKHRETN KLKQIVQSLI DAAHYDDRRI HTTFEQTIAA TGRLSSADPN LQNIPNRNAA
GREIRAAFIP GEEFEYLMSC DYSQVELRIM AHASHDKTLI EAFRSGVDFH KYVASLVYKI
PVEQVTPDQR SHVKAMSYGL AYGLSTYGLA KQLSISPAEA EMLKNKYFDT FGNVHDYLES
LVSKAKELGY TETMFGRRRY FPQLSSNNRA SREAAERGAL NAPIQGTAAD IMKLAMLRVD
LGLREAKVRS RVILQIHDEL ILEISHGEQA QVENIVRKAM ENAVHLDVPL SVSTGVGVDW
QLAAH