Gene HMPREF0424_0365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0365 
Symbol 
ID8709855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp393202 
End bp396273 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content45% 
IMG OID646482481 
Productputative ATP-dependent DNA helicase PcrA 
Protein accessionYP_003373615 
Protein GI283782861 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID[TIGR01073] ATP-dependent DNA helicase PcrA 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTAA GTATGGGCGC AAGTATGAAT GCAAATATGA GTGCAAAGGA TAGCGATTAC 
GGATATGTAA GTGATTTGTT GGGGAATCCA GTTGATTCGT TCGATTCTGA CGATCGTTTT
TACGAGAATG TTGCGGATGC GAATGATTAT GCCAACTACA GTTACGACGA GTATGCTGAC
GCTTACGCTA AAGAGCCTGA TTTTACGCAG CAAGAATATG CGTCTGATCA AGAATCAGTG
CCTCCTCAAG AATCAGTGCC TCCTCAAGAA TCAGTGCCTC CTCAAGAATC AGTGCCTTCC
CAAGAATCAG TGCCTTCCCA ACAAGAAGCT CTTGCCCAAC AAGAACTGCT TCCTCAAGAA
TCTCAAGAAT ATTCCCAAAA TTTCGCACAT TCTAGAGTTG CAGTTGACGC GCAAAAACTA
CTCGACGGCT TAAATCCGCA ACAATCTCAA GCAGTCCAGT ACGACGGACC GGCACTGCTT
ATTGGAGCAG GAGCTGGAAG CGGCAAAACG CGAGTTCTTA CGCGAAGAAT AGCTTGGATA
TTAAGCCAAA AAGGTGCCTG GCCAAGCCAA ATTCTTGCCA TCACTTTTAC AAACAAAGCA
GCTGCCGAAA TGCGTGAACG TTTAAGCAAA CTTATTGGTA GCGAAGCAAA TACTATGTGG
GTTTCGACGT TCCACTCTGC ATGTGTGAAA ATACTTCGCA GATCAGGACA GTATATTGGC
TTAAAATCAG GCTTTTCTAT TTACGATACT TCCGATTGCG AGCGATTAGT AAAAATTATT
GCAACAGAGC TTAACGTTGA TATTAAGCGT TTTACGCCAC GCAGCATTTT GGGCAAGATT
TCGGATTGCA AAAATAGTTT AATCACTTGG CGTGAGCAAC TAGACATGTA TGCTAACGAT
TACAAGCCTG GTGTTGCAGG TCAACAAATT GCGCACGCTG GCAATTCTGA AGCTGTTTAC
GCAACTATTT ATGCGGAATA TCAGCATCGT TTATCTCAAG CAAATGCGGT TGATTTTGAC
GATTTAATTA TGCGAACTGT GCAACTCATG CGCGAAGTAC CAGAAGCTGC GCAATATTAC
CGCCATAAGT TCCGCTATAT TCTTGTAGAC GAGTACCAGG ATACGAATCA CGCTCAATAC
GAGCTTATTC GCGAGCTTGC TGGAGTGGAT GTAAAGCAAA ATAGTGCAAA TCCATCTCAG
CAAAATAACA CTCCTGCATC AATCACAGTT GTAGGCGACT CCGACCAGTC GATTTACGCT
TTCCGCGGAG CAGATATTCG CAATATTCAA GACTTTGAAA AAGATTTCCC AAACGCTACA
ACTATTATGC TTGAGCAAAA TTATCGCTCT ACGCAAACTA TTTTGGATGC TGCAAATGCA
GTTATTTCGC ATAATCAAGG ACGAAAGCCT AAAAAATTGT GGACGGCATT AGGTAAGGGC
ACTCCAATTA CTGGTTATGC AGCAGACAGT GCTCAACAGG AAGCAGCTTG GGTTGCGCAA
GAAATAGCGC GATTAGCAGG TGAAGAGGGC GTTGCTTATT CGGATATGGC GATTATGTAT
CGCGCTAATG CACAGTCTCG TTCGCTGGAA GATGCGCTCG TTAAAGCAGG TCTTCCGTAT
CAACTTGTTG GCGGCACGAA GTTTTACGAG CGTCGCGAAG TAAAAGATGC GCTCGCATAC
TTGCAATCTA TGGCAAACCC AGATGACGAT GTGAATATGC GCCGAATTTT GAATGTGCCT
AAGCGTGGGC TTGGAGCGCG TGCGGAGGCT TTGCTAACGC TTTACGCTCA AACACAAGGA
GTTAGCTTCT GGCAAGCTTT GGAGCATTTG GAGTCAATCG AAGGCATGCC AACTCGCACT
GCAACCAAGC TTAAAGAGTT CCGCGAGCTT ATGCACAATT TAATTACTTG TATGCAAAAC
GATGACTCGA AACCTTCTAA AGTTATTGAC AGCATACTGA ACGACAGTGG TTTGCTGGAG
GATTTGCAAC GTTCTGAAGA TCCTCAAGAT GCCGCTCGCG TGGATAATTT GTCACAATTG
CAGTCGGTTG CAGCTGAGTA TGAGCAGAAT ACGCCGGATG CTAGTGTTGC TGGTTTCCTT
GAAACGACGG CGCTTGTGGC TGATTCGGAT CAATTGCCGG ATGAAAATGA GGACACTGGT
AAAGTTACGT TGATGACGCT TCACACGGCT AAGGGTTTGG AGTATCCGGT TGTGTTCCTA
ACTGGCATGG AGCAGGGAAC TTTTCCGCAT TCTCGCGCTT TGGAAGACGA TGGCGAGCTT
TGCGAGGAGC GTCGTCTTGC TTATGTTGGT ATTACGCGAG CTAAGCAGCG TTTGTATGTT
ACGCGTGCTG CTGTTCGTGC GCAATGGGGG CAGGCTCAAG ATATGCTTCC AAGCCAGTTT
TTGGATGAGA TTCCAGATAA TTTGATTGAT TGGAAGCGCA GAGAGTCTGA TGTTGAGCGT
TGGGGTGCTT CGTCTAGACG CGGCGATGAT TTCGATAGCG ATTTTGATAG TGATTTTGGC
GGCGATTTTG ATAGCGACTT CGGCGGCTGG GATGACGATT ATAGCGATGC GTTTAGTGAA
AAAACAACTT ATGGTGGAAG CAGTTACAGT TCTGAAAAAT CGCATGGCTC TAAGTCTTAC
GGAAGAAATA AATCGTATGG CTCGAGTTCT TACGGAAGAA ATAAATCGTA TGGCTCGAGT
TCTTACGGAT CTTCTTACGG AAAGTCTTAT AGCAAGTCTT ATGGCTCTTC TTACGGTTCT
TCTTACGGCT CGTCTTATTC TAAATCGAAT TGGTCTAAGT CAAGTGCAGT AAAAACGCGC
AAAGTTACCG CAAAAACTGC GGCTTCAACA AACGCAACAA AAGTTGGTTC AACTTCCGCA
GTTATGAATA AAGACAATCA TCTAAATATT CAAGACTTCC AAGTTGGCGA CAAAATTACT
CACGATACTT ACGGCTTAGG AACTGTGCTA GCAACGCAAG ATAAGGGCAG AAACTCAATT
ATTACAGTCG ATTTCGGCTC CGATGGAGTA AAGCGACTAA TGCTTAGAGT TGCACCAATC
GAGAAATTGT AA
 
Protein sequence
MSVSMGASMN ANMSAKDSDY GYVSDLLGNP VDSFDSDDRF YENVADANDY ANYSYDEYAD 
AYAKEPDFTQ QEYASDQESV PPQESVPPQE SVPPQESVPS QESVPSQQEA LAQQELLPQE
SQEYSQNFAH SRVAVDAQKL LDGLNPQQSQ AVQYDGPALL IGAGAGSGKT RVLTRRIAWI
LSQKGAWPSQ ILAITFTNKA AAEMRERLSK LIGSEANTMW VSTFHSACVK ILRRSGQYIG
LKSGFSIYDT SDCERLVKII ATELNVDIKR FTPRSILGKI SDCKNSLITW REQLDMYAND
YKPGVAGQQI AHAGNSEAVY ATIYAEYQHR LSQANAVDFD DLIMRTVQLM REVPEAAQYY
RHKFRYILVD EYQDTNHAQY ELIRELAGVD VKQNSANPSQ QNNTPASITV VGDSDQSIYA
FRGADIRNIQ DFEKDFPNAT TIMLEQNYRS TQTILDAANA VISHNQGRKP KKLWTALGKG
TPITGYAADS AQQEAAWVAQ EIARLAGEEG VAYSDMAIMY RANAQSRSLE DALVKAGLPY
QLVGGTKFYE RREVKDALAY LQSMANPDDD VNMRRILNVP KRGLGARAEA LLTLYAQTQG
VSFWQALEHL ESIEGMPTRT ATKLKEFREL MHNLITCMQN DDSKPSKVID SILNDSGLLE
DLQRSEDPQD AARVDNLSQL QSVAAEYEQN TPDASVAGFL ETTALVADSD QLPDENEDTG
KVTLMTLHTA KGLEYPVVFL TGMEQGTFPH SRALEDDGEL CEERRLAYVG ITRAKQRLYV
TRAAVRAQWG QAQDMLPSQF LDEIPDNLID WKRRESDVER WGASSRRGDD FDSDFDSDFG
GDFDSDFGGW DDDYSDAFSE KTTYGGSSYS SEKSHGSKSY GRNKSYGSSS YGRNKSYGSS
SYGSSYGKSY SKSYGSSYGS SYGSSYSKSN WSKSSAVKTR KVTAKTAAST NATKVGSTSA
VMNKDNHLNI QDFQVGDKIT HDTYGLGTVL ATQDKGRNSI ITVDFGSDGV KRLMLRVAPI
EKL