Gene HMPREF0424_1273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1273 
SymboltopA 
ID8709654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1519537 
End bp1522383 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content45% 
IMG OID646483361 
ProductDNA topoisomerase 
Protein accessionYP_003374463 
Protein GI283783709 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.188892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCAC AGAATAAGCT AGTCATTGTG GAGTCTCCCA CGAAAGCGCG AAAAATTGGC 
GGGTATTTAG GGAATGGCTA CACCGTCATG GCTTCAGTTG GGCATATTCG CGATCTCGCT
CAGCCAAGCC AAGTTCCAGC CTCACGCAAA GCAGCGTTTG GCAAGTTTGG TGTAGATGTC
GATCATGGTT TTGCGCCATA TTACGTAGTT GGCTCAGATA AAAAGAAAAC TGTTTCCGAT
TTAAAGTCTG CGCTCGCAAA AGCTGACGAA TTATATCTGG CAACTGATGA GGATCGCGAA
GGTGAAGCTA TTGCGTGGCA CTTGGTAGAA GCGTTGAAGC CAACTGTGCC AGTAAAGCGT
ATGGTGTTTC ATGAGATTAC TAAGGATGCA ATTCAAGCCT CGCTTAGTAA TACTCGAAAT
GTTGACGACA ATATGGTTGA TGCGCAAGAA ACTCGTCGTG TGTTAGACCG TTTGTATGGA
TATGAGCTTT CTCCTGTTTT GTGGAGGAAA GTTGGTCCTG GTCTTTCTGC TGGACGCGTG
CAGTCTGTTG CTACTAGGTT GATTGTTGAG CGCGAACGCG AGCGTATGGC GTTTACTAAG
GCGTCTTACT GGGATATTAG CGCGGTTTTA AGTTCTAAAG GCTCAGATGG CGAGAGTGTT
GATTTTGAAG CGCGCATGAG CGAGCTTTCT GGTCGTCGTT TGGCTGGTTC TAAAGATTTC
AATTCAAAAG GTGAGTTGGT CTCTACAAAA GATGAATCAC AAAAAGCTTT GCATGTTGAT
GCTGATTTTG CATCTAAGCT TTCTAAAGCT CTTGAGAATT CAGATTTTGT AGTTGACTCT
ATGGAAACGA AGCCGTATCG TCGTCGTCCT TTGCCACCTT TTACTACATC AACTTTGCAG
CAAACTGCTG GAAACCGACT TTCAATGAGC TCTCGCCAAA CTATGCGAGC TGCACAATCT
CTATACGAAA ACGGCTATAT CACTTACATG CGTACGGATT CCGTAACGCT TTCCAAAGAA
GCTATCGAAG CTGCGCGTAG TGCTGCTCGT GCAGCATTTG GCGACGAATA TGTTTCGCAA
TCTCCTAAGC AGTACGCAAC TACGTCTGCA GGAGCGCAGG AAGCTCATGA ATGTATTCGC
CCTGCTGGAG CGCGTTTCTT AAGCCCGGAC GAGCTTGCAG ATAAATTACC TGCGGATCAG
CTAAAATTGT ATACGCTTAT TTGGCAACGA ACCCTTGCGT CACAAATGGC TGATGCTACA
GGTTTTACAG CAACTGTTAA GTTAAATGCT TCTGCTGGAG AATATGGAGA AGCTTTGTTC
CAAGCTTCTG GAACAGTAAT TACTTTTGCT GGTTTTATGA AGGTTTTTGG CAATGCGCAT
GCATCTGAAG GTGAAAGCGA TAAGGCACTT CCTCAAATGC AAGCTGGGGA TGTTCTTGAA
GCTAAAAGTG TTAGTGCGGA TTCTCACGAA ACTCAGTCTC CTGCAAGATA TACTGAAGCT
TCTTTAGTTA AAACTTTGGA AGCTAAGGAA ATAGGACGCC CTTCTACTTA CGCAACGATT
ATTTCTACAA TTATTGATCG CGGATACGTG TATGAGCGTG GACGTGCGTT AATTCCTTCT
TGGCTTGCTT TTGCTGTAAT TAAACTTTTG GAAGCGAATT TCCCAAAGTA TGTTGATTAC
GCGTTTACTG CTGATATGGA AAATGGTTTG GACAGAATTG CGCACGGTGA AGAAACCGGT
CGCGATTGGC TAACTAGATT CTATTTCGGT TCCGGTGAGG GTGCTGCTAA TTCTGCTGAT
GAAGCTCATA TTGGTTTGCA ACAGCAGGTT GCAGAGCTTG GTGAAATTGA TGCGCGTGAA
ATAAATACCA TAGACATTGG CGATGGTTTG CATGTGCGTA TTGGACGCTA TGGTCCGTAC
TTGGAAGACA TTAAGAATCT TGATGCTGAA GGCAACCCTC GTCATGCTTC TTTGCCAGAA
ACTTTAGCTC CAGATGAGTT AACTGTTGAT GCTGCTCGAG AATTGCTTGA GAATAATGCT
GAAGGTCCAC GTGTGCTTGG AGTAGATCCA GAAACGGGTG GGAACGTAGA AGTGCGCAAT
GGTCGTTTTG GTCCGTACGT GGCACTGGTA GAAGAGCAGG ATAACGCGGA AGATTCTAAG
TCTTCTAAGG CTTCTAAAGC TCGTCCAAAA ATGGCTTCTT TGTTTAAAAC CATGGATCCA
GCGACTTTGA CTTTGCAAGA AGCGTTGCAA CTCTTGAATT TGCCACGTTT GGTTGGTGAG
TATGAAGAAG TTGATGCCGA AGGCGTTGTA AAACTAGCTC GTATCGAAGC AAATAATGGT
CGTTACGGTC CATATTTAAC TAAAACATAT TCTGCTGTAG ATACTTCTGC TGGAGAAACT
GTGGAATCTA AGCCGGATAC GCGATCCCTT TCTAGTGAAG ATGCTATTTT TACTGTTACT
TTGCAAGAAG CGAAAGATTT ATTTGCGCAA CCAAAGTACG TTAAGCGTAC TCGTGGTGCC
GCCAAGCCGC CTCTTCGTGA GCTTGGCGCA GATCCTGAAA CTGGAAAGCC AGTAGTGATT
AAGGATGGTT TCTACGGAGC TTATATTACT GATGGTGAAA CGAATCGCAC TTTGCCAAAG
CAGTATACGC CTGAATCGAT TGATCCGCAG GATGCGTTTG CACTTTTGGC GCAAAAGCGT
GCCGCAGGTC CCGTAAAACG CAAAAAGCGC GCGACTAAGA GCACTGCAAA ATCTTCTGAA
AAGAAGTCTA CTGCGAAAAA ATCTACCGCA AAGAAAACTT CTACAAAGAA GACTACTTCA
AAGAAATCTA CCGCTAAAAA AGCATAG
 
Protein sequence
MAAQNKLVIV ESPTKARKIG GYLGNGYTVM ASVGHIRDLA QPSQVPASRK AAFGKFGVDV 
DHGFAPYYVV GSDKKKTVSD LKSALAKADE LYLATDEDRE GEAIAWHLVE ALKPTVPVKR
MVFHEITKDA IQASLSNTRN VDDNMVDAQE TRRVLDRLYG YELSPVLWRK VGPGLSAGRV
QSVATRLIVE RERERMAFTK ASYWDISAVL SSKGSDGESV DFEARMSELS GRRLAGSKDF
NSKGELVSTK DESQKALHVD ADFASKLSKA LENSDFVVDS METKPYRRRP LPPFTTSTLQ
QTAGNRLSMS SRQTMRAAQS LYENGYITYM RTDSVTLSKE AIEAARSAAR AAFGDEYVSQ
SPKQYATTSA GAQEAHECIR PAGARFLSPD ELADKLPADQ LKLYTLIWQR TLASQMADAT
GFTATVKLNA SAGEYGEALF QASGTVITFA GFMKVFGNAH ASEGESDKAL PQMQAGDVLE
AKSVSADSHE TQSPARYTEA SLVKTLEAKE IGRPSTYATI ISTIIDRGYV YERGRALIPS
WLAFAVIKLL EANFPKYVDY AFTADMENGL DRIAHGEETG RDWLTRFYFG SGEGAANSAD
EAHIGLQQQV AELGEIDARE INTIDIGDGL HVRIGRYGPY LEDIKNLDAE GNPRHASLPE
TLAPDELTVD AARELLENNA EGPRVLGVDP ETGGNVEVRN GRFGPYVALV EEQDNAEDSK
SSKASKARPK MASLFKTMDP ATLTLQEALQ LLNLPRLVGE YEEVDAEGVV KLARIEANNG
RYGPYLTKTY SAVDTSAGET VESKPDTRSL SSEDAIFTVT LQEAKDLFAQ PKYVKRTRGA
AKPPLRELGA DPETGKPVVI KDGFYGAYIT DGETNRTLPK QYTPESIDPQ DAFALLAQKR
AAGPVKRKKR ATKSTAKSSE KKSTAKKSTA KKTSTKKTTS KKSTAKKA