Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_1273 |
Symbol | topA |
ID | 8709654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | - |
Start bp | 1519537 |
End bp | 1522383 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 646483361 |
Product | DNA topoisomerase |
Protein accession | YP_003374463 |
Protein GI | 283783709 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.188892 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCAC AGAATAAGCT AGTCATTGTG GAGTCTCCCA CGAAAGCGCG AAAAATTGGC GGGTATTTAG GGAATGGCTA CACCGTCATG GCTTCAGTTG GGCATATTCG CGATCTCGCT CAGCCAAGCC AAGTTCCAGC CTCACGCAAA GCAGCGTTTG GCAAGTTTGG TGTAGATGTC GATCATGGTT TTGCGCCATA TTACGTAGTT GGCTCAGATA AAAAGAAAAC TGTTTCCGAT TTAAAGTCTG CGCTCGCAAA AGCTGACGAA TTATATCTGG CAACTGATGA GGATCGCGAA GGTGAAGCTA TTGCGTGGCA CTTGGTAGAA GCGTTGAAGC CAACTGTGCC AGTAAAGCGT ATGGTGTTTC ATGAGATTAC TAAGGATGCA ATTCAAGCCT CGCTTAGTAA TACTCGAAAT GTTGACGACA ATATGGTTGA TGCGCAAGAA ACTCGTCGTG TGTTAGACCG TTTGTATGGA TATGAGCTTT CTCCTGTTTT GTGGAGGAAA GTTGGTCCTG GTCTTTCTGC TGGACGCGTG CAGTCTGTTG CTACTAGGTT GATTGTTGAG CGCGAACGCG AGCGTATGGC GTTTACTAAG GCGTCTTACT GGGATATTAG CGCGGTTTTA AGTTCTAAAG GCTCAGATGG CGAGAGTGTT GATTTTGAAG CGCGCATGAG CGAGCTTTCT GGTCGTCGTT TGGCTGGTTC TAAAGATTTC AATTCAAAAG GTGAGTTGGT CTCTACAAAA GATGAATCAC AAAAAGCTTT GCATGTTGAT GCTGATTTTG CATCTAAGCT TTCTAAAGCT CTTGAGAATT CAGATTTTGT AGTTGACTCT ATGGAAACGA AGCCGTATCG TCGTCGTCCT TTGCCACCTT TTACTACATC AACTTTGCAG CAAACTGCTG GAAACCGACT TTCAATGAGC TCTCGCCAAA CTATGCGAGC TGCACAATCT CTATACGAAA ACGGCTATAT CACTTACATG CGTACGGATT CCGTAACGCT TTCCAAAGAA GCTATCGAAG CTGCGCGTAG TGCTGCTCGT GCAGCATTTG GCGACGAATA TGTTTCGCAA TCTCCTAAGC AGTACGCAAC TACGTCTGCA GGAGCGCAGG AAGCTCATGA ATGTATTCGC CCTGCTGGAG CGCGTTTCTT AAGCCCGGAC GAGCTTGCAG ATAAATTACC TGCGGATCAG CTAAAATTGT ATACGCTTAT TTGGCAACGA ACCCTTGCGT CACAAATGGC TGATGCTACA GGTTTTACAG CAACTGTTAA GTTAAATGCT TCTGCTGGAG AATATGGAGA AGCTTTGTTC CAAGCTTCTG GAACAGTAAT TACTTTTGCT GGTTTTATGA AGGTTTTTGG CAATGCGCAT GCATCTGAAG GTGAAAGCGA TAAGGCACTT CCTCAAATGC AAGCTGGGGA TGTTCTTGAA GCTAAAAGTG TTAGTGCGGA TTCTCACGAA ACTCAGTCTC CTGCAAGATA TACTGAAGCT TCTTTAGTTA AAACTTTGGA AGCTAAGGAA ATAGGACGCC CTTCTACTTA CGCAACGATT ATTTCTACAA TTATTGATCG CGGATACGTG TATGAGCGTG GACGTGCGTT AATTCCTTCT TGGCTTGCTT TTGCTGTAAT TAAACTTTTG GAAGCGAATT TCCCAAAGTA TGTTGATTAC GCGTTTACTG CTGATATGGA AAATGGTTTG GACAGAATTG CGCACGGTGA AGAAACCGGT CGCGATTGGC TAACTAGATT CTATTTCGGT TCCGGTGAGG GTGCTGCTAA TTCTGCTGAT GAAGCTCATA TTGGTTTGCA ACAGCAGGTT GCAGAGCTTG GTGAAATTGA TGCGCGTGAA ATAAATACCA TAGACATTGG CGATGGTTTG CATGTGCGTA TTGGACGCTA TGGTCCGTAC TTGGAAGACA TTAAGAATCT TGATGCTGAA GGCAACCCTC GTCATGCTTC TTTGCCAGAA ACTTTAGCTC CAGATGAGTT AACTGTTGAT GCTGCTCGAG AATTGCTTGA GAATAATGCT GAAGGTCCAC GTGTGCTTGG AGTAGATCCA GAAACGGGTG GGAACGTAGA AGTGCGCAAT GGTCGTTTTG GTCCGTACGT GGCACTGGTA GAAGAGCAGG ATAACGCGGA AGATTCTAAG TCTTCTAAGG CTTCTAAAGC TCGTCCAAAA ATGGCTTCTT TGTTTAAAAC CATGGATCCA GCGACTTTGA CTTTGCAAGA AGCGTTGCAA CTCTTGAATT TGCCACGTTT GGTTGGTGAG TATGAAGAAG TTGATGCCGA AGGCGTTGTA AAACTAGCTC GTATCGAAGC AAATAATGGT CGTTACGGTC CATATTTAAC TAAAACATAT TCTGCTGTAG ATACTTCTGC TGGAGAAACT GTGGAATCTA AGCCGGATAC GCGATCCCTT TCTAGTGAAG ATGCTATTTT TACTGTTACT TTGCAAGAAG CGAAAGATTT ATTTGCGCAA CCAAAGTACG TTAAGCGTAC TCGTGGTGCC GCCAAGCCGC CTCTTCGTGA GCTTGGCGCA GATCCTGAAA CTGGAAAGCC AGTAGTGATT AAGGATGGTT TCTACGGAGC TTATATTACT GATGGTGAAA CGAATCGCAC TTTGCCAAAG CAGTATACGC CTGAATCGAT TGATCCGCAG GATGCGTTTG CACTTTTGGC GCAAAAGCGT GCCGCAGGTC CCGTAAAACG CAAAAAGCGC GCGACTAAGA GCACTGCAAA ATCTTCTGAA AAGAAGTCTA CTGCGAAAAA ATCTACCGCA AAGAAAACTT CTACAAAGAA GACTACTTCA AAGAAATCTA CCGCTAAAAA AGCATAG
|
Protein sequence | MAAQNKLVIV ESPTKARKIG GYLGNGYTVM ASVGHIRDLA QPSQVPASRK AAFGKFGVDV DHGFAPYYVV GSDKKKTVSD LKSALAKADE LYLATDEDRE GEAIAWHLVE ALKPTVPVKR MVFHEITKDA IQASLSNTRN VDDNMVDAQE TRRVLDRLYG YELSPVLWRK VGPGLSAGRV QSVATRLIVE RERERMAFTK ASYWDISAVL SSKGSDGESV DFEARMSELS GRRLAGSKDF NSKGELVSTK DESQKALHVD ADFASKLSKA LENSDFVVDS METKPYRRRP LPPFTTSTLQ QTAGNRLSMS SRQTMRAAQS LYENGYITYM RTDSVTLSKE AIEAARSAAR AAFGDEYVSQ SPKQYATTSA GAQEAHECIR PAGARFLSPD ELADKLPADQ LKLYTLIWQR TLASQMADAT GFTATVKLNA SAGEYGEALF QASGTVITFA GFMKVFGNAH ASEGESDKAL PQMQAGDVLE AKSVSADSHE TQSPARYTEA SLVKTLEAKE IGRPSTYATI ISTIIDRGYV YERGRALIPS WLAFAVIKLL EANFPKYVDY AFTADMENGL DRIAHGEETG RDWLTRFYFG SGEGAANSAD EAHIGLQQQV AELGEIDARE INTIDIGDGL HVRIGRYGPY LEDIKNLDAE GNPRHASLPE TLAPDELTVD AARELLENNA EGPRVLGVDP ETGGNVEVRN GRFGPYVALV EEQDNAEDSK SSKASKARPK MASLFKTMDP ATLTLQEALQ LLNLPRLVGE YEEVDAEGVV KLARIEANNG RYGPYLTKTY SAVDTSAGET VESKPDTRSL SSEDAIFTVT LQEAKDLFAQ PKYVKRTRGA AKPPLRELGA DPETGKPVVI KDGFYGAYIT DGETNRTLPK QYTPESIDPQ DAFALLAQKR AAGPVKRKKR ATKSTAKSSE KKSTAKKSTA KKTSTKKTTS KKSTAKKA
|
| |