Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG0754 |
Symbol | topA |
ID | 2552750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | + |
Start bp | 801083 |
End bp | 803449 |
Gene Length | 2367 bp |
Protein Length | 788 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637149499 |
Product | DNA topoisomerase I |
Protein accession | NP_905022 |
Protein GI | 34540543 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACT TAGTCATCGT AGAGTCTCCG GCAAAAGCCA AAACGATAGG ACGTTTCCTC GGTTCGGATT ATACAGTACT CTCGAGTTAC GGACACATCC GCGACCTCAA GCCCAATAAA TTCAGTGTAG ATATACAAAA CAACTACGAG CCGGAATACG AGATCCCTGC CGATAAACGT CCGGTAGTCA AGGAACTGAA ATCCCAAGCC GACCGATCGG ATTTCATCTG GCTGGCTTCC GATGAGGATC GCGAAGGAGA GGCCATCGCA TGGCATTTGT ACGAAGCATT AGGGCTGAAA AACAAACAGA CCAAGCGAAT TGTATTTCAC GAAATCACCG AGACAGCCAT CAGAGCTGCT ATCGAAAATC CACGAGATAT AGACATCAAT CTGGTCGATG CCCAACAGGC GAGGCGCGTC CTCGACCGTA TCGTCGGCTT CGAACTTTCT CCCGTTTTAT GGAGACGTAT TCGTCCTTCT CTTTCGGCAG GGCGTGTACA GTCCGTTGCA CTGCGTCTTA TCGTTGAGAG AGAGCGTGAG ATAAATGCTT TCGTGCCGGA AGCATCCTTC CGCTGTACGA TAGAGTTCGT CCTTCCCGAC GGTAGAATGC TGACGGCGGA ATTGCAGAAA CGATTCAAGA CGAAGGAGGA AGCCCGATAT TTCTTGGAGC AATGTATGGA TGCCCACTTT CATATAACAG ACGTTACGAA GCGTCCGGGC AAACGTTCTC CGGCCACTCC TTTTACTACC TCGACGCTCC AGCAGGAAGC AGCCCGAAAA CTCGGCTACG GTGTGGCACA GACCATGCGT ATCGCTCAGA AGTTGTACGA AGAAGGGCTT ATCACCTATA TGCGTACAGA CTCTGTGAAT CTGTCGGATA TGGCCCTCGG TGCACTCAAA AAGGAAATAA CCGAACATTG GGGTGAGCAA TACTACCGGT TCCGCAGGTA CAAGACCAAA ACCAAAGGGG CACAAGAAGC CCACGAAGCC ATCCGTCCTA CTTATATACA TAGAGCAGAG ATAGACGGCA CCCCGCAAGA GCAAAAGCTA TATCAACTGA TCCGCCGCAG AACCATTGCT TCCCAGATGG CGGATGCCAT CCTCGAAAAA ACGACGATAA CAATCGGAAC CGACAAGTTT GCAGAGACCC TCAGCTCGCA AGGAGAGGTG ATCGTATTCG ACGGGTTCCT CGGAGTCTAT CGTGAAGATT CGGACGAAGA ACATGGCTCT GCAAATACCG AGGAGCAGCT ATTGCCCTCT GTCAAAGCCG GCGATACGCT CTCTCTGCAT CATGCAAAAG CAACGGAGAG CTTCACGCAG CGTCCGGCAC GCTATACGGA GGCCAGCCTC GTTCGTAAAA TGGAGGAGTT GGGCATCGGC AGGCCATCCA CCTATGCCCC CACTATCCAA ACCATACAGA ATCGCGAATA CGTAGTACGC GGCGACAAAC CGGGCAAAAC ACGCGAATAC ATCCTGCTGG AATACCACAA GGGGAAAGCC ATAACGGAGA CGATCAAAAC GGAACTGAAC GGACAGGACC GGAATAAGCT CCTCCCCACT GATATGGGGC TTGTGGTGAA CGACTTTCTC GTGGCTTCGT TCCCTCAGGT GATCGATTAC AACTTCACGG CCAAAGTGGA AAAAGAATTT GACCAAATAG CCGAAGGAAA ACTGCAATGG CAGAAGCAGA TCGGTCGGTT CTACAACAAA TTCCACCCGT TGGTGGCAGA AGCATGCGAG TTCGATCCCG ACCAGAAGAT CGGTGAAAGA ATGCTGGGTA CGGATCCTGT GACCGGAGAA TCTGTGGTAG CAAAAATGGG ACGCTACGGA GCGATGGTGC AGAAAGGTCG CACGGATAAG GAAAACGGTA TCAAGGCGCA GTTTGCCTCG CTCCAGCCGG GACAGTCCAT CGAATCCATC ACGCTGGAGG AAGCTTTGGA ACTATTCCTC CTGCCCAAGA AACTGGGACA ATATGAGGAT GCGGATGTAA TGGTAGCCGT AGGACGCTTC GGCCCTTATA TCAAGCATGC AGGCAAGTTT GTAGGGTTGC CAAAAGATAC CGAACCCCTT TCCGTTTCGC TTGACGATGC CATCAAGTAT ATCGCCGACA AACGCGAGAA GGAGGAAAAA AGCCTGATCA AAGGATTTGC AGAAGATCCG GAGATGGAGA TCCGCACAGG GCGTTTCGGC GTTTATATCA AATACAAAGG GAAAAACTAC AAAGTCCCCA AAACGGTGGA AGACCCGGAG AAACTCACCC TCGAAGAATG TCTGAAATAC GTGGAAGAGG GAGAGACGAA ACCGGCCAAG GGAAAGAAAA AAGCTCCGGC CAAAAAGACA TCGGCAAAGA AGACTGCCAA GAAATAA
|
Protein sequence | MKNLVIVESP AKAKTIGRFL GSDYTVLSSY GHIRDLKPNK FSVDIQNNYE PEYEIPADKR PVVKELKSQA DRSDFIWLAS DEDREGEAIA WHLYEALGLK NKQTKRIVFH EITETAIRAA IENPRDIDIN LVDAQQARRV LDRIVGFELS PVLWRRIRPS LSAGRVQSVA LRLIVERERE INAFVPEASF RCTIEFVLPD GRMLTAELQK RFKTKEEARY FLEQCMDAHF HITDVTKRPG KRSPATPFTT STLQQEAARK LGYGVAQTMR IAQKLYEEGL ITYMRTDSVN LSDMALGALK KEITEHWGEQ YYRFRRYKTK TKGAQEAHEA IRPTYIHRAE IDGTPQEQKL YQLIRRRTIA SQMADAILEK TTITIGTDKF AETLSSQGEV IVFDGFLGVY REDSDEEHGS ANTEEQLLPS VKAGDTLSLH HAKATESFTQ RPARYTEASL VRKMEELGIG RPSTYAPTIQ TIQNREYVVR GDKPGKTREY ILLEYHKGKA ITETIKTELN GQDRNKLLPT DMGLVVNDFL VASFPQVIDY NFTAKVEKEF DQIAEGKLQW QKQIGRFYNK FHPLVAEACE FDPDQKIGER MLGTDPVTGE SVVAKMGRYG AMVQKGRTDK ENGIKAQFAS LQPGQSIESI TLEEALELFL LPKKLGQYED ADVMVAVGRF GPYIKHAGKF VGLPKDTEPL SVSLDDAIKY IADKREKEEK SLIKGFAEDP EMEIRTGRFG VYIKYKGKNY KVPKTVEDPE KLTLEECLKY VEEGETKPAK GKKKAPAKKT SAKKTAKK
|
| |