Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_04981 |
Symbol | topA |
ID | 4720011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | + |
Start bp | 445118 |
End bp | 447730 |
Gene Length | 2613 bp |
Protein Length | 870 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640080173 |
Product | DNA topoisomerase I |
Protein accession | YP_001010814 |
Protein GI | 123965733 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.225706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGATCACA CACTTGTTAT TGTTGAAAGT CCCACAAAGG CAAAAACTAT AAGAAGGTTT TTGCCTCCTA ATTATGAAGT TTTAGCTTCT ATGGGGCATG TAAGAGATCT TCCAAAAGGA GCTGCTGAAA TCCCTGCATC AGTAAAAAAA GAAAAATGGT CAAGAATAGG TGTAAACACT ACAGAAGATT TTGAACCACT TTATATCGTT CCTAAAGAAA AAAAGAAAGT TGTTAAAGAT TTAAAAAATG CCTTAAAAGA TGCTACCCAA TTACTATTAG CAACTGATGA AGATAGGGAG GGAGAGAGTA TAAGTTGGCA CTTAATGCAA ATACTTAAAC CCAAGATTCC AACTAAACGG ATGGTTTTTC ATGAAATTAC TAAAAAGGCT ATTAATAAGG CTCTAGATGA AACTAGAGAA ATTGATATGG AATTAGTTCA GGCCCAAGAA ACAAGAAGAA TTCTTGACAG GCTTTTTGGA TATGAACTTT CCCCATTACT TTGGAAAAAG GTAGCCCCTA GGCTTTCTGC AGGTCGTGTT CAGTCTGTAT CTGTAAGATT GCTTGTTAAT AAAGAAAGAG AAAGGAGAGC TTTTAAAAAA GCTACTTATT GGGGATTAAA AGCCACTTTA CTTAAGGATA ATGTTTTTTT TGACAGTAAG TTATCTAGCC TATCTGGTCA GAAAATCTCT AACGGATCTG ATTTTGATGA GAAAACCGGT AAATTAAAAA ATGGTAATAA ATCTTTAATT TTGAACGAAG AGAGAGCGAA GAGTTTGCTC AAGTCTTTAT CTGAAGAAAA GTGGAGAGTT ATAAAAATAG AGAAAAAACC AACAACTAGG AAGCCTGTTC CACCATTTAC TACTAGTACC TTGCAGCAGG AGGCTAATCG AAAACTTCGA TTATCAGCAA GAGAAACAAT GAGATGTGCA CAAGGCTTGT ATGAGAGAGG TTTTATTACA TATATGAGAA CTGATTCGGT ACATCTCTCA GAACAAGCAA TAAGAGCGGC GAGAGAATGT GTGCAGTCAA GATACGGAAA AGAATATTTA TCAAGTTCAG TTCGTCAATT TAATTCAAAA GAAAGGAATG CTCAAGAAGC TCATGAAGCT ATTAGACCTG CCGGAGAAGT ATTTAAAACA CCCAGCGATA CAGACTTGGC GGGAAGGGAT CTTTCCCTAT ATGAATTAAT TTGGAAAAGA ACTGTTGCCA GTCAAATGGC TGAAGCAAGA TTAACTATGG TTAATGCTGA AATAGAAGTG GGAGAAGGAT TATTTAAGTC CAGTGGAAAA AGTATTGATT TTGCAGGATT TTTTAGAGCT TACGTCGAAG GTAGTGATGA TCCAAGTGCA TCATTGGAAC AGCAAGAAGT AATTCTCCCA AATTTAACTC TTGGGTCTAC TCTTGAGGTG GCCAGTAAAG AGGCTACTTA TCATGAGACT AAATCTCCAG CCAGATATAC TGAGGCTGCA TTAGTTAAAG TTCTAGAAAA AGAGGGCATA GGCAGACCTT CTACTTATGC AAGTATTATT GGAACAATAG TTGATAGGGG TTATGCAAAT ATTTCTTCAA ACTCTTTATC TCCTACCTTT ACTGCTTTTG CTGTTACGGC ATTACTAGAG GAGCACTTCC CTGATCTAGT AGATACTACT TTTACGGCAA AAATGGAATC TTCATTAGAT GAAATTTCTT CAGGTAATCT AGAATGGCTA CCATATTTAG AAACTTTTTA TAAAGGTAAA AACGGTCTTG AGGTGAAAGT TCAGAAAACA GAAGGTGATA TTGATGGGAA AGCATATAGA CAAGTTGATT TTGATGATCT CCCTTGTGTA GTTAGAATCG GATCAAACGG GCCATGGCTA GAGGGAGTAA AAATAGATGA ATCAGGAAAT GAAATACAAG CAAAAGGAAA TCTCCCTATG GATATTACTC CTGGAGATTT AGATAAAAAG AAAGTTGATC AAATACTAAG TGGCCCCTCA GATCTTGGGA CTGATCCAAA AACAGGTGAG CAAGTATTTT TGAGATTTGG TCCTTATGGA CCTTATGTTC AGCTAGGTAA TATTGAAGAG GGTAAAGCCA AGCCAAGAAG GGCTAGTTTA CCCAAAGACT TGAAAACTGA TGACTTGTCA TTATCGGAAG CTCTAGAACT ATTAAGCTTA CCAAAATTGC TTGGAGAGCA TCCCGAGGGT GGAATTGTTG AGGCTGACAG AGGAAGATTT GGTCCTTATA TTAAATGGAT CAAAGATGAA GATACTTCTG AAAATAGATC TTTAAAAAAA GAAGATGATG TATTTAAGGT TGATCTTAAG CGTGCATTAG AAATCCTCGC GATGCCAAAA TTAGGGAGAG GGGGACAAGA AGTAATAAAG GATTTTGGGA AACCTAAAGA GTTAAATGAT AAAGTTCAAG TTTTAAATGG AAAATATGGA ATATATGTGA AGTGTGGGAA AATAAATGTT TCTCTGCCTA AAGATACTGA TTTAGAAAAA TTTACAATTG AAAATGCCTT GATTCTTTTA GAAGAGAAAA TGAAAGATAA AAAGGTTTCT GTTTTCAAAA AAAATAAAGT AATTAGTAAA AAGACCAAAA AAAATAAAAA AATTAAAAAA TAA
|
Protein sequence | MDHTLVIVES PTKAKTIRRF LPPNYEVLAS MGHVRDLPKG AAEIPASVKK EKWSRIGVNT TEDFEPLYIV PKEKKKVVKD LKNALKDATQ LLLATDEDRE GESISWHLMQ ILKPKIPTKR MVFHEITKKA INKALDETRE IDMELVQAQE TRRILDRLFG YELSPLLWKK VAPRLSAGRV QSVSVRLLVN KERERRAFKK ATYWGLKATL LKDNVFFDSK LSSLSGQKIS NGSDFDEKTG KLKNGNKSLI LNEERAKSLL KSLSEEKWRV IKIEKKPTTR KPVPPFTTST LQQEANRKLR LSARETMRCA QGLYERGFIT YMRTDSVHLS EQAIRAAREC VQSRYGKEYL SSSVRQFNSK ERNAQEAHEA IRPAGEVFKT PSDTDLAGRD LSLYELIWKR TVASQMAEAR LTMVNAEIEV GEGLFKSSGK SIDFAGFFRA YVEGSDDPSA SLEQQEVILP NLTLGSTLEV ASKEATYHET KSPARYTEAA LVKVLEKEGI GRPSTYASII GTIVDRGYAN ISSNSLSPTF TAFAVTALLE EHFPDLVDTT FTAKMESSLD EISSGNLEWL PYLETFYKGK NGLEVKVQKT EGDIDGKAYR QVDFDDLPCV VRIGSNGPWL EGVKIDESGN EIQAKGNLPM DITPGDLDKK KVDQILSGPS DLGTDPKTGE QVFLRFGPYG PYVQLGNIEE GKAKPRRASL PKDLKTDDLS LSEALELLSL PKLLGEHPEG GIVEADRGRF GPYIKWIKDE DTSENRSLKK EDDVFKVDLK RALEILAMPK LGRGGQEVIK DFGKPKELND KVQVLNGKYG IYVKCGKINV SLPKDTDLEK FTIENALILL EEKMKDKKVS VFKKNKVISK KTKKNKKIKK
|
| |