Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_04911 |
Symbol | topA |
ID | 4781077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 445813 |
End bp | 448719 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640083766 |
Product | DNA topoisomerase I |
Protein accession | YP_001014318 |
Protein GI | 124025202 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.624674 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCACTG ACCATACTCT GGTAATTGTT GAAAGTCCTA CAAAGGCAAA AACTATTAGA GGGTTTTTGC CTAAGGACTT TCAGGTTCTT GCGTCAATGG GTCACATAAG AGACTTGCCT AACAATGCAT CTGAGATCCC TGCGAAGCAC AAAGGCGAAA AGTGGGCAAC GATTGGAGTT AATACAACTG CTGATTTTGA TCCTTTGTAC GTCGTACCCA AAGACAAGAA AAAAATTGTC AAGGAATTAA AACAATCTTT GAAGGGTGCT AGTGAATTGT TGCTTGCGAC TGATGAAGAT AGAGAAGGAG AAAGTATAAG TTGGCATTTA ATGAATGTGC TTGACCCGAA AATCCCTGTG AAGAGGATGG TCTTTCATGA GATAACTAAA GAAGCTATTT CCAAAGCTCT ATCGAAAACA AGAGCAATTG ATATGGAATT AGTTCATGCC CAAGAGACAA GGAGGATCTT AGACAGATTA GTTGGGTACA CGCTTTCTCC TCTTTTATGG AAGAAAGTTT CATGGGGGTT ATCTGCAGGA AGAGTTCAAT CAGTTGCAGT AAGGTTGCTA GTTCTGAGAG AGAGAGCAAG GAGAGCTTTC AAAAGCGGGA GTTATTGGGA CTTAAAAGCA AAATTAGAGA AAGAAGGTAG TGAATTTGAG GTGAAAATGA CCTCAATTGG TGGAAAAAGA ATTGCTACAG GTAGTGATTT TGATGAGTCA ACGGGATTAT TGAAATCTGG CCGAAATGTC ATATTACTCA AGGAAGAGGA GTCTAAGGAA CTTGCAAAAA ATTTAACTAC TGATAAATGG AAAGTTGTTA ATGTCGAGGA AAAGCCGTCA ATCCGTAAAC CAGTTCCTCC TTTTACAACA AGCACATTAC AACAAGAGGC TAATAGAAAA CTTCGATTAT CAGCTAGGGA GACTATGAGA TGTGCTCAGG GTTTGTATGA AAGAGGTTTT ATTACATATA TGAGAACAGA TTCTGTTCAT CTGTCTGATC AGGCAATTAA TGCCTCACGA AATTGTGTTG AATCAAAATA TGGTGTTGAA TATTTAAGTA AAAAGCCCCG ACAATTCTCC AATAAGACGA GAAATGCTCA AGAAGCCCAT GAAGCAATAC GTCCTTCTGG TGAGAGCTTT AAAACACCCA AAGAGTCAAA CTTGCAAGGT AGGGATCTTT CTTTATACGA ACTTATTTGG AAACGGACAG TTGCTAGTCA AATGGCCGAT GCAAGGTTGA CAATGCTTGG AGTCGAATTA AAAGCATCGG ATGTATCTTT TCGGGCTAGT GGTAAACGAA TAGATTTCCC TGGATTCTTT AGAGCTTATG TTGAAGGTAC TGATGATCCT GATAGTGCAC TTGAAGGACA AGAAGTGCTT TTGCCTAAAT TAGCGGTAGG AGATTCTCCA ACAGCTAAGA ATGTAGAGGC ATTGGGGCAT CAGACTCAAC CTCCAGCTAG ATATAGCGAA GCTTCATTAG TTAAAACACT TGAGAAAGAA GGCATAGGTC GTCCGTCAAC TTATGCAAGC ATTATAGGAA CAATTGTAGA TCGAGGTTAT TCAGTCCTAA ATAACAATTC TTTAACTCCA AGCTTTACAG CATTTGCTGT GACGGCACTT CTTGAAGAAC ATTTTCCTGA TCTTGTAGAT ACCAGTTTTA CTGCTCGAAT GGAATCTACA CTTGATGAGA TCTCAACAGG AAAAGTGAGT TGGCTTCCAT ACCTTAAGGG CTTTTATAAG GGTGATACTG GCCTAGAGAA TCAGGTGCAA CAAAGGGAAG GGGATATTGA TGGAGGCGAG TTTAGAGCTG TTTCCTTGGA GGGACTTTCA TCTCTAGTTA GGTTGGGCAA ATTTGGAACA TATCTGGAAT CAAAGCAACT GGGTGAAAAT GGCAAGCCCA TAACAGCTAC TCTTCCACAG GAAATTACTC CCGCAGATTT GGATGAGGAT ATCGCAGAGA TGATTTTAAA ACAAAAAGCT GAGGGTCCTG AATCACTTGG GGTTGACCCT GATAGTGGAC AGAATCTATA TCTATTAAAT GGTAGATATG GTCATTTTGT TCAAAGGGGA TTAGTAGTCG AATTGAAAGA TCTTGGGATT CCAAAAGGTA AGAAGAAATT AGGAAATCTT CGCTTGTTCA AAAGCAGTCA ATATGGACTC TATTTGAAGC AGGATTCATC AAAGGTTCAG GTTTTGTTAC CAGAGAATAT AAAAGAGGAA GAGATAGATG TTGAAAAAGC ACTTGAATAT TTAGATGATA AATCATTAAA AAAAGCTCCA AATCCAAAAA GGACTTCCTT ACCAAAGAGT CTAAAACCAG AGGACTTGAC ATTTGAGAAG GCCCTTGGAT TAATCCAATT ACCACGTCTA CTTGGAGAGC ACCCAGAGGG AGGTAAAGTT CAATCAAGCT TGGGTAGATT TGGTCCGTAT GTGGTTTGGA GTAAAAATGG TGGTGAAAAA GATTATCGCT CAATTAAGGG GGAAGATGAC GTTCTTCAAG TAAGCCTAGA AAGAGCTCTT GAGCTTTTAT CAATACCAAA AAGAGGAAGA GGCGGAAGAA CTGCGTTGAA AGAACTTGGT ATCCCAGATG GAGAAAAAGA AACTATCCAA TTATTTGATG GTCCTTATGG TTTATATGTT AAACAGGGTA AAGTAAATGC TTCTCTACCA GAGGGAAAAA CCGCTGAAGA TATTACTATT GAGGTAGCTA TTGAATTATT GGCAGCTAAG AAATCAAGTA AAAAGACAAC ATCTAAGAAA AGAAAATCTA CACAAAAGAC AACCAAGTCA ACAAAGAAAG ATTTAAACTC ATCAGCATCA AAAAAAAGTA GTACTCAAAA AGCGCCCTCT ACAACTAAAA CAGGACGTCT AAGAGCCAGT AAAGTAAGGG TAATTAAAAC AAAATAA
|
Protein sequence | MPTDHTLVIV ESPTKAKTIR GFLPKDFQVL ASMGHIRDLP NNASEIPAKH KGEKWATIGV NTTADFDPLY VVPKDKKKIV KELKQSLKGA SELLLATDED REGESISWHL MNVLDPKIPV KRMVFHEITK EAISKALSKT RAIDMELVHA QETRRILDRL VGYTLSPLLW KKVSWGLSAG RVQSVAVRLL VLRERARRAF KSGSYWDLKA KLEKEGSEFE VKMTSIGGKR IATGSDFDES TGLLKSGRNV ILLKEEESKE LAKNLTTDKW KVVNVEEKPS IRKPVPPFTT STLQQEANRK LRLSARETMR CAQGLYERGF ITYMRTDSVH LSDQAINASR NCVESKYGVE YLSKKPRQFS NKTRNAQEAH EAIRPSGESF KTPKESNLQG RDLSLYELIW KRTVASQMAD ARLTMLGVEL KASDVSFRAS GKRIDFPGFF RAYVEGTDDP DSALEGQEVL LPKLAVGDSP TAKNVEALGH QTQPPARYSE ASLVKTLEKE GIGRPSTYAS IIGTIVDRGY SVLNNNSLTP SFTAFAVTAL LEEHFPDLVD TSFTARMEST LDEISTGKVS WLPYLKGFYK GDTGLENQVQ QREGDIDGGE FRAVSLEGLS SLVRLGKFGT YLESKQLGEN GKPITATLPQ EITPADLDED IAEMILKQKA EGPESLGVDP DSGQNLYLLN GRYGHFVQRG LVVELKDLGI PKGKKKLGNL RLFKSSQYGL YLKQDSSKVQ VLLPENIKEE EIDVEKALEY LDDKSLKKAP NPKRTSLPKS LKPEDLTFEK ALGLIQLPRL LGEHPEGGKV QSSLGRFGPY VVWSKNGGEK DYRSIKGEDD VLQVSLERAL ELLSIPKRGR GGRTALKELG IPDGEKETIQ LFDGPYGLYV KQGKVNASLP EGKTAEDITI EVAIELLAAK KSSKKTTSKK RKSTQKTTKS TKKDLNSSAS KKSSTQKAPS TTKTGRLRAS KVRVIKTK
|
| |